Update winml version, add llm trt rtx recipes. #335

chinazhangchao · 2025-07-17T09:39:42Z

No description provided.

xieofxie · 2025-07-18T08:33:42Z

model_lab_configs/huggingface/Qwen/Qwen2.5-1.5B-Instruct/1/qwen2_5_trtrtx_config.json

+        "builder": { "type": "ModelBuilder", "precision": "fp16", "search": { "max_length": 2048, "min_length": 0 } },
+        "metadata": {
+            "type": "ModelBuilder",
+            "precision": "fp16",


shall we do this? as amol said today, TRT should use QDQ

let's wait for nvidia

xieofxie · 2025-07-29T09:01:42Z

model_lab_configs/huggingface/Qwen/Qwen2.5-1.5B-Instruct/1/qwen2_5_trtrtx_config.json.config

@@ -0,0 +1,42 @@
+{
+    "name": "Convert to NVIDIA GPU (TRT RTX)",


Convert to NVIDIA TRT for RTX

xieofxie · 2025-08-01T06:16:05Z

model_lab_configs/huggingface/Intel/bert-base-uncased-mrpc/1/inference_sample.ipynb

+    "    site_packages_path = Path(str(metadata.distribution('winrt-runtime').locate_file('')))\n",
+    "    dll_path = site_packages_path / 'winrt' / 'msvcp140.dll'\n",
+    "    if dll_path.exists():\n",
+    "        dll_path.unlink()\n",


could we unload it once we get ep and before register instead of deletion?

chinazhangchao added 3 commits July 17, 2025 17:08

upgrade winml

b43f230

add recipe

539a529

fix inference sample

15632e4

chinazhangchao changed the title ~~Chao/rtx~~ Update winml version, add llm trt rtx recipes. Jul 17, 2025

chinazhangchao added 3 commits July 18, 2025 09:06

use olive main

b31f72e

change requirements

d8cd77c

change requirements

8587543

xieofxie reviewed Jul 18, 2025

View reviewed changes

chinazhangchao added 8 commits July 21, 2025 18:02

add commit id

d9dfe24

change llm trt rtx recipe

dcd7c35

format

26b4a12

merge dev

72a853b

sanitize

859d58b

fix lf

288f571

fix lf

7372a63

change model list

a923298

xieofxie reviewed Jul 29, 2025

View reviewed changes

change inference sample

f2d7b4b

xieofxie reviewed Aug 1, 2025

View reviewed changes

chinazhangchao added 10 commits August 1, 2025 15:15

change inference sample

4591b7f

fix inference sample

e7dec0d

fix optimum version

4cc5729

fix clip recipe

eba7988

merge dev

d62fb51

fix name

2211113

use olive main

694d137

use winml.py

56a5089

fix eol

a9a8fe4

fix eol

2bde33b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update winml version, add llm trt rtx recipes. #335

Update winml version, add llm trt rtx recipes. #335

Uh oh!

chinazhangchao commented Jul 17, 2025

Uh oh!

xieofxie Jul 18, 2025

Uh oh!

xieofxie Jul 21, 2025

Uh oh!

xieofxie Jul 29, 2025

Uh oh!

xieofxie Aug 1, 2025

Uh oh!

Uh oh!

Update winml version, add llm trt rtx recipes. #335

Are you sure you want to change the base?

Update winml version, add llm trt rtx recipes. #335

Uh oh!

Conversation

chinazhangchao commented Jul 17, 2025

Uh oh!

xieofxie Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

xieofxie Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

xieofxie Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

xieofxie Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!