-
Notifications
You must be signed in to change notification settings - Fork 24
Update winml version, add llm trt rtx recipes. #335
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Conversation
"builder": { "type": "ModelBuilder", "precision": "fp16", "search": { "max_length": 2048, "min_length": 0 } }, | ||
"metadata": { | ||
"type": "ModelBuilder", | ||
"precision": "fp16", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
shall we do this? as amol said today, TRT should use QDQ
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let's wait for nvidia
@@ -0,0 +1,42 @@ | |||
{ | |||
"name": "Convert to NVIDIA GPU (TRT RTX)", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Convert to NVIDIA TRT for RTX
" site_packages_path = Path(str(metadata.distribution('winrt-runtime').locate_file('')))\n", | ||
" dll_path = site_packages_path / 'winrt' / 'msvcp140.dll'\n", | ||
" if dll_path.exists():\n", | ||
" dll_path.unlink()\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could we unload it once we get ep and before register instead of deletion?
No description provided.