-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update range of gpu arch #23309
base: main
Are you sure you want to change the base?
Update range of gpu arch #23309
Conversation
If we drop older arch, shall we also drop ort package for cuda 11.8 in next release? |
I highly recommend doing so. Now we only have two people working on build pipelines. We should focus more on the main targets. |
/azp run Win_TRT_Minimal_CUDA_Test_CI |
Azure Pipelines successfully started running 1 pipeline(s). |
After testing, adding sm90 to build arch list is causing issues to cuda 11.8+cudnn8 alt pkg build on windows, To support sm90, we can choose to support cuda12 only, or we might need to update current cuda 11.8 env with cudnn9 |
CUDA 11.8 with cudnn9 doesn't work. I tried. I hit the following compilation error when compiling cudnn_flash_attention.cu
Therefore, I suggest giving up on that. |
Description
Remove deprecated gpu arch and reduce nuget/python package size (latest TRT supports sm75 Turing and newer arch)
Motivation and Context