Update range of gpu arch #23309

yf711 · 2025-01-09T21:08:02Z

Description

Remove deprecated gpu arch and reduce nuget/python package size (latest TRT supports sm75 Turing and newer arch)

Test on pkg CI	Python-cuda12	Nuget-cuda12
Before	Linux: 279MB Win: 267MB	Linux: 247MB Win: 235MB
After	Linux: 174MB Win: 162MB	Linux: 168MB Win: 156MB

Motivation and Context

tianleiwu · 2025-01-09T21:41:34Z

If we drop older arch, shall we also drop ort package for cuda 11.8 in next release?

snnn · 2025-01-09T21:50:42Z

If we drop older arch, shall we also drop ort package for cuda 11.8 in next release?

I highly recommend doing so. Now we only have two people working on build pipelines. We should focus more on the main targets.

tools/ci_build/github/linux/build_cuda_c_api_package.sh

snnn · 2025-01-10T21:20:33Z

/azp run Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-01-10T21:20:45Z

Azure Pipelines successfully started running 1 pipeline(s).

yf711 · 2025-01-11T00:11:03Z

After testing, adding sm90 to build arch list is causing issues to cuda 11.8+cudnn8 alt pkg build on windows,
which is likely because cudnn8 is deprecated by blackwell. cuda 12 pkg build is not affected.

To support sm90, we can choose to support cuda12 only, or we might need to update current cuda 11.8 env with cudnn9

snnn · 2025-01-11T00:39:54Z

CUDA 11.8 with cudnn9 doesn't work. I tried.

I hit the following compilation error when compiling cudnn_flash_attention.cu

/build/Release/_deps/cudnn_frontend-src/include/cudnn_frontend/graph_interface.h:519:27:   required from here
/build/Release/_deps/cudnn_frontend-src/include/cudnn_frontend/thirdparty/nlohmann/json.hpp:9132:68: error: static assertion failed: Missing/invalid function: bool boolean(bool)
 9132 |     static_assert(is_detected_exact<bool, boolean_function_t, SAX>::value,

Therefore, I suggest giving up on that.

update range of gpu arch

30bee96

snnn previously approved these changes Jan 9, 2025

View reviewed changes

tianleiwu reviewed Jan 10, 2025

View reviewed changes

tools/ci_build/github/linux/build_cuda_c_api_package.sh Outdated Show resolved Hide resolved

yf711 requested a review from jywu-msft January 10, 2025 00:32

append latest arch

3718bab

yf711 dismissed snnn’s stale review via 3718bab January 10, 2025 00:50

snnn previously approved these changes Jan 10, 2025

View reviewed changes

revert

bec0f68

yf711 dismissed snnn’s stale review via bec0f68 January 10, 2025 21:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update range of gpu arch #23309

Update range of gpu arch #23309

yf711 commented Jan 9, 2025 •

edited

Loading

tianleiwu commented Jan 9, 2025

snnn commented Jan 9, 2025

snnn commented Jan 10, 2025

azure-pipelines bot commented Jan 10, 2025

yf711 commented Jan 11, 2025

snnn commented Jan 11, 2025

Update range of gpu arch #23309

Are you sure you want to change the base?

Update range of gpu arch #23309

Conversation

yf711 commented Jan 9, 2025 • edited Loading

Description

Motivation and Context

tianleiwu commented Jan 9, 2025

snnn commented Jan 9, 2025

snnn commented Jan 10, 2025

azure-pipelines bot commented Jan 10, 2025

yf711 commented Jan 11, 2025

snnn commented Jan 11, 2025

yf711 commented Jan 9, 2025 •

edited

Loading