Add option to set ORT_DISABLE_ALL as optimization #195

casassg · 2023-06-21T00:37:24Z

Please let me know if it would be better to use a different value instead of 2.

This change allows users to set optimization->graph->level to DISABLE_ALL by setting 2. The reason for this is that sometimes specially when using a triton server for serving multiple models, we prefer users to optimize the model offline instead of online. This allows us to disable optimization greatly speeding up load time into triton onnxruntime backend.

This is my first contribution here, so please do let me know if I should change anything or add more tests. Given the code is 2 lines, I didn't initially add it.

dyastremsky · 2023-07-13T22:38:51Z

If you haven't already, do you mind to follow the contribution page here to submit a signed CLA?

Also, can you please update the README documentation for level to reflect this option?

CC: @tanmayv25

dyastremsky · 2023-07-31T17:54:13Z

@casassg Following up about the CLA and README updates.

oandreeva-nv · 2023-08-16T19:00:20Z

Hi @casassg, thank you for this PR and we are looking forward to merging it soon. However, we do need a signed CLA and a simple entry in the documentation about this level of optimization (for greater visibility of this feature). May I ask you if you have time in the nearest future to do these 2 actions?

dyastremsky · 2023-09-20T19:32:15Z

@casassg Following up about the CLA. Are you able to submit it? If not, we can merge these in a separate PR on our own.

Also, I am reading this a little closer and think it would be good to add a comment about this option in the README under Model Config Options.

nv-kmcgill53 · 2023-10-25T20:56:36Z

The triton project has accepted the CLA for Block Inc.. This PR should not be blocked on the CLA anymore.

dyastremsky · 2023-10-25T21:48:44Z

Fantastic, thank you Gerard and Kyle! @cassassg, would you be able to add the minor documentation updates, so we can get this merged? We can re-approve after.

casassg · 2023-12-18T22:24:29Z

@dyastremsky added a small comment (sorry this got deprioritized in my stack and buried down the line)

Allow ORT_DISABLE_ALL value

94c0359

pranavsharma previously approved these changes Jun 26, 2023

View reviewed changes

oandreeva-nv mentioned this pull request Aug 16, 2023

Newer versions of triton server have a consirable slowdown in start time triton-inference-server/server#6014

Open

update readme

efa593b

casassg dismissed pranavsharma’s stale review via efa593b December 18, 2023 22:23

tanmayv25 approved these changes Dec 18, 2023

View reviewed changes

tanmayv25 merged commit fa05686 into triton-inference-server:main Dec 18, 2023
3 checks passed

casassg deleted the patch-1 branch January 16, 2024 18:04

casassg mentioned this pull request Jan 17, 2024

BackendMemory::Create must release all errors triton-inference-server/backend#95

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to set ORT_DISABLE_ALL as optimization #195

Add option to set ORT_DISABLE_ALL as optimization #195

casassg commented Jun 21, 2023

dyastremsky commented Jul 13, 2023

dyastremsky commented Jul 31, 2023

oandreeva-nv commented Aug 16, 2023

dyastremsky commented Sep 20, 2023 •

edited

Loading

nv-kmcgill53 commented Oct 25, 2023

dyastremsky commented Oct 25, 2023

casassg commented Dec 18, 2023

Add option to set ORT_DISABLE_ALL as optimization #195

Add option to set ORT_DISABLE_ALL as optimization #195

Conversation

casassg commented Jun 21, 2023

dyastremsky commented Jul 13, 2023

dyastremsky commented Jul 31, 2023

oandreeva-nv commented Aug 16, 2023

dyastremsky commented Sep 20, 2023 • edited Loading

nv-kmcgill53 commented Oct 25, 2023

dyastremsky commented Oct 25, 2023

casassg commented Dec 18, 2023

dyastremsky commented Sep 20, 2023 •

edited

Loading