What is `execution context memory`? #2698

wxsms · 2025-01-16T08:16:58Z

We found that with same model & same inference image & same configuration, model running on different devices will result in different execution context memory, and the difference could be very large. for example, on 4090, it takes 2304 MiB, and on A10 it takes 6384 MiB. Is this expected or configurable?

The text was updated successfully, but these errors were encountered:

nv-guomingz · 2025-01-21T03:04:59Z

I'd like to say it's a expected but not configurable(depends on your device hw).

The value refers to a memory which mainly used for temporary storage required by layer implementations.
Different hw will has their own optimization, so the value difference is expected.

nv-guomingz added the triaged Issue has been triaged by maintainers label Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is `execution context memory`? #2698

What is `execution context memory`? #2698

wxsms commented Jan 16, 2025 •

edited

Loading

nv-guomingz commented Jan 21, 2025

What is execution context memory? #2698

What is execution context memory? #2698

Comments

wxsms commented Jan 16, 2025 • edited Loading

nv-guomingz commented Jan 21, 2025

What is `execution context memory`? #2698

What is `execution context memory`? #2698

wxsms commented Jan 16, 2025 •

edited

Loading