Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Blender with AMD GPU and HIP #710

Open
playday3008 opened this issue Nov 14, 2024 · 13 comments
Open

Blender with AMD GPU and HIP #710

playday3008 opened this issue Nov 14, 2024 · 13 comments

Comments

@playday3008
Copy link

playday3008 commented Nov 14, 2024

Your system information

  • Steam client version (build number or date): 1731433018
  • Distribution: Fedora 41
  • Opted into Steam client beta?: No
  • Have you checked for system updates?: Yes
  • Steam Logs: None
  • GPU: AMD

Please describe your issue in as much detail as possible:

When running Blender through Steam, it uses SteamLinuxRuntime 1.0 (scout), which causes Blender to not be able to load libamdhip64.so (dependency of Cycles Rendering Engine when using AMD GPU as render device (a.k.a. HIP).
Running it directly or with -compat-force-slr off though, works fine.
The problem is, disabling SteamLinuxRuntime globally is not a good idea I guess.

Examples
Running through Steam

image

Running through Steam with -compat-force-slr off

image

Running directly

image

Steps for reproducing this issue:

  1. Start Steam
  2. Open Blender
  3. Go to Edit -> Preferences... -> System -> Cycles Render Devices -> HIP
  4. Says: No compatible GPUs found for Cycles...

Steps to avoid this issue (kinda hack I guess):

  • Run steam with -compat-force-slr off (Disables SteamLinuxRuntime globally)

    OR

  • Run Blender directly, without Steam, eg. bash ~/.local/share/Steam/steamapps/common/Blender/blender-launcher in your terminal

@kisak-valve kisak-valve transferred this issue from ValveSoftware/steam-for-linux Nov 14, 2024
@smcv
Copy link
Contributor

smcv commented Nov 14, 2024

It is expected that apps/games running in a Steam Linux Runtime container cannot load arbitrary libraries from the host system, other than accelerated graphics drivers (OpenGL, Vulkan, VA-API, ...) which have a series of special cases to make them work.

Using the GPU for general-purpose computation (HIP, CUDA, OpenCL, ...) is not exactly graphics, but it's working in approximately the same space, so perhaps ideally the container infrastructure would pick up this library from the host too. I think we currently have similar special cases for some GPU computation APIs (in particular, I think we do have CUDA), but not all of them.

I see that Blender's code to interact with HIP involves running a hipcc executable, under at least some circumstances. That executable is not going to be available in the container either, unless we take very specific steps to make it available - and that would also need to include any files that it opens at runtime, because nothing from the host system is visible in the container unless we arrange for it to be.

To know whether this is feasible, we'll need some information from AMD, or from you if you happen to know the answers:

  • does this library have a long-term-stable API and ABI?
  • where is this library installed / how can we discover it? is it expected to be in the default library search path, or are applications expected to "just know" paths like /opt/rocm/lib?
  • what dependencies does it have?
  • is it x86_64-only, or does it exist in x86_64 and i386 flavours?
  • I see that Blender's code to interact with HIP involves running a hipcc executable, under at least some circumstances. How does this executable work? How are applications expected to discover it? What dependencies does it have?
  • are the library and executable relocatable (can be installed to arbitrary places and can still work) or do they hard-code specific paths for configuration, libraries, resources?

@playday3008
Copy link
Author

I do have answers for some questions, but be aware that some answers might be applicable to Fedora only:

  1. No idea, I don't have much/long time experience with HIP

  2. On my machine, it's located in /usr/lib64/libamdhip64.so, because that's where rocm-hip-devel package is installing it.

    • Package manager on a supported distro (assumption, eg. for Debian there's a libamdhip64-dev package) will put lib into /usr/lib64 (or respective alternative (assumption, eg. for Debian lib goes into /usr/lib/x86_64-linux-gnu/libamdhip64.so)), where ldconfig for example can find it (true for Fedora, assumption for the rest of distros)
    • Manual installation (a.k.a. installation from upstream) will put lib into /opt/rocm/lib
  3. Dependency tree for rocm-hip-devel on Fedora (including hipcc)

    tree

    As we can see, it's impossible to have libamdhip64.so without hipcc installed

  4. It does not have i686 version

  5. hipcc is compiler driver (something like clang-cl for msvc), it depends on rocm-device-libs, rocminfo and clang, below is dependency tree

    hipcc dependency tree

    image

  6. file on lib says it's shared object and not pie executable, but using strace on blender reveals file access sequence after loading libamdhip64.so (in order of accessing)

    /lib64/libamdhip64.so
    /lib64/libamd_comgr.so.2
    /lib64/libhsa-runtime64.so.1
    /lib64/libnuma.so.1
    /usr/lib64/llvm18/lib/liblldELF.so.18.1
    /usr/lib64/llvm18/lib/liblldCommon.so.18.1
    /usr/lib64/llvm18/lib/libclang-cpp.so.18.1
    /usr/lib64/llvm18/lib/libLLVM.so.18.1
    /lib64/libhsakmt.so.1
    (/sys/..., /proc/..., /dev/...)
    /usr/share/libdrm/amdgpu.ids
    (/sys/..., /proc/..., /dev/...)
    ~/.local/share/Steam/steamapps/common/Blender/lib/libOpenImageDenoise_device_hip.so.2.3.0
    

Hope those answers will help somehow, but nice feature would be option for disabling SteamLinuxRuntime per app, something like

that

image

@playday3008
Copy link
Author

Also, seems like related issue from Blender repo: https://projects.blender.org/blender/blender/issues/129895

@smcv
Copy link
Contributor

smcv commented Nov 14, 2024

If libamdhip64.so.5 is following normal Linux library naming conventions, then libamdhip64.so.5 should be long-term ABI-stable, but libamdhip64.so most likely isn't: if AMD released an incompatible version with a different ABI, it would probably be named like libamdhip64.so.6, and libamdhip64.so would probably change to point to that.

@smcv
Copy link
Contributor

smcv commented Nov 14, 2024

5. hipcc is compiler driver (something like clang-cl for msvc), it depends on rocm-device-libs, rocminfo and clang, below is dependency tree

Sorry, that's probably too "big" to be reasonable to pull into the container as part of the graphics stack: we can pick up a few essential libraries like the ones needed by Mesa, but every new library we add involves another roll of the dice on whether it will set up some incompatibility that causes a crash, and sooner or later our luck will run out. So I think whatever happens here will have to involve hipcc running outside the container in some way.

nice feature would be option for disabling SteamLinuxRuntime per app

I can see the appeal of that, but that's outside the Steam Runtime team's control, and would have to come from a Steam client developer.

@playday3008
Copy link
Author

If libamdhip64.so.5 is following normal Linux library naming conventions, then libamdhip64.so.5 should be long-term ABI-stable, but libamdhip64.so most likely isn't: if AMD released an incompatible version with a different ABI, it would probably be named like libamdhip64.so.6, and libamdhip64.so would probably change to point to that.

On my system, it looks like that: /lib64/libamdhip64.so -> /lib64/libamdhip64.so.6 -> /lib64/libamdhip64.so.6.2.41134 (arrows means symlink), and ldconfig have both, ...so.6 and ...so

@playday3008
Copy link
Author

playday3008 commented Nov 14, 2024

nice feature would be option for disabling SteamLinuxRuntime per app

I can see the appeal of that, but that's outside the Steam Runtime team's control, and would have to come from a Steam client developer.

Honestly, that's the only optimal solution I see, the suboptimal but still good one would be, putting some voodoo magic into launch options, so it would escape/go through/disable that container and run natively, something like STEAM_LINUX_RUNTIME=0 %command%, etc.

Or maybe there's already some environment variable or something else that does that, and I'm not aware of it (STEAM_RUNTIME=0 steam does not count, but will it work though?)

@smcv
Copy link
Contributor

smcv commented Nov 14, 2024

In the short term, the best/least-bad workaround is steam -compat-force-slr off (which returns to the pre-November behaviour), together with optionally marking native Linux apps/games that run correctly in SLR (i.e. not Blender, in your case) as Properties → Compatibility → Force the use of... → Steam Linux Runtime 1.0.

STEAM_RUNTIME=0 steam does not count, but will it work though?

No, that won't work. Disabling the use of the older LD_LIBRARY_PATH-based runtime to run the Steam client itself (which is what STEAM_RUNTIME=0 does) is specifically not supported or supportable, and it doesn't have any effect on whether things get run in the Steam Linux Runtime container or not.

For native Linux apps/games that target the scout ABI (which means most native Linux apps/games right now, including Blender), steam -compat-force-slr off is currently the only way to disable the use of the SLR container.

For completeness:

For native Linux apps/games that target the newer sniper ABI, the SLR container is required and cannot be disabled.

For Windows apps/games running under Proton 5.13 or newer, the SLR container is required and cannot be disabled (except by choosing to use a very old old version of Proton instead).

For Windows apps/games running under Proton 5.0 or older, the SLR container is not used and cannot be enabled (except by choosing to use a newer version of Proton instead).

putting some voodoo magic into launch options, so it would escape/go through/disable that container and run natively

This is not something that is supported or supportable. (A developer-oriented mechanism does exist, but should not be recommended to end users as a workaround.)

@playday3008
Copy link
Author

In the short term, the best/least-bad workaround is steam -compat-force-slr off (which returns to the pre-November behaviour), together with optionally marking native Linux apps/games that run correctly in SLR (i.e. not Blender, in your case) as Properties → Compatibility → Force the use of... → Steam Linux Runtime 1.0.

So instead of explicitly disabling SLR per app, we can disable SLR completely and then explicitly enable it per app (if needed, but I guess better enable for all apps and then disable for those one that doesn't work well with it). Why I didn't think of that, that's an option, the junky one, but definitely an option, big thanks for deep dive explanations and for single (I hope for now) solution.

@smcv
Copy link
Contributor

smcv commented Nov 15, 2024

I guess better enable for all apps and then disable for those one that doesn't work well with it

If there was a supported/supportable way to achieve this, then yes I think that would be better, but currently there is not.

A future Steam Client release might add a way to do this (but this is not under my control, so no guarantees). If it does, I'll try to update this issue to mention it.

@DarkDefender
Copy link

DarkDefender commented Nov 15, 2024

I just wanted to chime in here as well. I'm the current Linux maintainer for Blender.

As you guys already figured out it is not really viable for Steam to ship all the needed HIP libraries.
Currently HIP is changing quite a bit between versions and the same is true for Intel's "oneAPI" GPU backend as well.

We (the Blender developers) do not really see a good way for Blender to work inside of the new steam sandbox.
Besides just general hardware interface issues, it also leads to us having a harder time to provide access to the host machines mounted volumes etc:
https://projects.blender.org/blender/blender/issues/129902

We hope that we can reach out and discuss the alternatives with Valve directly and come up with a solution.

@smcv
Copy link
Contributor

smcv commented Nov 28, 2024

A future Steam Client release might add a way to [switch between the Steam Linux Runtime 1.0 container environment and the legacy LD_LIBRARY_PATH-based runtime on a per-title basis]

This feature was added in the 2024-11-27 Steam Client beta update (it didn't make it into the release notes, but it's there). This is a less intrusive workaround than steam -compat-force-slr off because it only affects individual games.

To force use of the legacy LD_LIBRARY_PATH-based runtime, you can use Properties → Compatibility → Force the use of... → Legacy runtime 1.0. This is the same as the old default behaviour (before November 2024), and is still the default if you ran Steam as steam -compat-force-slr off. Apps and games that run successfully in this mode might still regress as a result of a routine OS update.

To force use of the container runtime, you can use Properties → Compatibility → Force the use of... → Steam Linux Runtime 1.0 (scout). This is the default if you ran Steam with no special options. A few apps and games won't work in this mode, but the container runtime means that those that do work in this mode are much more likely to continue to work in years to come.

(The Compatibility tab also allows you to force Steam to run the Windows version of an app/game instead, via Proton, but I suspect that this is unlikely to be practically useful for Blender!)

A future version of Steam might also provide a way for app/game vendors to select the legacy LD_LIBRARY_PATH-based runtime on a per-title basis

As far as I know, this is not yet available. I think this is most likely to be added later, after the changes in the 2024-11-27 Steam Client beta get re-released as stable (until then, it would only work for beta client users).

@DarkDefender
Copy link

Thanks for the update! Good to know that user can now work around the issue themselves in the beta release! :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants