You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Why are timestep and image_latents (collectively referred to as x) not kept during recording kv_cache, but only condition c? This makes the first time step of inference not perform self-attention of x on x, but only cross-attention of x on c.
The text was updated successfully, but these errors were encountered:
Hi, @hrz2000 , thank you for pointing out this issue. It appears to be a bug in the inference process, and I will carefully check this part of the code later.
OmniGen/OmniGen/scheduler.py
Line 106 in 8d1647c
Why are timestep and image_latents (collectively referred to as x) not kept during recording kv_cache, but only condition c? This makes the first time step of inference not perform self-attention of x on x, but only cross-attention of x on c.
The text was updated successfully, but these errors were encountered: