Try larger models 💪 #3

maxbbraun · 2023-11-15T03:24:32Z

The current implementation works with the 15M parameter version of tinyllamas. Just dropping in the next larger one (42M) flashes fine, but freezes at runtime.

Would need to look into what's happening here. It could be that the model weights plus the run state are larger than the available RAM (63.5MB). I might also have overlooked something about the memory layout. If it's the former, there might be a way to optimize memory usage to fit everything.

Another option would be to train a model between 15M and 42M parameters that just barely fits without any further optimizations.

The text was updated successfully, but these errors were encountered:

maxbbraun added enhancement New feature or request good first issue Good for newcomers labels Nov 15, 2023

maxbbraun changed the title ~~Try larger models~~ Try larger models 💪 Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try larger models 💪 #3

Try larger models 💪 #3

maxbbraun commented Nov 15, 2023

Try larger models 💪 #3

Try larger models 💪 #3

Comments

maxbbraun commented Nov 15, 2023