- Download a GGUF model from HuggingFace and place it in /models
- Update the
<model_name>
in predict.py to match the file in /models - Create a model on Replicate (https://replicate.com/docs/guides/push-a-transformers-model)
- Run
cog login
- Run
cog push r8.im/<your-username>/<your-model-name>
forked from lucaswadedavis/funkyllama-7b-chat
-
Notifications
You must be signed in to change notification settings - Fork 0
tomasmcm/cog-replicate-llama-gguf
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
How to push any GGUF LLM to Replicate
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 100.0%