How to push any GGUF LLM to Replicate

Download a GGUF model from HuggingFace and place it in /models
Update the <model_name> in predict.py to match the file in /models
Create a model on Replicate (https://replicate.com/docs/guides/push-a-transformers-model)
Run cog login
Run cog push r8.im/<your-username>/<your-model-name>

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
models		models
.gitignore		.gitignore
README.md		README.md
cog.yaml		cog.yaml
predict.py		predict.py

Provide feedback