Reproducing GPTX from scratch using PyTorch
- Implement the model architecture
- The architectures
- LitGPT: https://github.com/Lightning-AI/litgpt
- nanoGPT: https://github.com/karpathy/nanoGPT
- llama3 from scratch: https://github.com/naklecha/llama3-from-scratch
- LLMs from scratch: https://github.com/rasbt/LLMs-from-scratch
- x-transformers: https://github.com/lucidrains/x-transformers
- minbpe: https://github.com/karpathy/minbpe
- RLHF
- Use Case