Large Language Models POC 2024

Large Language Models POC 2024

🚧 🚀 WIP... 🚧

🎯 About

My goal is to work my way through certain LLM models starting from Transformers to help me understand how each model works and builds from the previous models. Once I have a set of models I am interested in, the next focus will be on fine-tuning and optimizing the models to run on the cheapest hardware possible.

✨ Models

✅ Transformer
✅ GPT
✅ LLaMA
◻️ LLM Inference Optimization
◻️ In-flight Batching
◻️ Speculative inference
◻️ Key-Value Caching
◻️ PagedAttention
◻️ Pipeline Parallelism
◻️ Tensor Parallelism
◻️ Sequence Parallelism
◻️ Flash Attention
◻️ Quantization
◻️ Sparsity
◻️ Distillation
◻️
◻️

🔥 Helpful Notebooks

✅ Transformer Arithmetic
✅ [WIP] Transformer Scaling
◻️

🧪 Requirements

Requirements for all the models are stored under a single requirements.txt file.

📝 License

This project is under license from MIT. For more details, see the LICENSE file.

Made with ❤️ by Mukesh Mithrakumar

Back to top

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
GPT		GPT
llama		llama
llm-inference		llm-inference
transformer		transformer
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
llm_poc_2024_banner.jpeg		llm_poc_2024_banner.jpeg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Models POC 2024

🚧 🚀 WIP... 🚧

🎯 About

✨ Models

🔥 Helpful Notebooks

🧪 Requirements

📝 License

About

Releases

Packages

Languages

License

mukeshmithrakumar/LLM-POC-2024

Folders and files

Latest commit

History

Repository files navigation

Large Language Models POC 2024

🚧 🚀 WIP... 🚧

🎯 About

✨ Models

🔥 Helpful Notebooks

🧪 Requirements

📝 License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages