This repository contains two Python examples designed for fine-tuning, deploying and scaling language models using Modal, Langchain, Fastapi, VLLM and Hugging Face's Transformers.
Prince Canuma - An MLOPs Engineer and founder at Kulissiwa. Previously, he worked as a ML Engineer at neptune.ai. He is passionate about MLOps, Deep Learning, and Software Engineering.
Contributions to this project are welcome. Please follow the standard procedures for submitting issues and pull requests.
This project is licensed under the MIT License - see the LICENSE file for details.