Llama2-Inference

This repository contains example code for performing inference with Llama 2 using 🤗 Transformers on a supercomputer.

The process consists of the following steps:

Create an Apptainer file using the tested llama2_2.def and mamba_llama2_2.yml files.
Develop a Python script using the llama2.ipynb Jupyter Notebook. Jupyter Notebooks are a favourite among data scientists!
Create a Python as llama2.py. My script is quite a mess as I worked alone. No one will adjust it.
Create Slurm scripts 7b_prompt-1.sh and 13b_prompt-1.sh to run the Python script that we created.
Submit the Slurm job to execute it.

Provide feedback

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
13b_prompt-1.sh		13b_prompt-1.sh
7b_prompt-1.sh		7b_prompt-1.sh
README.md		README.md
llama2.ipynb		llama2.ipynb
llama2.py		llama2.py
llama2_2.def		llama2_2.def
mamba_llama2_2.yml		mamba_llama2_2.yml