This repository contains example code for performing inference with Llama 2 using 🤗 Transformers on a supercomputer.
The process consists of the following steps:
- Create an Apptainer file using the tested
llama2_2.def
andmamba_llama2_2.yml
files. - Develop a Python script using the
llama2.ipynb
Jupyter Notebook. Jupyter Notebooks are a favourite among data scientists! - Create a Python as
llama2.py
. My script is quite a mess as I worked alone. No one will adjust it. - Create Slurm scripts
7b_prompt-1.sh
and13b_prompt-1.sh
to run the Python script that we created. - Submit the Slurm job to execute it.