ragas-lab

RAG (Retrieval-Augmented Generation) pipelines consist of two key components:

Retriever: Responsible for extracting the most pertinent information to address the query.
Generator: Tasked with formulating a response using the retrieved information. To effectively evaluate a RAG pipeline, it's crucial to assess these components both individually and collectively. This approach yields an overall performance score while also providing specific metrics for each component, allowing for targeted improvements. For instance:

Enhancing the Retriever: This can be achieved through improved chunking strategies or by employing more advanced embedding models.
Optimizing the Generator: Experimenting with different language models or refining prompts can lead to better generation outcomes. However, this raises several important questions: What metrics should be used to measure and benchmark these components? Which datasets are most suitable for evaluation? How can Amazon Bedrock be integrated with RAGAS for this purpose? In the following labs, we'll delve into these critical aspects and show you how to use a framework called RAGAS to create RAG pipeline evaluation and optimization.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
lab-01-titan-cohere-embeddings-ragas.ipynb		lab-01-titan-cohere-embeddings-ragas.ipynb
lab-02-Compare-LLMs-Ragas-Evaluations.ipynb		lab-02-Compare-LLMs-Ragas-Evaluations.ipynb
lab-03-tweaking-chunking-strategy-and-reviewing-results.ipynb		lab-03-tweaking-chunking-strategy-and-reviewing-results.ipynb
octank_financial_10K.pdf		octank_financial_10K.pdf
utility.py		utility.py

Provide feedback