RAG Intelligence

RAG Intelligence is a Retrieval-Augmented Generation (RAG) system that integrates various components to facilitate seamless interaction with large language models (LLMs), data retrieval, and conversational AI. This README provides an overview of the key scripts in the repository and instructions on how to run them.

Prerequisites

Python 3.7 or higher
pip
virtualenv (optional but recommended)

Installation

Clone the repository:

git clone https://github.com/RSKMN/rag-intelligence.git
cd rag-intelligence

Create and activate a virtual environment (optional but recommended):

python -m venv env
source env/bin/activate  # On Windows, use 'env\Scripts\activate'

Install the required dependencies:
```
pip install -r requirements.txt
```

Scripts Overview

app.py

Description: This script sets up a Flask API to handle various endpoints, including processing user queries, handling PDF uploads, and interacting with the LLM for responses.

Usage:

/ai: Accepts POST requests with a JSON payload containing a query field.
/ask_pdf: Accepts POST requests with a JSON payload containing a query field, retrieves relevant information from the PDF data, and returns an answer.
/pdf: Accepts POST requests with a file upload, processes the PDF, and updates the vector store.

server.py

Description: This script initializes and runs a FastAPI server, providing endpoints for health checks and processing prompts. It dynamically loads and initializes example classes that implement methods like ingest_docs, rag_chain, and llm_chain.

Usage:

/health: GET endpoint for health checks.
/prompt: POST endpoint that accepts a JSON payload with a list of messages constituting the conversation so far.

llm_convo.py

Description: This script facilitates conversations with the LLM, managing the context and flow of dialogue to maintain coherence and relevance.

Usage: Handles user inputs, maintains conversation history, and interacts with the LLM to generate responses.

script.py

Description: This script is designed for fine-tuning the LLaMA model on question-answer pairs, enhancing the model's performance on specific tasks or datasets.

Usage: Loads training data, fine-tunes the LLaMA model, and saves the updated model for deployment.

pipeline.py

Description: This script implements the full RAG pipeline, integrating document retrieval and LLM response generation to answer user queries effectively.

Usage: Combines retrievers and LLMs to process user queries, retrieve relevant documents, and generate informed responses.

Running the Scripts

Running app.py

To run the Flask API:

python app.py

The server will start, and you can interact with the endpoints as described above.

Running server.py

To run the FastAPI server:

python server.py

The server will start, providing health checks and prompt processing endpoints.

Running llm_convo.py

To engage in a conversation with the LLM:

python llm_convo.py

Follow the on-screen prompts to input your queries and receive responses from the LLM.

Running script.py

To fine-tune the LLaMA model:

python script.py

Ensure you have the necessary training data and configurations set up before running this script.

Running pipeline.py

To execute the RAG pipeline:

python pipeline.py

This will process user queries through the retrieval and generation components to produce responses.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request with your changes. Ensure that your code adheres to the project's coding standards and includes appropriate tests.

License

This project is licensed under the MIT License. See the LICENSE.md file for more details.

Name	Name	Last commit message	Last commit date
Latest commit Stakeylock Update README.md Mar 22, 2025 a56baee · Mar 22, 2025 History 14 Commits
__pycache__	__pycache__	organisation	Mar 22, 2025
history	history	organisation	Mar 22, 2025
red	red	organisation	Mar 22, 2025
scenarios	scenarios	organisation	Mar 22, 2025
.gitignore	.gitignore	1 commit for all	Mar 18, 2025
LICENSE.md	LICENSE.md	license added	Mar 22, 2025
README.md	README.md	Update README.md	Mar 22, 2025
app.py	app.py	UI and REST API server	Mar 18, 2025
llm_convo.py	llm_convo.py	organisation	Mar 22, 2025
main.py	main.py	1 commit for all v2	Mar 18, 2025
main1.py	main1.py	1 commit for all	Mar 18, 2025
main12.py	main12.py	Streamlit story	Mar 22, 2025
pipeline.py	pipeline.py	organisation	Mar 22, 2025
queryscript.py	queryscript.py	initial	Mar 18, 2025
requirements.txt	requirements.txt	organisation	Mar 22, 2025
script.py	script.py	initial	Mar 18, 2025
script1.py	script1.py	1 commit for all	Mar 18, 2025
script11.py	script11.py	Streamlit story	Mar 22, 2025
server.py	server.py	UI and REST API server	Mar 18, 2025
simulated_data.jsonl	simulated_data.jsonl	license added	Mar 22, 2025
simulated_dataM.jsonl	simulated_dataM.jsonl	organisation	Mar 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Intelligence

Table of Contents

Prerequisites

Installation

Scripts Overview

app.py

server.py

llm_convo.py

script.py

pipeline.py

Running the Scripts

Running app.py

Running server.py

Running llm_convo.py

Running script.py

Running pipeline.py

Contributing

License

About

Releases

Packages

Contributors 4

Languages

License

RSKMN/rag-intelligence

Folders and files

Latest commit

History

Repository files navigation

RAG Intelligence

Table of Contents

Prerequisites

Installation

Scripts Overview

app.py

server.py

llm_convo.py

script.py

pipeline.py

Running the Scripts

Running app.py

Running server.py

Running llm_convo.py

Running script.py

Running pipeline.py

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages