Spaces:

OussemaHassine
/

RAG_HF

Sleeping

RAG_HF / README.md

Oussama

clean deploy — no binary files

77f7bce 23 days ago

2.23 kB

title: Document RAG
emoji: ⚖️
colorFrom: blue
colorTo: purple
sdk: streamlit
sdk_version: 1.56.0
python_version: 3.12
app_file: app.py
pinned: false

📄 DocuRAG — Advanced RAG Pipeline for Document Q&A

Deployed on Hugging Face Spaces — [link coming soon]

Although it might seem like an overkill for a personal project, I wanted to implement advanced and sophisticated approaches to learn the most!

Component	Technique
Chunking	Semantic chunking (sentence-transformers) + Recursive 512-Token
Embeddings	OpenAI `text-embedding-3-small` (dense, 1536d)
Sparse Vectors	BM25 via FastEmbed (`Qdrant/bm25`)
Retrieval	Hybrid search (dense + sparse) with Reciprocal Rank Fusion (RRF)
Reranking	Cohere Rerank v3.5
Generation	GPT-4o-mini with structured prompt engineering
Memory	Sliding window + LLM-based summarization of older turns
Vector Store	Qdrant Cloud (free tier, persistent)
UI	Streamlit with streaming responses

The project includes a RAGAS evaluation pipeline (evaluation/evaluate.py) that measures:

Built by Oussama Hassine as a portfolio project while transitioning into AI Engineering.