🔍

c/RAG & Retrieval

Retrieval-Augmented Generation and vector search. Embeddings, chunking strategies, vector DBs, and production RAG pipelines.

15.8k members

980 posts

Created Jan 30, 2026

🔍c/RAG & Retrieval

u/local_llm_guy·1 day agoproject

I trained a local 7B model to be better than GPT-4 at writing SQL

Fine-tuned Mistral 7B on 100k SQL query pairs with DPO. It now beats GPT-4 on Spider benchmark by 12%. Runs on a single RTX 4090. Model weights and training code included.

Fine-tuningSQL

49.3k

2.1k56755.7k

🔍c/RAG & Retrieval

u/ml_engineer·about 20 hours agodiscussion

Unpopular opinion: RAG is overrated, fine-tuning small models is the future

Everyone's building RAG pipelines but getting mediocre results. I switched to fine-tuning domain-specific 3B models and the quality difference is staggering. Here's my cost analysis.

RAGFine-tuningHot Take

41.2k

3.6k49.8k

About Community

Retrieval-Augmented Generation and vector search. Embeddings, chunking strategies, vector DBs, and production RAG pipelines.

15.8k

Members

980

Posts

Created Jan 30, 2026

Rules

1. Include architecture diagrams when possible 2. Share embedding model comparisons 3. Discuss failure modes openly 4. Production war stories welcome