RAG 检索增强生成
检索增强生成(RAG)从入门到生产:向量数据库选型、Embedding、分块与重排、多向量与混合检索,以及 Pinecone / Weaviate / Qdrant / pgvector 等实战。
RAG 检索增强生成
检索增强生成(RAG)从入门到生产:向量数据库选型、Embedding、分块与重排、多向量与混合检索,以及 Pinecone / Weaviate / Qdrant / pgvector 等实战。
本主题共 100 篇教程
Build a Production RAG System with LlamaIndex and Pinecone
Step-by-step guide to retrieval-augmented generation that works on real data
高级Vector Databases & RAG in Production: Pinecone, Weaviate & pgvector in 2025
Build production-grade retrieval-augmented generation systems with vector search at scale
高级Vector Database Guide 2026: Pinecone vs Qdrant vs pgvector vs Weaviate
Choose the right vector database for your RAG application performance and cost
高级LangChain vs LlamaIndex vs Haystack: RAG Framework 2026
Choose the right RAG framework for production LLM applications
高级Fine-Tuning GPT-4 and Claude: When to Fine-Tune vs RAG 2026
Make the right architectural decision: fine-tuning or RAG for your LLM application
高级Python AI Development Stack 2026: FastAPI + LangChain + Supabase
Build production-ready AI applications with the modern Python AI stack
高级LlamaIndex Tutorial 2026: Build Production RAG Applications
Connect LLMs to your documents with LlamaIndex ingestion pipelines and query engines
高级DSPy Tutorial 2026: Automatic LLM Prompt Optimization
Replace manual prompt engineering with DSPy automatic optimization
进阶OpenAI Assistants API v2 2026: Files, Code Interpreter, and Threads
Build persistent AI assistants with built-in RAG, code execution, function calling
进阶pgvector Tutorial 2026: Vector Similarity Search in PostgreSQL
Add semantic search to your PostgreSQL database for RAG without extra infrastructure
进阶Supabase AI Stack 2026: pgvector + Edge Functions + Realtime Streaming
Build full-stack AI apps with user-scoped RAG, Edge Functions, and streaming
入门Perplexity AI API Guide 2026: Real-Time Web Search for AI Apps
Build AI apps with current web knowledge using Perplexity search API
进阶Mistral AI API Guide 2026: Mixtral, Codestral, Embeddings
Build cost-efficient AI applications with Mistral AI models
进阶Supabase + OpenAI: Build a Semantic Search App in 30 Minutes 2026
Build a full-stack semantic search application using Supabase pgvector, OpenAI embeddings, and Next.js with authentication and real-time updates
高级Building a RAG System from Scratch: Complete Python Tutorial 2026
Build a production-quality Retrieval Augmented Generation system step by step, from document processing to API deployment
进阶Vector Databases Compared 2026: Pinecone vs Weaviate vs Qdrant vs Chroma
Which vector database should you choose for your AI application? Performance benchmarks, pricing, and use case analysis for the top 4 options
进阶LangChain vs LlamaIndex 2026: Which Framework Should You Use for RAG?
An honest technical comparison of LangChain and LlamaIndex for building RAG applications, with benchmarks, use cases, and migration guide
进阶LangChain vs LlamaIndex: Which Framework to Choose in 2025?
In-depth comparison of the two leading LLM frameworks
入门RAG Architecture Cheat Sheet
RAG pipeline components, patterns, and configuration reference
高级LLM Fine-Tuning in 2025: When to Fine-Tune vs. RAG vs. Prompting (With Cost Analysis)
Senior AI engineers explain the decision framework for choosing between fine-tuning, RAG, and prompt engineering
高级Building Production RAG Systems with LangChain: From Prototype to 99.9% Uptime
Engineering teams share battle-tested patterns for reliable retrieval-augmented generation in production
进阶Contextual Retrieval: Advanced RAG Tutorial
Anthropic contextual retrieval for improved chunk context
高级Vector Database Showdown 2025: Pinecone vs. Weaviate vs. Qdrant vs. pgvector
Benchmark results and use case analysis for choosing the right vector database for your AI application
进阶Reranking for RAG: Advanced RAG Tutorial
Using cross-encoder reranking to improve RAG precision
进阶Building AI-Powered Search with Semantic Retrieval
Replace keyword search with intelligent semantic understanding
高级Vector Databases for Production: Architecture, Performance, and Scaling
The complete technical guide to deploying vector databases at enterprise scale
进阶Hybrid Search RAG: Advanced RAG Tutorial
Combining vector and keyword search for better RAG retrieval
高级Corrective RAG: Implementation Guide with Weaviate 2026
Build a self-correcting retrieval with quality assessment RAG system from scratch
进阶AI-Powered Search Engine
Building semantic search with vector database — hands-on project tutorial
进阶Chunking Strategies: Advanced RAG Tutorial
Optimal text chunking strategies for different document types
高级Multi-Vector RAG: Implementation Guide with Weaviate 2026
Build a storing multiple embedding types per document RAG system from scratch
进阶Parent Document Retrieval: Advanced RAG Tutorial
Hierarchical chunking with parent-child document strategy
进阶Milvus Distributed Vectors: Tutorial and Best Practices
Build production AI with Milvus — scalable distributed vector search
进阶Multi-Query Retrieval: Advanced RAG Tutorial
Generating multiple queries for comprehensive RAG retrieval
进阶Supabase + pgvector: How to Add vector search to Supabase apps (2026)
Complete integration guide for Supabase and pgvector
进阶PostgreSQL pgvector vs Dedicated DBs: Side-by-Side Comparison
Comparing vector search in SQL vs purpose-built stores — comparing operational simplicity across postgresql and qdrant
入门Vector Database Comparison Table
Quick comparison of Qdrant, Pinecone, Weaviate, Chroma features
进阶Streaming RAG: Advanced RAG Tutorial
Implementing streaming responses for RAG applications
进阶RAG (Retrieval Augmented Generation): Complete Developer Guide 2026
Master RAG (Retrieval Augmented Generation) with practical examples and production patterns
进阶Voyage AI Embeddings
High-quality embeddings with Voyage AI for RAG
进阶RAG Fusion: Advanced RAG Tutorial
Combining results from multiple retrieval strategies
进阶HyDE Hypothetical Document: Advanced RAG Tutorial
Hypothetical Document Embedding for improved retrieval
进阶Build a RAG Chatbot in 30 Minutes
Quick tutorial building a fully functional RAG chatbot — hands-on project tutorial
高级RAPTOR RAG: Implementation Guide with Pinecone 2026
Build a hierarchical document summarization for better context RAG system from scratch
进阶Cohere Embed v3 Search
Semantic search with Cohere Embed v3 embeddings
进阶RAG with Metadata Filtering: Advanced RAG Tutorial
Using metadata filters for precise document retrieval
高级Hybrid Search RAG: Implementation Guide with Elasticsearch 2026
Build a combining vector and keyword search for maximum recall RAG system from scratch
进阶LLM Text Chunking Best Practices: Practical Tutorial
Optimal strategies for chunking text for RAG applications
进阶Long Context RAG: Advanced RAG Tutorial
Handling long documents with sliding window chunking
进阶PostgreSQL + pgvector: How to Implement vector search in PostgreSQL (2026)
Complete integration guide for PostgreSQL and pgvector
高级Contextual Compression RAG: Implementation Guide with Pinecone 2026
Build a compressing retrieved context to fit LLM window RAG system from scratch
高级Self-Query RAG: Implementation Guide with Qdrant 2026
Build a AI-generated metadata filters for precise retrieval RAG system from scratch
进阶Vector Embeddings Explained: Complete Developer Guide 2026
Master Vector Embeddings Explained with practical examples and production patterns
进阶Qdrant vs Pinecone vs Weaviate: Side-by-Side Comparison
Vector database comparison for production RAG — comparing performance and cost across qdrant and pinecone
进阶Hybrid RAG with BM25 and Vectors: Practical Tutorial
Combining BM25 keyword search with vector search
进阶Adaptive RAG: Advanced RAG Tutorial
Dynamic routing between different retrieval strategies
进阶Qdrant Vector Search: Tutorial and Best Practices
Build production AI with Qdrant — high-performance vector database
高级Graph RAG: Implementation Guide with Neo4j 2026
Build a knowledge graph traversal for multi-hop reasoning RAG system from scratch
入门Pinecone vs Weaviate: Which is Better for production vector search? (2026)
Detailed comparison of Pinecone and Weaviate for production vector search
进阶LangChain vs LlamaIndex Comparison: Complete Developer Guide 2026
Master LangChain vs LlamaIndex Comparison with practical examples and production patterns
高级Parent Document RAG: Implementation Guide with Chroma 2026
Build a retrieving small chunks with large parent context RAG system from scratch
进阶Semantic Search Implementation: Complete Developer Guide 2026
Master Semantic Search Implementation with practical examples and production patterns
进阶ColBERT Late Interaction
Token-level retrieval with ColBERT for better RAG
进阶Cohere Command R+ RAG
Building enterprise RAG with Cohere Command R+
高级Time-Aware RAG: Implementation Guide with Pinecone 2026
Build a weighting recent documents higher in retrieval RAG system from scratch
进阶Building AI Applications with PostgreSQL and pgvector: Complete Guide
Full-stack AI app with Supabase, pgvector, and Next.js for semantic search and RAG
进阶Production RAG Architecture: Advanced RAG Tutorial
Scalable production RAG system design patterns
进阶AI Embedding Models Comparison 2025: OpenAI vs Cohere vs Open Source
Benchmarking text embeddings on MTEB for retrieval, classification, and semantic similarity
高级Advanced RAG: Moving Beyond Naive Retrieval to Production-Grade Systems
Corrective RAG, Self-RAG, adaptive retrieval, and evaluation with RAGAS
入门AI Embeddings Reference
Embedding model comparison, dimensions, and use cases
高级Building Enterprise Semantic Search with AI: Beyond Keyword Matching
Hybrid search, reranking, and personalization for intelligent enterprise knowledge systems
进阶Vector Database Comparison 2025: Pinecone vs Weaviate vs Qdrant vs pgvector
Performance benchmarks, cost analysis, and use case recommendations
高级Cross-Encoder RAG: Implementation Guide with Qdrant 2026
Build a neural reranking for high-precision retrieval RAG system from scratch
进阶Corrective RAG (CRAG): Advanced RAG Tutorial
Self-correcting RAG with web search fallback
入门Chroma vs Qdrant: Which is Better for local vector database? (2026)
Detailed comparison of Chroma and Qdrant for local vector database
进阶Weaviate Hybrid Search: Tutorial and Best Practices
Build production AI with Weaviate — vector + BM25 hybrid search
进阶RAG with SQL: Advanced RAG Tutorial
Querying databases in natural language for structured RAG
进阶RAG Evaluation with RAGAS: Advanced RAG Tutorial
Systematic evaluation of RAG pipeline quality
进阶Multimodal RAG: Advanced RAG Tutorial
Retrieving and generating across text and images
进阶Self-RAG Framework: Advanced RAG Tutorial
Self-reflective RAG that validates its own retrieval
入门RAG Application Boilerplate: Starter Guide
Production-ready RAG application starter with all best practices
进阶Semantic Chunking: Advanced RAG Tutorial
Intelligently splitting documents at semantic boundaries
进阶Pinecone Serverless Vectors: Tutorial and Best Practices
Build production AI with Pinecone — managed serverless vector store
进阶Graph RAG: Advanced RAG Tutorial
Knowledge graph enhanced retrieval for complex reasoning
高级RAG系统从零到生产:2026年完整构建指南
向量数据库选型、分块策略、重排序优化——RAG最佳实践
高级Advanced RAG 高级技巧完整指南 2026:超越基础检索,构建生产级知识库
解决 RAG 幻觉、检索不准、上下文丢失三大核心问题
进阶Building RAG Applications: The Complete Production Guide 2025
From simple document Q&A to enterprise-grade RAG systems that actually work
进阶LlamaIndex 实战指南:RAG 应用开发从入门到生产
LlamaIndex vs LangChain 怎么选?5个真实场景代码示例
进阶向量数据库选型指南:Pinecone vs Weaviate vs Chroma vs Qdrant(2026)
RAG 和 AI Agent 应用的数据库底座怎么选?4款主流向量数据库深度对比
高级RAG高级技术:分块策略、重排序和混合检索的工程优化
从基础RAG到生产级RAG,解决准确率、延迟和成本的工程挑战
进阶Dify 企业私有知识库完全搭建指南:RAG 配置与最佳实践(2026)
从部署到调优,手把手搭建企业级 RAG 知识库问答系统
进阶LlamaIndex vs LangChain:到底该用哪个搭 RAG(2026 实战对比)
两个都用过半年后,我把选型逻辑讲清楚:什么项目用 LlamaIndex,什么项目用 LangChain
进阶Qdrant vs Chroma:向量数据库怎么选(2026 选型指南)
Chroma 适合快速起步,Qdrant 扛得住生产——但分界线没那么绝对
进阶AI in the Food and Beverage Industry: From Recipe Development to Quality Control
How AI is transforming product development, supply chain, and food safety
进阶MongoDB + Atlas Vector Search: How to Add AI search to MongoDB (2026)
Complete integration guide for MongoDB and Atlas Vector Search
进阶E5 Multilingual Embeddings
Multilingual semantic search with E5 embeddings
入门LlamaIndex RAG Template: Starter Guide
Production LlamaIndex RAG project with evaluation
入门Your First RAG Application: Beginner's Complete Guide
Everything a beginner needs to know about your first rag application
进阶RAG System Design Best Practices: 2026 Developer Guide
Essential practices every AI developer should follow for rag system design
入门Understanding Embeddings for Beginners: Beginner's Complete Guide
Everything a beginner needs to know about understanding embeddings for beginners