RAG 检索增强生成

检索增强生成(RAG)从入门到生产:向量数据库选型、Embedding、分块与重排、多向量与混合检索,以及 Pinecone / Weaviate / Qdrant / pgvector 等实战。

全部教程

RAG 检索增强生成

检索增强生成(RAG)从入门到生产:向量数据库选型、Embedding、分块与重排、多向量与混合检索,以及 Pinecone / Weaviate / Qdrant / pgvector 等实战。

本主题共 100 篇教程

高级

Build a Production RAG System with LlamaIndex and Pinecone

Step-by-step guide to retrieval-augmented generation that works on real data

高级

Vector Databases & RAG in Production: Pinecone, Weaviate & pgvector in 2025

Build production-grade retrieval-augmented generation systems with vector search at scale

高级

Vector Database Guide 2026: Pinecone vs Qdrant vs pgvector vs Weaviate

Choose the right vector database for your RAG application performance and cost

高级

LangChain vs LlamaIndex vs Haystack: RAG Framework 2026

Choose the right RAG framework for production LLM applications

高级

Fine-Tuning GPT-4 and Claude: When to Fine-Tune vs RAG 2026

Make the right architectural decision: fine-tuning or RAG for your LLM application

高级

Python AI Development Stack 2026: FastAPI + LangChain + Supabase

Build production-ready AI applications with the modern Python AI stack

高级

LlamaIndex Tutorial 2026: Build Production RAG Applications

Connect LLMs to your documents with LlamaIndex ingestion pipelines and query engines

高级

DSPy Tutorial 2026: Automatic LLM Prompt Optimization

Replace manual prompt engineering with DSPy automatic optimization

进阶

OpenAI Assistants API v2 2026: Files, Code Interpreter, and Threads

Build persistent AI assistants with built-in RAG, code execution, function calling

进阶

pgvector Tutorial 2026: Vector Similarity Search in PostgreSQL

Add semantic search to your PostgreSQL database for RAG without extra infrastructure

进阶

Supabase AI Stack 2026: pgvector + Edge Functions + Realtime Streaming

Build full-stack AI apps with user-scoped RAG, Edge Functions, and streaming

入门

Perplexity AI API Guide 2026: Real-Time Web Search for AI Apps

Build AI apps with current web knowledge using Perplexity search API

进阶

Mistral AI API Guide 2026: Mixtral, Codestral, Embeddings

Build cost-efficient AI applications with Mistral AI models

进阶

Supabase + OpenAI: Build a Semantic Search App in 30 Minutes 2026

Build a full-stack semantic search application using Supabase pgvector, OpenAI embeddings, and Next.js with authentication and real-time updates

高级

Building a RAG System from Scratch: Complete Python Tutorial 2026

Build a production-quality Retrieval Augmented Generation system step by step, from document processing to API deployment

进阶

Vector Databases Compared 2026: Pinecone vs Weaviate vs Qdrant vs Chroma

Which vector database should you choose for your AI application? Performance benchmarks, pricing, and use case analysis for the top 4 options

进阶

LangChain vs LlamaIndex 2026: Which Framework Should You Use for RAG?

An honest technical comparison of LangChain and LlamaIndex for building RAG applications, with benchmarks, use cases, and migration guide

进阶

LangChain vs LlamaIndex: Which Framework to Choose in 2025?

In-depth comparison of the two leading LLM frameworks

入门

RAG Architecture Cheat Sheet

RAG pipeline components, patterns, and configuration reference

高级

LLM Fine-Tuning in 2025: When to Fine-Tune vs. RAG vs. Prompting (With Cost Analysis)

Senior AI engineers explain the decision framework for choosing between fine-tuning, RAG, and prompt engineering

高级

Building Production RAG Systems with LangChain: From Prototype to 99.9% Uptime

Engineering teams share battle-tested patterns for reliable retrieval-augmented generation in production

进阶

Contextual Retrieval: Advanced RAG Tutorial

Anthropic contextual retrieval for improved chunk context

高级

Vector Database Showdown 2025: Pinecone vs. Weaviate vs. Qdrant vs. pgvector

Benchmark results and use case analysis for choosing the right vector database for your AI application

进阶

Reranking for RAG: Advanced RAG Tutorial

Using cross-encoder reranking to improve RAG precision

进阶

Building AI-Powered Search with Semantic Retrieval

Replace keyword search with intelligent semantic understanding

高级

Vector Databases for Production: Architecture, Performance, and Scaling

The complete technical guide to deploying vector databases at enterprise scale

进阶

Hybrid Search RAG: Advanced RAG Tutorial

Combining vector and keyword search for better RAG retrieval

高级

Corrective RAG: Implementation Guide with Weaviate 2026

Build a self-correcting retrieval with quality assessment RAG system from scratch

进阶

AI-Powered Search Engine

Building semantic search with vector database — hands-on project tutorial

进阶

Chunking Strategies: Advanced RAG Tutorial

Optimal text chunking strategies for different document types

高级

Multi-Vector RAG: Implementation Guide with Weaviate 2026

Build a storing multiple embedding types per document RAG system from scratch

进阶

Parent Document Retrieval: Advanced RAG Tutorial

Hierarchical chunking with parent-child document strategy

进阶

Milvus Distributed Vectors: Tutorial and Best Practices

Build production AI with Milvus — scalable distributed vector search

进阶

Multi-Query Retrieval: Advanced RAG Tutorial

Generating multiple queries for comprehensive RAG retrieval

进阶

Supabase + pgvector: How to Add vector search to Supabase apps (2026)

Complete integration guide for Supabase and pgvector

进阶

PostgreSQL pgvector vs Dedicated DBs: Side-by-Side Comparison

Comparing vector search in SQL vs purpose-built stores — comparing operational simplicity across postgresql and qdrant

入门

Vector Database Comparison Table

Quick comparison of Qdrant, Pinecone, Weaviate, Chroma features

进阶

Streaming RAG: Advanced RAG Tutorial

Implementing streaming responses for RAG applications

进阶

RAG (Retrieval Augmented Generation): Complete Developer Guide 2026

Master RAG (Retrieval Augmented Generation) with practical examples and production patterns

进阶

Voyage AI Embeddings

High-quality embeddings with Voyage AI for RAG

进阶

RAG Fusion: Advanced RAG Tutorial

Combining results from multiple retrieval strategies

进阶

HyDE Hypothetical Document: Advanced RAG Tutorial

Hypothetical Document Embedding for improved retrieval

进阶

Build a RAG Chatbot in 30 Minutes

Quick tutorial building a fully functional RAG chatbot — hands-on project tutorial

高级

RAPTOR RAG: Implementation Guide with Pinecone 2026

Build a hierarchical document summarization for better context RAG system from scratch

进阶

Cohere Embed v3 Search

Semantic search with Cohere Embed v3 embeddings

进阶

RAG with Metadata Filtering: Advanced RAG Tutorial

Using metadata filters for precise document retrieval

高级

Hybrid Search RAG: Implementation Guide with Elasticsearch 2026

Build a combining vector and keyword search for maximum recall RAG system from scratch

进阶

LLM Text Chunking Best Practices: Practical Tutorial

Optimal strategies for chunking text for RAG applications

进阶

Long Context RAG: Advanced RAG Tutorial

Handling long documents with sliding window chunking

进阶

PostgreSQL + pgvector: How to Implement vector search in PostgreSQL (2026)

Complete integration guide for PostgreSQL and pgvector

高级

Contextual Compression RAG: Implementation Guide with Pinecone 2026

Build a compressing retrieved context to fit LLM window RAG system from scratch

高级

Self-Query RAG: Implementation Guide with Qdrant 2026

Build a AI-generated metadata filters for precise retrieval RAG system from scratch

进阶

Vector Embeddings Explained: Complete Developer Guide 2026

Master Vector Embeddings Explained with practical examples and production patterns

进阶

Qdrant vs Pinecone vs Weaviate: Side-by-Side Comparison

Vector database comparison for production RAG — comparing performance and cost across qdrant and pinecone

进阶

Hybrid RAG with BM25 and Vectors: Practical Tutorial

Combining BM25 keyword search with vector search

进阶

Adaptive RAG: Advanced RAG Tutorial

Dynamic routing between different retrieval strategies

进阶

Qdrant Vector Search: Tutorial and Best Practices

Build production AI with Qdrant — high-performance vector database

高级

Graph RAG: Implementation Guide with Neo4j 2026

Build a knowledge graph traversal for multi-hop reasoning RAG system from scratch

入门

Pinecone vs Weaviate: Which is Better for production vector search? (2026)

Detailed comparison of Pinecone and Weaviate for production vector search

进阶

LangChain vs LlamaIndex Comparison: Complete Developer Guide 2026

Master LangChain vs LlamaIndex Comparison with practical examples and production patterns

高级

Parent Document RAG: Implementation Guide with Chroma 2026

Build a retrieving small chunks with large parent context RAG system from scratch

进阶

Semantic Search Implementation: Complete Developer Guide 2026

Master Semantic Search Implementation with practical examples and production patterns

进阶

ColBERT Late Interaction

Token-level retrieval with ColBERT for better RAG

进阶

Cohere Command R+ RAG

Building enterprise RAG with Cohere Command R+

高级

Time-Aware RAG: Implementation Guide with Pinecone 2026

Build a weighting recent documents higher in retrieval RAG system from scratch

进阶

Building AI Applications with PostgreSQL and pgvector: Complete Guide

Full-stack AI app with Supabase, pgvector, and Next.js for semantic search and RAG

进阶

Production RAG Architecture: Advanced RAG Tutorial

Scalable production RAG system design patterns

进阶

AI Embedding Models Comparison 2025: OpenAI vs Cohere vs Open Source

Benchmarking text embeddings on MTEB for retrieval, classification, and semantic similarity

高级

Advanced RAG: Moving Beyond Naive Retrieval to Production-Grade Systems

Corrective RAG, Self-RAG, adaptive retrieval, and evaluation with RAGAS

入门

AI Embeddings Reference

Embedding model comparison, dimensions, and use cases

高级

Building Enterprise Semantic Search with AI: Beyond Keyword Matching

Hybrid search, reranking, and personalization for intelligent enterprise knowledge systems

进阶

Vector Database Comparison 2025: Pinecone vs Weaviate vs Qdrant vs pgvector

Performance benchmarks, cost analysis, and use case recommendations

高级

Cross-Encoder RAG: Implementation Guide with Qdrant 2026

Build a neural reranking for high-precision retrieval RAG system from scratch

进阶

Corrective RAG (CRAG): Advanced RAG Tutorial

Self-correcting RAG with web search fallback

入门

Chroma vs Qdrant: Which is Better for local vector database? (2026)

Detailed comparison of Chroma and Qdrant for local vector database

进阶

Weaviate Hybrid Search: Tutorial and Best Practices

Build production AI with Weaviate — vector + BM25 hybrid search

进阶

RAG with SQL: Advanced RAG Tutorial

Querying databases in natural language for structured RAG

进阶

RAG Evaluation with RAGAS: Advanced RAG Tutorial

Systematic evaluation of RAG pipeline quality

进阶

Multimodal RAG: Advanced RAG Tutorial

Retrieving and generating across text and images

进阶

Self-RAG Framework: Advanced RAG Tutorial

Self-reflective RAG that validates its own retrieval

入门

RAG Application Boilerplate: Starter Guide

Production-ready RAG application starter with all best practices

进阶

Semantic Chunking: Advanced RAG Tutorial

Intelligently splitting documents at semantic boundaries

进阶

Pinecone Serverless Vectors: Tutorial and Best Practices

Build production AI with Pinecone — managed serverless vector store

进阶

Graph RAG: Advanced RAG Tutorial

Knowledge graph enhanced retrieval for complex reasoning

高级

RAG系统从零到生产:2026年完整构建指南

向量数据库选型、分块策略、重排序优化——RAG最佳实践

高级

Advanced RAG 高级技巧完整指南 2026:超越基础检索,构建生产级知识库

解决 RAG 幻觉、检索不准、上下文丢失三大核心问题

进阶

Building RAG Applications: The Complete Production Guide 2025

From simple document Q&A to enterprise-grade RAG systems that actually work

进阶

LlamaIndex 实战指南:RAG 应用开发从入门到生产

LlamaIndex vs LangChain 怎么选?5个真实场景代码示例

进阶

向量数据库选型指南:Pinecone vs Weaviate vs Chroma vs Qdrant(2026)

RAG 和 AI Agent 应用的数据库底座怎么选?4款主流向量数据库深度对比

高级

RAG高级技术:分块策略、重排序和混合检索的工程优化

从基础RAG到生产级RAG,解决准确率、延迟和成本的工程挑战

进阶

Dify 企业私有知识库完全搭建指南:RAG 配置与最佳实践(2026)

从部署到调优,手把手搭建企业级 RAG 知识库问答系统

进阶

LlamaIndex vs LangChain:到底该用哪个搭 RAG(2026 实战对比)

两个都用过半年后,我把选型逻辑讲清楚:什么项目用 LlamaIndex,什么项目用 LangChain

进阶

Qdrant vs Chroma:向量数据库怎么选(2026 选型指南)

Chroma 适合快速起步,Qdrant 扛得住生产——但分界线没那么绝对

进阶

AI in the Food and Beverage Industry: From Recipe Development to Quality Control

How AI is transforming product development, supply chain, and food safety

进阶

MongoDB + Atlas Vector Search: How to Add AI search to MongoDB (2026)

Complete integration guide for MongoDB and Atlas Vector Search

进阶

E5 Multilingual Embeddings

Multilingual semantic search with E5 embeddings

入门

LlamaIndex RAG Template: Starter Guide

Production LlamaIndex RAG project with evaluation

入门

Your First RAG Application: Beginner's Complete Guide

Everything a beginner needs to know about your first rag application

进阶

RAG System Design Best Practices: 2026 Developer Guide

Essential practices every AI developer should follow for rag system design

入门

Understanding Embeddings for Beginners: Beginner's Complete Guide

Everything a beginner needs to know about understanding embeddings for beginners