教程中心

AI Agent 从入门到实战:概念理解、MCP 使用、平台实操、工作流自动化

1252

教程总数

234

入门教程

42

实操教程

高级其他

AI Model Quantization (GPTQ, AWQ): Complete Developer Guide 2026

Master AI Model Quantization (GPTQ, AWQ) with practical examples and production patterns

AI 模型量化(GPTQ/AWQ)完全指南(2026):用更少比特存权重以省显存/提速。GPTQ vs AWQ 对比、bitsandbytes/GGUF、4bit 甜点位选择,以及"直接下预量化权重 + vLLM/Ollama 部署"的实战路径。

quantizationgptq
10分钟
高级其他

LLM Fine-tuning with LoRA: Complete Developer Guide 2026

Master LLM Fine-tuning with LoRA with practical examples and production patterns

LoRA 微调大模型完全指南(2026):冻结基座、只训低秩适配器,单卡数小时完成;QLoRA 在 4bit 基座上训练适配器。含 PEFT 真实代码、何时该微调(vs 提示/RAG)、数据质量 > 数量的实战要点。

fine-tuninglora
11分钟
进阶其他

Streaming AI Responses with Server-Sent Events: Complete Developer Guide 2026

Master Streaming AI Responses with Server-Sent Events with practical examples and production patterns

用 SSE 实现 AI 流式响应(2026):为什么用 SSE 而非 WebSocket、FastAPI 服务端 + 浏览器 EventSource 客户端真实代码、关闭代理缓冲/逐 token flush/断连取消等生产要点,以及 Next.js 用 Vercel AI SDK 的更简路径。

streamingsse
9分钟
进阶其他

Semantic Search Implementation: Complete Developer Guide 2026

Master Semantic Search Implementation with practical examples and production patterns

语义搜索实现完全指南(2026):分块→嵌入→向量库存储→近邻检索→重排的完整管线,含真实代码、向量库选型(Chroma/Qdrant/pgvector/Pinecone)、分块/混合检索/重排/元数据过滤等质量杠杆。RAG 的检索底座。

semantic searchembeddings
10分钟
进阶其他

OpenAI Function Calling Complete Guide: Complete Developer Guide 2026

Master OpenAI Function Calling Complete Guide with practical examples and production patterns

OpenAI Function/Tool Calling 完全指南(2026):用 JSON Schema 定义工具→模型返回结构化调用→你执行并回填结果的完整循环,含真实代码、生产模式(校验/tool_choice/并行调用/strict)、与结构化输出的区别,以及它如何支撑 Agent。

openaifunction calling
11分钟
高级其他

Corrective RAG: Implementation Guide with Weaviate 2026

Build a self-correcting retrieval with quality assessment RAG system from scratch

Corrective RAG: Complete Implementation 2026 Overview Corrective RAG is a specialized retrieval pattern that focuses on self-correcting retrieval with quality assessment. This guide shows you how to build a production-ready system using Weaviate.

ragcorrective
30分钟
高级其他

Multi-Vector RAG: Implementation Guide with Weaviate 2026

Build a storing multiple embedding types per document RAG system from scratch

Multi-Vector RAG: Complete Implementation 2026 Overview Multi-Vector RAG is a specialized retrieval pattern that focuses on storing multiple embedding types per document. This guide shows you how to build a production-ready system using Weaviate.

ragmulti-vector
30分钟
高级其他

RAPTOR RAG: Implementation Guide with Pinecone 2026

Build a hierarchical document summarization for better context RAG system from scratch

RAPTOR RAG: Complete Implementation 2026 Overview RAPTOR RAG is a specialized retrieval pattern that focuses on hierarchical document summarization for better context. This guide shows you how to build a production-ready system using Pinecone. Why

ragraptor
30分钟
高级其他

Hybrid Search RAG: Implementation Guide with Elasticsearch 2026

Build a combining vector and keyword search for maximum recall RAG system from scratch

Hybrid Search RAG: Complete Implementation 2026 Overview Hybrid Search RAG is a specialized retrieval pattern that focuses on combining vector and keyword search for maximum recall. This guide shows you how to build a production-ready system using

raghybrid-search
30分钟
高级其他

Contextual Compression RAG: Implementation Guide with Pinecone 2026

Build a compressing retrieved context to fit LLM window RAG system from scratch

Contextual Compression RAG: Complete Implementation 2026 Overview Contextual Compression RAG is a specialized retrieval pattern that focuses on compressing retrieved context to fit LLM window. This guide shows you how to build a production-ready sy

ragcontextual-compression
30分钟
高级其他

Self-Query RAG: Implementation Guide with Qdrant 2026

Build a AI-generated metadata filters for precise retrieval RAG system from scratch

Self-Query RAG: Complete Implementation 2026 Overview Self-Query RAG is a specialized retrieval pattern that focuses on AI-generated metadata filters for precise retrieval. This guide shows you how to build a production-ready system using Qdrant.

ragself-query
30分钟
高级其他

Graph RAG: Implementation Guide with Neo4j 2026

Build a knowledge graph traversal for multi-hop reasoning RAG system from scratch

Graph RAG: Complete Implementation 2026 Overview Graph RAG is a specialized retrieval pattern that focuses on knowledge graph traversal for multi-hop reasoning. This guide shows you how to build a production-ready system using Neo4j. Why Graph RAG

raggraph
30分钟
高级其他

Parent Document RAG: Implementation Guide with Chroma 2026

Build a retrieving small chunks with large parent context RAG system from scratch

Parent Document RAG: Complete Implementation 2026 Overview Parent Document RAG is a specialized retrieval pattern that focuses on retrieving small chunks with large parent context. This guide shows you how to build a production-ready system using C

ragparent-document
30分钟
高级其他

Time-Aware RAG: Implementation Guide with Pinecone 2026

Build a weighting recent documents higher in retrieval RAG system from scratch

Time-Aware RAG: Complete Implementation 2026 Overview Time-Aware RAG is a specialized retrieval pattern that focuses on weighting recent documents higher in retrieval. This guide shows you how to build a production-ready system using Pinecone. Why

ragtime-aware
30分钟
高级其他

Cross-Encoder RAG: Implementation Guide with Qdrant 2026

Build a neural reranking for high-precision retrieval RAG system from scratch

Cross-Encoder RAG: Complete Implementation 2026 Overview Cross-Encoder RAG is a specialized retrieval pattern that focuses on neural reranking for high-precision retrieval. This guide shows you how to build a production-ready system using Qdrant.

ragcross-encoder
30分钟