教程中心

AI Agent 从入门到实战：概念理解、MCP 使用、平台实操、工作流自动化

1252

教程总数

234

入门教程

实操教程

按主题浏览

RAG 检索增强生成 AI Agent 与多智能体模型部署与生产化工作流与自动化 OpenAI 开发实战 Claude / Anthropic 开发 LangChain / LangGraph 模型微调与训练 Prompt 工程 MCP（Model Context Protocol）评估、测试与可观测 AI 安全与合规 API 与集成开发

高级其他

Fine-Tuning LLMs in 2025: When to Do It and How to Do It Right

The practical guide to fine-tuning language models for specific tasks and domains

Fine-tuning is often unnecessary—but when it's the right choice, it delivers significant improvements. This guide covers: when fine-tuning beats prompt engineering (with decision framework), LoRA and QLoRA parameter-efficient fine-tuning explained, preparing training data (quality over quantity), evaluating fine-tuned models, deploying fine-tuned models in production, and cost analysis across fine-tuning providers (OpenAI, Together AI, Fireworks AI, self-hosted). Includes hands-on examples with real training code.

fine-tuningLoRA

40分钟

高级其他

AI Evaluation Frameworks: How to Measure What Actually Matters

Building evaluation systems that catch real-world AI failures before they reach users

AI evaluation is the difference between AI that works in demos and AI that works in production. This guide covers building comprehensive eval suites: metric design for different task types, automated vs. LLM-based evaluation, human evaluation methodology, regression testing for model updates, A/B testing AI systems, and evaluation infrastructure using open source tools (RAGAS, HELM, DeepEval) and cloud platforms.

AI evaluationLLM testing

38分钟

高级其他

AI Agents in Production: Architecture Patterns and Reliability Engineering

Building AI agent systems that work reliably in enterprise production environments

AI agents—autonomous systems that use tools and make decisions to complete multi-step tasks—are moving into production at enterprise scale. This guide covers reliable agent architecture: tool design and error handling, state management for long-running agents, human-in-the-loop patterns, observability and debugging agents, graceful failure modes, security considerations, and testing strategies for non-deterministic systems.

AI agentsLangGraph

42分钟

高级其他

LLM Cost Optimization: Reduce AI API Costs by 80% Without Sacrificing Quality

Practical techniques for optimizing LLM API costs in production applications

LLM API costs can spiral quickly: a production application making 1M requests/day at $0.01 average = $3,000/month. This guide covers comprehensive cost optimization strategies: prompt compression, intelligent model routing (use GPT-4 only when needed), caching strategies, batch processing optimization, output length control, model selection framework, and architecture patterns that dramatically reduce per-request cost without meaningful quality degradation.

LLM costsAI optimization

35分钟

高级其他

Building Multimodal AI Applications: Text, Images, Audio, and Video

Practical guide to building applications that understand and generate multiple modalities

Multimodal AI—systems that understand and generate text, images, audio, and video together—enables a new category of AI applications. This guide covers multimodal model architectures (GPT-4V, Gemini Pro Vision, Claude 3 Vision), building vision-language applications, document intelligence with layout understanding, audio-language models for transcription and analysis, video understanding with temporal reasoning, and production deployment considerations for multimodal systems.

multimodal AIvision language models

38分钟

高级其他

Vector Databases for Production: Architecture, Performance, and Scaling

The complete technical guide to deploying vector databases at enterprise scale

Vector databases power modern AI applications: semantic search, RAG pipelines, recommendation systems, anomaly detection. This deep dive covers vector similarity search algorithms (HNSW, IVF, PQ), index architecture choices and performance tradeoffs, filtering strategies for hybrid search, distributed deployment patterns, benchmarking methodology, and scaling considerations from thousands to billions of vectors. Includes performance comparisons across Pinecone, Weaviate, Qdrant, pgvector, and Milvus.

vector databaseembeddings

40分钟

高级其他

Reducing LLM Hallucinations: Practical Techniques for Production Applications

Engineering solutions to the most persistent reliability problem in deployed AI systems

LLM hallucination—generating confident but false information—is the primary reliability challenge in production AI applications. This guide covers the root causes of hallucination, detection strategies (fact-checking layers, self-consistency checks, confidence calibration), mitigation techniques (RAG, constrained generation, chain-of-thought verification), and monitoring approaches for production systems. Includes benchmark data on hallucination rates across different model and technique combinations.

hallucinationLLM reliability

32分钟

高级其他

LangChain LCEL: Advanced Patterns for Production AI Applications

Master LangChain Expression Language for composable, streaming AI pipelines

LangChain Expression Language (LCEL) is the modern way to build composable LLM pipelines. This guide covers advanced LCEL patterns: parallel execution, streaming, dynamic routing, conditional chains, retry and fallback logic, tool use orchestration, and testing strategies. Includes production patterns for RAG applications, multi-step agents, and complex data transformation pipelines with real performance benchmarks.

LangChainLCEL

38分钟