教程中心

AI Agent 从入门到实战：概念理解、MCP 使用、平台实操、工作流自动化

1252

教程总数

234

入门教程

实操教程

按主题浏览

RAG 检索增强生成 AI Agent 与多智能体模型部署与生产化工作流与自动化 OpenAI 开发实战 Claude / Anthropic 开发 LangChain / LangGraph 模型微调与训练 Prompt 工程 MCP（Model Context Protocol）评估、测试与可观测 AI 安全与合规 API 与集成开发

进阶其他

Claude API vs OpenAI API: Which Should You Build With in 2026?

A developer honest comparison for production applications

Claude API vs OpenAI API 开发者对比（2026）：Claude 强在 Agent 编码/1M 上下文标准价/指令遵循，OpenAI 强在多模态广度/生态体量。含模型阵容与官方定价、API 设计差异（思考控制/采样参数/缓存哲学）、生产级答案：网关路由两家都用。

claude apiopenai api

11分钟

进阶其他

OpenAI API vs Anthropic API vs Gemini API: Developer Comparison 2026

Compare LLM APIs for developers: pricing, rate limits, SDKs, and production patterns

Complete developer comparison of OpenAI API, Anthropic API, and Google Gemini API for 2026. Covers authentication, streaming, function calling, structured output, rate limits, and cost comparison.

openai apianthropic api

14分钟

进阶其他

TypeScript AI Development: Building LLM Apps with Vercel AI SDK 2026

Build streaming AI applications with TypeScript, Next.js, and Vercel AI SDK

Complete TypeScript guide for AI application development using Vercel AI SDK. Covers streaming chat, tool calling, structured generation, multi-model routing, and production deployment.

typescriptvercel ai sdk

18分钟

进阶其他

AI Application Testing: Evaluation Frameworks and Best Practices

Systematically test and evaluate AI-powered applications

Comprehensive guide to testing AI applications including unit testing LLM calls, evaluation frameworks like RAGAS and DeepEval, regression testing, and continuous evaluation in CI/CD.

testingevaluation

33分钟

进阶其他

Real-Time AI Streaming with WebSockets and SSE

Build responsive AI applications with streaming responses

Learn to implement real-time AI response streaming using Server-Sent Events and WebSockets. Build ChatGPT-like streaming UIs with Next.js and FastAPI.

streamingwebsockets

30分钟

进阶其他

Gemini API Tutorial: 15x Cheaper Alternative to GPT-4o

Build multimodal AI apps at a fraction of GPT-4o cost

Complete Gemini API tutorial with multimodal inputs, function calling, Google Search grounding. Gemini Flash is 15-20x cheaper than GPT-4o for equivalent quality on many tasks. Includes setup and code examples.

gemini apigoogle ai

16分钟

进阶其他

AI Observability: Tracing and Monitoring LLM Applications

Debug, optimize, and monitor production AI systems

Learn to implement comprehensive observability for LLM applications using LangSmith, Langfuse, and Helicone. Monitor latency, costs, errors, and output quality in real-time.

observabilitymonitoring

32分钟

进阶其他

Advanced Prompt Engineering: Chain-of-Thought, Few-Shot & Structured Outputs in 2025

Master LLM prompting techniques that reliably produce high-quality, structured outputs

Prompt engineering has evolved from simple instructions to sophisticated techniques that dramatically improve LLM reliability and output quality. This guide covers chain-of-thought prompting, few-shot examples, self-consistency, ReAct (Reasoning + Acting), structured output extraction with Instructor and Pydantic, system prompt design, and building a prompt testing and versioning discipline.

Prompt EngineeringChain-of-Thought

18分钟

进阶其他

Multimodal AI: Building Vision-Language Applications with GPT-4V & Gemini in 2025

Leverage vision-language models for document intelligence, visual QA, and real-world automation

Multimodal AI combines vision and language understanding to unlock powerful real-world applications. This guide covers GPT-4V, Gemini 1.5 Pro, Claude 3 Opus vision capabilities, open-source models (LLaVA, Qwen-VL), document intelligence with OCR + LLM, building visual QA systems, video understanding, and deploying multimodal AI applications in production.

Multimodal AIVision-Language

20分钟

进阶其他

AI Inference Cost Optimization: Reduce LLM Costs by 80%

Practical techniques to cut AI API costs dramatically

Learn proven strategies to dramatically reduce AI inference costs including model selection, caching, batching, prompt optimization, and intelligent routing.

cost-optimizationinference

28分钟

进阶其他

Building AI-Powered Search with Semantic Retrieval

Replace keyword search with intelligent semantic understanding

Learn to build semantic search systems using embeddings, vector databases, and re-ranking. Covers hybrid search combining BM25 with dense retrieval for production search applications.

semantic-searchembeddings

35分钟

进阶其他

Build an AI ChatOps Bot for Slack: Automate DevOps Tasks with Natural Language

Slash commands, LLM orchestration, and tool integration for intelligent Slack workflows

Build a powerful AI-powered Slack bot for DevOps automation including deployment commands, incident management, on-call queries, and intelligent runbook execution via natural language.

ChatOpsSlack

30分钟

进阶其他

AI-Powered Test Automation: Intelligent Test Generation and Self-Healing Tests

LLM test generation, visual testing, and auto-healing selectors for robust automation

Modernize QA automation with AI including LLM-generated test cases, visual regression testing with AI comparison, self-healing test selectors, and natural language test specification.

test-automationQA

22分钟

进阶其他

Model Context Protocol (MCP): Connect Claude and LLMs to Any Data Source

Building MCP servers for databases, APIs, and tools with Anthropic protocol

Learn to build Model Context Protocol (MCP) servers to connect Claude and other LLMs to databases, APIs, and custom tools, enabling powerful AI-native integrations for enterprise applications.

MCPAnthropic

25分钟

进阶其他

Production Sentiment Analysis: From BERT to LLM-Based Approaches in 2025

Fine-tuning DistilBERT, using LLMs as classifiers, and production deployment patterns

Build production sentiment analysis systems comparing traditional fine-tuned BERT approaches with modern LLM-based classification, including multi-aspect sentiment, emotion detection, and real-time analysis.

sentiment-analysisNLP

28分钟

进阶其他

Build a Production RAG Application with LlamaIndex and Qdrant

Document ingestion, hybrid search, reranking, and evaluation with LlamaIndex

Complete guide to building a production RAG application using LlamaIndex for orchestration, Qdrant for vector storage, and comprehensive evaluation with LlamaIndex evaluation modules.

LlamaIndexRAG

35分钟

进阶其他

Building AI Translation and Localization Systems for Global Products

Neural machine translation, quality evaluation, and post-editing workflows

Design and implement AI-powered translation systems for global products using neural machine translation, LLM-based localization, quality estimation, and efficient human post-editing workflows.

translationlocalization

28分钟

进阶其他

LLM Structured Output: JSON Schema, Function Calling, and Pydantic Integration

Force reliable structured data extraction from LLMs with zero parsing failures

Master reliable structured output extraction from LLMs using JSON Schema mode, function calling, Pydantic validators, and instructor library for zero-failure parsing in production.

structured-outputJSON-schema

25分钟

进阶其他

Building AI Applications with PostgreSQL and pgvector: Complete Guide

Full-stack AI app with Supabase, pgvector, and Next.js for semantic search and RAG

Build a complete AI application using PostgreSQL with pgvector extension for vector storage, Supabase for backend, and Next.js for frontend, implementing semantic search and RAG functionality.

pgvectorPostgreSQL

35分钟

进阶其他

Microsoft Semantic Kernel: Building Enterprise AI Applications

Plugins, planners, memory, and .NET/Python integration for enterprise AI orchestration

Build enterprise AI applications with Microsoft Semantic Kernel including plugin architecture, AI planners, memory management, and integration with Azure OpenAI for production-grade orchestration.

Semantic-KernelMicrosoft

26分钟

进阶其他

AI Agent Autonomy Levels: From Copilots to Fully Autonomous Systems

Design patterns for different levels of AI agent autonomy in enterprise applications

Understand the spectrum of AI agent autonomy levels and how to design appropriate human-AI collaboration patterns for different business contexts and risk tolerances.

AI-agentsautonomy

25分钟

进阶其他

AI Embedding Models Comparison 2025: OpenAI vs Cohere vs Open Source

Benchmarking text embeddings on MTEB for retrieval, classification, and semantic similarity

Comprehensive comparison of text embedding models on MTEB benchmark including OpenAI text-embedding-3, Cohere Embed v3, BGE, E5, and other open source models for production RAG systems.

embeddingsMTEB

25分钟

进阶其他

AI Document Processing: Extract Structured Data from PDFs and Scanned Documents

OCR, layout analysis, entity extraction, and building document intelligence pipelines

Build production document processing pipelines using AI for extracting structured data from PDFs, invoices, contracts, and scanned documents with high accuracy.

document-AIOCR

26分钟

Getting Started

Learn how to get started with this application.

Learn more

Installation Guide

教程中心

按主题浏览

Claude API vs OpenAI API: Which Should You Build With in 2026?

OpenAI API vs Anthropic API vs Gemini API: Developer Comparison 2026

TypeScript AI Development: Building LLM Apps with Vercel AI SDK 2026

AI Application Testing: Evaluation Frameworks and Best Practices

Real-Time AI Streaming with WebSockets and SSE

Gemini API Tutorial: 15x Cheaper Alternative to GPT-4o

AI Observability: Tracing and Monitoring LLM Applications

Advanced Prompt Engineering: Chain-of-Thought, Few-Shot & Structured Outputs in 2025

Multimodal AI: Building Vision-Language Applications with GPT-4V & Gemini in 2025

AI Inference Cost Optimization: Reduce LLM Costs by 80%

Building AI-Powered Search with Semantic Retrieval

Build an AI ChatOps Bot for Slack: Automate DevOps Tasks with Natural Language

AI-Powered Test Automation: Intelligent Test Generation and Self-Healing Tests

Model Context Protocol (MCP): Connect Claude and LLMs to Any Data Source

Production Sentiment Analysis: From BERT to LLM-Based Approaches in 2025

Build a Production RAG Application with LlamaIndex and Qdrant

Building AI Translation and Localization Systems for Global Products

LLM Structured Output: JSON Schema, Function Calling, and Pydantic Integration

Building AI Applications with PostgreSQL and pgvector: Complete Guide

Microsoft Semantic Kernel: Building Enterprise AI Applications

AI Agent Autonomy Levels: From Copilots to Fully Autonomous Systems

AI Embedding Models Comparison 2025: OpenAI vs Cohere vs Open Source

AI Document Processing: Extract Structured Data from PDFs and Scanned Documents

Documentation

Getting Started

Learn more