教程中心

AI Agent 从入门到实战:概念理解、MCP 使用、平台实操、工作流自动化

1252

教程总数

234

入门教程

42

实操教程

进阶其他

Claude API vs OpenAI API: Which Should You Build With in 2026?

A developer honest comparison for production applications

Claude API vs OpenAI API 开发者对比(2026):Claude 强在 Agent 编码/1M 上下文标准价/指令遵循,OpenAI 强在多模态广度/生态体量。含模型阵容与官方定价、API 设计差异(思考控制/采样参数/缓存哲学)、生产级答案:网关路由两家都用。

claude apiopenai api
11分钟
高级其他

Fine-Tuning GPT-4o Mini: OpenAI Fine-Tuning API Complete Guide

When and how to fine-tune LLMs for domain-specific tasks

GPT-4o mini 微调完全指南(2026):用 OpenAI 微调 API 得到格式/风格稳定的托管模型、海量调用降本。含 JSONL 数据准备→上传→训练→调用真实代码、何时微调 vs 提示/RAG、数据质量 > 数量。

fine-tuninggpt-4o-mini
10分钟
高级其他

LangGraph Tutorial: Build Stateful AI Agents with Persistent Memory

Build complex multi-step AI workflows with state management using LangGraph

LangGraph enables AI agents with persistent state, conditional branching, and human-in-the-loop workflows. This tutorial builds a real research agent from scratch with memory, tool use, and error recovery.

langgraphai agents
18分钟
高级其他

Advanced Prompt Engineering: Techniques That Actually Work

Chain-of-thought, tree-of-thoughts, self-consistency, and systematic evaluation methods

Beyond basic prompting: master chain-of-thought, self-consistency sampling, tree-of-thoughts, constitutional AI prompting, and systematic evaluation techniques that reliably improve LLM performance.

prompt engineeringchain of thought
16分钟
高级其他

vLLM Production Deployment: Self-Host Llama 3 at Scale

Deploy Llama 3 with 20x higher throughput than naive serving

Deploy open-source LLMs in production with vLLM. Covers GPU selection, Docker setup, Kubernetes orchestration, AWQ quantization for 75% memory reduction, and cost comparison showing break-even vs OpenAI at 5M tokens/month.

vllmllm deployment
20分钟
高级其他

Technical Architecture for AI Startups: From Prototype to Scale

Build AI infrastructure that grows with your startup

Architecture guide for AI startups covering the evolution from prototype to production scale. Includes cost-effective infrastructure choices, avoiding common pitfalls, and when to invest in custom ML.

architecturestartup
35分钟
高级其他

LLM Fine-Tuning for Production: LoRA, QLoRA & RLHF in 2025

Adapt foundation models to your domain efficiently with parameter-efficient fine-tuning techniques

Fine-tuning LLMs allows adapting powerful foundation models to specific domains without training from scratch. This guide covers LoRA and QLoRA for parameter-efficient fine-tuning, dataset preparation and quality filtering, instruction tuning format, RLHF and DPO for alignment, fine-tuning on consumer GPUs with quantization, evaluation with domain benchmarks, and deploying fine-tuned models with vLLM or TGI for production serving.

Fine-tuningLoRA
24分钟
高级其他

Build a Production RAG System with LlamaIndex and Pinecone

Step-by-step guide to retrieval-augmented generation that works on real data

Most RAG tutorials only show the happy path. This guide builds a production-ready RAG system covering chunking strategies, embedding selection, reranking, evaluation, and edge case handling.

ragllamaindex
20分钟
高级其他

Vector Databases & RAG in Production: Pinecone, Weaviate & pgvector in 2025

Build production-grade retrieval-augmented generation systems with vector search at scale

Retrieval-Augmented Generation (RAG) is the dominant pattern for grounding LLMs with up-to-date knowledge. This guide covers vector database selection (Pinecone, Weaviate, Qdrant, pgvector), embedding model selection and optimization, chunking strategies for documents, hybrid search (vector + keyword), re-ranking, evaluating RAG quality, and deploying production RAG systems that stay accurate over time.

RAGVector Database
23分钟
高级其他

AI Agent Frameworks: LangChain, AutoGen & CrewAI for Production in 2025

Build reliable AI agents that use tools, plan multi-step tasks, and collaborate in teams

AI agents go beyond chatbots—they use tools, maintain memory, plan multi-step tasks, and collaborate with other agents. This guide compares LangChain, LangGraph, AutoGen, and CrewAI for different use cases, covers reliable agent design patterns, tool calling best practices, memory architectures (short-term, long-term, episodic), handling errors and hallucinations, and deploying production agents with observability.

AI AgentsLangChain
21分钟
高级其他

LLM Inference Optimization: vLLM, TensorRT-LLM, and Serving at Scale

PagedAttention, continuous batching, quantization, and production serving strategies

LLM 推理优化:vLLM、TensorRT-LLM 与规模化服务(2026):KV 缓存是瓶颈——PagedAttention + 连续批处理是最大吞吐杠杆。vLLM vs TensorRT-LLM 选型、量化/投机解码/前缀缓存/选小模型等其余手段。

LLM-inferencevLLM
11分钟
高级其他

CrewAI vs AutoGen vs LangGraph: Multi-Agent Framework 2026

Build production multi-agent systems with the right framework

Comprehensive comparison of CrewAI, AutoGen, and LangGraph for multi-agent AI systems. Covers role-based collaboration, conversation agents, state machines, and production deployment patterns.

crewaiautogen
20分钟
高级其他

Vector Database Guide 2026: Pinecone vs Qdrant vs pgvector vs Weaviate

Choose the right vector database for your RAG application performance and cost

Complete 2026 comparison of Pinecone, Qdrant, pgvector, and Weaviate. Includes Python code examples, performance benchmarks at 1M vectors, filtering, and self-hosting setup.

vector databasepinecone
18分钟
进阶其他

OpenAI API vs Anthropic API vs Gemini API: Developer Comparison 2026

Compare LLM APIs for developers: pricing, rate limits, SDKs, and production patterns

Complete developer comparison of OpenAI API, Anthropic API, and Google Gemini API for 2026. Covers authentication, streaming, function calling, structured output, rate limits, and cost comparison.

openai apianthropic api
14分钟
进阶其他

TypeScript AI Development: Building LLM Apps with Vercel AI SDK 2026

Build streaming AI applications with TypeScript, Next.js, and Vercel AI SDK

Complete TypeScript guide for AI application development using Vercel AI SDK. Covers streaming chat, tool calling, structured generation, multi-model routing, and production deployment.

typescriptvercel ai sdk
18分钟
高级其他

Fine-Tuning GPT-4 and Claude: When to Fine-Tune vs RAG 2026

Make the right architectural decision: fine-tuning or RAG for your LLM application

Comprehensive guide to deciding between fine-tuning and RAG for LLM applications. Covers fine-tuning GPT-4o mini, LoRA training with Hugging Face, cost comparison, and use case decision framework.

fine-tuningrag
22分钟
高级其他

AI System Design Patterns 2026: Rate Limiting, Caching, Fallbacks

Production patterns for reliable, cost-efficient AI applications

Essential system design patterns for production AI applications: token budgeting, response caching, fallback chains, circuit breakers, and monitoring. Reduce costs 60-80% while improving reliability.

ai engineeringsystem design
20分钟
高级其他

Python AI Development Stack 2026: FastAPI + LangChain + Supabase

Build production-ready AI applications with the modern Python AI stack

Complete guide to building production AI applications with FastAPI, LangChain, and Supabase in 2026. Covers project setup, async AI endpoints, RAG pipeline, vector search, and deployment.

fastapilangchain
20分钟
进阶其他

AI Application Testing: Evaluation Frameworks and Best Practices

Systematically test and evaluate AI-powered applications

Comprehensive guide to testing AI applications including unit testing LLM calls, evaluation frameworks like RAGAS and DeepEval, regression testing, and continuous evaluation in CI/CD.

testingevaluation
33分钟
进阶其他

Real-Time AI Streaming with WebSockets and SSE

Build responsive AI applications with streaming responses

Learn to implement real-time AI response streaming using Server-Sent Events and WebSockets. Build ChatGPT-like streaming UIs with Next.js and FastAPI.

streamingwebsockets
30分钟
高级其他

Building Real-Time AI Personalization Engines

Deliver hyper-personalized experiences at scale

Design and implement real-time personalization using AI, covering user profiling, content ranking, A/B testing, and multi-armed bandit algorithms for continuous optimization.

personalizationrecommendation
40分钟
高级其他

AI-Powered Code Review: Beyond Static Analysis

Use LLMs to review code for bugs, security, and quality

Build intelligent code review tools using LLMs that go beyond traditional linters. Detect security vulnerabilities, suggest refactoring, explain complex code, and enforce team conventions automatically.

code-reviewsecurity
35分钟
进阶其他

Gemini API Tutorial: 15x Cheaper Alternative to GPT-4o

Build multimodal AI apps at a fraction of GPT-4o cost

Complete Gemini API tutorial with multimodal inputs, function calling, Google Search grounding. Gemini Flash is 15-20x cheaper than GPT-4o for equivalent quality on many tasks. Includes setup and code examples.

gemini apigoogle ai
16分钟
进阶其他

AI Observability: Tracing and Monitoring LLM Applications

Debug, optimize, and monitor production AI systems

Learn to implement comprehensive observability for LLM applications using LangSmith, Langfuse, and Helicone. Monitor latency, costs, errors, and output quality in real-time.

observabilitymonitoring
32分钟
进阶其他

Advanced Prompt Engineering: Chain-of-Thought, Few-Shot & Structured Outputs in 2025

Master LLM prompting techniques that reliably produce high-quality, structured outputs

Prompt engineering has evolved from simple instructions to sophisticated techniques that dramatically improve LLM reliability and output quality. This guide covers chain-of-thought prompting, few-shot examples, self-consistency, ReAct (Reasoning + Acting), structured output extraction with Instructor and Pydantic, system prompt design, and building a prompt testing and versioning discipline.

Prompt EngineeringChain-of-Thought
18分钟
进阶其他

Multimodal AI: Building Vision-Language Applications with GPT-4V & Gemini in 2025

Leverage vision-language models for document intelligence, visual QA, and real-world automation

Multimodal AI combines vision and language understanding to unlock powerful real-world applications. This guide covers GPT-4V, Gemini 1.5 Pro, Claude 3 Opus vision capabilities, open-source models (LLaVA, Qwen-VL), document intelligence with OCR + LLM, building visual QA systems, video understanding, and deploying multimodal AI applications in production.

Multimodal AIVision-Language
20分钟
高级其他

LLM Inference Optimization: vLLM, TensorRT-LLM & Quantization in 2025

Achieve 10-50x throughput improvements for LLM serving through batching, quantization, and GPU optimization

Serving LLMs in production requires careful optimization to achieve cost-effective performance at scale. This guide covers continuous batching with vLLM, NVIDIA TensorRT-LLM for GPU-optimized inference, speculative decoding, flash attention, KV cache optimization, INT4/INT8 quantization with AWQ and GPTQ, and benchmarking LLM serving systems to find the right performance/cost tradeoff.

LLM InferencevLLM
23分钟
进阶其他

AI Inference Cost Optimization: Reduce LLM Costs by 80%

Practical techniques to cut AI API costs dramatically

Learn proven strategies to dramatically reduce AI inference costs including model selection, caching, batching, prompt optimization, and intelligent routing.

cost-optimizationinference
28分钟
进阶其他

Building AI-Powered Search with Semantic Retrieval

Replace keyword search with intelligent semantic understanding

Learn to build semantic search systems using embeddings, vector databases, and re-ranking. Covers hybrid search combining BM25 with dense retrieval for production search applications.

semantic-searchembeddings
35分钟
进阶其他

Build an AI ChatOps Bot for Slack: Automate DevOps Tasks with Natural Language

Slash commands, LLM orchestration, and tool integration for intelligent Slack workflows

Build a powerful AI-powered Slack bot for DevOps automation including deployment commands, incident management, on-call queries, and intelligent runbook execution via natural language.

ChatOpsSlack
30分钟
高级其他

AI-Powered DevOps: Intelligent Infrastructure Management and Incident Resolution

AIOps, automated root cause analysis, capacity planning, and self-healing systems

Implement AIOps practices including ML-powered anomaly detection, automated root cause analysis, predictive capacity planning, and self-healing infrastructure for modern cloud environments.

AIOpsDevOps
28分钟
进阶其他

AI-Powered Test Automation: Intelligent Test Generation and Self-Healing Tests

LLM test generation, visual testing, and auto-healing selectors for robust automation

Modernize QA automation with AI including LLM-generated test cases, visual regression testing with AI comparison, self-healing test selectors, and natural language test specification.

test-automationQA
22分钟
进阶其他

Model Context Protocol (MCP): Connect Claude and LLMs to Any Data Source

Building MCP servers for databases, APIs, and tools with Anthropic protocol

Learn to build Model Context Protocol (MCP) servers to connect Claude and other LLMs to databases, APIs, and custom tools, enabling powerful AI-native integrations for enterprise applications.

MCPAnthropic
25分钟
高级其他

Reducing LLM Hallucinations: Techniques That Actually Work in Production

RAG, self-consistency, chain-of-verification, and calibration for faithful AI outputs

Comprehensive guide to practical techniques for reducing LLM hallucinations in production systems, including RAG, retrieval verification, self-consistency sampling, and chain-of-verification prompting.

hallucinationLLM
30分钟
高级其他

AI-Powered Search and Autocomplete with Elasticsearch and LLMs

Semantic search, neural reranking, personalized suggestions, and query understanding

Build an intelligent search system combining Elasticsearch with AI for semantic understanding, neural reranking, personalized autocomplete, and query expansion for superior search relevance.

searchElasticsearch
32分钟
高级其他

Production Computer Vision with YOLO v11: Object Detection at Scale

Training, optimization, edge deployment, and real-time video processing with YOLO

Build production computer vision systems using YOLO v11 for object detection, including custom training, model optimization with TensorRT, edge deployment, and real-time video stream processing.

computer-visionYOLO
38分钟
高级其他

Production NER Systems: Fine-Tuning spaCy and Transformers for Custom Entities

Training custom NER models, handling low-resource scenarios, and deployment patterns

Build production Named Entity Recognition systems for custom entity types using spaCy and transformer models, covering annotation strategies, active learning, and deployment optimization.

NERNLP
32分钟
进阶其他

Production Sentiment Analysis: From BERT to LLM-Based Approaches in 2025

Fine-tuning DistilBERT, using LLMs as classifiers, and production deployment patterns

Build production sentiment analysis systems comparing traditional fine-tuned BERT approaches with modern LLM-based classification, including multi-aspect sentiment, emotion detection, and real-time analysis.

sentiment-analysisNLP
28分钟
高级其他

Production Document Q&A System: PDF Processing to Enterprise Deployment

Complete guide from PDF parsing to scalable enterprise document intelligence

Build a production document Q&A system from PDF parsing and chunking through vector indexing, RAG-based answering, citation extraction, and enterprise deployment with access controls.

document-QARAG
40分钟
高级其他

AI Anomaly Detection for Time Series: From Statistical to Deep Learning Approaches

Isolation Forest, LSTM Autoencoders, and production anomaly detection systems

Build production anomaly detection systems for time series data using statistical methods, isolation forest, LSTM autoencoders, and modern time series foundation models for infrastructure and IoT monitoring.

anomaly-detectiontime-series
35分钟
进阶其他

Build a Production RAG Application with LlamaIndex and Qdrant

Document ingestion, hybrid search, reranking, and evaluation with LlamaIndex

Complete guide to building a production RAG application using LlamaIndex for orchestration, Qdrant for vector storage, and comprehensive evaluation with LlamaIndex evaluation modules.

LlamaIndexRAG
35分钟
进阶其他

Building AI Translation and Localization Systems for Global Products

Neural machine translation, quality evaluation, and post-editing workflows

Design and implement AI-powered translation systems for global products using neural machine translation, LLM-based localization, quality estimation, and efficient human post-editing workflows.

translationlocalization
28分钟
高级其他

LLM Application Architecture Patterns: From Simple to Complex Systems

Simple chains, RAG, agents, and multi-agent patterns with decision frameworks

Comprehensive guide to LLM application architecture patterns from simple prompt-response to complex multi-agent systems, with a decision framework for choosing the right architecture.

architectureLLM
30分钟
进阶其他

LLM Structured Output: JSON Schema, Function Calling, and Pydantic Integration

Force reliable structured data extraction from LLMs with zero parsing failures

Master reliable structured output extraction from LLMs using JSON Schema mode, function calling, Pydantic validators, and instructor library for zero-failure parsing in production.

structured-outputJSON-schema
25分钟
高级其他

AI Content Moderation at Scale: Building Trust and Safety Systems

Multi-modal content classification, human review workflows, and policy enforcement

Design production-grade AI content moderation systems for text, images, and video, covering classification models, human review workflows, policy management, and appeals processes.

content-moderationtrust-safety
32分钟
进阶其他

Building AI Applications with PostgreSQL and pgvector: Complete Guide

Full-stack AI app with Supabase, pgvector, and Next.js for semantic search and RAG

Build a complete AI application using PostgreSQL with pgvector extension for vector storage, Supabase for backend, and Next.js for frontend, implementing semantic search and RAG functionality.

pgvectorPostgreSQL
35分钟
高级其他

Designing AI-Powered APIs: Best Practices for LLM-Backed Services

Rate limiting, streaming, idempotency, and versioning for AI APIs in production

Design patterns and best practices for building robust AI-powered REST and WebSocket APIs including streaming responses, idempotency, rate limiting, versioning, and managing non-deterministic outputs.

API-designstreaming
30分钟
进阶其他

Microsoft Semantic Kernel: Building Enterprise AI Applications

Plugins, planners, memory, and .NET/Python integration for enterprise AI orchestration

Build enterprise AI applications with Microsoft Semantic Kernel including plugin architecture, AI planners, memory management, and integration with Azure OpenAI for production-grade orchestration.

Semantic-KernelMicrosoft
26分钟
1 / 2下一页