教程中心
AI Agent 从入门到实战:概念理解、MCP 使用、平台实操、工作流自动化
2024
教程总数
368
入门教程
45
实操教程
按主题浏览
Multi-Agent AI System Orchestration: Building Complex AI Workflows
LangGraph, CrewAI, and human-in-the-loop patterns for production AI systems
Design and implement multi-agent AI systems using LangGraph, AutoGen, and CrewAI for complex task decomposition, parallel execution, and human-in-the-loop workflows.
AI API可靠性工程:速率限制、重试策略与故障转移
构建生产级AI应用的容错架构,确保99.9%可用性
详解构建高可靠AI应用的工程实践,包括速率限制处理、指数退避重试、多模型故障转移、断路器模式,以及SLO监控体系。
A/B Testing ML Models
Statistical A/B testing framework for model evaluation
A/B Testing ML Models Overview Statistical A/B testing framework for model evaluation. This guide covers practical implementation for production ML systems. Why This Matters in MLOps Modern ML systems require rigorous operations practices: - **Re
Fine-Tuning LLMs with LoRA and QLoRA: Complete Practical Guide 2025
Hugging Face PEFT, dataset preparation, training and deployment on consumer hardware
Step-by-step guide to fine-tuning large language models using LoRA and QLoRA with Hugging Face PEFT library, covering dataset preparation, training, evaluation, and deployment.
Multi-Provider Fallback: Production AI Architecture Guide 2026
How to implement automatically switching AI providers on failure
Multi-Provider Fallback: Production Architecture 2026 Overview **Multi-Provider Fallback** solves the challenge of automatically switching AI providers on failure. This guide covers the design decisions, implementation details, and trade-offs you n
AI System Design Interview: How to Design ML Systems at Scale
Recommendation systems, fraud detection, and search ranking at tech companies
Comprehensive guide to answering AI/ML system design interview questions at top tech companies. Covers recommendation systems, search ranking, fraud detection, and LLM applications.
AI Output Validation and Guardrails: Building Reliable LLM Pipelines
Pydantic validators, Guardrails AI, and content safety for production systems
Implement robust validation, safety guardrails, and output parsing for production LLM applications using Guardrails AI, Pydantic, and custom validation frameworks.
AI驱动的增长黑客:用机器学习优化用户获取与留存
用AI技术重塑增长策略,实现高效低成本的用户增长
探索如何利用AI和机器学习技术优化用户获取渠道、提升转化率、预测用户生命周期价值,以及建立数据驱动的增长飞轮。
LLM Context Window Management: Strategies for Long Documents
Chunking, hierarchical summarization, and retrieval-augmented approaches
Learn techniques to handle documents longer than LLM context windows including chunking, sliding windows, hierarchical summarization, and retrieval-augmented approaches.
Cross-Encoder RAG: Implementation Guide with Qdrant 2026
Build a neural reranking for high-precision retrieval RAG system from scratch
Cross-Encoder RAG: Complete Implementation 2026 Overview Cross-Encoder RAG is a specialized retrieval pattern that focuses on neural reranking for high-precision retrieval. This guide shows you how to build a production-ready system using Qdrant.
Deploy Llama 3.1 70B on vLLM Production Serving — High-throughput serving
Complete setup guide for running Llama 3.1 70B locally on vLLM Production Serving for high-throughput serving
Deploy Llama 3.1 70B on vLLM Production Serving Overview Run Llama 3.1 70B directly on vLLM Production Serving for high-throughput serving. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: NVIDIA A100 · 80GB VRAM
LoRA Fine-tuning Guide: Hands-On Tutorial
Low-Rank Adaptation for efficient LLM fine-tuning — step-by-step implementation guide
LoRA Fine-tuning Guide Overview Low-Rank Adaptation for efficient LLM fine-tuning. This tutorial provides a complete, runnable implementation. Prerequisites ```bash Install required packages pip install transformers datasets peft trl accelerate b
ML Model Monitoring Dashboard
Building real-time model performance dashboards
ML Model Monitoring Dashboard Overview Building real-time model performance dashboards. This guide covers practical implementation for production ML systems. Why This Matters in MLOps Modern ML systems require rigorous operations practices: - **R
AI-First API Design: Production AI Architecture Guide 2026
How to implement designing APIs with AI capabilities as first-class features
AI-First API Design: Production Architecture 2026 Overview **AI-First API Design** solves the challenge of designing APIs with AI capabilities as first-class features. This guide covers the design decisions, implementation details, and trade-offs y
Email Management Agent: Complete Tutorial
Autonomous email reading, writing, and organization agent
Email Management Agent Overview Autonomous email reading, writing, and organization agent. This guide covers architecture, implementation, and production deployment of AI agents. Agent Architecture ``` User Input ↓ Agent Orchestrator ↓ ┌─
System Prompt Mechanics: Technical Deep Dive
How system prompts influence LLM behavior internally
System Prompt Mechanics: Technical Deep Dive Overview How system prompts influence LLM behavior internally. This comprehensive guide covers everything you need to know for production implementation. Why It Matters System Prompt Mechanics: Technic
Function Calling Internals: Technical Deep Dive
How OpenAI function calling works under the hood
Function Calling Internals: Technical Deep Dive Overview How OpenAI function calling works under the hood. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Function Calling Internals: Techn
KV Cache Optimization: Technical Deep Dive
How key-value caching accelerates autoregressive generation
KV Cache Optimization: Technical Deep Dive Overview How key-value caching accelerates autoregressive generation. This comprehensive guide covers everything you need to know for production implementation. Why It Matters KV Cache Optimization: Tech
Tokenization and Vocabulary: Technical Deep Dive
How LLMs tokenize text and why it matters for prompting
Tokenization and Vocabulary: Technical Deep Dive Overview How LLMs tokenize text and why it matters for prompting. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Tokenization and Vocabula
CI/CD for AI Systems: Production Setup Guide
Continuous integration and deployment pipelines for AI
CI/CD for AI Systems Overview Continuous integration and deployment pipelines for AI. This guide provides practical, production-ready implementations. **Category**: ai-infrastructure **Primary Tool**: github-actions **Tags**: infrastructure, d
Thinking Models Deep Dive: 2025 Guide
Extended thinking in Claude and its practical applications
Thinking Models Deep Dive: 2025 Guide Overview Extended thinking in Claude and its practical applications Implementation ```python from openai import OpenAI from pydantic import BaseModel from typing import Optional import json client = OpenAI()
API Orchestration Agent: Complete Tutorial
Agent that autonomously calls and chains API operations
API Orchestration Agent Overview Agent that autonomously calls and chains API operations. This guide covers architecture, implementation, and production deployment of AI agents. Agent Architecture ``` User Input ↓ Agent Orchestrator ↓ ┌──
Agentic AI Workflows: 2025 Guide
Long-horizon planning and execution for AI agents
Agentic AI Workflows: 2025 Guide Overview Long-horizon planning and execution for AI agents Implementation ```python from openai import OpenAI from pydantic import BaseModel from typing import Optional import json client = OpenAI() class Handle
Testing Automation Agent: Complete Tutorial
AI agent that generates and runs automated tests
Testing Automation Agent Overview AI agent that generates and runs automated tests. This guide covers architecture, implementation, and production deployment of AI agents. Agent Architecture ``` User Input ↓ Agent Orchestrator ↓ ┌────────
Temperature and Sampling: Technical Deep Dive
How temperature, top-p, and top-k affect LLM outputs
Temperature and Sampling: Technical Deep Dive Overview How temperature, top-p, and top-k affect LLM outputs. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Temperature and Sampling: Techn
Zero-Downtime AI Deployments
Rolling updates and blue-green deployments for AI services
Zero-Downtime AI Deployments Overview Rolling updates and blue-green deployments for AI services Implementation ```python from openai import OpenAI from pydantic import BaseModel from typing import Optional import json client = OpenAI() class H
Fine-tuning for Code Generation: Hands-On Tutorial
Domain-specific fine-tuning for code completion and generation — step-by-step implementation guide
Fine-tuning for Code Generation Overview Domain-specific fine-tuning for code completion and generation. This tutorial provides a complete, runnable implementation. Prerequisites ```bash Install required packages pip install transformers datasets
AI Cost Governance: Production AI Architecture Guide 2026
How to implement policies and systems to control AI spending
AI Cost Governance: Production Architecture 2026 Overview **AI Cost Governance** solves the challenge of policies and systems to control AI spending. This guide covers the design decisions, implementation details, and trade-offs you need to know.
Prometheus ML Metrics
Instrumenting ML services with Prometheus metrics
Prometheus ML Metrics Overview Instrumenting ML services with Prometheus metrics. This guide covers practical implementation for production ML systems. Why This Matters in MLOps Modern ML systems require rigorous operations practices: - **Reliabi
Hierarchical Agent System: Complete Tutorial
Multi-level agent hierarchy with delegation and reporting
Hierarchical Agent System Overview Multi-level agent hierarchy with delegation and reporting. This guide covers architecture, implementation, and production deployment of AI agents. Agent Architecture ``` User Input ↓ Agent Orchestrator ↓
Conversation State Management: Production AI Architecture Guide 2026
How to implement managing multi-turn chat state in distributed systems
Conversation State Management: Production Architecture 2026 Overview **Conversation State Management** solves the challenge of managing multi-turn chat state in distributed systems. This guide covers the design decisions, implementation details, an
Terraform AI Infrastructure: Production Setup Guide
Infrastructure as code for AI platform deployment
Terraform AI Infrastructure Overview Infrastructure as code for AI platform deployment. This guide provides practical, production-ready implementations. **Category**: ai-infrastructure **Primary Tool**: terraform **Tags**: infrastructure, devo
Hugging Face SFT Trainer: Hands-On Tutorial
Supervised fine-tuning with Hugging Face TRL SFTTrainer — step-by-step implementation guide
Hugging Face SFT Trainer Overview Supervised fine-tuning with Hugging Face TRL SFTTrainer. This tutorial provides a complete, runnable implementation. Prerequisites ```bash Install required packages pip install transformers datasets peft trl acce
Prompt Sensitivity in LLMs: Technical Deep Dive
Why small prompt changes can cause large output variations
Prompt Sensitivity in LLMs: Technical Deep Dive Overview Why small prompt changes can cause large output variations. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Prompt Sensitivity in L
Chain-of-Thought Reasoning: Technical Deep Dive
Why step-by-step prompting dramatically improves LLM accuracy
Chain-of-Thought Reasoning: Technical Deep Dive Overview Why step-by-step prompting dramatically improves LLM accuracy. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Chain-of-Thought Rea
AI Service Mesh Patterns
Advanced service mesh configurations for AI workloads
AI Service Mesh Patterns Overview Advanced service mesh configurations for AI workloads Implementation ```python from openai import OpenAI from pydantic import BaseModel from typing import Optional import json client = OpenAI() class Handler:
Deploy GGUF Models on LM Studio Desktop — No-code local AI GUI
Complete setup guide for running GGUF Models locally on LM Studio Desktop for no-code local AI GUI
Deploy GGUF Models on LM Studio Desktop Overview Run GGUF Models directly on LM Studio Desktop for no-code local AI GUI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: CPU/GPU · 8GB+ Installation ```bash Insta
DPO: Direct Preference Optimization: Hands-On Tutorial
Simplified alignment using Direct Preference Optimization — step-by-step implementation guide
DPO: Direct Preference Optimization Overview Simplified alignment using Direct Preference Optimization. This tutorial provides a complete, runnable implementation. Prerequisites ```bash Install required packages pip install transformers datasets
ML Metadata Management
Tracking ML artifacts, lineage, and provenance with MLMD
ML Metadata Management Overview Tracking ML artifacts, lineage, and provenance with MLMD. This guide covers practical implementation for production ML systems. Why This Matters in MLOps Modern ML systems require rigorous operations practices: - *
Embedding Spaces Visualized: Technical Deep Dive
Understanding high-dimensional embedding spaces intuitively
Embedding Spaces Visualized: Technical Deep Dive Overview Understanding high-dimensional embedding spaces intuitively. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Embedding Spaces Visu
Constitutional AI Principles: Technical Deep Dive
How Anthropic implemented Constitutional AI for Claude
Constitutional AI Principles: Technical Deep Dive Overview How Anthropic implemented Constitutional AI for Claude. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Constitutional AI Princip
Code Generation Agents: 2025 Guide
Fully autonomous software development AI agents
Code Generation Agents: 2025 Guide Overview Fully autonomous software development AI agents Implementation ```python from openai import OpenAI from pydantic import BaseModel from typing import Optional import json client = OpenAI() class Handle
Deploy MobileNet variants on Google Coral Edge TPU — IoT vision AI
Complete setup guide for running MobileNet variants locally on Google Coral Edge TPU for IoT vision AI
Deploy MobileNet variants on Google Coral Edge TPU Overview Run MobileNet variants directly on Google Coral Edge TPU for IoT vision AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Edge TPU · 1W power Install
GitOps for AI Deployments
Managing AI deployments with GitOps and ArgoCD
GitOps for AI Deployments Overview Managing AI deployments with GitOps and ArgoCD Implementation ```python from openai import OpenAI from pydantic import BaseModel from typing import Optional import json client = OpenAI() class Handler: """
Mixture of Experts Architecture: Technical Deep Dive
Understanding MoE models like Mixtral and GPT-4
Mixture of Experts Architecture: Technical Deep Dive Overview Understanding MoE models like Mixtral and GPT-4. This comprehensive guide covers everything you need to know for production implementation. Why It Matters Mixture of Experts Architectu
Research Agent with Citations: Complete Tutorial
Autonomous research agent that cites sources accurately
Research Agent with Citations Overview Autonomous research agent that cites sources accurately. This guide covers architecture, implementation, and production deployment of AI agents. Agent Architecture ``` User Input ↓ Agent Orchestrator
Deploy Llama 3.2 3B on NVIDIA Jetson Orin — Robotics and edge AI
Complete setup guide for running Llama 3.2 3B locally on NVIDIA Jetson Orin for robotics and edge AI
Deploy Llama 3.2 3B on NVIDIA Jetson Orin Overview Run Llama 3.2 3B directly on NVIDIA Jetson Orin for robotics and edge AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Ampere GPU · 8GB Installation ```bash
Model Routing Rules Engine: Production AI Architecture Guide 2026
How to implement intelligently routing requests to optimal models
Model Routing Rules Engine: Production Architecture 2026 Overview **Model Routing Rules Engine** solves the challenge of intelligently routing requests to optimal models. This guide covers the design decisions, implementation details, and trade-off