教程中心
AI Agent 从入门到实战:概念理解、MCP 使用、平台实操、工作流自动化
1252
教程总数
234
入门教程
42
实操教程
按主题浏览
Deploy TinyLlama 1.1B on Raspberry Pi 5 — Home automation assistant
Complete setup guide for running TinyLlama 1.1B locally on Raspberry Pi 5 for home automation assistant
Deploy TinyLlama 1.1B on Raspberry Pi 5 Overview Run TinyLlama 1.1B directly on Raspberry Pi 5 for home automation assistant. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: ARM CPU · 4GB RAM Installation ```ba
Deploy Llama 3.1 8B on Apple MacBook M3 — Offline productivity AI
Complete setup guide for running Llama 3.1 8B locally on Apple MacBook M3 for offline productivity AI
Deploy Llama 3.1 8B on Apple MacBook M3 Overview Run Llama 3.1 8B directly on Apple MacBook M3 for offline productivity AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Apple Silicon · 16-96GB Installation `
Deploy Any GGUF Model on Ollama Local Server — Local development AI
Complete setup guide for running Any GGUF Model locally on Ollama Local Server for local development AI
Deploy Any GGUF Model on Ollama Local Server Overview Run Any GGUF Model directly on Ollama Local Server for local development AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: CPU/GPU auto · Variable Installa
Deploy CF AI Models on Cloudflare Workers AI — Edge CDN inference
Complete setup guide for running CF AI Models locally on Cloudflare Workers AI for edge CDN inference
Deploy CF AI Models on Cloudflare Workers AI Overview Run CF AI Models directly on Cloudflare Workers AI for edge CDN inference. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: V8 isolates · Serverless Installat
Deploy Mistral 7B on Intel Core Ultra Laptop — Laptop inference
Complete setup guide for running Mistral 7B locally on Intel Core Ultra Laptop for laptop inference
Deploy Mistral 7B on Intel Core Ultra Laptop Overview Run Mistral 7B directly on Intel Core Ultra Laptop for laptop inference. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Intel NPU · 16-32GB Installation ``
Deploy Mistral 7B Q4 on Fly.io Machines — Geo-distributed AI
Complete setup guide for running Mistral 7B Q4 locally on Fly.io Machines for geo-distributed AI
Deploy Mistral 7B Q4 on Fly.io Machines Overview Run Mistral 7B Q4 directly on Fly.io Machines for geo-distributed AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Micro VMs · 8GB Installation ```bash Instal
Deploy Ollama + Open WebUI on Docker Compose Stack — Self-hosted AI stack
Complete setup guide for running Ollama + Open WebUI locally on Docker Compose Stack for self-hosted AI stack
Deploy Ollama + Open WebUI on Docker Compose Stack Overview Run Ollama + Open WebUI directly on Docker Compose Stack for self-hosted AI stack. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Container · 16GB Ins
Deploy Any ONNX Model on ONNX Runtime CrossPlatform — Cross-platform deployment
Complete setup guide for running Any ONNX Model locally on ONNX Runtime CrossPlatform for cross-platform deployment
Deploy Any ONNX Model on ONNX Runtime CrossPlatform Overview Run Any ONNX Model directly on ONNX Runtime CrossPlatform for cross-platform deployment. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: ONNX Runtime ·
Deploy Llama 3.1 8B on AWS Graviton3 — ARM cloud inference
Complete setup guide for running Llama 3.1 8B locally on AWS Graviton3 for ARM cloud inference
Deploy Llama 3.1 8B on AWS Graviton3 Overview Run Llama 3.1 8B directly on AWS Graviton3 for ARM cloud inference. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: ARM Neoverse · 32-256GB Installation ```bash Ins
Deploy Gemma 2B on Android Smartphone — On-device mobile AI
Complete setup guide for running Gemma 2B locally on Android Smartphone for on-device mobile AI
Deploy Gemma 2B on Android Smartphone Overview Run Gemma 2B directly on Android Smartphone for on-device mobile AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Qualcomm NPU · 6-12GB Installation ```bash Ins
Deploy Phi-3 Mini on Web Browser WebGPU — Browser-native inference
Complete setup guide for running Phi-3 Mini locally on Web Browser WebGPU for browser-native inference
Deploy Phi-3 Mini on Web Browser WebGPU Overview Run Phi-3 Mini directly on Web Browser WebGPU for browser-native inference. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: WebGPU · Client device Installation `
Deploy Llama 3.1 70B on vLLM Production Serving — High-throughput serving
Complete setup guide for running Llama 3.1 70B locally on vLLM Production Serving for high-throughput serving
Deploy Llama 3.1 70B on vLLM Production Serving Overview Run Llama 3.1 70B directly on vLLM Production Serving for high-throughput serving. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: NVIDIA A100 · 80GB VRAM
Deploy GGUF Models on LM Studio Desktop — No-code local AI GUI
Complete setup guide for running GGUF Models locally on LM Studio Desktop for no-code local AI GUI
Deploy GGUF Models on LM Studio Desktop Overview Run GGUF Models directly on LM Studio Desktop for no-code local AI GUI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: CPU/GPU · 8GB+ Installation ```bash Insta
Deploy MobileNet variants on Google Coral Edge TPU — IoT vision AI
Complete setup guide for running MobileNet variants locally on Google Coral Edge TPU for IoT vision AI
Deploy MobileNet variants on Google Coral Edge TPU Overview Run MobileNet variants directly on Google Coral Edge TPU for IoT vision AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Edge TPU · 1W power Install
Deploy Llama 3.2 3B on NVIDIA Jetson Orin — Robotics and edge AI
Complete setup guide for running Llama 3.2 3B locally on NVIDIA Jetson Orin for robotics and edge AI
Deploy Llama 3.2 3B on NVIDIA Jetson Orin Overview Run Llama 3.2 3B directly on NVIDIA Jetson Orin for robotics and edge AI. Local inference offers privacy, zero latency, and no ongoing API costs. **Specs**: Ampere GPU · 8GB Installation ```bash