AI Agent News
实时追踪 AI Agent 赛道的重大事件、融资动向、模型发布和技术突破
最新行业资讯
实时追踪 AI Agent 赛道的重大事件、融资动向、模型发布和技术突破
重大事件时间线
OpenClaw GitHub 爆发
OpenClaw 10 天冲上 GitHub 全球 Top 10,超越 Linux 内核 Star 增速
Meta 20亿收购 Manus
Meta 以 20 亿美元收购 Manus AI,通用 Agent 赛道正式被巨头锁定
DeepSeek-V3 开源
性价比之王,成本仅 GPT-4 的 5%
Manus 一夜爆火
全球首款通用 AI Agent 在国内社交平台引发空前关注
OpenAI Deep Research
OpenAI 推出深度研究 Agent,一键生成专业研究报告
MCP Server 破 500
MCP 生态爆发,3 个月构建 500+ Server
DeepSeek-R1 震惊全球
开源推理模型,成本仅 OpenAI 的 3%,引发全球 AI 格局震动
MCP 协议诞生
Anthropic 发布 Model Context Protocol,成为 Agent 接口事实标准
Claude Computer Use
Anthropic 让 AI 首次直接操控电脑屏幕,开创计算机使用新范式
Replit Agent 全栈自动化
自然语言到上线产品,面向非工程师
Cursor ARR 破亿
史上增长最快 SaaS,AI 编程工具新王者
Claude 3.5 登顶 SWE-bench
最强编程 AI,Bug 修复能力达到初级工程师水平
Devin 发布
全球首个自主 AI 软件工程师,能独立完成完整编程任务
OpenAI o3 Sets New Benchmarks: What Its Reasoning Capabilities Mean for AI Applications
Analysis of OpenAI's o3 reasoning model performance on frontier benchmarks. Covers ARC-AGI results, programming competitions, mathematical olympiads, and implications for enterprise AI applications requiring complex reasoning.
Anthropic Claude 4 Enterprise Features: Extended Thinking, Computer Use, and Agentic Workflows
Deep dive into Anthropic Claude 4's enterprise capabilities. Covers extended thinking mode for complex reasoning, computer use API for automated workflows, API improvements, and enterprise safety features for regulated industries.
Meta Llama 4 Scout and Maverick: Open Source AI Gets Multimodal at 10M Context Window
Meta releases Llama 4 family including Scout (native multimodal, 10M context) and Maverick (MoE architecture). Analysis of performance benchmarks, commercial licensing changes, and implications for the open-source AI ecosystem.
Mistral Codestral 2 Achieves Top Coding Benchmarks, Challenges GitHub Copilot Economics
Mistral AI releases Codestral 2, achieving state-of-the-art results on HumanEval and SWE-bench. Analysis of how this open model changes the economics of AI coding assistants and threatens closed-source incumbents.