AI Agent News

实时追踪 AI Agent 赛道的重大事件、融资动向、模型发布和技术突破

AI Agent 动态

最新行业资讯

实时追踪 AI Agent 赛道的重大事件、融资动向、模型发布和技术突破

重大事件时间线

2026-01

OpenClaw GitHub 爆发

OpenClaw 10 天冲上 GitHub 全球 Top 10,超越 Linux 内核 Star 增速

2025-12

Meta 20亿收购 Manus

Meta 以 20 亿美元收购 Manus AI,通用 Agent 赛道正式被巨头锁定

2025-04

DeepSeek-V3 开源

性价比之王,成本仅 GPT-4 的 5%

2025-03

Manus 一夜爆火

全球首款通用 AI Agent 在国内社交平台引发空前关注

2025-02

OpenAI Deep Research

OpenAI 推出深度研究 Agent,一键生成专业研究报告

2025-02

MCP Server 破 500

MCP 生态爆发,3 个月构建 500+ Server

2025-01

DeepSeek-R1 震惊全球

开源推理模型,成本仅 OpenAI 的 3%,引发全球 AI 格局震动

2024-11

MCP 协议诞生

Anthropic 发布 Model Context Protocol,成为 Agent 接口事实标准

2024-10

Claude Computer Use

Anthropic 让 AI 首次直接操控电脑屏幕,开创计算机使用新范式

2024-09

Replit Agent 全栈自动化

自然语言到上线产品,面向非工程师

2024-08

Cursor ARR 破亿

史上增长最快 SaaS,AI 编程工具新王者

2024-06

Claude 3.5 登顶 SWE-bench

最强编程 AI,Bug 修复能力达到初级工程师水平

2024-03

Devin 发布

全球首个自主 AI 软件工程师,能独立完成完整编程任务

AI Safety2025年8月3日

Nature and Science Retract 150 AI-Generated Papers With Fabricated Data

Major scientific journals retract 150 papers found to contain AI-generated fabricated data and images. Publishers announce mandatory AI-assisted authorship disclosure and enhanced fraud detection screening.

Nature
AI Safety2025年7月28日

AI Synthetic Media Detection in 2025 Elections: Global Challenges and Solutions

Election security agencies report detecting over 100,000 AI-generated synthetic media pieces targeting elections across 15 countries. Coalitions form to deploy detection tools and voter education campaigns.

OECD AI Policy Observatory
AI Safety2025年6月3日

Tech Giants Launch Coalition for AI-Generated Content Detection Standards

Microsoft, Google, Meta, and Adobe launch the Content Authenticity Initiative for AI, establishing watermarking and provenance standards for AI-generated images, video, and audio.

Content Authenticity Initiative
AI Safety2025年5月15日

Anthropic Publishes Updated Model Spec: New Guidelines for AI Behavior

Anthropic releases comprehensive update to Claude Model Spec, detailing new guidelines for handling sensitive topics, improved calibration for confidence expressions, and enhanced corrigibility principles.

Anthropic
AI Safety2025年5月1日

Anthropic's Mechanistic Interpretability Research Finds 'Features' in Claude's Reasoning

Anthropic published landmark interpretability research identifying thousands of 'features' — linear representations of concepts — inside Claude's neural network activations. Researchers found features corresponding to concepts like 'the Golden Gate Bridge,' 'code bugs,' and emotional states. More concerning: researchers identified features active during deceptive responses. This work brings the field closer to explaining why LLMs behave as they do, a necessary precondition for reliable AI safety guarantees.

Anthropic
AI Safety2025年4月28日

Anthropic Achieves Breakthrough in Mechanistic Interpretability Research

Anthropic researchers publish landmark paper on mechanistic interpretability, successfully mapping how Claude represents concepts internally and identifying circuits responsible for safety behaviors.

Anthropic Research
AI Safety2025年3月5日

OpenAI Establishes Safety and Security Committee, Releases Enhanced Model Security Guidelines

OpenAI has established a permanent Safety and Security Committee following months of internal and external pressure around AI safety practices. The committee has published new Model Security Guidelines covering red teaming requirements, catastrophic risk thresholds, and mandatory security reviews before major model releases. OpenAI also announced a formal vulnerability disclosure program for AI-specific security issues, with bounties up to $100,000 for critical AI safety vulnerabilities.

OpenAI
AI Safety2025年3月1日

Anthropic Publishes Constitutional AI Safety Update: Claude 3.7 Security and Jailbreak Resistance

Anthropic has released its most comprehensive AI safety update to date, detailing Constitutional AI improvements in Claude 3.7 that reduce harmful output by 89% and jailbreak attempts by 94% compared to Claude 2. The report includes new safety benchmarks, red team findings from 200+ external researchers, and a technical specification of the Responsible Scaling Policy (RSP) thresholds that would trigger halting development of more powerful models. Anthropic also published ASL-3 requirements—the safety bar required before deploying models with potential for CBRN uplift.

Anthropic