中文
← Back to news
ModelsJun 29, 2026

World's First Latent Space World Model MWA Released, Tops Embodied Intelligence Leaderboard

Boundless Dynamics has officially released MWA™, the world's first 'long-sequence bidirectional physical causal chain' latent space world model. It employs a 'bidirectional dynamics' architecture for reasoning within a unified shared latent space and introduces time-chunk-level inverse dynamics modeling, enabling stable planning of long-cycle continuous action sequences exceeding 10 seconds. On the RoboCasa GR1 TableTop leaderboard, jointly initiated by Stanford University and others, MWA achieved a 75.2% average task success rate, ranking first globally and surpassing mainstream models like NVIDIA's GR00T-N1.6.

Technical Approach: Latent Space World Model + Reinforcement Learning

Boundless Dynamics has chosen a technical path distinct from mainstream VLA (Vision-Language-Action) approaches. Traditional VLA models rely on imitation learning and are sensitive to environmental changes such as lighting and object positions, resulting in poor generalization. MWA, on the other hand, builds an understanding of physical causality through a latent space world model and then combines reinforcement learning to convert understanding into precise execution strategies.

  • Latent Action Self-Supervised Pretraining: MWA uses 'Latent Action' as the carrier of physical causality. Through an inverse dynamics encoder, it autonomously extracts general scene interaction representations from changes between consecutive frames, without requiring manual action labels. This enables training on vast amounts of unlabeled video data from the internet.
  • Bidirectional Dynamics Architecture: The model simultaneously runs two reasoning lines: inverse dynamics (from effect to cause) and forward dynamics (from cause to effect). It introduces a 'forward-inverse mutual review mechanism' for causal alignment, improving reasoning accuracy.
  • Long-Sequence Chunk-Level Modeling: Breaking the limitations of single-step reasoning, it directly outputs continuous multi-step latent action chunks from visual sequences spanning over 10 seconds, significantly reducing the 'snowball effect' of error accumulation.

Key Innovations: Triple Gradient Constraints and Negative Sample Data System

During inference, MWA constructs deterministic policy boundaries in the latent space through triple gradient constraints: forward dynamics predicts future environmental changes and corrects deviations; policy outputs align with a frozen inverse dynamics encoder; and latent actions in the latent space are mapped to hardware-executable control sequences.

Additionally, Boundless Dynamics pioneered the AnyPhys negative sample core data system. Addressing the industry's issue of datasets being 'heavy on positive samples and light on negative ones,' it interweaves deep negative samples, boundary instability samples, suboptimal samples, and positive samples, providing the complete sample dimensions needed for dense reward training in reinforcement learning.

Funding and Market Performance

Boundless Dynamics has previously completed over $200 million in angel round financing, with its Pre-A round of nearly $200 million nearing completion. Investors include Sequoia China, Linear Capital, and a JD-affiliated fund. The company's founder and CEO, Zhang Yufeng, stated that the ultimate goal of the embodied brain is to give robots human-like world cognitive modeling capabilities, rather than fully replicating the objective world.

Also available in 中文.