New Paradigms for Latent Space World Models: Boundless Dynamics' MWA and FaceMind's LoopWM Released
Two Chinese startups recently unveiled new paradigms for latent space world models in embodied AI. Boundless Dynamics launched MWA™, the world's first long-sequence bidirectional physical causal chain latent space world model, achieving 75.2% average task success rate on the RoboCasa GR1 TableTop benchmark, surpassing models like NVIDIA's GR00T-N1.6. FaceMind released Looped World Model (LoopWM), proposing "iterative latent depth" as a new scaling axis, achieving results surpassing large models on ScienceWorld and AlfWorld tasks with approximately 1B parameters, claiming up to 100× parameter efficiency.
Boundless Dynamics MWA: Long-Sequence Bidirectional Physical Causal Chain
MWA™ operates entirely in latent space, skipping pixel-level redundant computation, extracting "latent actions" as the underlying representation of scene interaction changes, eliminating reliance on manual action labels and enabling direct training on massive unlabeled internet videos.
Its core architecture is "bidirectional dynamics": an inverse dynamics model (IDM) for "effect-to-cause" and a forward dynamics model (FDM) for "cause-to-effect," coordinated through a "forward-inverse mutual review mechanism." MWA™ pioneers temporal chunk-level inverse dynamics modeling, breaking traditional single-step instantaneous inference limitations, enabling stable planning of continuous action sequences over 10 seconds, forming multi-step latent action chunks that significantly reduce error accumulation.
During inference, MWA™ introduces triple gradient constraints: FDM predicts future changes, policy output aligns with frozen IDM baseline, and latent actions map to hardware control sequences. Through positive-negative feedback loops, it defines action forbidden zones and recommended intervals in latent space.
Additionally, Boundless Dynamics pioneered the AnyPhys negative sample core data system, interweaving deep negative samples, boundary instability samples, etc., with positive samples to provide full-dimensional samples for dense reinforcement learning training. The company has completed over $200 million in angel round financing, with Pre-A round of nearly $200 million nearing completion, backed by investors including Sequoia China, Linear Capital, and JD-affiliated funds.
FaceMind LoopWM: Recurrent Architecture and Iterative Latent Depth
LoopWM uses parameter-shared recurrent Transformer blocks as the dynamics core, refining latent states through repeated iterations rather than stacking more parameters. The architecture consists of three parts: Prelude (input processing), Recurrent Block (multiple updates of latent state with shared parameters), and Coda (final output representation).
Key designs include:
- Spectral stability constraint: Special parameterization of the state transition matrix to constrain eigenvalues within a stable range, ensuring long-range rollout stability.
- Deferred Decoding: During multi-step rollout, continuous latent space inference is performed first, decoding only when output is needed, reducing inference cost.
- Early Exit: Lightweight gating dynamically determines if the state has "thought enough," with simple transitions exiting early and complex transitions iterating more, enabling "thinking on demand."
Experimental results show that LoopWM with approximately 1B parameters achieves 68.4% EM and 85.3% Token F1 on ScienceWorld, significantly surpassing Claude-opus-4-6-max (47.2% EM, 72.8% F1); on AlfWorld, it achieves 51.6% EM and 80.4% Token F1. The paper indicates that LoopWM achieves up to 100× parameter efficiency through iterative latent depth, offering a new path for resource-constrained scenarios.
Impact and Industry Significance
Both works point toward new directions for latent space world models: MWA™ emphasizes long-sequence causal chains combined with reinforcement learning, while LoopWM explores the balance between recurrent computation and parameter efficiency. Both aim to address error accumulation and computational redundancy in long-range rollout of traditional world models, providing different technical routes for embodied AI generalization and deployment. Boundless Dynamics focuses on industrial deployment and financing capability, while FaceMind advances academic exploration through open-source papers.
Also available in 中文.