Paper Archive

Latent Spatial Memory for Video World Models

0

5.0/10

Weijie Wang, Haoyu Zhao, Yifan Yang, Feng Chen, Zeyu Zhang, Yefei He, Zicheng Duan, Donny Y. Chen, Yuqing Yang, Bohan Zhuang 6/8/2026 arxiv

natural language processing

Video world models that maintain 3D spatial consistency across generated frames typically rely on explicit point cloud memory constructed in RGB space. This design is both computationally expensive, requiring repeated rendering and VAE encoding, and inherently lossy, as the round trip through pixel ...

Keywords: diffusion model

View Paper

MemoryVLA++: Temporal Modeling via Memory and Imagination in Vision-Language-Action Models

0

5.0/10

Hao Shi, Weiye Li, Bin Xie, Yulin Wang, Renping Zhou, Tiancai Wang, Xiangyu Zhang, Ping Luo, Gao Huang 6/8/2026 arxiv

computer vision

Temporal modeling is essential for robotic manipulation, as effective control requires both memory of past interactions and imagination of future states. However, most VLA models rely primarily on the current observation and therefore struggle with long-horizon, temporally dependent tasks. Cognitive...

View Paper

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

0

5.0/10

Mingxian Lin, Shengju Qian, Yuqi Liu, Yi-Hua Huang, Yiyu Wang, Wei Huang, Yitang Li, Fan Zhang, Zeyu Hu, Lingting Zhu, Xin Wang, Xiaojuan Qi 6/8/2026 arxiv

computer vision

Vision-language model (VLM) agents are increasingly deployed in interactive game environments. Yet game benchmarks for VLM agents typically report a single first-attempt score per (agent, game) pair, focus on single-agent Solo play, and lack unified protocols for evaluating heterogeneous agent class...

View Paper

An Agency-Transferring Model-Free Policy Enhancement Technique

0

5.0/10

Anton Bolychev, Georgiy Malaniya, Sinan Ibrahim, Pavel Osinenko 6/8/2026 arxiv

reinforcement learning

Training reinforcement learning (RL) policies from scratch is costly: it requires careful reward and environment design, extensive tuning, and substantial computation. Yet many control problems already have a functional but suboptimal policy available as a baseline. This paper proposes a m...

Keywords: neural network, reinforcement learning

View Paper

Causally Evaluating the Learnability of Formal Language Tasks

0

5.0/10

Vésteinn Snæbjarnarson, Anej Svete, Josef Valvoda, Reda Boumasmoud, Brian DuSell, Ryan Cotterell 6/8/2026 arxiv

natural language processing

Language models, as multi-task learners, acquire a wide range of abilities during training. A fundamental question is how much task-specific data is needed to learn a given task. Answering this for natural language is difficult: tasks are hard to delineate and can confound one another. To rigorously...

View Paper

Rethinking the Divergence Regularization in LLM RL

0

5.0/10

Jiarui Yao, Xiangxin Zhou, Penghui Qi, Wee Sun Lee, Liefeng Bo, Tianyu Pang 6/8/2026 arxiv

natural language processing

Reinforcement learning (RL) has become a key component of post-training large language models (LLMs). In practice, LLM RL is often off-policy because of training-inference mismatch and policy staleness, making trust-region control essential for stable optimization. Mainstream methods such as PPO and...

Keywords: reinforcement learning

View Paper

Weighted universal approximation of differentiable maps on infinite-dimensional manifolds

0

5.0/10

Philipp Schmocker, Josef Teichmann 6/8/2026 arxiv

machine learning

We generalize the universal approximation theorem for functional input neural networks (FNN) to differentiable maps by including the approximation of the derivatives. A FNN maps the input from a possibly infinite-dimensional weighted manifold to the real-valued hidden layer, on which a non-linear sc...

Keywords: neural network

View Paper

PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws

0

5.0/10

Danqi Zhuang, Jisui Huang, Xiaoyue Xi, Andrew Kiggins, Xiaojie Wang, Ke Chen, Yue Wu 6/8/2026 arxiv

machine learning

Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explicit structure for data concentrated near low-dimensional manifolds, whe...

Keywords: diffusion model

View Paper

iMaC: Translating Actions into Motion and Contact Images for Embodied World Models

0

5.0/10

Zhenyu Wu, Xiuwei Xu, Yukun Zhou, Yifan Li, Qiuping Deng, Xiaofeng Wang, Zheng Zhu, Bingyao Yu, Ziwei Wang, Jiwen Lu, Haibin Yan 6/8/2026 arxiv

computer vision

Embodied world models have emerged as a pivotal paradigm for visual robotic decision-making and interactive environment simulation. However, conventional embodied frameworks rely on low-dimensional structured action vectors (e.g., joint angles and end-effector poses), which suffer from limited expre...

View Paper

AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing

0

5.0/10

Jisong Cai, Long Ling, Shiwei Chu, Zhongshan Liu, Jiayue Kang, Zhixuan Liang, Wenjie Xu, Yinan Mao, Weinan Zhang, Xiaokang Yang, Ru Ying, Ran Zheng, Yao Mu 6/8/2026 arxiv

computer vision

World-action models have emerged as a promising paradigm for robot manipulation, jointly modeling visual scene dynamics and actions to inject physical priors into policy learning. However, existing world-action models couple world prediction and action execution at the same temporal resolution, forc...

Keywords: transformer, attention, pretraining

View Paper

Export Archive Data

Browse by Date

Papers for June 9, 2026

Latent Spatial Memory for Video World Models

MemoryVLA++: Temporal Modeling via Memory and Imagination in Vision-Language-Action Models

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

An Agency-Transferring Model-Free Policy Enhancement Technique

Causally Evaluating the Learnability of Formal Language Tasks

Rethinking the Divergence Regularization in LLM RL

Weighted universal approximation of differentiable maps on infinite-dimensional manifolds

PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws

iMaC: Translating Actions into Motion and Contact Images for Embodied World Models

AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing