Paper Archive

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

0

5.0/10

Jundong Xu, Qingchuan Li, Jiaying Wu, Yihuai Lan, Shuyue Stella Li, Huichi Zhou, Bowen Jiang, Lei Wang, Jun Wang, Anh Tuan Luu, Caiming Xiong, Hae Won Park, Bryan Hooi, Zhiyuan Hu 6/11/2026 arxiv

natural language processing

Large language model (LLM) agents have achieved strong performance on a wide range of benchmarks, yet most evaluations assume static environments. In contrast, real-world deployment is inherently dynamic, requiring agents to continually align their knowledge, skills, and behavior with changing envir...

View Paper

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

0

5.0/10

Zilin Xiao, Qi Ma, Chun-cheng Jason Chen, Xintao Chen, Avinash Atreya, Hanjie Chen, Vicente Ordonez 6/11/2026 arxiv

natural language processing

Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowledge, yet conventional retrieval based on lexical or semantic similarity is poorly suited for complex reasoning tasks: a semantically similar problem may demand an entirely different s...

Keywords: fine-tuning

View Paper

InterleaveThinker: Reinforcing Agentic Interleaved Generation

0

5.0/10

Dian Zheng, Harry Lee, Manyuan Zhang, Kaituo Feng, Zoey Guo, Ray Zhang, Hongsheng Li 6/11/2026 arxiv

computer vision

Recent image generators have demonstrated impressive photorealism and instruction-following capabilities in single-image generation and editing. However, constrained by their architectures, they cannot achieve interleaved generation (text-image sequence), which has crucial applications in visual nar...

Keywords: gpt

View Paper

Mana: Dexterous Manipulation of Articulated Tools

0

5.0/10

Zhao-Heng Yin, Guanya Shi, Pieter Abbeel, C. Karen Liu 6/11/2026 arxiv

reinforcement learning

Articulated tool manipulation remains a major challenge in dexterous robotics due to the need to coordinate internal degrees of freedom and contact-rich interactions. While prior work has largely focused on rigid objects, articulated tool use remains underexplored because of its physical complexity ...

Keywords: reinforcement learning

View Paper

Improving Robotic Generalist Policies via Flow Reversal Steering

0

5.0/10

Andy Tang, William Chen, Andrew Wagenmaker, Chelsea Finn, Sergey Levine 6/11/2026 arxiv

computer vision

Generalist policies can learn a wide range of skills from diverse robot datasets. In order to solve or improve on challenging news tasks, we need a way to infer and invoke the appropriate actions from the policy's rich behavioral prior, especially when directly commanding the policy fails. We focus ...

Keywords: reinforcement learning

View Paper

Modality Forcing for Scalable Spatial Generation

0

5.0/10

Bardienus Pieter Duisterhof, Deva Ramanan, Jeffrey Ichnowski, Justin Johnson, Keunhong Park 6/11/2026 arxiv

computer vision

Text-to-image (T2I) models contain rich spatial priors. Synthesizing photorealistic, cluttered scenes requires an understanding of geometry, including perspective and relative scale. Prior works adapt T2I models to leverage this prior for depth prediction, but they require dense depth data and invol...

View Paper

RepWAM: World Action Modeling with Representation Visual-Action Tokenizers

0

5.0/10

Junke Wang, Qihang Zhang, Shuai Yang, Yiming Luo, Yujun Shen, Zuxuan Wu, Yu-Gang Jiang, Yinghao Xu 6/11/2026 arxiv

computer vision

This work presents RepWAM, a representation-centric world action model (WAM) built on representation visual-action tokenizers. Existing WAMs typically inherit reconstruction-oriented video tokenizers from pretrained video generation models. Although these tokenizers preserve visual fidelity, pixel r...

View Paper

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

0

5.0/10

Seokju Cho, Ryo Hachiuma, Abhishek Badki, Hang Su, Byung-Kwan Lee, Chan Hee Song, Sifei Liu, Subhashree Radhakrishnan, Seungryong Kim, Yu-Chiang Frank Wang, Min-Hung Chen 6/11/2026 arxiv

computer vision

Spatial reasoning, the ability to determine where objects are, how they relate, and how they move in 3D, remains a fundamental challenge for vision-language models (VLMs). Tool-augmented agents attempt to address this by augmenting VLMs with specialist perception modules, yet their effectiveness is ...

View Paper

$\texttt{WEAVER}$, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

0

5.0/10

Arnav Kumar Jain, Yilin Wu, Jesse Farebrother, Gokul Swamy, Andrea Bajcsy 6/11/2026 arxiv

natural language processing

The potential impacts of world models (WMs, i.e., learned simulators) on robotics are far-reaching -- policy evaluation, policy improvement, and test-time planning -- all with limited real-world interaction. To unlock these downstream capabilities, a WM needs to jointly satisfy three desiderata: $\t...

View Paper

Understanding Truncated Positional Encodings for Graph Neural Networks

0

5.0/10

James Flora, Mitchell Black, Weng-Keen Wong, Amir Nayyeri 6/11/2026 arxiv

reinforcement learning

Positional encodings (PEs) enhance the power of graph neural networks (GNNs), both theoretically and empirically. Two of the most popular families of PEs - spectral (e.g., Laplacian eigenspaces, effective resistance) and walk-based (polynomials of the adjacency matrix) - are theoretically equivalent...

Keywords: neural network

View Paper

Export Archive Data

Browse by Date

Papers for June 12, 2026

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Mana: Dexterous Manipulation of Articulated Tools

Improving Robotic Generalist Policies via Flow Reversal Steering

Modality Forcing for Scalable Spatial Generation

RepWAM: World Action Modeling with Representation Visual-Action Tokenizers

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

$\texttt{WEAVER}$, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

Understanding Truncated Positional Encodings for Graph Neural Networks