Paper Archive

UFO-4D: Unposed Feedforward 4D Reconstruction from Two Images

0

9.0/10

Junhwa Hur, Charles Herrmann, Songyou Peng, Philipp Henzler, Zeyu Ma, Todd Zickler, Deqing Sun 2/27/2026 arxiv

computer vision

Dense 4D reconstruction from unposed images remains a critical challenge, with current methods relying on slow test-time optimization or fragmented, task-specific feedforward models. We introduce UFO-4D, a unified feedforward framework to reconstruct a dense, explicit 4D representation from just a p...

Keywords: 4D reconstruction, dynamic 3D Gaussian Splats, feedforward, self-supervised rendering, geometry-motion-pose joint estimation, novel view interpolation

View Paper

Mode Seeking meets Mean Seeking for Fast Long Video Generation

0

9.0/10

Shengqu Cai, Weili Nie, Chao Liu, Julius Berner, Lvmin Zhang, Nanye Ma, Hansheng Chen, Maneesh Agrawala, Leonidas Guibas, Gordon Wetzstein, Arash Vahdat 2/27/2026 arxiv

machine learning

Scaling video generation from seconds to minutes faces a critical bottleneck: while short-video data is abundant and high-fidelity, coherent long-form data is scarce and limited to narrow domains. To address this, we propose a training paradigm where Mode Seeking meets Mean Seeking, decoupling local...

Keywords: long-video-generation, diffusion-models, decoupled-representation, flow-matching, mode-seeking, reverse-KL, short-video-teacher, fast-generation

View Paper

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

0

9.0/10

Fan Shu, Yite Wang, Ruofan Wu, Boyi Liu, Zhewei Yao, Yuxiong He, Feng Yan 2/27/2026 arxiv

machine learning

The fast-growing demands in using Large Language Models (LLMs) to tackle complex multi-step data science tasks create an emergent need for accurate benchmarking. There are two major gaps in existing benchmarks: (i) the lack of standardized, process-aware evaluation that captures instruction adherenc...

Keywords: LLM, benchmark, data science, instruction following, process-aware evaluation, verifiable ground truth, Kaggle-derived, training data

View Paper

Do LLMs Benefit From Their Own Words?

0

9.0/10

Jenny Y. Huang, Leshem Choshen, Ramon Astudillo, Tamara Broderick, Jacob Andreas 2/27/2026 arxiv

natural language processing

Multi-turn interactions with large language models typically retain the assistant's own past responses in the conversation history. In this work, we revisit this design choice by asking whether large language models benefit from conditioning on their own prior responses. Using in-the-wild, multi-tur...

Keywords: large_language_models, multi-turn_conversation, context_filtering, context_pollution, efficiency, prompting

View Paper

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

0

9.0/10

Weinan Dai, Hanlin Wu, Qiying Yu, Huan-ang Gao, Jiahao Li, Chengquan Jiang, Weiqiang Lou, Yufan Song, Hongli Yu, Jiaze Chen, Wei-Ying Ma, Ya-Qin Zhang, Jingjing Liu, Mingxuan Wang, Xin Liu, Hao Zhou 2/27/2026 arxiv

machine learning

GPU kernel optimization is fundamental to modern deep learning but remains a highly specialized task requiring deep hardware expertise. Despite strong performance in general programming, large language models (LLMs) remain uncompetitive with compiler-based systems such as torch.compile for CUDA kern...

Keywords: CUDA, reinforcement learning, kernel optimization, LLM, compiler, KernelBench, agentic RL, profiling

View Paper

Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

0

9.0/10

Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan 2/27/2026 arxiv

machine learning

Modern optimizers like Adam and Muon are central to training large language models, but their reliance on first- and second-order momenta introduces significant memory overhead, which constrains scalability and computational efficiency. In this work, we reframe the exponential moving average (EMA) u...

Keywords: LoRA-Pre, low-rank, optimizer, momentum, EMA, online linear regressor, Llama, pre-training

View Paper

Who Guards the Guardians? The Challenges of Evaluating Identifiability of Learned Representations

0

9.0/10

Shruti Joshi, Théo Saulus, Wieland Brendel, Philippe Brouillard, Dhanya Sridhar, Patrik Reizinger 2/27/2026 arxiv

machine learning

Identifiability in representation learning is commonly evaluated using standard metrics (e.g., MCC, DCI, R^2) on synthetic benchmarks with known ground-truth factors. These metrics are assumed to reflect recovery up to the equivalence class guaranteed by identifiability theory. We show that this ass...

Keywords: identifiability, representation learning, evaluation metrics, MCC, DCI, R^2, data-generating process, encoder geometry

View Paper

Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification

0

9.0/10

Amir Asiaee 2/27/2026 arxiv

machine learning

Neural networks are hypothesized to implement interpretable causal mechanisms, yet verifying this requires finding a causal abstraction -- a simpler, high-level Structural Causal Model (SCM) faithful to the network under interventions. Discovering such abstractions is hard: it typically demands brut...

Keywords: causal abstraction, structural causal model, interventional risk, structured pruning, neural mechanism sparsification, activation variance, interchange interventions, model interpretability

View Paper

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

0

9.0/10

Arnas Uselis, Andrea Dittadi, Seong Joon Oh 2/27/2026 arxiv

machine learning

Compositional generalization, the ability to recognize familiar parts in novel contexts, is a defining property of intelligent systems. Although modern models are trained on massive datasets, they still cover only a tiny fraction of the combinatorial space of possible inputs, raising the question of...

Keywords: compositional generalization, linear representation hypothesis, orthogonal representations, vision embeddings, divisibility, transferability, stability, embedding geometry

View Paper

Memory Caching: RNNs with Growing Memory

0

8.0/10

Ali Behrouz, Zeman Li, Yuan Deng, Peilin Zhong, Meisam Razaviyayn, Vahab Mirrokni 2/27/2026 arxiv

natural language processing

Transformers have been established as the de-facto backbones for most recent advances in sequence modeling, mainly due to their growing memory capacity that scales with the context length. While plausible for retrieval tasks, it causes quadratic complexity and so has motivated recent studies to expl...

Keywords: Memory Caching, RNN, Recurrent Neural Networks, Transformers, Long-context, Growing Memory, Checkpointing, Gated aggregation

View Paper

Export Archive Data

Browse by Date

Papers for March 2, 2026

UFO-4D: Unposed Feedforward 4D Reconstruction from Two Images

Mode Seeking meets Mean Seeking for Fast Long Video Generation

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Do LLMs Benefit From Their Own Words?

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

Who Guards the Guardians? The Challenges of Evaluating Identifiability of Learned Representations

Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Memory Caching: RNNs with Growing Memory