Paper Archive

Browse and export your curated research paper collection

176
Archived Days
1748
Total Papers
7.8
Avg Score
9
Categories

Export Archive Data

Download your archived papers in various formats

JSON: Complete data with analysis • CSV: Tabular data for analysis • Markdown: Human-readable reports • BibTeX: Academic citations
Browse by Date

Papers for March 2, 2026

10 papers found

Junhwa Hur, Charles Herrmann, Songyou Peng, Philipp Henzler, Zeyu Ma, Todd Zickler, Deqing Sun 2/27/2026 arxiv

computer vision

Dense 4D reconstruction from unposed images remains a critical challenge, with current methods relying on slow test-time optimization or fragmented, task-specific feedforward models. We introduce UFO-4D, a unified feedforward framework to reconstruct a dense, explicit 4D representation from just a p...

Keywords: 4D reconstruction, dynamic 3D Gaussian Splats, feedforward, self-supervised rendering, geometry-motion-pose joint estimation, novel view interpolation

Shengqu Cai, Weili Nie, Chao Liu, Julius Berner, Lvmin Zhang, Nanye Ma, Hansheng Chen, Maneesh Agrawala, Leonidas Guibas, Gordon Wetzstein, Arash Vahdat 2/27/2026 arxiv

machine learning

Scaling video generation from seconds to minutes faces a critical bottleneck: while short-video data is abundant and high-fidelity, coherent long-form data is scarce and limited to narrow domains. To address this, we propose a training paradigm where Mode Seeking meets Mean Seeking, decoupling local...

Keywords: long-video-generation, diffusion-models, decoupled-representation, flow-matching, mode-seeking, reverse-KL, short-video-teacher, fast-generation

Fan Shu, Yite Wang, Ruofan Wu, Boyi Liu, Zhewei Yao, Yuxiong He, Feng Yan 2/27/2026 arxiv

machine learning

The fast-growing demands in using Large Language Models (LLMs) to tackle complex multi-step data science tasks create an emergent need for accurate benchmarking. There are two major gaps in existing benchmarks: (i) the lack of standardized, process-aware evaluation that captures instruction adherenc...

Keywords: LLM, benchmark, data science, instruction following, process-aware evaluation, verifiable ground truth, Kaggle-derived, training data

Jenny Y. Huang, Leshem Choshen, Ramon Astudillo, Tamara Broderick, Jacob Andreas 2/27/2026 arxiv

natural language processing

Multi-turn interactions with large language models typically retain the assistant's own past responses in the conversation history. In this work, we revisit this design choice by asking whether large language models benefit from conditioning on their own prior responses. Using in-the-wild, multi-tur...

Keywords: large_language_models, multi-turn_conversation, context_filtering, context_pollution, efficiency, prompting

Weinan Dai, Hanlin Wu, Qiying Yu, Huan-ang Gao, Jiahao Li, Chengquan Jiang, Weiqiang Lou, Yufan Song, Hongli Yu, Jiaze Chen, Wei-Ying Ma, Ya-Qin Zhang, Jingjing Liu, Mingxuan Wang, Xin Liu, Hao Zhou 2/27/2026 arxiv

machine learning

GPU kernel optimization is fundamental to modern deep learning but remains a highly specialized task requiring deep hardware expertise. Despite strong performance in general programming, large language models (LLMs) remain uncompetitive with compiler-based systems such as torch.compile for CUDA kern...

Keywords: CUDA, reinforcement learning, kernel optimization, LLM, compiler, KernelBench, agentic RL, profiling

Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan 2/27/2026 arxiv

machine learning

Modern optimizers like Adam and Muon are central to training large language models, but their reliance on first- and second-order momenta introduces significant memory overhead, which constrains scalability and computational efficiency. In this work, we reframe the exponential moving average (EMA) u...

Keywords: LoRA-Pre, low-rank, optimizer, momentum, EMA, online linear regressor, Llama, pre-training

Shruti Joshi, Théo Saulus, Wieland Brendel, Philippe Brouillard, Dhanya Sridhar, Patrik Reizinger 2/27/2026 arxiv

machine learning

Identifiability in representation learning is commonly evaluated using standard metrics (e.g., MCC, DCI, R^2) on synthetic benchmarks with known ground-truth factors. These metrics are assumed to reflect recovery up to the equivalence class guaranteed by identifiability theory. We show that this ass...

Keywords: identifiability, representation learning, evaluation metrics, MCC, DCI, R^2, data-generating process, encoder geometry

Amir Asiaee 2/27/2026 arxiv

machine learning

Neural networks are hypothesized to implement interpretable causal mechanisms, yet verifying this requires finding a causal abstraction -- a simpler, high-level Structural Causal Model (SCM) faithful to the network under interventions. Discovering such abstractions is hard: it typically demands brut...

Keywords: causal abstraction, structural causal model, interventional risk, structured pruning, neural mechanism sparsification, activation variance, interchange interventions, model interpretability

Arnas Uselis, Andrea Dittadi, Seong Joon Oh 2/27/2026 arxiv

machine learning

Compositional generalization, the ability to recognize familiar parts in novel contexts, is a defining property of intelligent systems. Although modern models are trained on massive datasets, they still cover only a tiny fraction of the combinatorial space of possible inputs, raising the question of...

Keywords: compositional generalization, linear representation hypothesis, orthogonal representations, vision embeddings, divisibility, transferability, stability, embedding geometry

Ali Behrouz, Zeman Li, Yuan Deng, Peilin Zhong, Meisam Razaviyayn, Vahab Mirrokni 2/27/2026 arxiv

natural language processing

Transformers have been established as the de-facto backbones for most recent advances in sequence modeling, mainly due to their growing memory capacity that scales with the context length. While plausible for retrieval tasks, it causes quadratic complexity and so has motivated recent studies to expl...

Keywords: Memory Caching, RNN, Recurrent Neural Networks, Transformers, Long-context, Growing Memory, Checkpointing, Gated aggregation
Loading...

Preparing your export...