Paper Archive

ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation

0

5.0/10

Omar El Khalifi, Thomas Rossi, Oscar Fossey, Thibault Fouque, Ulysse Mizrahi, Philip Torr, Ivan Laptev, Fabio Pizzati, Baptiste Bellot-Gurlet 5/7/2026 arxiv

computer vision

For artistic applications, video generation requires fine-grained control over both performance and cinematography, i.e., the actor's motion and the camera trajectory. We present ActCam, a zero-shot method for video generation that jointly transfers character motion from a driving video into a new s...

Keywords: diffusion model

View Paper

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

0

5.0/10

Minbin Huang, Han Shi, Chuanyang Zheng, Yimeng Wu, Guoxuan Chen, Xintong Yu, Yichun Yin, Hong Cheng 5/7/2026 arxiv

reinforcement learning

Modern Mixture-of-Experts (MoE) architectures allocate expert capacity through a rigid per-layer rule: each transformer layer owns a separate expert set. This convention couples depth scaling with linear expert-parameter growth and assumes that every layer needs isolated expert capacity. However, re...

Keywords: transformer

View Paper

BAMI: Training-Free Bias Mitigation in GUI Grounding

0

5.0/10

Borui Zhang, Bo Zhang, Bo Wang, Wenzhao Zheng, Yuhao Cheng, Liang Tang, Yiqiang Yan, Jie Zhou, Jiwen Lu 5/7/2026 arxiv

computer vision

GUI grounding is a critical capability for enabling GUI agents to execute tasks such as clicking and dragging. However, in complex scenarios like the ScreenSpot-Pro benchmark, existing models often suffer from suboptimal performance. Utilizing the proposed \textbf{Masked Prediction Distribution (MPD...

View Paper

EMO: Pretraining Mixture of Experts for Emergent Modularity

0

5.0/10

Ryan Wang, Akshita Bhagia, Sewon Min 5/7/2026 arxiv

natural language processing

Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e.g., code, math, or domain-specific knowledge. Mixture-of-Experts (MoEs) seemingly offer a potential alternative by activating only a subset...

Keywords: pretraining

View Paper

Multi-Robot Coordination in V2X Environments

0

5.0/10

John Pravin Arockiasamy, Alexey Vinel 5/7/2026 arxiv

reinforcement learning

This paper presents a Vehicle-to-Everything (V2X) communication framework that enables decentralized cooperation among social robots operating in complex urban traffic environments. Building on ETSI Cooperative Awareness and Maneuver Coordination services, the framework introduces two robot-centric ...

View Paper

Verifier-Backed Hard Problem Generation for Mathematical Reasoning

0

5.0/10

Yuhang Lai, Jiazhan Feng, Yee Whye Teh, Ning Miao 5/7/2026 arxiv

natural language processing

Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, and novel problems - an essential component for advancing LLM training and enabling autonomous scientific research. Existing problem generat...

View Paper

Relit-LiVE: Relight Video by Jointly Learning Environment Video

0

5.0/10

Weiqing Xiao, Hong Li, Xiuyu Yang, Houyuan Chen, Wenyi Li, Tianqi Liu, Shaocong Xu, Chongjie Ye, Hao Zhao, Beibei Wang 5/7/2026 arxiv

computer vision

Recent advances have shown that large-scale video diffusion models can be repurposed as neural renderers by first decomposing videos into intrinsic scene representations and then performing forward rendering under novel illumination. While promising, this paradigm fundamentally relies on accurate in...

Keywords: diffusion model

View Paper

Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

0

5.0/10

Jai Moondra, Ayela Chughtai, Bhargavi Lanka, Swati Gupta 5/7/2026 arxiv

natural language processing

Ranking LLMs via pairwise human feedback underpins current leaderboards for open-ended tasks, such as creative writing and problem-solving. We analyze ~89K comparisons in 116 languages from 52 LLMs from Arena, and show that the best-fit global Bradley-Terry (BT) ranking is misleading. Nearly 2/3 of ...

Keywords: machine learning, classification

View Paper

Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less

0

5.0/10

Yuxing Liu, Jianyu Wang, Tong Zhang 5/7/2026 arxiv

natural language processing

Optimizers play an important role in both pretraining and finetuning stages when training large language models (LLMs). In this paper, we present an observation that full finetuning with the same optimizer as in pretraining achieves a better learning-forgetting tradeoff, i.e., forgetting less while ...

Keywords: pretraining

View Paper

When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

0

5.0/10

Sushant Gautam, Finn Schwall, Annika Willoch Olstad, Fernando Vallecillos Ruiz, Birk Torpmann-Hagen, Sunniva Maria Stordal Bjørklund, Leon Moonen, Klas Pettersen, Michael A. Riegler 5/7/2026 arxiv

natural language processing

Many deployments must compare candidate language models for safety before a labeled benchmark exists for the relevant language, sector, or regulatory regime. We formalize this setting as benchmarkless comparative safety scoring and specify the contract under which a scenario-based audit can be inter...

View Paper

Export Archive Data

Browse by Date

Papers for May 10, 2026

ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

BAMI: Training-Free Bias Mitigation in GUI Grounding

EMO: Pretraining Mixture of Experts for Emergent Modularity

Multi-Robot Coordination in V2X Environments

Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Relit-LiVE: Relight Video by Jointly Learning Environment Video

Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less

When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels