Paper Archive

Unique Lives, Shared World: Learning from Single-Life Videos

0

9.0/10

Tengda Han, Sayna Ebrahimi, Dilara Gokay, Li Yang Ku, Maks Ovsjanikov, Iva Babukova, Daniel Zoran, Viorica Patraucean, Joao Carreira, Andrew Zisserman, Dima Damen 12/3/2025 arxiv

computer vision

We introduce the "single-life" learning paradigm, where we train a distinct vision model exclusively on egocentric videos captured by one individual. We leverage the multiple viewpoints naturally captured within a single life to learn a visual encoder in a self-supervised manner. Our experiments dem...

Keywords: single-life learning, egocentric video, self-supervised learning, geometric representation, cross-attention metric, depth transfer, representation learning

View Paper

SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows

0

9.0/10

Qinyu Zhao, Guangting Zheng, Tao Yang, Rui Zhu, Xingjian Leng, Stephen Gould, Liang Zheng 12/3/2025 arxiv

generative models

Normalizing Flows (NFs) learn invertible mappings between the data and a Gaussian distribution. Prior works usually suffer from two limitations. First, they add random noise to training samples or VAE latents as data augmentation, introducing complex pipelines including extra noising and denoising s...

Keywords: SimFlow, normalizing_flows, VAE, fixed_variance, end_to_end_training, REPA-E, ImageNet, gFID

View Paper

PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

0

9.0/10

Jiazhe Wei, Ken Li, Tianyu Lao, Haofan Wang, Liang Wang, Caifeng Shan, Chenyang Si 12/3/2025 arxiv

computer vision

Graphic design forms the cornerstone of modern visual communication, serving as a vital medium for promoting cultural and commercial events. Recent advances have explored automating this process using Large Multimodal Models (LMMs), yet existing methods often produce geometrically inaccurate layouts...

Keywords: PosterCopilot, layout_reasoning, large_multimodal_models, reinforcement_learning, graphic_design, iterative_editing, aesthetic_feedback, generative_models

View Paper

Radiance Meshes for Volumetric Reconstruction

0

9.0/10

Alexander Mai, Trevor Hedstrom, George Kopanas, Janne Kontkanen, Falko Kuester, Jonathan T. Barron 12/3/2025 arxiv

machine learning

We introduce radiance meshes, a technique for representing radiance fields with constant density tetrahedral cells produced with a Delaunay tetrahedralization. Unlike a Voronoi diagram, a Delaunay tetrahedralization yields simple triangles that are natively supported by existing hardware. As such, o...

Keywords: radiance meshes, Delaunay tetrahedralization, tetrahedral mesh, volume rendering, Zip-NeRF, real-time view synthesis, rasterization, ray tracing

View Paper

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

0

9.0/10

Siyi Chen, Mikaela Angelina Uy, Chan Hee Song, Faisal Ladhak, Adithyavairavan Murali, Qing Qu, Stan Birchfield, Valts Blukis, Jonathan Tremblay 12/3/2025 arxiv

robotics

Vision Language Models (VLMs) demonstrate strong qualitative visual understanding, but struggle with metrically precise spatial reasoning required for embodied applications. The agentic paradigm promises that VLMs can use a wide variety of tools that could augment these capabilities, such as depth e...

Keywords: Vision-Language Models, spatial reasoning, reinforcement learning, tool use, DIRL, SpaceTools, robotics

View Paper

Learning Steerable Clarification Policies with Collaborative Self-play

0

9.0/10

Jonathan Berant, Maximillian Chen, Adam Fisch, Reza Aghajani, Fantine Huot, Mirella Lapata, Jacob Eisenstein 12/3/2025 arxiv

machine learning

To handle underspecified or ambiguous queries, AI assistants need a policy for managing their uncertainty to determine (a) when to guess the user intent and answer directly, (b) when to enumerate and answer multiple possible intents, and (c) when to ask a clarifying question. However, such policies ...

Keywords: steerable policies, clarification questions, self-play, reinforcement learning, ReST, dialogue systems, user simulation, cost-sensitive decision-making

View Paper

Eval Factsheets: A Structured Framework for Documenting AI Evaluations

0

9.0/10

Florian Bordes, Candace Ross, Justine T Kao, Evangelia Spiliopoulou, Adina Williams 12/3/2025 arxiv

machine learning

The rapid proliferation of benchmarks has created significant challenges in reproducibility, transparency, and informed decision-making. However, unlike datasets and models -- which benefit from structured documentation frameworks like Datasheets and Model Cards -- evaluation methodologies lack syst...

Keywords: Eval Factsheets, benchmarks, evaluation, transparency, reproducibility, taxonomy, questionnaire, LLM-as-judge

View Paper

Closing the problem of which causal structures of up to six total nodes have a classical-quantum gap

0

9.0/10

Shashaank Khanna, Matthew Pusey, Roger Colbeck 12/3/2025 arxiv

machine learning

The discovery of Bell that there exist quantum correlations that cannot be reproduced classically is one of the most important in the foundations of quantum mechanics, as well as having practical implications. Bell's result was originally proven in a simple bipartite causal structure, but analogous ...

Keywords: quantum correlations, causal structures, classical-quantum gap, Bell nonlocality, causal inference, quantum networks, foundations of quantum mechanics

View Paper

Stable Signer: Hierarchical Sign Language Generative Model

0

9.0/10

Sen Fang, Yalin Feng, Hongbin Zhong, Yanxin Zhang, Dimitris N. Metaxas 12/3/2025 arxiv

computer vision

Sign Language Production (SLP) is the process of converting the complex input text into a real video. Most previous works focused on the Text2Gloss, Gloss2Pose, Pose2Vid stages, and some concentrated on Prompt2Gloss and Text2Avatar stages. However, this field has made slow progress due to the inaccu...

Keywords: Sign Language Production, Stable Signer, SLUL, SAGM Loss, SLP-MoE, hierarchical generation, Pose2Vid, Prompt2Gloss

View Paper

Polarization by Design: How Elites Could Shape Mass Preferences as AI Reduces Persuasion Costs

0

9.0/10

Nadav Kunievsky 12/3/2025 arxiv

machine learning

In democracies, major policy decisions typically require some form of majority or consensus, so elites must secure mass support to govern. Historically, elites could shape support only through limited instruments like schooling and mass media; advances in AI-driven persuasion sharply reduce the cost...

Keywords: polarization, AI persuasion, elite influence, dynamic model, persuasion costs, democratic stability, preference design, political economy

View Paper

Export Archive Data

Browse by Date

Papers for December 4, 2025

Unique Lives, Shared World: Learning from Single-Life Videos

SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows

PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

Radiance Meshes for Volumetric Reconstruction

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

Learning Steerable Clarification Policies with Collaborative Self-play

Eval Factsheets: A Structured Framework for Documenting AI Evaluations

Closing the problem of which causal structures of up to six total nodes have a classical-quantum gap

Stable Signer: Hierarchical Sign Language Generative Model

Polarization by Design: How Elites Could Shape Mass Preferences as AI Reduces Persuasion Costs