Paper Archive

Neu-PiG: Neural Preconditioned Grids for Fast Dynamic Surface Reconstruction on Long Sequences

0

9.0/10

Julian Kaltheuner, Hannah Dröge, Markus Plack, Patrick Stotko, Reinhard Klein 2/25/2026 arxiv

computer vision

Temporally consistent surface reconstruction of dynamic 3D objects from unstructured point cloud data remains challenging, especially for very long sequences. Existing methods either optimize deformations incrementally, risking drift and requiring long runtimes, or rely on complex learned models tha...

Keywords: Neu-PiG, neural preconditioned grid, latent grid, Sobolev preconditioning, dynamic surface reconstruction, point clouds, 6-DoF deformation, multi-resolution

View Paper

WHOLE: World-Grounded Hand-Object Lifted from Egocentric Videos

0

9.0/10

Yufei Ye, Jiaman Li, Ryan Rong, C. Karen Liu 2/25/2026 arxiv

computer vision

Egocentric manipulation videos are highly challenging due to severe occlusions during interactions and frequent object entries and exits from the camera view as the person moves. Current methods typically focus on recovering either hand or object pose in isolation, but both struggle during interacti...

Keywords: egocentric vision, hand-object interaction, generative prior, 6D object pose, world-space reconstruction, occlusion handling, first-person video, WHOLE

View Paper

Solaris: Building a Multiplayer Video World Model in Minecraft

0

9.0/10

Georgy Savva, Oscar Michel, Daohan Lu, Suppakit Waiwitlikhit, Timothy Meehan, Dhairya Mishra, Srivats Poddar, Jack Lu, Saining Xie 2/25/2026 arxiv

computer vision

Existing action-conditioned video generation models (video world models) are limited to single-agent perspectives, failing to capture the multi-agent interactions of real-world environments. We introduce Solaris, a multiplayer video world model that simulates consistent multi-view observations. To e...

Keywords: multi-agent, video world model, multi-view, Minecraft, data collection, Self Forcing, Checkpointed Self Forcing, bidirectional training

View Paper

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

0

9.0/10

Hanna Yukhymenko, Anton Alexandrov, Martin Vechev 2/25/2026 arxiv

machine learning

The reliability of multilingual Large Language Model (LLM) evaluation is currently compromised by the inconsistent quality of translated benchmarks. Existing resources often suffer from semantic drift and context loss, which can lead to misleading performance metrics. In this work, we present a full...

Keywords: benchmark translation, multilingual evaluation, LLM, Universal Self-Improvement, T-RANK, dataset localization, semantic drift, LLM-as-judge

View Paper

SumTablets: A Transliteration Dataset of Sumerian Tablets

0

9.0/10

Cole Simmons, Richard Diehl Martinez, Dan Jurafsky 2/25/2026 arxiv

machine learning

Sumerian transliteration is a conventional system for representing a scholar's interpretation of a tablet in the Latin script. Thanks to visionary digital Assyriology projects such as ETCSL, CDLI, and Oracc, a large number of Sumerian transliterations have been published online, and these data are w...

Keywords: Sumerian, cuneiform, transliteration, SumTablets, Oracc, dataset, HuggingFace, transformer

View Paper

Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes

0

9.0/10

Xavier Pleimling, Sifat Muhammad Abdullah, Gunjan Balde, Peng Gao, Mainack Mondal, Murtuza Jadliwala, Bimal Viswanath 2/25/2026 arxiv

computer vision

Advances in Generative AI (GenAI) have led to the development of various protection strategies to prevent the unauthorized use of images. These methods rely on adding imperceptible protective perturbations to images to thwart misuse such as style mimicry or deepfake manipulations. Although previous ...

Keywords: generative_AI, image-to-image, denoising, image_protection, adversarial_robustness, deepfakes, security

View Paper

Improving Parametric Knowledge Access in Reasoning Language Models

0

9.0/10

Melody Ma, John Hewitt 2/25/2026 arxiv

natural language processing

We study reasoning for accessing world knowledge stored in a language model's parameters. For example, recalling that Canberra is Australia's capital may benefit from thinking through major cities and the concept of purpose-built capitals. While reasoning language models are trained via reinforcemen...

Keywords: reasoning language models, parametric knowledge, reinforcement learning, TriviaQA, chain-of-thought, world-knowledge QA, Natural Questions, HotpotQA

View Paper

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

0

9.0/10

Rui Yang, Qianhui Wu, Zhaoyang Wang, Hanyang Chen, Ke Yang, Hao Cheng, Huaxiu Yao, Baoling Peng, Huan Zhang, Jianfeng Gao, Tong Zhang 2/25/2026 arxiv

machine learning

Open-source native GUI agents still lag behind closed-source systems on long-horizon navigation tasks. This gap stems from two limitations: a shortage of high-quality, action-aligned reasoning data, and the direct adoption of generic post-training pipelines that overlook the unique challenges of GUI...

Keywords: GUI agents, action-aware SFT, partial verifiability, KL trust region, success-adaptive scaling, GUI dataset, offline-to-online predictability

View Paper

Learning and Naming Subgroups with Exceptional Survival Characteristics

0

9.0/10

Mhd Jawad Al Rahwanji, Sascha Xu, Nils Philipp Walter, Jilles Vreeken 2/25/2026 arxiv

machine learning

In many applications, it is important to identify subpopulations that survive longer or shorter than the rest of the population. In medicine, for example, it allows determining which patients benefit from treatment, and in predictive maintenance, which components are more likely to fail. Existing me...

Keywords: Sysurv, random survival forests, survival analysis, subgroup discovery, interpretable rules, non-parametric, differentiable, individual survival curves

View Paper

Mixed Magnification Aggregation for Generalizable Region-Level Representations in Computational Pathology

0

9.0/10

Eric Zimmermann, Julian Viret, Michal Zelechowski, James Brian Hall, Neil Tenenholtz, Adam Casson, George Shaikovski, Eugene Vorontsov, Siqi Liu, Kristen A Severson 2/25/2026 arxiv

machine learning

In recent years, a standard computational pathology workflow has emerged where whole slide images are cropped into tiles, these tiles are processed using a foundation model, and task-specific models are built using the resulting representations. At least 15 different foundation models have been prop...

Keywords: mixed magnification, region-level aggregation, masked embedding modeling, computational pathology, whole slide images, tile representations, foundation model, biomarker prediction

View Paper

Export Archive Data

Browse by Date

Papers for February 26, 2026

Neu-PiG: Neural Preconditioned Grids for Fast Dynamic Surface Reconstruction on Long Sequences

WHOLE: World-Grounded Hand-Object Lifted from Egocentric Videos

Solaris: Building a Multiplayer Video World Model in Minecraft

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

SumTablets: A Transliteration Dataset of Sumerian Tablets

Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes

Improving Parametric Knowledge Access in Reasoning Language Models

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Learning and Naming Subgroups with Exceptional Survival Characteristics

Mixed Magnification Aggregation for Generalizable Region-Level Representations in Computational Pathology