Paper Archive

MediX-R1: Open Ended Medical Reinforcement Learning

0

9.0/10

Sahal Shaji Mullappilly, Mohammed Irfan Kurpath, Omair Mohamed, Mohamed Zidan, Fahad Khan, Salman Khan, Rao Anwer, Hisham Cholakkal 2/26/2026 arxiv

machine learning

We introduce MediX-R1, an open-ended Reinforcement Learning (RL) framework for medical multimodal large language models (MLLMs) that enables clinically grounded, free-form answers beyond multiple-choice formats. MediX-R1 fine-tunes a baseline vision-language backbone with Group Based RL and a compos...

Keywords: MediX-R1, medical RL, multimodal LLM, vision-language, Group Based RL, composite reward, LLM-as-judge, open-ended medical QA

View Paper

VGG-T$^3$: Offline Feed-Forward 3D Reconstruction at Scale

0

9.0/10

Sven Elflein, Ruilong Li, Sérgio Agostinho, Zan Gojcic, Laura Leal-Taixé, Qunjie Zhou, Aljosa Osep 2/26/2026 arxiv

computer vision

We present a scalable 3D reconstruction model that addresses a critical limitation in offline feed-forward methods: their computational and memory requirements grow quadratically w.r.t. the number of input images. Our approach is built on the key insight that this bottleneck stems from the varying-l...

Keywords: VGG-T3, test-time training, MLP distillation, key-value representation, 3D reconstruction, linear scaling, visual localization, softmax attention

View Paper

SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport

0

9.0/10

Simon Roschmann, Paul Krzakala, Sonia Mazelet, Quentin Bouniot, Zeynep Akata 2/26/2026 arxiv

machine learning

The Platonic Representation Hypothesis posits that neural networks trained on different modalities converge toward a shared statistical model of the world. Recent work exploits this convergence by aligning frozen pretrained vision and language models with lightweight alignment layers, but typically ...

Keywords: semi-supervised, optimal transport, vision-language, representation alignment, unimodal encoders, teacher-student, multimodal learning

View Paper

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

0

9.0/10

Amita Kamath, Jack Hessel, Khyathi Chandu, Jena D. Hwang, Kai-Wei Chang, Ranjay Krishna 2/26/2026 arxiv

machine learning

The lack of reasoning capabilities in Vision-Language Models (VLMs) has remained at the forefront of research discourse. We posit that this behavior stems from a reporting bias in their training data. That is, how people communicate about visual content by default omits tacit information needed to s...

Keywords: reporting bias, vision-language, VLM, pragmatics, reasoning, spatial reasoning, temporal reasoning, negation

View Paper

FlashOptim: Optimizers for Memory Efficient Training

0

9.0/10

Jose Javier Gonzalez Ortiz, Abhay Gupta, Chris Renard, Davis Blalock 2/26/2026 arxiv

optimization

Standard mixed-precision training of neural networks requires many bytes of accelerator memory for each model parameter. These bytes reflect not just the parameter itself, but also its gradient and one or more optimizer state variables. With each of these values typically requiring 4 bytes, training...

Keywords: FlashOptim, optimizer quantization, master weight splitting, companding, mixed-precision, AdamW, Lion, SGD

View Paper

Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms

0

9.0/10

Alkis Kalavasis, Anay Mehrotra, Manolis Zampetakis, Felix Zhou, Ziyu Zhu 2/26/2026 arxiv

machine learning

Coarse data arise when learners observe only partial information about samples; namely, a set containing the sample rather than its exact value. This occurs naturally through measurement rounding, sensor limitations, and lag in economic systems. We study Gaussian mean estimation from coarse data, wh...

Keywords: coarse data, identifiability, Gaussian mean estimation, convex partitions, polynomial-time algorithm, computational-statistical tradeoff, NP-hardness

View Paper

Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?

0

9.0/10

Tilemachos Aravanis, Vladan Stojnić, Bill Psomas, Nikos Komodakis, Giorgos Tolias 2/26/2026 arxiv

computer vision

Open-vocabulary segmentation (OVS) extends the zero-shot recognition capabilities of vision-language models (VLMs) to pixel-level prediction, enabling segmentation of arbitrary categories specified by text prompts. Despite recent progress, OVS lags behind fully supervised approaches due to two chall...

Keywords: open-vocabulary segmentation, few-shot, retrieval-augmented, vision-language models, test-time adapter, per-image classifier, personalized segmentation, support set

View Paper

Differentiable Zero-One Loss via Hypersimplex Projections

0

9.0/10

Camilo Gomez, Pengyang Wang, Liansheng Tang 2/26/2026 arxiv

machine learning

Recent advances in machine learning have emphasized the integration of structured optimization components into end-to-end differentiable models, enabling richer inductive biases and tighter alignment with task-specific objectives. In this work, we introduce a novel differentiable approximation to th...

Keywords: zero-one loss, hypersimplex, Soft-Binary-Argmax, differentiable projection, Jacobian, geometric consistency, large-batch training, multiclass classification

View Paper

Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

0

9.0/10

Dany Haddad, Dan Bareket, Joseph Chee Chang, Jay DeYoung, Jena D. Hwang, Uri Katz, Mark Polak, Sangho Suh, Harshit Surana, Aryeh Tiktinsky, Shriya Atmakuri, Jonathan Bragg, Mike D'Arcy, Sergey Feldman, Amal Hassan-Ali, Rubén Lozano, Bodhisattwa Prasad Majumder, Charles McGrady, Amanpreet Singh, Brooke Vlahos, Yoav Goldberg, Doug Downey 2/26/2026 arxiv

machine learning

AI-powered scientific research tools are rapidly being integrated into research workflows, yet the field lacks a clear lens into how researchers use these systems in real-world settings. We present and analyze the Asta Interaction Dataset, a large-scale resource comprising over 200,000 user queries ...

Keywords: Asta Interaction Dataset, interaction logs, retrieval-augmented generation, LLM, query taxonomy, user behavior, scientific QA, literature discovery

View Paper

Model Agreement via Anchoring

0

8.0/10

Eric Eaton, Surbhi Goel, Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell 2/26/2026 arxiv

machine learning

Numerous lines of aim to control $\textit{model disagreement}$ -- the extent to which two machine learning models disagree in their predictions. We adopt a simple and standard notion of model disagreement in real-valued prediction problems, namely the expected squared difference in predictions betwe...

Keywords: model disagreement, anchoring, stacking, gradient boosting, neural network architecture search, regression trees, theoretical bounds, strongly convex loss

View Paper

Export Archive Data

Browse by Date

Papers for March 1, 2026

MediX-R1: Open Ended Medical Reinforcement Learning

VGG-T$^3$: Offline Feed-Forward 3D Reconstruction at Scale

SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

FlashOptim: Optimizers for Memory Efficient Training

Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms

Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?

Differentiable Zero-One Loss via Hypersimplex Projections

Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

Model Agreement via Anchoring