Paper Archive

Browse and export your curated research paper collection

176
Archived Days
1748
Total Papers
7.8
Avg Score
9
Categories

Export Archive Data

Download your archived papers in various formats

JSON: Complete data with analysis • CSV: Tabular data for analysis • Markdown: Human-readable reports • BibTeX: Academic citations
Browse by Date

Papers for March 1, 2026

10 papers found

Sahal Shaji Mullappilly, Mohammed Irfan Kurpath, Omair Mohamed, Mohamed Zidan, Fahad Khan, Salman Khan, Rao Anwer, Hisham Cholakkal 2/26/2026 arxiv

machine learning

We introduce MediX-R1, an open-ended Reinforcement Learning (RL) framework for medical multimodal large language models (MLLMs) that enables clinically grounded, free-form answers beyond multiple-choice formats. MediX-R1 fine-tunes a baseline vision-language backbone with Group Based RL and a compos...

Keywords: MediX-R1, medical RL, multimodal LLM, vision-language, Group Based RL, composite reward, LLM-as-judge, open-ended medical QA

Sven Elflein, Ruilong Li, Sérgio Agostinho, Zan Gojcic, Laura Leal-Taixé, Qunjie Zhou, Aljosa Osep 2/26/2026 arxiv

computer vision

We present a scalable 3D reconstruction model that addresses a critical limitation in offline feed-forward methods: their computational and memory requirements grow quadratically w.r.t. the number of input images. Our approach is built on the key insight that this bottleneck stems from the varying-l...

Keywords: VGG-T3, test-time training, MLP distillation, key-value representation, 3D reconstruction, linear scaling, visual localization, softmax attention

Simon Roschmann, Paul Krzakala, Sonia Mazelet, Quentin Bouniot, Zeynep Akata 2/26/2026 arxiv

machine learning

The Platonic Representation Hypothesis posits that neural networks trained on different modalities converge toward a shared statistical model of the world. Recent work exploits this convergence by aligning frozen pretrained vision and language models with lightweight alignment layers, but typically ...

Keywords: semi-supervised, optimal transport, vision-language, representation alignment, unimodal encoders, teacher-student, multimodal learning

Amita Kamath, Jack Hessel, Khyathi Chandu, Jena D. Hwang, Kai-Wei Chang, Ranjay Krishna 2/26/2026 arxiv

machine learning

The lack of reasoning capabilities in Vision-Language Models (VLMs) has remained at the forefront of research discourse. We posit that this behavior stems from a reporting bias in their training data. That is, how people communicate about visual content by default omits tacit information needed to s...

Keywords: reporting bias, vision-language, VLM, pragmatics, reasoning, spatial reasoning, temporal reasoning, negation

Jose Javier Gonzalez Ortiz, Abhay Gupta, Chris Renard, Davis Blalock 2/26/2026 arxiv

optimization

Standard mixed-precision training of neural networks requires many bytes of accelerator memory for each model parameter. These bytes reflect not just the parameter itself, but also its gradient and one or more optimizer state variables. With each of these values typically requiring 4 bytes, training...

Keywords: FlashOptim, optimizer quantization, master weight splitting, companding, mixed-precision, AdamW, Lion, SGD

Alkis Kalavasis, Anay Mehrotra, Manolis Zampetakis, Felix Zhou, Ziyu Zhu 2/26/2026 arxiv

machine learning

Coarse data arise when learners observe only partial information about samples; namely, a set containing the sample rather than its exact value. This occurs naturally through measurement rounding, sensor limitations, and lag in economic systems. We study Gaussian mean estimation from coarse data, wh...

Keywords: coarse data, identifiability, Gaussian mean estimation, convex partitions, polynomial-time algorithm, computational-statistical tradeoff, NP-hardness

Tilemachos Aravanis, Vladan Stojnić, Bill Psomas, Nikos Komodakis, Giorgos Tolias 2/26/2026 arxiv

computer vision

Open-vocabulary segmentation (OVS) extends the zero-shot recognition capabilities of vision-language models (VLMs) to pixel-level prediction, enabling segmentation of arbitrary categories specified by text prompts. Despite recent progress, OVS lags behind fully supervised approaches due to two chall...

Keywords: open-vocabulary segmentation, few-shot, retrieval-augmented, vision-language models, test-time adapter, per-image classifier, personalized segmentation, support set

Camilo Gomez, Pengyang Wang, Liansheng Tang 2/26/2026 arxiv

machine learning

Recent advances in machine learning have emphasized the integration of structured optimization components into end-to-end differentiable models, enabling richer inductive biases and tighter alignment with task-specific objectives. In this work, we introduce a novel differentiable approximation to th...

Keywords: zero-one loss, hypersimplex, Soft-Binary-Argmax, differentiable projection, Jacobian, geometric consistency, large-batch training, multiclass classification

Dany Haddad, Dan Bareket, Joseph Chee Chang, Jay DeYoung, Jena D. Hwang, Uri Katz, Mark Polak, Sangho Suh, Harshit Surana, Aryeh Tiktinsky, Shriya Atmakuri, Jonathan Bragg, Mike D'Arcy, Sergey Feldman, Amal Hassan-Ali, Rubén Lozano, Bodhisattwa Prasad Majumder, Charles McGrady, Amanpreet Singh, Brooke Vlahos, Yoav Goldberg, Doug Downey 2/26/2026 arxiv

machine learning

AI-powered scientific research tools are rapidly being integrated into research workflows, yet the field lacks a clear lens into how researchers use these systems in real-world settings. We present and analyze the Asta Interaction Dataset, a large-scale resource comprising over 200,000 user queries ...

Keywords: Asta Interaction Dataset, interaction logs, retrieval-augmented generation, LLM, query taxonomy, user behavior, scientific QA, literature discovery

Eric Eaton, Surbhi Goel, Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell 2/26/2026 arxiv

machine learning

Numerous lines of aim to control $\textit{model disagreement}$ -- the extent to which two machine learning models disagree in their predictions. We adopt a simple and standard notion of model disagreement in real-valued prediction problems, namely the expected squared difference in predictions betwe...

Keywords: model disagreement, anchoring, stacking, gradient boosting, neural network architecture search, regression trees, theoretical bounds, strongly convex loss
Loading...

Preparing your export...