Paper Archive

Browse and export your curated research paper collection

79
Archived Days
788
Total Papers
8.4
Avg Score
9
Categories

Export Archive Data

Download your archived papers in various formats

JSON: Complete data with analysis • CSV: Tabular data for analysis • Markdown: Human-readable reports • BibTeX: Academic citations
Browse by Date

Papers for November 30, 2025

10 papers found

Yeganeh Kordi, Nihal V. Nayak, Max Zuo, Ilana Nguyen, Stephen H. Bach 11/26/2025 arxiv

machine learning

We investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. Existing research is mixed regarding whether training on easier or harder data leads to better results, and whether those gains come on easie...

Keywords: LLM generalization, difficulty estimation, Item Response Theory, dataset curation, evaluation robustness, cross-difficulty generalization, fine-grained analysis

Yusuf Dalva, Guocheng Gordon Qian, Maya Goldenberg, Tsai-Shien Chen, Kfir Aberman, Sergey Tulyakov, Pinar Yanardag, Kuan-Chieh Jackson Wang 11/26/2025 arxiv

computer vision

While modern diffusion models excel at generating high-quality and diverse images, they still struggle with high-fidelity compositional and multimodal control, particularly when users simultaneously specify text prompts, subject references, spatial arrangements, pose constraints, and layout annotati...

Keywords: Canvas-to-Image, multimodal controls, diffusion model, composite canvas, Multi-Task Canvas Training, compositional generation, identity preservation, control adherence

Seungjae Lee, Yoonkyo Jung, Inkook Chun, Yao-Chih Lee, Zikui Cai, Hongjia Huang, Aayush Talreja, Tan Dat Dao, Yongyuan Liang, Jia-Bin Huang, Furong Huang 11/26/2025 arxiv

robotics

Learning new robot tasks on new platforms and in new scenes from only a handful of demonstrations remains challenging. While videos of other embodiments - humans and different robots - are abundant, differences in embodiment, camera, and environment hinder their direct use. We address the small-data...

Keywords: TraceGen, TraceForge, 3D trace-space, world model, cross-embodiment, few-shot learning, robot learning, video pretraining

Hongjin Su, Shizhe Diao, Ximing Lu, Mingjie Liu, Jiacheng Xu, Xin Dong, Yonggan Fu, Peter Belcak, Hanrong Ye, Hongxu Yin, Yi Dong, Evelina Bakhturina, Tao Yu, Yejin Choi, Jan Kautz, Pavlo Molchanov 11/26/2025 arxiv

machine learning

Large language models are powerful generalists, yet solving deep and complex problems such as those of the Humanity's Last Exam (HLE) remains both conceptually challenging and computationally expensive. We show that small orchestrators managing other models and a variety of tools can both push the u...

Keywords: ToolOrchestra, Orchestrator (8B), orchestration, tool-use, reinforcement learning, HLE benchmark, GPT-5, efficiency

Wenbo Hu, Jingli Lin, Yilin Long, Yunlong Ran, Lihan Jiang, Yifan Wang, Chenming Zhu, Runsen Xu, Tai Wang, Jiangmiao Pang 11/26/2025 arxiv

machine learning

Vision-Language Models (VLMs) still lack robustness in spatial intelligence, demonstrating poor performance on spatial understanding and reasoning tasks. We attribute this gap to the absence of a visual geometry learning process capable of reconstructing 3D space from 2D images. We present G$^2$VLM,...

Keywords: geometry-grounded VLM, 3D reconstruction, spatial reasoning, multi-view, in-context learning, interleaved reasoning, 3D visual priors, vision-language models

Dong Wang, Yang Li, Ansong Ni, Ching-Feng Yeh, Youssef Emad, Xinjie Lei, Liam Robbins, Karthik Padthe, Hu Xu, Xian Li, Asli Celikyilmaz, Ramya Raghavendra, Lifei Huang, Carole-Jean Wu, Shang-Wen Li 11/26/2025 arxiv

machine learning

Synthetic data has become increasingly important for training large language models, especially when real data is scarce, expensive, or privacy-sensitive. Many such generation tasks require coordinated multi-agent workflows, where specialized agents collaborate to produce data that is higher quality...

Keywords: synthetic data, multi-agent systems, decentralized, peer-to-peer, Ray, scalable data generation, LLM inference, distributed queues

Zihui Xue, Kristen Grauman, Dima Damen, Andrew Zisserman, Tengda Han 11/26/2025 arxiv

computer vision

Can one perceive a video's content without seeing its pixels, just from the camera trajectory-the path it carves through space? This paper is the first to systematically investigate this seemingly implausible question. Towards this end, we propose a contrastive learning framework to train CamFormer,...

Keywords: camera trajectory, CamFormer, contrastive learning, pose embeddings, cross-modal alignment, egocentric, exocentric, RGB-only

Weihao Bo, Shan Zhang, Yanpeng Sun, Jingjing Wu, Qunyi Xie, Xiao Tan, Kunbin Chen, Wei He, Xiaofan Li, Na Zhao, Jingdong Wang, Zechao Li 11/26/2025 arxiv

machine learning

MLLMs exhibit strong reasoning on isolated queries, yet they operate de novo -- solving each problem independently and often repeating the same mistakes. Existing memory-augmented agents mainly store past trajectories for reuse. However, trajectory-based memory suffers from brevity bias, gradually l...

Keywords: ViLoMem, multimodal semantic memory, dual-stream memory, grow-and-refine, visual distraction, logical errors, MLLMs, trajectory-based memory

Sadegh Shirani, Mohsen Bayati 11/26/2025 arxiv

machine learning

Causal effect estimation in networked systems is central to data-driven decision making. In such settings, interventions on one unit can spill over to others, and in complex physical or social systems, the interaction pathways driving these interference structures remain largely unobserved. We argue...

Keywords: causal inference, interference, spillover effects, exposure mapping, evolution-based models, difference-in-differences, causal message passing, influencer networks

Pandiyaraju V, Sreya Mynampati, Abishek Karthik, Poovarasan L, D. Saraswathi 11/26/2025 arxiv

machine learning

Gliomas are brain tumor types that have a high mortality rate which means early and accurate diagnosis is important for therapeutic intervention for the tumors. To address this difficulty, the proposed research will develop a hybrid deep learning model which integrates U-Net based segmentation and a...

Keywords: glioma, 3D MRI, U‑Net, DenseNet, VGG, multi‑head attention, spatial‑channel attention, segmentation
Loading...

Preparing your export...