Paper Archive

How reliable are LLMs when it comes to playing dice?

0

5.0/10

Luca Avena, Gianmarco Bet, Bernardo Busoni 6/5/2026 arxiv

natural language processing

We investigate the probabilistic reasoning capabilities of large language models through a controlled benchmarking study on discrete probability problems. We constructed two datasets, respectively a set of standard exercises and a set of counterintuitive exercises, designed to trigger heuristic reas...

View Paper

UniSHARP: Universal Sharp Monocular View Synthesis

0

5.0/10

Meixi Song, Dizhe Zhang, Hao Ren, Ruiyang Zhang, Bo Du, Ming-Hsuan Yang, Lu Qi 6/5/2026 arxiv

computer vision

In this work, we focus on extending SHARP, the popular photorealistic view synthesis method, for universal monocular rendering across a continuum of camera systems, from conventional perspective cameras to wide-field-of-view, fisheye and omnidirectional panoramic settings. To overcome the pinhole-sp...

View Paper

Agentopia: Long-Term Life Simulation and Learning in Agent Societies

0

5.0/10

Xintao Wang, Sirui Zheng, Hongqiu Wu, Weiyuan Li, Jen-tse Huang, Minghao Zhu, Can Zu, Qi Deng, Jiawei Wang, Qianyu He, Heng Wang, Xiaojian Wu, Yunzhe Tao 6/5/2026 arxiv

reinforcement learning

Humans learn from social life. Simulating this process with LLM-powered agents represents a promising research direction, raising a natural question: whether LLMs can learn from such simulated social experience to better understand and replicate human behavior. However, prior agent society simulatio...

View Paper

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

0

5.0/10

Cong Chen, Guo Gan, Kaixiang Ji, ChaoYang Zhang, Zhen Yang, Guangming Yao, Hao Chen, Jingdong Chen, Yi Yuan, Chunhua Shen 6/5/2026 arxiv

computer vision

Current Vision-Language Models struggle with hours-long videos because processing full-length visual sequences induces prohibitive token explosion and attention dilution. To overcome this, we introduce MemDreamer to decouple perception and reasoning, shifting long-video understanding into an agentic...

Keywords: attention

View Paper

Streaming Video Generation with Streaming Force Control

0

5.0/10

Hanhui Wang, Yiming Xie, Haiwen Feng, Zhaoyang Lv, Shenlong Wang, Huaizu Jiang 6/5/2026 arxiv

machine learning

We introduce StreamForce, a streaming video generation framework that enables physically grounded control through continuous force inputs. Unlike prior video models that train separate models for different force types, assume fixed forces, or rely on non-causal processing, StreamForce is a causal an...

View Paper

Affordance-Based Hierarchical Reinforcement Learning for Quadruped Pedipulation

0

5.0/10

Tuba Girgin, Jose Castelblanco, Gabriel Rodriguez, Emre Girgin, Cagri Kilic 6/5/2026 arxiv

reinforcement learning

The object manipulation capabilities of quadruped robots is an open research challenge. While previous studies have focused on low-level policy learning, task execution still relies on expert-designed high-level trajectories. Autonomous selection of both an affordable interaction point on the target...

Keywords: reinforcement learning

View Paper

Differences in Detection: Explainability Where it Matters

0

5.0/10

Johannes Theodoridis, Johannes Maucher, Andreas Schilling 6/5/2026 arxiv

reinforcement learning

We propose Differences in Detection (DnD), an intuitive method to compare two object detection models. Based on the same matching algorithm, it complements the standard metrics of mean Average Precision ($mAP$) and TIDE error analysis with the ability to compare two models directly. More specificall...

Keywords: detection

View Paper

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

0

5.0/10

Songhao Wu, Zhongxin Chen, Yuxuan Liu, Heng Cui, Cong Li, Rui Yan 6/5/2026 arxiv

natural language processing

Large language models exhibit impressive zero-shot capabilities across a wide range of downstream tasks. However, they struggle to function as off-the-shelf embedding models, leading to suboptimal performance on massive text embedding benchmarks. In this paper, we identify a potential cause underlyi...

View Paper

Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning

0

5.0/10

Fatema Siddika, Md Anwar Hossen, Tanwi Mallick, Ali Jannesari 6/5/2026 arxiv

natural language processing

Continual learning in Large Language Models (LLMs) is hindered by the plasticity-stability dilemma, where acquiring new capabilities often leads to catastrophic forgetting of previous knowledge. Existing methods typically treat parameters uniformly, failing to distinguish between specific task knowl...

View Paper

Implicit Data Synthesis for Contrastive Unsupervised Data Augmentation

0

5.0/10

Patrick Kage, Trevor Hedges, N. Siddharth, Pavlos Andreadis 6/5/2026 arxiv

computer vision

Scientific observations generate large quantities of unlabeled data which is laborious to hand-label, making unsupervised learning techniques valuable for processing datasets. Among these approaches, contrastive learning provides a convenient mechanism for extracting structural representations from ...

View Paper

Export Archive Data

Browse by Date

Papers for June 8, 2026

How reliable are LLMs when it comes to playing dice?

UniSHARP: Universal Sharp Monocular View Synthesis

Agentopia: Long-Term Life Simulation and Learning in Agent Societies

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Streaming Video Generation with Streaming Force Control

Affordance-Based Hierarchical Reinforcement Learning for Quadruped Pedipulation

Differences in Detection: Explainability Where it Matters

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning

Implicit Data Synthesis for Contrastive Unsupervised Data Augmentation