Paper Archive

Representation Forcing for Bottleneck-Free Unified Multimodal Models

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

computer vision

Unified multimodal models (UMMs) aim to handle perception and generation in a single model. Yet existing UMMs still rely on a frozen, separately pretrained VAE for image generation, imposing a structural bottleneck. Naively removing it introduces a quality gap, as the model must learn both high-leve...

View Paper

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

computer vision

Connector-based video unified models have demonstrated strong capability in instruction-grounded video synthesis, but integrating a large high-fidelity generator into the unified training loop is computationally prohibitive, limiting achievable visual quality. We therefore propose Lumos-Nexus, a tra...

View Paper

Linear Scaling Video VLMs for Long Video Understanding

0

5.0/10

[object Object], [object Object], [object Object] 5/29/2026 huggingface

computer vision

Video vision-language models (VLMs) are increasingly used in long-horizon and streaming settings, yet most video encoders still rely on spatiotemporal self-attention, causing compute and latency to grow quadratically with the number of frames. Existing efficiency methods improve scalability but ofte...

Keywords: attention, fine-tuning

View Paper

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

computer vision

Long-context reasoning remains a central challenge for large language models, which often fail to locate and integrate key information in extensive distracting content. Reinforcement learning with verifiable rewards (RLVR) has shown promise for this task, yet existing methods are limited by low-conf...

Keywords: reinforcement learning

View Paper

GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

natural language processing

GPU kernels are the workhorse of modern deep learning, and optimizing them (via evolutionary search or coding agents) usually requires repeated measurement on target hardware. While these measurements provide the ground-truth signal necessary for kernel search, they are costly, because each evaluati...

Keywords: deep learning, reinforcement learning

View Paper

DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

natural language processing

Large language models are increasingly deployed in multi-turn interactive settings where users or environments can iteratively provide lightweight feedback. Unfortunately, optimizing such behavior presents a sharp dilemma in practice: online reinforcement learning is able to effectively address mult...

Keywords: reinforcement learning, fine-tuning

View Paper

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

computer vision

Self-play can train language models without external supervision. However, existing methods require rule-checkable answers, leaving open-ended tasks dependent on curated prompts or frontier-model judges. We introduce SCOPE, a data-free self-play framework for open-ended tasks that co-evolves two pol...

View Paper

Bundesrecht: An Open Library and Corpus for German Statutory Reference Processing

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

computer vision

Statutory references are central to legal language understanding, but are difficult to process automatically, as they appear in compact and variable surface forms, may combine multiple targets, use special abbreviations, and often point to lower-level units. Existing tools for German focus either on...

View Paper

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

reinforcement learning

Recent advances in video generative models have promoted rapid progress in controllable world models. However, maintaining fine-grained spatio-temporal consistency under long-horizon reasoning remains a key challenge. In this work, we move beyond explicit 3D memory and coarse frame-level implicit mo...

Keywords: attention

View Paper

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object] 5/29/2026 huggingface

natural language processing

LLM agents are increasingly expected not only to complete isolated tasks, but also to carry bounded representations of human expertise, judgment, and interaction style. Building such person-grounded agents remains difficult because actionable knowledge associated with a person or role is usually emb...

View Paper

Export Archive Data

Browse by Date

Papers for June 1, 2026

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models

Linear Scaling Video VLMs for Long Video Understanding

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Bundesrecht: An Open Library and Corpus for German Statutory Reference Processing

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation