Paper Archive

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

natural language processing

Modern Mixture-of-Experts (MoE) architectures allocate expert capacity through a rigid per-layer rule: each transformer layer owns a separate expert set. This convention couples depth scaling with linear expert-parameter growth and assumes that every layer needs isolated expert capacity. However, re...

Keywords: transformer

View Paper

EMO: Pretraining Mixture of Experts for Emergent Modularity

0

5.0/10

[object Object], [object Object], [object Object] 5/7/2026 huggingface

natural language processing

Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e.g., code, math, or domain-specific knowledge. Mixture-of-Experts (MoEs) seemingly offer a potential alternative by activating only a subset...

Keywords: pretraining

View Paper

Relit-LiVE: Relight Video by Jointly Learning Environment Video

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

computer vision

Recent advances have shown that large-scale video diffusion models can be repurposed as neural renderers by first decomposing videos into intrinsic scene representations and then performing forward rendering under novel illumination. While promising, this paradigm fundamentally relies on accurate in...

Keywords: diffusion model

View Paper

When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

natural language processing

Many deployments must compare candidate language models for safety before a labeled benchmark exists for the relevant language, sector, or regulatory regime. We formalize this setting as benchmarkless comparative safety scoring and specify the contract under which a scenario-based audit can be inter...

View Paper

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

reinforcement learning

We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician is optimized to provide holistic support for the exploratory and iterative reality of mathematical workflows, including ideation, literature...

View Paper

Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

reinforcement learning

Despite the growing popularity of Multimodal Domain Generalization (MMDG) for enhancing model robustness, it remains unclear whether reported performance gains reflect genuine algorithmic progress or are artifacts of inconsistent evaluation protocols. Current research is fragmented, with studies var...

Keywords: neural network, detection, classification

View Paper

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

natural language processing

Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficulty has been hampered by the lack of controlled, scalable environments. We introduce ScaleLogic, a synthetic logical reasoning framework that...

Keywords: reinforcement learning

View Paper

SkillOS: Learning Skill Curation for Self-Evolving Agents

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

reinforcement learning

LLM-based agents are increasingly deployed to handle streaming tasks, yet they often remain one-off problem solvers that fail to learn from past interactions. Reusable skills distilled from experience provide a natural substrate for self-evolution, where high-quality skill curation serves as the key...

View Paper

Continuous Latent Diffusion Language Model

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

natural language processing

Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve generation efficiency, scalable representation learning, and effective g...

View Paper

Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidance

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/7/2026 huggingface

natural language processing

In recent years, open-source efforts like Senorita-2M have propelled video editing toward natural language instruction. However, current publicly available datasets predominantly focus on local editing or style transfer, which largely preserve the original scene structure and are easier to scale. In...

View Paper

Export Archive Data

Browse by Date

Papers for May 8, 2026

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

EMO: Pretraining Mixture of Experts for Emergent Modularity

Relit-LiVE: Relight Video by Jointly Learning Environment Video

When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

SkillOS: Learning Skill Curation for Self-Evolving Agents

Continuous Latent Diffusion Language Model

Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidance