Paper Archive

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

Currently, enhancing Unified Multimodal Models (UMMs) with image understanding, generation, and editing capabilities mainly relies on mixed multi-task training. Due to inherent task conflicts, such strategy requires complex multi-stage pipelines, massive data mixing, and balancing tricks, merely res...

View Paper

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

Reinforcement learning with verifiable rewards (RLVR) has become a dominant paradigm for improving reasoning in large language models (LLMs), yet the underlying geometry of the resulting parameter trajectories remains underexplored. In this work, we demonstrate that RLVR weight trajectories are extr...

Keywords: reinforcement learning, regression

View Paper

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

0

5.0/10

[object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

Reinforcement learning from verifiable rewards (RLVR) has emerged as a central technique for improving the reasoning capabilities of large language models. Despite its effectiveness, how response-level rewards translate into token-level probability changes remains poorly understood. We introduce a d...

Keywords: reinforcement learning

View Paper

Mem-π: Adaptive Memory through Learning When and What to Generate

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

We present Mem-π, a framework for adaptive memory in large language model (LLM) agents, where useful guidance is generated on demand rather than retrieved from external memory stores. Existing memory-augmented agents typically rely on similarity-based retrieval from episodic memory banks or skill li...

Keywords: reinforcement learning

View Paper

iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

Video Virtual Try-On (VVT) aims to seamlessly replace a garment on a person in a video with a new one. While existing methods have made significant strides in maintaining temporal consistency, they are predominantly confined to non-interactive scenarios where models merely showcase garments. This li...

Keywords: transformer

View Paper

SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

As long-horizon coding agents produce more code than any developer can review, oversight collapses onto a single surface: the automated test suite. Reward hacking naturally arises in this setup, as the agent optimizes for passing tests while deviating from the users true goal. We study this reward h...

View Paper

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

0

5.0/10

[object Object], [object Object] 5/20/2026 huggingface

computer vision

Recent layout-to-image models have achieved remarkable progress in spatial controllability. However, they still struggle with inter-object occlusion. When bounding boxes overlap, most existing methods lack explicit occlusion information, which makes the generation in intersection regions inherently ...

Keywords: transformer

View Paper

SR-Ground: Image Quality Grounding for Super-Resolved Content

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

Super-Resolution (SR) has advanced rapidly in recent years, with diffusion-based models achieving unprecedented fidelity at the cost of introducing new types of visual artifacts. While existing Image Quality Assessment (IQA) methods provide holistic quality scores, they lack interpretability and fai...

Keywords: fine-tuning, segmentation

View Paper

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

0

5.0/10

[object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

The key-value (KV) cache dominates memory bandwidth and footprint in long-context autoregressive inference. Recent rotation-preconditioned codecs (TurboQuant, PolarQuant) show that a structured random rotation followed by a per-coordinate scalar quantizer matched to an analytically tractable margina...

Keywords: transformer

View Paper

Manga109-v2026: Revisiting Manga109 Annotations for Modern Manga Understanding

0

5.0/10

[object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

Manga is a culturally distinctive multimodal medium and one of the most influential forms of Japanese popular culture. As AI systems increasingly target manga understanding, OCR, and translation, Manga109 has become a foundational dataset for manga-related AI research. However, the current Manga109 ...

Keywords: detection

View Paper

Export Archive Data

Browse by Date

Papers for May 21, 2026

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Mem-π: Adaptive Memory through Learning When and What to Generate

iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

SR-Ground: Image Quality Grounding for Super-Resolved Content

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

Manga109-v2026: Revisiting Manga109 Annotations for Modern Manga Understanding