Paper Archive

Browse and export your curated research paper collection

247
Archived Days
2458
Total Papers
7.8
Avg Score
9
Categories

Export Archive Data

Download your archived papers in various formats

JSON: Complete data with analysis • CSV: Tabular data for analysis • Markdown: Human-readable reports • BibTeX: Academic citations
Browse by Date

Papers for May 21, 2026

10 papers found

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

Currently, enhancing Unified Multimodal Models (UMMs) with image understanding, generation, and editing capabilities mainly relies on mixed multi-task training. Due to inherent task conflicts, such strategy requires complex multi-stage pipelines, massive data mixing, and balancing tricks, merely res...

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

Reinforcement learning with verifiable rewards (RLVR) has become a dominant paradigm for improving reasoning in large language models (LLMs), yet the underlying geometry of the resulting parameter trajectories remains underexplored. In this work, we demonstrate that RLVR weight trajectories are extr...

Keywords: reinforcement learning, regression

[object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

Reinforcement learning from verifiable rewards (RLVR) has emerged as a central technique for improving the reasoning capabilities of large language models. Despite its effectiveness, how response-level rewards translate into token-level probability changes remains poorly understood. We introduce a d...

Keywords: reinforcement learning

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

We present Mem-π, a framework for adaptive memory in large language model (LLM) agents, where useful guidance is generated on demand rather than retrieved from external memory stores. Existing memory-augmented agents typically rely on similarity-based retrieval from episodic memory banks or skill li...

Keywords: reinforcement learning

[object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

Video Virtual Try-On (VVT) aims to seamlessly replace a garment on a person in a video with a new one. While existing methods have made significant strides in maintaining temporal consistency, they are predominantly confined to non-interactive scenarios where models merely showcase garments. This li...

Keywords: transformer

[object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

As long-horizon coding agents produce more code than any developer can review, oversight collapses onto a single surface: the automated test suite. Reward hacking naturally arises in this setup, as the agent optimizes for passing tests while deviating from the users true goal. We study this reward h...

[object Object], [object Object] 5/20/2026 huggingface

computer vision

Recent layout-to-image models have achieved remarkable progress in spatial controllability. However, they still struggle with inter-object occlusion. When bounding boxes overlap, most existing methods lack explicit occlusion information, which makes the generation in intersection regions inherently ...

Keywords: transformer

[object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

Super-Resolution (SR) has advanced rapidly in recent years, with diffusion-based models achieving unprecedented fidelity at the cost of introducing new types of visual artifacts. While existing Image Quality Assessment (IQA) methods provide holistic quality scores, they lack interpretability and fai...

Keywords: fine-tuning, segmentation

[object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

natural language processing

The key-value (KV) cache dominates memory bandwidth and footprint in long-context autoregressive inference. Recent rotation-preconditioned codecs (TurboQuant, PolarQuant) show that a structured random rotation followed by a per-coordinate scalar quantizer matched to an analytically tractable margina...

Keywords: transformer

[object Object], [object Object], [object Object], [object Object], [object Object] 5/20/2026 huggingface

computer vision

Manga is a culturally distinctive multimodal medium and one of the most influential forms of Japanese popular culture. As AI systems increasingly target manga understanding, OCR, and translation, Manga109 has become a foundational dataset for manga-related AI research. However, the current Manga109 ...

Keywords: detection
Loading...

Preparing your export...