Paper Archive

Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training

0

9.0/10

Valentin Lacombe, Valentin Quesnel, Damien Sileo 3/2/2026 arxiv

machine learning

Training on verifiable symbolic data is a promising way to expand the reasoning frontier of language models beyond what standard pre-training corpora provide. Yet existing procedural generators often rely on fixed puzzles or templates and do not deliver the distributional breadth needed at scale. We...

Keywords: procedural generation, symbolic reasoning, pre-training, PDDL, first-order logic, context-free grammar, Bayesian networks, solver verification

View Paper

Partial Causal Structure Learning for Valid Selective Conformal Inference under Interventions

0

9.0/10

Amir Asiaee, Kavey Aryan, James P. Long 3/2/2026 arxiv

machine learning

Selective conformal prediction can yield substantially tighter uncertainty sets when we can identify calibration examples that are exchangeable with the test example. In interventional settings, such as perturbation experiments in genomics, exchangeability often holds only within subsets of interven...

Keywords: Selective conformal prediction, Causal inference, Interventions, Invariant prediction, Calibration, Descendant discovery, CRISPRi, Structural equation models

View Paper

Frontier Models Can Take Actions at Low Probabilities

0

9.0/10

Alex Serrano, Wen Xing, David Lindner, Erik Jenner 3/2/2026 arxiv

machine learning

Pre-deployment evaluations inspect only a limited sample of model actions. A malicious model seeking to evade oversight could exploit this by randomizing when to "defect": misbehaving so rarely that no malicious actions are observed during evaluation, but often enough that they occur eventually in d...

Keywords: calibration, frontier models, probabilistic prompting, chain-of-thought, model evaluation, ai safety, GPT-5, Claude-4.5

View Paper

Conformal Policy Control

0

9.0/10

Drew Prinster, Clara Fannjiang, Ji Won Park, Kyunghyun Cho, Anqi Liu, Suchi Saria, Samuel Stanton 3/2/2026 arxiv

reinforcement learning

An agent must try new behaviors to explore and improve. In high-stakes environments, an agent that violates safety constraints may cause harm and must be taken offline, curtailing any future interaction. Imitating old behavior is safe, but excessive conservatism discourages exploration. How much beh...

Keywords: conformal_calibration, safe_exploration, policy_control, finite_sample_guarantees, reinforcement_learning, risk_control, biomolecular_engineering, question_answering

View Paper

Symbol-Equivariant Recurrent Reasoning Models

0

9.0/10

Richard Freinschlag, Timo Bertram, Erich Kobler, Andreas Mayr, Günter Klambauer 3/2/2026 arxiv

machine learning

Reasoning problems such as Sudoku and ARC-AGI remain challenging for neural networks. The structured problem solving architecture family of Recurrent Reasoning Models (RRMs), including Hierarchical Reasoning Model (HRM) and Tiny Recursive Model (TRM), offer a compact alternative to large language mo...

Keywords: symbol-equivariance, permutation equivariance, recurrent reasoning models, RRM, Sudoku, ARC-AGI, symmetry, data augmentation

View Paper

Multi-Head Low-Rank Attention

0

9.0/10

Songtao Liu, Hongwu Peng, Zhiwei Zhang, Zhengyu Chen, Yue Guo 3/2/2026 arxiv

natural language processing

Long-context inference in large language models is bottlenecked by Key--Value (KV) cache loading during the decoding stage, where the sequential nature of generation requires repeatedly transferring the KV cache from off-chip High-Bandwidth Memory (HBM) to on-chip Static Random-Access Memory (SRAM) ...

Keywords: MLRA, MLA, low-rank attention, KV cache, tensor parallelism, long-context, decoding speedup, LLM serving

View Paper

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

0

9.0/10

Jinqi Wu, Sishuo Chen, Zhangming Chan, Yong Bai, Lei Zhang, Sheng Chen, Chenghuan Hou, Xiang-Rong Sheng, Han Zhu, Jian Xu, Bo Zheng, Chaoyou Fu 3/2/2026 arxiv

machine learning

Multi-attribution learning (MAL), which enhances model performance by learning from conversion labels yielded by multiple attribution mechanisms, has emerged as a promising learning paradigm for conversion rate (CVR) prediction. However, the conversion labels in public CVR datasets are generated by ...

Keywords: multi-attribution learning, conversion rate prediction, benchmark dataset, MoAE, PyMAL, advertising

View Paper

Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

0

9.0/10

Quoc-Khang Tran, Minh-Thien Nguyen, Nguyen-Khang Pham 3/2/2026 arxiv

computer vision

The classification of Intangible Cultural Heritage (ICH) images in the Mekong Delta poses unique challenges due to limited annotated data, high visual similarity among classes, and domain heterogeneity. In such low-resource settings, conventional deep learning models often suffer from high variance ...

Keywords: CoAtNet, model soups, checkpoint averaging, ICH-17, bias-variance, MDS, cross-entropy distance, low-resource classification

View Paper

Organizing, Orchestrating, and Benchmarking Agent Skills at Ecosystem Scale

0

9.0/10

Hao Li, Chunjiang Mu, Jianhao Chen, Siyue Ren, Zhiyao Cui, Yiqun Zhang, Lei Bai, Shuyue Hu 3/2/2026 arxiv

machine learning

The rapid proliferation of Claude agent skills has raised the central question of how to effectively leverage, manage, and scale the agent skill ecosystem. In this paper, we propose AgentSkillOS, the first principled framework for skill selection, orchestration, and ecosystem-level management. Agent...

Keywords: AgentSkillOS, skill selection, capability tree, DAG orchestration, skill ecosystem, benchmark, Bradley-Terry, LLM evaluation

View Paper

Efficient Diffusion Models for High-Resolution Image Generation

0

9.0/10

AI Research Community 3/3/2026 huggingface

generative models

We present novel techniques for accelerating diffusion models while maintaining high-quality image generation. Our approach combines architectural improvements with advanced sampling strategies to achieve significant speedup in inference time without compromising output quality.

Keywords: diffusion models, high-resolution image generation, inference acceleration, sampling strategies, model architecture, efficiency

View Paper

Export Archive Data

Browse by Date

Papers for March 3, 2026

Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training

Partial Causal Structure Learning for Valid Selective Conformal Inference under Interventions

Frontier Models Can Take Actions at Low Probabilities

Conformal Policy Control

Symbol-Equivariant Recurrent Reasoning Models

Multi-Head Low-Rank Attention

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

Organizing, Orchestrating, and Benchmarking Agent Skills at Ecosystem Scale

Efficient Diffusion Models for High-Resolution Image Generation