Paper Archive

Scale Space Diffusion

0

9.0/10

Soumik Mukhopadhyay, Prateksha Udhayanan, Abhinav Shrivastava 3/9/2026 arxiv

generative models

Diffusion models degrade images through noise, and reversing this process reveals an information hierarchy across timesteps. Scale-space theory exhibits a similar hierarchy via low-pass filtering. We formalize this connection and show that highly noisy diffusion states contain no more information th...

Keywords: diffusion models, scale-space, downsampling, Flexi-UNet, generative models, image synthesis, CelebA, ImageNet

View Paper

Agentic Critical Training

0

9.0/10

Weize Liu, Minghui Liu, Sy-Tuyen Ho, Souradip Chakraborty, Xiyao Wang, Furong Huang 3/9/2026 arxiv

reinforcement learning

Training large language models (LLMs) as autonomous agents often begins with imitation learning, but it only teaches agents what to do without understanding why: agents never contrast successful actions against suboptimal alternatives and thus lack awareness of action quality. Recent approaches atte...

Keywords: Agentic Critical Training, ACT, self-reflection, reinforcement learning, imitation learning, LLM agents, out-of-distribution generalization

View Paper

Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines

0

9.0/10

Akshay Gulati, Kanha Singhania, Tushar Banga, Parth Arora, Anshul Verma, Vaibhav Kumar Singh, Agyapal Digra, Jayant Singh Bisht, Danish Sharma, Varun Singla, Shubh Garg 3/9/2026 arxiv

machine learning

Large language models are increasingly used for financial analysis and investment research, yet systematic evaluation of their financial reasoning capabilities remains limited. In this work, we introduce the AI Financial Intelligence Benchmark (AFIB), a multi-dimensional evaluation framework designe...

Keywords: AFIB, financial intelligence, LLMs, benchmark, SuperInvesting, Perplexity, factual accuracy, analytical completeness

View Paper

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

0

9.0/10

Kai Zou, Dian Zheng, Hongbo Liu, Tiankai Hang, Bin Liu, Nenghai Yu 3/9/2026 arxiv

generative models

Autoregressive (AR) diffusion offers a promising framework for generating videos of theoretically infinite length. However, a major challenge is maintaining temporal continuity while preventing the progressive quality degradation caused by error accumulation. To ensure continuity, existing methods t...

Keywords: autoregressive diffusion, hierarchical denoising, long video generation, temporal continuity, error accumulation, forward-KL, self-rollout distillation, pipelined inference

View Paper

A Multi-Objective Optimization Approach for Sustainable AI-Driven Entrepreneurship in Resilient Economies

0

9.0/10

Anas ALsobeh, Raneem Alkurdi 3/9/2026 arxiv

machine learning

The rapid advancement of artificial intelligence (AI) technologies presents both unprecedented opportunities and significant challenges for sustainable economic development. While AI offers transformative potential for addressing environmental challenges and enhancing economic resilience, its deploy...

Keywords: EcoAI-Resilience, multi-objective optimization, sustainability, economic resilience, renewable energy, AI deployment, energy consumption, entrepreneurship

View Paper

Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio

0

9.0/10

Phillip Long, Zachary Novack, Chris Donahue 3/9/2026 arxiv

machine learning

Autoregressive "language" models (LMs) trained on raw waveforms can be repurposed for lossless audio compression, but prior work is limited to 8-bit audio, leaving open whether such approaches work for practical settings (16/24-bit) and can compete with existing codecs. We benchmark LM-based compres...

Keywords: lossless compression, language models, audio, Trilobyte, tokenization, FLAC, 24-bit audio, byte-level

View Paper

ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose Estimation

0

9.0/10

Nanjun Li, Pinqi Cheng, Zean Liu, Minghe Tian, Xuanyin Wang 3/9/2026 arxiv

computer vision

Single-stage multi-person pose estimation aims to jointly perform human localization and keypoint prediction within a unified framework, offering advantages in inference efficiency and architectural simplicity. Consequently, multi-scale real-time detection architectures, such as YOLO-like models, ar...

Keywords: ER-Pose, keypoint-driven, single-stage, pose estimation, YOLO-Pose, OKS, dynamic sample assignment, NMS-free

View Paper

A New Lower Bound for the Random Offerer Mechanism in Bilateral Trade using AI-Guided Evolutionary Search

0

9.0/10

Yang Cai, Vineet Gupta, Zun Li, Aranyak Mehta 3/9/2026 arxiv

machine learning

The celebrated Myerson--Satterthwaite theorem shows that in bilateral trade, no mechanism can be simultaneously fully efficient, Bayesian incentive compatible (BIC), and budget balanced (BB). This naturally raises the question of how closely the gains from trade (GFT) achievable by a BIC and BB mech...

Keywords: Random-Offerer, bilateral trade, Myerson-Satterthwaite, gains from trade, AlphaEvolve, evolutionary search, lower bound, mechanism design

View Paper

Talking Together: Synthesizing Co-Located 3D Conversations from Audio

0

9.0/10

Mengyi Shan, Shouchieh Chang, Ziqian Bai, Shichen Liu, Yinda Zhang, Luchuan Song, Rohit Pandey, Sean Fanello, Zeng Huang 3/9/2026 arxiv

computer vision

We tackle the challenging task of generating complete 3D facial animations for two interacting, co-located participants from a mixed audio stream. While existing methods often produce disembodied "talking heads" akin to a video conference call, our work is the first to explicitly model the dynamic 3...

Keywords: 3D facial animation, dyadic conversations, mixed audio disentanglement, dual-stream architecture, speaker role embeddings, cross-attention, gaze loss, large-scale dataset

View Paper

Exp-Force: Experience-Conditioned Pre-Grasp Force Selection with Vision-Language Models

0

9.0/10

Siqi Shang, Minchao Huang, Bill Fan, Lillian Chin 3/9/2026 arxiv

robotics

Accurate pre-contact grasp force selection is critical for safe and reliable robotic manipulation. Adaptive controllers regulate force after contact but still require a reasonable initial estimate. Starting a grasp with too little force requires reactive adjustment, while starting a grasp with too h...

Keywords: pre-grasp force, vision-language model, in-context learning, experience-conditioned, robotic grasping, compliant grippers

View Paper

Export Archive Data

Browse by Date

Papers for March 10, 2026

Scale Space Diffusion

Agentic Critical Training

Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

A Multi-Objective Optimization Approach for Sustainable AI-Driven Entrepreneurship in Resilient Economies

Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio

ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose Estimation

A New Lower Bound for the Random Offerer Mechanism in Bilateral Trade using AI-Guided Evolutionary Search

Talking Together: Synthesizing Co-Located 3D Conversations from Audio

Exp-Force: Experience-Conditioned Pre-Grasp Force Selection with Vision-Language Models