Paper Archive

Latent Collaboration in Multi-Agent Systems

0

9.0/10

Unknown authors 11/27/2025 huggingface

machine learning

Multi-agent systems (MAS) extend large language models (LLMs) from independent single-model reasoning to coordinative system-level intelligence. While existing LLM agents depend on text-based mediation for reasoning and communication, we take a step forward by enabling models to collaborate directly...

Keywords: LatentMAS, multi-agent systems, LLM agents, latent collaboration, latent working memory, training-free, efficiency

View Paper

Monet: Reasoning in Latent Visual Space Beyond Images and Language

0

9.0/10

Unknown authors 11/27/2025 huggingface

machine learning

"Thinking with images" has emerged as an effective paradigm for advancing visual reasoning, extending beyond text-only chains of thought by injecting visual evidence into intermediate reasoning steps. However, existing methods fall short of human-like visual thinking, as their flexibility is fundame...

Keywords: Monet, latent visual reasoning, VLPO, Monet-SFT-125K, multimodal LLM, distillation SFT, visual chain-of-thought, GRPO limitation

View Paper

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

0

9.0/10

Unknown authors 11/27/2025 huggingface

machine learning

We investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. Existing research is mixed regarding whether training on easier or harder data leads to better results, and whether those gains come on easie...

Keywords: LLMs, Item Response Theory, difficulty, generalization, data curation, evaluation, curriculum learning

View Paper

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

0

9.0/10

Unknown authors 11/27/2025 huggingface

computer vision

World models serve as core simulators for fields such as agentic AI, embodied AI, and gaming, capable of generating long, physically realistic, and interactive high-quality videos. Moreover, scaling these models could unlock emergent capabilities in visual perception, understanding, and reasoning, p...

Keywords: block-diffusion, semi-autoregressive decoding, world models, video generation, KV cache, Inferix, LV-Bench, interactive streaming

View Paper

Terminal Velocity Matching

0

9.0/10

Unknown authors 11/27/2025 huggingface

generative models

We propose Terminal Velocity Matching (TVM), a generalization of flow matching that enables high-fidelity one- and few-step generative modeling. TVM models the transition between any two diffusion timesteps and regularizes its behavior at its terminal time rather than at the initial time. We prove t...

Keywords: Terminal Velocity Matching, TVM, flow matching, diffusion, Diffusion Transformer, Lipschitz, 2-Wasserstein, fused attention

View Paper

Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization

0

9.0/10

Unknown authors 11/27/2025 huggingface

computer vision

Despite 3D Gaussian Splatting (3DGS) excelling in most configurations, it lacks generalization across novel viewpoints in a few-shot scenario because it overfits to the sparse observations. We revisit 3DGS optimization from a machine learning perspective, framing novel view synthesis as a generaliza...

Keywords: 3D Gaussian Splatting, FASR, Sharpness-Aware Minimization, frequency-adaptive, novel view synthesis, few-shot, generalization, neural rendering

View Paper

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots

0

9.0/10

Unknown authors 11/27/2025 huggingface

robotics

Grounding natural-language instructions into continuous control for quadruped robots remains a fundamental challenge in vision language action. Existing methods struggle to bridge high-level semantic reasoning and low-level actuation, leading to unstable grounding and weak generalization in the real...

Keywords: vision-language-action, MobileVLA-R1, MobileVLA-CoT, chain-of-thought, GRPO, quadruped, VLN, VLA

View Paper

NVIDIA Nemotron Parse 1.1

0

3.0/10

Unknown authors 11/27/2025 huggingface

machine learning

This paper presents research on nvidia, nemotron, parse. The full abstract is not available at this time. Please visit the paper's website for complete details about the methodology, results, and contributions.

Keywords: NVIDIA, Nemotron, Parse, missing_abstract, reproducibility

View Paper

Export Archive Data

Browse by Date

Papers for November 27, 2025

Latent Collaboration in Multi-Agent Systems

Monet: Reasoning in Latent Visual Space Beyond Images and Language

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Terminal Velocity Matching

Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots

NVIDIA Nemotron Parse 1.1