Paper Archive

HyCOP: Hybrid Composition Operators for Interpretable Learning of PDEs

0

9.0/10

Jinpai Zhao, Nishant Panda, Yen Ting Lin, Eirik Valseth, Diane Oyen, Clint Dawson 5/1/2026 arxiv

machine learning

We introduce HyCOP, a modular framework that learns parametric PDE solution operators by composing simple modules (advection, diffusion, learned closures, boundary handling) in a query-conditioned way. Rather than learning a monolithic map, HyCOP learns a policy over short programs - which module to...

Keywords: HyCOP, hybrid composition, neural operators, PDEs, modularity, interpretability, OOD generalization, scientific machine learning

View Paper

When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models

0

9.0/10

Sailesh Panda, Pritam Kadasi, Abhishek Upperwal, Mayank Singh 5/1/2026 arxiv

machine learning

Large language models (LLMs) often achieve strong performance on reasoning benchmarks, but final-answer accuracy alone does not show whether they faithfully execute the procedure specified in a prompt. We study this question through a controlled diagnostic benchmark for procedural execution, where m...

Keywords: LLMs, procedural execution, diagnostic benchmark, arithmetic algorithms, faithfulness, generation errors, model evaluation, hallucination

View Paper

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

0

9.0/10

Siyuan Huang, Xiaoye Qu, Yafu Li, Tong Zhu, Zefeng He, Muxin Fu, Daizong Liu, Wei-Long Zheng, Yu Cheng 5/1/2026 arxiv

machine learning

While autoregressive Large Vision-Language Models (LVLMs) demonstrate remarkable proficiency in multimodal tasks, they face a "Visual Signal Dilution" phenomenon, where the accumulation of textual history expands the attention partition function, causing visual attention to decay inversely with gene...

Keywords: Persistent Visual Memory, PVM, Visual Signal Dilution, LVLMs, Qwen3-VL, attention decay, feed-forward network

View Paper

Let ViT Speak: Generative Language-Image Pre-training

0

9.0/10

Yan Fang, Mengcheng Lan, Zilong Huang, Weixian Lei, Yunqing Zhao, Yujie Zhong, Yingchen Yu, Qi She, Yao Zhao, Yunchao Wei 5/1/2026 arxiv

machine learning

In this paper, we present \textbf{Gen}erative \textbf{L}anguage-\textbf{I}mage \textbf{P}re-training (GenLIP), a minimalist generative pretraining framework for Vision Transformers (ViTs) designed for multimodal large language models (MLLMs). To better align vision encoders with the autoregressive n...

Keywords: ViT, GenLIP, generative pretraining, vision encoder, MLLM, language modeling objective, Recap-DataComp-1B, multi-resolution

View Paper

Can Coding Agents Reproduce Findings in Computational Materials Science?

0

9.0/10

Ziyang Huang, Yi Cao, Ali K. Shargh, Jing Luo, Ruidong Mei, Mohd Zaki, Zhan Liu, Wyatt Bunstine, William Jurayj, Somdatta Goswami, Tyrel McQueen, Michael Shields, Jaafar El-Awady, Paulette Clancy, Benjamin Van Durme, Nicholas Andrews, William Walden, Daniel Khashabi 5/1/2026 arxiv

machine learning

Large language models are increasingly deployed as autonomous coding agents and have achieved remarkably strong performance on software engineering benchmarks. However, it is unclear whether such success transfers to computational scientific workflows, where tasks require not only strong coding abil...

Keywords: AutoMat, LLM-based agents, computational materials science, reproducibility, benchmark, coding agents, toolchains, execution fragility

View Paper

Generating Statistical Charts with Validation-Driven LLM Workflows

0

9.0/10

Pavlin G. Poličar, Andraž Pevcin, Blaž Zupan 5/1/2026 arxiv

machine learning

Generating diverse, readable statistical charts from tabular data remains challenging for LLMs, as many failures become apparent after rendering and are not detectable from data or code alone. Existing chart datasets also rarely provide fully aligned artifacts, such as executable code, dataset conte...

Keywords: LLM, chart generation, rendered-output validation, data visualization, multimodal LLMs, UCI datasets, question-answer pairs

View Paper

RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution

0

9.0/10

Arunabh Srivastava, Mohammad A., Khojastepour, Srimat Chakradhar, Sennur Ulukus 5/1/2026 arxiv

natural language processing

Humans solve problems by executing targeted plans, yet large language models (LLMs) remain unreliable for structured workflow execution. We propose RunAgent, a multi-agent plan execution platform that interprets natural-language plans while enforcing stepwise execution through constraints and rubric...

Keywords: RunAgent, LLM, plan execution, constraint-guided, agentic language, code generation, tool use, rubrics

View Paper

When RAG Chatbots Expose Their Backend: An Anonymized Case Study of Privacy and Security Risks in Patient-Facing Medical AI

0

9.0/10

Alfredo Madrid-García, Miguel Rujas 5/1/2026 arxiv

machine learning

Background: Patient-facing medical chatbots based on retrieval-augmented generation (RAG) are increasingly promoted to deliver accessible, grounded health information. AI-assisted development lowers the barrier to building them, but they still demand rigorous security, privacy, and governance contro...

Keywords: RAG, retrieval-augmented generation, medical chatbot, privacy, security, system prompt, LLM-assisted audit, Chrome DevTools

View Paper

Make Your LVLM KV Cache More Lightweight

0

9.0/10

Xihao Chen, Yangyang Guo, Roger Zimmermann 5/1/2026 arxiv

machine learning

Key-Value (KV) cache has become a de facto component of modern Large Vision-Language Models (LVLMs) for inference. While it enhances decoding efficiency in Large Language Models (LLMs), its direct adoption in LVLMs introduces substantial GPU memory overhead due to the large number of vision tokens p...

Keywords: LightKV, KV cache, LVLM, vision-token compression, cross-modality message passing, prompt-aware, prefill, GPU memory

View Paper

GeoContra: From Fluent GIS Code to Verifiable Spatial Analysis with Geography-Grounded Repair

0

9.0/10

Yinhao Xiao, Rongbo Xiao, Yihan Zhang 5/1/2026 arxiv

machine learning

Reliable spatial analysis in GIScience requires preserving coordinate semantics, topology, units, and geographic plausibility. Current LLM-based GIS systems generate fluent scripts but rarely enforce these geographic rules at scale. We present GeoContra, a verification and repair framework for LLM-d...

Keywords: GeoContra, geospatial contract, GIS, LLM, verification, repair, CRS, topology

View Paper

Export Archive Data

Browse by Date

Papers for May 4, 2026

HyCOP: Hybrid Composition Operators for Interpretable Learning of PDEs

When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

Let ViT Speak: Generative Language-Image Pre-training

Can Coding Agents Reproduce Findings in Computational Materials Science?

Generating Statistical Charts with Validation-Driven LLM Workflows

RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution

When RAG Chatbots Expose Their Backend: An Anonymized Case Study of Privacy and Security Risks in Patient-Facing Medical AI

Make Your LVLM KV Cache More Lightweight

GeoContra: From Fluent GIS Code to Verifiable Spatial Analysis with Geography-Grounded Repair