ArXiV ML/AI/CV papers summary

Jun 29, 2026

Papers summarized today: 529

Theme 1: Mechanistic Interpretability & Model Forensics

The field is evolving from simple attribution toward a rigorous, causal understanding of internal model states. We are moving beyond “black-box” testing to peer into the very neurons of these systems. Research like The Curse of Multiple Mediators: Hidden Interaction Effects in Activation Patching and Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs highlights that latent representations are complex, causal structures that require axiomatic frameworks to decode.

This interpretability extends to specialized domains and safety: PairSAE: Mechanistic Interpretability from Pair Representations in Protein Co-Folding and VASAE: Naming SAE Dictionary Directions with Vocabulary-Aligned Anchoring bridge the gap between abstract latent directions and human-readable concepts. In safety, Model Forensics: Investigating Whether Concerning Behavior Reflects Misalignment, Robust Harmful Features Under Jailbreak Attacks: Mechanistic Evidence from Attention Head Specialization in Large Language Models, and The Alignment Target Problem: Divergent Moral Judgments of Humans, AI Systems, and Their Designers reveal that safety is a fragile balance of internal signals, and that “alignment” is as much a social construct as a technical one. Finally, efforts like Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning and Do Vision Models Truly Forget? New Findings from Representation-Level Certification of Visual Unlearning in Vertical Federated Learning emphasize that true “unlearning” requires rigorous auditing to remove deep representational traces.

Theme 2: Efficient Inference & Architectural Optimization

As models scale, the computational cost of inference—particularly the KV cache—has become a primary bottleneck. We are seeing a shift toward hardware-aware optimization and dynamic memory management. RateQuant: Optimal Mixed-Precision KV Cache Quantization via Rate-Distortion Theory, RedKnot: Efficient Long-Context LLM Serving with Head-Aware KV Reuse and SegPagedAttention, and ReFreeKV: Towards Threshold-Free KV Cache Compression treat the KV cache as a dynamic, model-aware object.

Architectural innovations are further driving efficiency: Simplified Sparse Attention via Gist Tokens, Prism Transformer: Progressive Head Schedules for Hierarchical Attention Processing, Accelerating Attention with Basis Decomposition, and Flexformer: Flexible Linear Transformer with Learnable Attention Kernel provide ways to maintain expressiveness while reducing complexity. Additionally, hardware-software co-design—exemplified by FP8 is All You Need (Part 2): Efficient Ozaki-Bailey Style FFT Through Tensor-core Garner Reformulation and Kulisch Escape Route—and pruning techniques like REFINE: Super-efficient 3D Gaussian Splatting Pruning via Rendering-Free Primitive Importance and IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models demonstrate that we can achieve massive speedups by mathematically identifying and removing redundant information.

Theme 3: Agentic Reasoning, Planning, and Ecosystem Governance

We are transitioning from isolated chatbots to autonomous agents that operate within complex software and physical ecosystems. This shift necessitates new paradigms for reliability and security. Agent-Native Immune System: Architecture, Taxonomy, and Engineering, Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning, Towards Reliable and Robust LLM Planning: Symbolic Feedback-Driven Iterative Self-Refinement Framework, and Grounded Iterative Language Planning: How Parameterized World Models Reduce Hallucination Propagation in LLM Agents move agents toward verifiable, grounded decision-making.

However, autonomy introduces systemic risks. Govern the Repository, Not the Agent: Measuring Ecosystem-Level Risk in AI-Native Software highlights that integration friction is an ecosystem property, while Agentic Hardware Design as Repository-Level Code Evolution and AgentX: Towards Agent-Driven Self-Iteration of Industrial Recommender Systems explore self-evolving systems. Finally, the security of these agents is paramount, as shown by Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs and AI Snitches Get Glitches: Towards Evading Agentic Surveillance, which underscore the dual-use risks of agentic autonomy.

Theme 4: Embodied AI and World Modeling

The next frontier is “Embodied AI”—models that understand the physical world. This requires world models that simulate physics and reason about 3D space. Perceptual 3D Simulation With Physical World Modeling and A Comprehensive Survey on World Models for Embodied AI define the state of the art in simulating physical dynamics.

Spatial intelligence is being pushed by SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation and AirGroundBench: Probing Spatial Intelligence in Multimodal Large Models under Heterogeneous Multi-View Embodied Collaboration, which test model robustness in real-world conditions. Furthermore, PhysChoreo: Physics-Controllable Video Generation with Part-Aware Semantic Grounding and Directing the World: Fast Autoregressive Video Generation with Compositional Human-Camera Control represent a shift toward controllable generation constrained by physical laws.

Theme 5: Scientific Machine Learning & Physics-Informed Models

Scientific machine learning is increasingly moving toward models that respect physical laws by construction. LieSolver: PDE-Constrained Learning for IBVPs via Lie Symmetries and Higher-Order Fourier Neural Operator: Explicit Mode Mixer for Nonlinear PDEs provide superior inductive biases for complex physical systems.

In nuclear physics, Bridging Ab Initio Symmetries and Global Nuclear Masses with Interpretable Neural Networks recovers fundamental symmetries from data, while Physics-Informed Neural Network with Transfer Learning for State Estimation in Lithium-Ion Batteries using the Single Particle Model with Electrolyte demonstrates how transfer learning adapts these models to new chemistries, maintaining electrochemical consistency while reducing training time.

Theme 6: Robustness, Fairness, & Alignment

Ensuring models are robust and fair requires integrating these goals into the training and inference processes. Productionized Fairness Measurement Under Privacy Constraints addresses fairness under privacy, while PEBS: Per-rater Empirical-Bayes Shrinkage for RLHF Reward-Model Calibration calibrates reward models to individual annotators.

Adversarial robustness is tackled by Halt Fast! Early Stopping for Certified Robustness and NormGuard: Reward-Preserving Norm Constraints in Flow-Matching Reinforcement Learning, which prevent performance degradation during fine-tuning. These works emphasize that robustness and alignment are not post-hoc fixes, but essential components of the model lifecycle.