Saved in:
| Main Authors: | Milano, Nicola, Nolfi, Stefano |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.22948 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sensory-Motor Control with Large Language Models via Iterative Policy Refinement
by: Carvalho, Jônata Tyska, et al.
Published: (2025)
by: Carvalho, Jônata Tyska, et al.
Published: (2025)
Large Language Models as Simulative Agents for Neurodivergent Adult Psychometric Profiles
by: Chiappone, Francesco, et al.
Published: (2026)
by: Chiappone, Francesco, et al.
Published: (2026)
Comparing Human Expertise and Large Language Models Embeddings in Content Validity Assessment of Personality Tests
by: Milano, Nicola, et al.
Published: (2025)
by: Milano, Nicola, et al.
Published: (2025)
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models
by: Kim, Dongyoung, et al.
Published: (2026)
by: Kim, Dongyoung, et al.
Published: (2026)
Information-Theoretic Constraints for Continual Vision-Language-Action Alignment
by: Zhao, Libang, et al.
Published: (2026)
by: Zhao, Libang, et al.
Published: (2026)
Panoptic Vision-Language Feature Fields
by: Chen, Haoran, et al.
Published: (2023)
by: Chen, Haoran, et al.
Published: (2023)
ROSA: Harnessing Robot States for Vision-Language and Action Alignment
by: Wen, Yuqing, et al.
Published: (2025)
by: Wen, Yuqing, et al.
Published: (2025)
FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies
by: Hu, Xintong, et al.
Published: (2026)
by: Hu, Xintong, et al.
Published: (2026)
VLA-Trace: Diagnosing Vision-Language-Action Models through Representation and Behavior Tracing
by: Shi, Haoyuan, et al.
Published: (2026)
by: Shi, Haoyuan, et al.
Published: (2026)
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning
by: Zhang, Borong, et al.
Published: (2025)
by: Zhang, Borong, et al.
Published: (2025)
When Alignment Fails: Multimodal Adversarial Attacks on Vision-Language-Action Models
by: Yan, Yuping, et al.
Published: (2025)
by: Yan, Yuping, et al.
Published: (2025)
Action Hallucination in Generative Vision-Language-Action Models
by: Soh, Harold, et al.
Published: (2026)
by: Soh, Harold, et al.
Published: (2026)
Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification
by: Wu, Yilin, et al.
Published: (2025)
by: Wu, Yilin, et al.
Published: (2025)
Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces
by: Yashwante, Pratham, et al.
Published: (2026)
by: Yashwante, Pratham, et al.
Published: (2026)
Recursive Belief Vision Language Action Models
by: Bagaria, Vaidehi, et al.
Published: (2026)
by: Bagaria, Vaidehi, et al.
Published: (2026)
Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning
by: Qi, Xiuxiu, et al.
Published: (2025)
by: Qi, Xiuxiu, et al.
Published: (2025)
ROCKET: Residual-Oriented Multi-Layer Alignment for Spatially-Aware Vision-Language-Action Models
by: Sun, Guoheng, et al.
Published: (2026)
by: Sun, Guoheng, et al.
Published: (2026)
Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment
by: Kwok, Jacky, et al.
Published: (2026)
by: Kwok, Jacky, et al.
Published: (2026)
Action-aware Dynamic Pruning for Efficient Vision-Language-Action Manipulation
by: Pei, Xiaohuan, et al.
Published: (2025)
by: Pei, Xiaohuan, et al.
Published: (2025)
LVDrive: Latent Visual Representation Enhanced Vision-Language-Action Autonomous Driving Model
by: Mei, Xiaodong, et al.
Published: (2026)
by: Mei, Xiaodong, et al.
Published: (2026)
Enhancing Generalization in Vision-Language-Action Models by Preserving Pretrained Representations
by: Grover, Shresth, et al.
Published: (2025)
by: Grover, Shresth, et al.
Published: (2025)
Beyond World-Frame Action Heads: Motion-Centric Action Frames for Vision-Language-Action Models
by: Yang, Huoren, et al.
Published: (2026)
by: Yang, Huoren, et al.
Published: (2026)
AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models
by: Hu, Yutong, et al.
Published: (2026)
by: Hu, Yutong, et al.
Published: (2026)
Continuous Reasoning for Vision-Language-Action
by: Wu, Yueh-Hua, et al.
Published: (2026)
by: Wu, Yueh-Hua, et al.
Published: (2026)
ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge
by: Dai, Yuntao, et al.
Published: (2025)
by: Dai, Yuntao, et al.
Published: (2025)
VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models
by: Gao, Chongkai, et al.
Published: (2025)
by: Gao, Chongkai, et al.
Published: (2025)
Auditing Disability Representation in Vision-Language Models
by: Panda, Srikant, et al.
Published: (2026)
by: Panda, Srikant, et al.
Published: (2026)
SpatialFly: Geometry-Guided Representation Alignment for UAV Vision-and-Language Navigation in Urban Environments
by: Jiang, Wen, et al.
Published: (2026)
by: Jiang, Wen, et al.
Published: (2026)
Adversarial Attacks on Robotic Vision Language Action Models
by: Jones, Eliot Krzysztof, et al.
Published: (2025)
by: Jones, Eliot Krzysztof, et al.
Published: (2025)
Survey of Vision-Language-Action Models for Embodied Manipulation
by: Li, Haoran, et al.
Published: (2025)
by: Li, Haoran, et al.
Published: (2025)
Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning
by: Shen, Weijie, et al.
Published: (2025)
by: Shen, Weijie, et al.
Published: (2025)
Enhance Vision-Language Alignment with Noise
by: Huang, Sida, et al.
Published: (2024)
by: Huang, Sida, et al.
Published: (2024)
Safety Alignment for Vision Language Models
by: Liu, Zhendong, et al.
Published: (2024)
by: Liu, Zhendong, et al.
Published: (2024)
GaussianVision: Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting
by: Omri, Yasmine, et al.
Published: (2025)
by: Omri, Yasmine, et al.
Published: (2025)
Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving
by: Wang, Zehao, et al.
Published: (2026)
by: Wang, Zehao, et al.
Published: (2026)
Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces
by: Botteghi, Nicolò, et al.
Published: (2021)
by: Botteghi, Nicolò, et al.
Published: (2021)
ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training
by: Yang, Rushuai, et al.
Published: (2026)
by: Yang, Rushuai, et al.
Published: (2026)
Adaptive Capacity Allocation for Vision Language Action Fine-tuning
by: Kim, Donghoon, et al.
Published: (2026)
by: Kim, Donghoon, et al.
Published: (2026)
Event-Grounded Sparse Autoencoders for Vision-Language-Action Policies
by: Jin, Xinchen, et al.
Published: (2026)
by: Jin, Xinchen, et al.
Published: (2026)
Mean-Flow based One-Step Vision-Language-Action
by: Chen, Yang, et al.
Published: (2026)
by: Chen, Yang, et al.
Published: (2026)
Similar Items
-
Sensory-Motor Control with Large Language Models via Iterative Policy Refinement
by: Carvalho, Jônata Tyska, et al.
Published: (2025) -
Large Language Models as Simulative Agents for Neurodivergent Adult Psychometric Profiles
by: Chiappone, Francesco, et al.
Published: (2026) -
Comparing Human Expertise and Large Language Models Embeddings in Content Validity Assessment of Personality Tests
by: Milano, Nicola, et al.
Published: (2025) -
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models
by: Kim, Dongyoung, et al.
Published: (2026) -
Information-Theoretic Constraints for Continual Vision-Language-Action Alignment
by: Zhao, Libang, et al.
Published: (2026)