Saved in:
| Main Authors: | Tai, Yintao, Liao, Xiyang, Suglia, Alessandro, Vergari, Antonio |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.03321 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MIXAR: Scaling Autoregressive Pixel-based Language Models to Multiple Languages and Scripts
by: Hu, Chen, et al.
Published: (2026)
by: Hu, Chen, et al.
Published: (2026)
Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models
by: Chiyah-Garcia, Javier, et al.
Published: (2024)
by: Chiyah-Garcia, Javier, et al.
Published: (2024)
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
by: McCallum, Sabrina, et al.
Published: (2025)
by: McCallum, Sabrina, et al.
Published: (2025)
CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts
by: Nikandrou, Malvina, et al.
Published: (2024)
by: Nikandrou, Malvina, et al.
Published: (2024)
Towards Logically Consistent Language Models via Probabilistic Reasoning
by: Calanzone, Diego, et al.
Published: (2024)
by: Calanzone, Diego, et al.
Published: (2024)
Logically Consistent Language Models via Neuro-Symbolic Integration
by: Calanzone, Diego, et al.
Published: (2024)
by: Calanzone, Diego, et al.
Published: (2024)
Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks
by: Pantazopoulos, Georgios, et al.
Published: (2024)
by: Pantazopoulos, Georgios, et al.
Published: (2024)
Language Shapes Mental Health Evaluations in Large Language Models
by: Xu, Jiayi, et al.
Published: (2026)
by: Xu, Jiayi, et al.
Published: (2026)
On the Power of Decision Trees in Auto-Regressive Language Modeling
by: Gan, Yulu, et al.
Published: (2024)
by: Gan, Yulu, et al.
Published: (2024)
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
by: Parekh, Amit, et al.
Published: (2024)
by: Parekh, Amit, et al.
Published: (2024)
What can Large Language Models Capture about Code Functional Equivalence?
by: Maveli, Nickil, et al.
Published: (2024)
by: Maveli, Nickil, et al.
Published: (2024)
EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling
by: Ren, Siyu, et al.
Published: (2023)
by: Ren, Siyu, et al.
Published: (2023)
Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines
by: Hu, Xiyang
Published: (2025)
by: Hu, Xiyang
Published: (2025)
Correlation Dimension of Auto-Regressive Large Language Models
by: Du, Xin, et al.
Published: (2025)
by: Du, Xin, et al.
Published: (2025)
Lossless Vocabulary Reduction for Auto-Regressive Language Models
by: Chijiwa, Daiki, et al.
Published: (2025)
by: Chijiwa, Daiki, et al.
Published: (2025)
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
by: Wu, Xiyang, et al.
Published: (2024)
by: Wu, Xiyang, et al.
Published: (2024)
Efficient Context Propagating Perceiver Architectures for Auto-Regressive Language Modeling
by: Mahmood, Kaleel, et al.
Published: (2024)
by: Mahmood, Kaleel, et al.
Published: (2024)
Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning
by: Lovelace, Justin, et al.
Published: (2026)
by: Lovelace, Justin, et al.
Published: (2026)
APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding
by: Liu, Mingdao, et al.
Published: (2024)
by: Liu, Mingdao, et al.
Published: (2024)
CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding
by: Huo, Jiahao, et al.
Published: (2026)
by: Huo, Jiahao, et al.
Published: (2026)
Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models
by: Liu, Xiyu, et al.
Published: (2024)
by: Liu, Xiyu, et al.
Published: (2024)
Value-Action Alignment in Large Language Models under Privacy-Prosocial Conflict
by: Chen, Guanyu, et al.
Published: (2026)
by: Chen, Guanyu, et al.
Published: (2026)
Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers
by: Pantazopoulos, Georgios, et al.
Published: (2024)
by: Pantazopoulos, Georgios, et al.
Published: (2024)
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
by: Saxena, Rohit, et al.
Published: (2026)
by: Saxena, Rohit, et al.
Published: (2026)
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
by: Pantazopoulos, Georgios, et al.
Published: (2024)
by: Pantazopoulos, Georgios, et al.
Published: (2024)
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding
by: Suglia, Alessandro, et al.
Published: (2024)
by: Suglia, Alessandro, et al.
Published: (2024)
Multilingual Pretraining for Pixel Language Models
by: Kesen, Ilker, et al.
Published: (2025)
by: Kesen, Ilker, et al.
Published: (2025)
DPad: Efficient Diffusion Language Models with Suffix Dropout
by: Chen, Xinhua, et al.
Published: (2025)
by: Chen, Xinhua, et al.
Published: (2025)
Cross-modal Consistency Guidance for Robust Emotion Control in Auto-Regressive TTS Models
by: Peng, Yizhou, et al.
Published: (2025)
by: Peng, Yizhou, et al.
Published: (2025)
Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
by: Momentè, Filippo, et al.
Published: (2025)
by: Momentè, Filippo, et al.
Published: (2025)
SEMMA: A Semantic Aware Knowledge Graph Foundation Model
by: Arun, Arvindh, et al.
Published: (2025)
by: Arun, Arvindh, et al.
Published: (2025)
Auto-Regressive Next-Token Predictors are Universal Learners
by: Malach, Eran
Published: (2023)
by: Malach, Eran
Published: (2023)
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
by: Wang, Haozhe, et al.
Published: (2025)
by: Wang, Haozhe, et al.
Published: (2025)
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models
by: Tatariya, Kushal, et al.
Published: (2024)
by: Tatariya, Kushal, et al.
Published: (2024)
Pixels to Principles: Probing Intuitive Physics Understanding in Multimodal Language Models
by: Ballout, Mohamad, et al.
Published: (2025)
by: Ballout, Mohamad, et al.
Published: (2025)
Evaluating Pixel Language Models on Non-Standardized Languages
by: Muñoz-Ortiz, Alberto, et al.
Published: (2024)
by: Muñoz-Ortiz, Alberto, et al.
Published: (2024)
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
by: Luo, Feng, et al.
Published: (2025)
by: Luo, Feng, et al.
Published: (2025)
AutoMix: Automatically Mixing Language Models
by: Aggarwal, Pranjal, et al.
Published: (2023)
by: Aggarwal, Pranjal, et al.
Published: (2023)
Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation
by: Tan, Xingwei, et al.
Published: (2024)
by: Tan, Xingwei, et al.
Published: (2024)
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models
by: Yu, Tian, et al.
Published: (2024)
by: Yu, Tian, et al.
Published: (2024)
Similar Items
-
MIXAR: Scaling Autoregressive Pixel-based Language Models to Multiple Languages and Scripts
by: Hu, Chen, et al.
Published: (2026) -
Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models
by: Chiyah-Garcia, Javier, et al.
Published: (2024) -
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
by: McCallum, Sabrina, et al.
Published: (2025) -
CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts
by: Nikandrou, Malvina, et al.
Published: (2024) -
Towards Logically Consistent Language Models via Probabilistic Reasoning
by: Calanzone, Diego, et al.
Published: (2024)