Saved in:
| Main Authors: | Xiu, Zidi, Sun, David Q., Cheng, Kevin, Patel, Maitrik, Date, Josh, Zhang, Yizhe, Lu, Jiarui, Attia, Omar, Vemulapalli, Raviteja, Tuzel, Oncel, Cao, Meng, Bengio, Samy |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.01357 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2023)
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2023)
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining
by: Li, Jeffrey, et al.
Published: (2025)
by: Li, Jeffrey, et al.
Published: (2025)
Learning from Self Critique and Refinement for Faithful LLM Summarization
by: Hu, Ting-Yao, et al.
Published: (2025)
by: Hu, Ting-Yao, et al.
Published: (2025)
Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models
by: Vemulapalli, Raviteja, et al.
Published: (2023)
by: Vemulapalli, Raviteja, et al.
Published: (2023)
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
by: Mirzadeh, Iman, et al.
Published: (2024)
by: Mirzadeh, Iman, et al.
Published: (2024)
Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
by: Huang, Chen, et al.
Published: (2025)
by: Huang, Chen, et al.
Published: (2025)
TiC-CLIP: Continual Training of CLIP Models
by: Garg, Saurabh, et al.
Published: (2023)
by: Garg, Saurabh, et al.
Published: (2023)
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
by: Echterhoff, Jessica, et al.
Published: (2024)
by: Echterhoff, Jessica, et al.
Published: (2024)
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
by: Wang, Haoxiang, et al.
Published: (2023)
by: Wang, Haoxiang, et al.
Published: (2023)
FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations
by: Hsieh, Cheng-Yu, et al.
Published: (2025)
by: Hsieh, Cheng-Yu, et al.
Published: (2025)
AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding
by: Chowdhury, Sanjoy, et al.
Published: (2025)
by: Chowdhury, Sanjoy, et al.
Published: (2025)
Synth4Seg -- Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization
by: Mou, Shancong, et al.
Published: (2024)
by: Mou, Shancong, et al.
Published: (2024)
Learning to Reason for Hallucination Span Detection
by: Su, Hsuan, et al.
Published: (2025)
by: Su, Hsuan, et al.
Published: (2025)
Your Interest, Your Summaries: Query-Focused Long Video Summarization
by: Patel, Nirav, et al.
Published: (2024)
by: Patel, Nirav, et al.
Published: (2024)
Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue Summarization
by: Lu, Yen-Ju, et al.
Published: (2025)
by: Lu, Yen-Ju, et al.
Published: (2025)
From green fields to green benches
by: Josh Loeb
Published: (2025)
by: Josh Loeb
Published: (2025)
CLIP with Quality Captions: A Strong Pretraining for Vision Tasks
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2024)
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2024)
LiTo: Surface Light Field Tokenization
by: Chang, Jen-Hao Rick, et al.
Published: (2026)
by: Chang, Jen-Hao Rick, et al.
Published: (2026)
Pretraining with hierarchical memories: separating long-tail and common knowledge
by: Pouransari, Hadi, et al.
Published: (2025)
by: Pouransari, Hadi, et al.
Published: (2025)
ASTRA: An Action Spotting TRAnsformer for Soccer Videos
by: Xarles, Artur, et al.
Published: (2024)
by: Xarles, Artur, et al.
Published: (2024)
LangDA: Building Context-Awareness via Language for Domain Adaptive Semantic Segmentation
by: Liu, Chang, et al.
Published: (2025)
by: Liu, Chang, et al.
Published: (2025)
3D Shape Tokenization via Latent Flow Matching
by: Chang, Jen-Hao Rick, et al.
Published: (2024)
by: Chang, Jen-Hao Rick, et al.
Published: (2024)
RayRoPE: Projective Ray Positional Encoding for Multi-view Attention
by: Wu, Yu, et al.
Published: (2026)
by: Wu, Yu, et al.
Published: (2026)
Boolformer: Symbolic Regression of Logic Functions with Transformers
by: d'Ascoli, Stéphane, et al.
Published: (2023)
by: d'Ascoli, Stéphane, et al.
Published: (2023)
Protected group bias and stereotypes in Large Language Models
by: Kotek, Hadas, et al.
Published: (2024)
by: Kotek, Hadas, et al.
Published: (2024)
ASTRA: A Negotiation Agent with Adaptive and Strategic Reasoning via Tool-integrated Action for Dynamic Offer Optimization
by: Kwon, Deuksin, et al.
Published: (2025)
by: Kwon, Deuksin, et al.
Published: (2025)
What Makes the Preferred Thinking Direction for LLMs in Multiple-choice Questions?
by: Zhang, Yizhe, et al.
Published: (2025)
by: Zhang, Yizhe, et al.
Published: (2025)
$τ$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
by: Yao, Shunyu, et al.
Published: (2024)
by: Yao, Shunyu, et al.
Published: (2024)
AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking
by: Gao, Silin, et al.
Published: (2025)
by: Gao, Silin, et al.
Published: (2025)
Generalization on the Unseen, Logic Reasoning and Degree Curriculum
by: Abbe, Emmanuel, et al.
Published: (2023)
by: Abbe, Emmanuel, et al.
Published: (2023)
Velox: Learning Representations of 4D Geometry and Appearance
by: Malik, Anagh, et al.
Published: (2026)
by: Malik, Anagh, et al.
Published: (2026)
El uso de las redes sociales y la cultura popular para una mejor comprensión intercultural
by: Sait Tuzel
Published: (2017)
by: Sait Tuzel
Published: (2017)
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
by: Lu, Jiarui, et al.
Published: (2024)
by: Lu, Jiarui, et al.
Published: (2024)
MobileCLIP2: Improving Multi-Modal Reinforced Training
by: Faghri, Fartash, et al.
Published: (2025)
by: Faghri, Fartash, et al.
Published: (2025)
Bounds on Successive Minima of Orders in Number Fields and Scrollar Invariants of Curves
by: Vemulapalli, Sameera
Published: (2022)
by: Vemulapalli, Sameera
Published: (2022)
The distribution of lattices arising from orders in low degree number fields
by: Vemulapalli, Sameera
Published: (2024)
by: Vemulapalli, Sameera
Published: (2024)
The Steinitz Realization Problem
by: Vemulapalli, Sameera
Published: (2024)
by: Vemulapalli, Sameera
Published: (2024)
The Role of GDPs in Oral Surgery Delivery
by: Vidwat Vemulapalli
Published: (2025)
by: Vidwat Vemulapalli
Published: (2025)
CL-bench: A Benchmark for Context Learning
by: Dou, Shihan, et al.
Published: (2026)
by: Dou, Shihan, et al.
Published: (2026)
On the Way to LLM Personalization: Learning to Remember User Conversations
by: Magister, Lucie Charlotte, et al.
Published: (2024)
by: Magister, Lucie Charlotte, et al.
Published: (2024)
Similar Items
-
MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2023) -
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining
by: Li, Jeffrey, et al.
Published: (2025) -
Learning from Self Critique and Refinement for Faithful LLM Summarization
by: Hu, Ting-Yao, et al.
Published: (2025) -
Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models
by: Vemulapalli, Raviteja, et al.
Published: (2023) -
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
by: Mirzadeh, Iman, et al.
Published: (2024)