Saved in:
| Main Authors: | Roberts, Jonathan, Taesiri, Mohammad Reza, Sharma, Ansh, Gupta, Akash, Roberts, Samuel, Croitoru, Ioana, Bogolin, Simion-Vlad, Tang, Jialu, Langer, Florian, Raina, Vyas, Raina, Vatsal, Xiong, Hanyi, Udandarao, Vishaal, Lu, Jingyi, Chen, Shiyang, Purkis, Sam, Yan, Tianshuo, Lin, Wenye, Shin, Gyungin, Yang, Qiaochu, Nguyen, Anh Totti, Atkinson, David I., Baranwal, Aaditya, Coca, Alexandru, Dang, Mikah, Dziadzio, Sebastian, Kunz, Jakob D., Liang, Kaiqu, Lo, Alexander, Pulfer, Brian, Walton, Steven, Yang, Charig, Han, Kai, Albanie, Samuel |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.09696 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
by: Ghosh, Adhiraj, et al.
Published: (2024)
by: Ghosh, Adhiraj, et al.
Published: (2024)
How to Merge Your Multimodal Models Over Time?
by: Dziadzio, Sebastian, et al.
Published: (2024)
by: Dziadzio, Sebastian, et al.
Published: (2024)
A Practitioner's Guide to Continual Multimodal Pretraining
by: Roth, Karsten, et al.
Published: (2024)
by: Roth, Karsten, et al.
Published: (2024)
A Good CREPE needs more than just Sugar: Investigating Biases in Compositional Vision-Language Benchmarks
by: Udandarao, Vishaal, et al.
Published: (2025)
by: Udandarao, Vishaal, et al.
Published: (2025)
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
by: Hochlehnert, Andreas, et al.
Published: (2025)
by: Hochlehnert, Andreas, et al.
Published: (2025)
Efficient Lifelong Model Evaluation in an Era of Rapid Progress
by: Prabhu, Ameya, et al.
Published: (2024)
by: Prabhu, Ameya, et al.
Published: (2024)
GAMEBoT: Transparent Assessment of LLM Reasoning in Games
by: Lin, Wenye, et al.
Published: (2024)
by: Lin, Wenye, et al.
Published: (2024)
How Long Is a Piece of String? A Brief Empirical Analysis of Tokenizers
by: Roberts, Jonathan, et al.
Published: (2026)
by: Roberts, Jonathan, et al.
Published: (2026)
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
by: Udandarao, Vishaal, et al.
Published: (2024)
by: Udandarao, Vishaal, et al.
Published: (2024)
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
Active Data Curation Effectively Distills Large-Scale Multimodal Models
by: Udandarao, Vishaal, et al.
Published: (2024)
by: Udandarao, Vishaal, et al.
Published: (2024)
Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs
by: Roberts, Jonathan, et al.
Published: (2023)
by: Roberts, Jonathan, et al.
Published: (2023)
MEMORIAS E HISTORIOGRAFÍA EN TORNO AL DEBATE POR LA “VIOLENCIA POLÍTICA” EN LA ARGENTINA, 2003-2013
by: Andrea Raina
Published: (2016)
by: Andrea Raina
Published: (2016)
Chapter 12 Kashmir’s Crafts Women
by: Raina, Neelam
Published: (2024)
by: Raina, Neelam
Published: (2024)
Logit Disagreement: OoD Detection with Bayesian Neural Networks
by: Raina, Kevin
Published: (2025)
by: Raina, Kevin
Published: (2025)
El mito político de la RDA
by: Raina Zimmering
Published: (2001)
by: Raina Zimmering
Published: (2001)
B-score: Detecting biases in large language models using response history
by: Vo, An, et al.
Published: (2025)
by: Vo, An, et al.
Published: (2025)
anguyen8/vision-llms-are-blind: official
by: Pooyan R, et al.
Published: (2026)
by: Pooyan R, et al.
Published: (2026)
Vision language models are blind: Failing to translate detailed visual features into words
by: Rahmanzadehgervi, Pooyan, et al.
Published: (2024)
by: Rahmanzadehgervi, Pooyan, et al.
Published: (2024)
PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans
by: Giang, et al.
Published: (2023)
by: Giang, et al.
Published: (2023)
Implicación de los valores éticos y morales en el enfrentamiento de las adicciones en jóvenes
by: Daniel Gutierrez Raina
Published: (2008)
by: Daniel Gutierrez Raina
Published: (2008)
Environmental microbiology // Raina M. Maier, Ian L. Pepper, Charles P. Gerba
by: Maier, Raina M
Published: (2009)
by: Maier, Raina M
Published: (2009)
“Transgender Marginalization and Exclusion”: A Study of Arundhati Roy’s The Ministry of Utmost Happiness
by: Javeed Ahmad Raina
Published: (2017)
by: Javeed Ahmad Raina
Published: (2017)
Knock, Knock. Who's There? On the Security of LG's Knock Codes
by: Samuel, Raina, et al.
Published: (2020)
by: Samuel, Raina, et al.
Published: (2020)
Florence Nightingale, la vigencia de un legado
by: Daniel Ernesto Gutiérrez Raina
Published: (2021)
by: Daniel Ernesto Gutiérrez Raina
Published: (2021)
Question-Based Retrieval using Atomic Units for Enterprise RAG
by: Raina, Vatsal, et al.
Published: (2024)
by: Raina, Vatsal, et al.
Published: (2024)
Decoding Quantum LDPC Codes using Collaborative Check Node Removal
by: Bhattacharyya, Mainak, et al.
Published: (2025)
by: Bhattacharyya, Mainak, et al.
Published: (2025)
A Provably Secure Framework for Noise-Aware Delegated Quantum Computation and Storage
by: Gupta, Sanidhya, et al.
Published: (2024)
by: Gupta, Sanidhya, et al.
Published: (2024)
Question Difficulty Ranking for Multiple-Choice Reading Comprehension
by: Raina, Vatsal, et al.
Published: (2024)
by: Raina, Vatsal, et al.
Published: (2024)
Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers
by: Raina, Kevin, et al.
Published: (2025)
by: Raina, Kevin, et al.
Published: (2025)
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models
by: Raina, Vyas, et al.
Published: (2024)
by: Raina, Vyas, et al.
Published: (2024)
Design Considerations Based on Stability for a Class of TCP Algorithms
by: Prabhakar, Sreekanth, et al.
Published: (2025)
by: Prabhakar, Sreekanth, et al.
Published: (2025)
Quantum Approximation Optimization Algorithm for the Trellis based Viterbi Decoding of Classical Error Correcting Codes
by: Bhattacharyya, Mainak, et al.
Published: (2023)
by: Bhattacharyya, Mainak, et al.
Published: (2023)
SyncNet: correlating objective for time delay estimation in audio signals
by: Raina, Akshay, et al.
Published: (2022)
by: Raina, Akshay, et al.
Published: (2022)
Pulse Shaping for Superconducting Qubits
by: Patra, Animesh, et al.
Published: (2026)
by: Patra, Animesh, et al.
Published: (2026)
Fault-tolerance of the [[8,1,3]] non-CSS code
by: Maheshwari, Pranav, et al.
Published: (2024)
by: Maheshwari, Pranav, et al.
Published: (2024)
Soft Contextualized Encoder For User Defined Text Classification
by: Maheshwari, Charu, et al.
Published: (2026)
by: Maheshwari, Charu, et al.
Published: (2026)
A Physics‐Informed Neural Network Framework for Tumor‐Immune Interactions, Metastatic Invasion, and Haptotaxis Systems
by: Aayushman Raina, et al.
Published: (2025)
by: Aayushman Raina, et al.
Published: (2025)
Similar Items
-
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
by: Ghosh, Adhiraj, et al.
Published: (2024) -
How to Merge Your Multimodal Models Over Time?
by: Dziadzio, Sebastian, et al.
Published: (2024) -
A Practitioner's Guide to Continual Multimodal Pretraining
by: Roth, Karsten, et al.
Published: (2024) -
A Good CREPE needs more than just Sugar: Investigating Biases in Compositional Vision-Language Benchmarks
by: Udandarao, Vishaal, et al.
Published: (2025) -
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
by: Hochlehnert, Andreas, et al.
Published: (2025)