Saved in:
| Main Authors: | Wu, Tsung-Han, Biamby, Giscard, Quenum, Jerome, Gupta, Ritwik, Gonzalez, Joseph E., Darrell, Trevor, Chan, David M. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.13766 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning
by: Niu, Dantong, et al.
Published: (2024)
by: Niu, Dantong, et al.
Published: (2024)
Needle In A Multimodal Haystack
by: Wang, Weiyun, et al.
Published: (2024)
by: Wang, Weiyun, et al.
Published: (2024)
Two Causally Related Needles in a Video Haystack
by: Li, Miaoyu, et al.
Published: (2025)
by: Li, Miaoyu, et al.
Published: (2025)
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery
by: Quenum, Jerome, et al.
Published: (2025)
by: Quenum, Jerome, et al.
Published: (2025)
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
by: Zhao, Zijia, et al.
Published: (2024)
by: Zhao, Zijia, et al.
Published: (2024)
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling
by: Wu, Tsung-Han, et al.
Published: (2025)
by: Wu, Tsung-Han, et al.
Published: (2025)
Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
by: Sharma, Aditya, et al.
Published: (2024)
by: Sharma, Aditya, et al.
Published: (2024)
REOrdering Patches Improves Vision Models
by: Kutscher, Declan, et al.
Published: (2025)
by: Kutscher, Declan, et al.
Published: (2025)
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
by: Chowdhury, Sanjoy, et al.
Published: (2025)
by: Chowdhury, Sanjoy, et al.
Published: (2025)
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
by: Wang, Hengyi, et al.
Published: (2024)
by: Wang, Hengyi, et al.
Published: (2024)
Needle in a Haystack: One-Class Representation Learning for Detecting Rare Malignant Cells in Computational Cytology
by: Chatterjee, Swarnadip, et al.
Published: (2026)
by: Chatterjee, Swarnadip, et al.
Published: (2026)
Document Haystack: A Long Context Multimodal Image/Document Understanding Vision LLM Benchmark
by: Huybrechts, Goeric, et al.
Published: (2025)
by: Huybrechts, Goeric, et al.
Published: (2025)
Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents
by: Chen, Jun, et al.
Published: (2024)
by: Chen, Jun, et al.
Published: (2024)
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint
by: Lee, Heekyung, et al.
Published: (2025)
by: Lee, Heekyung, et al.
Published: (2025)
MultiHaystack: Benchmarking Multimodal Retrieval and Reasoning over 40K Images, Videos, and Documents
by: Xu, Dannong, et al.
Published: (2026)
by: Xu, Dannong, et al.
Published: (2026)
Multi-axis Analysis of Image Manipulation Localization
by: Nichols, Keanu, et al.
Published: (2026)
by: Nichols, Keanu, et al.
Published: (2026)
Reasoning on Multiple Needles In A Haystack
by: Wang, Yidong
Published: (2025)
by: Wang, Yidong
Published: (2025)
Finding Outliers in a Haystack: Anomaly Detection for Large Pointcloud Scenes
by: Faulkner, Ryan, et al.
Published: (2025)
by: Faulkner, Ryan, et al.
Published: (2025)
xT: Nested Tokenization for Larger Context in Large Images
by: Gupta, Ritwik, et al.
Published: (2024)
by: Gupta, Ritwik, et al.
Published: (2024)
Looking for the Information Needle in the Internet Haystack.
by: Clausen, Helge
Published: (1996)
by: Clausen, Helge
Published: (1996)
Finding BSM Needles in Electromagnetic Haystacks at DUNE
by: Brdar, Vedran, et al.
Published: (2025)
by: Brdar, Vedran, et al.
Published: (2025)
Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts
by: Yu, Yifei, et al.
Published: (2025)
by: Yu, Yifei, et al.
Published: (2025)
ALOHa: A New Measure for Hallucination in Captioning Models
by: Petryk, Suzanne, et al.
Published: (2024)
by: Petryk, Suzanne, et al.
Published: (2024)
Needle in the Haystack for Memory Based Large Language Models
by: Nelson, Elliot, et al.
Published: (2024)
by: Nelson, Elliot, et al.
Published: (2024)
Finding Interest Needle in Popularity Haystack: Improving Retrieval by Modeling Item Exposure
by: Agarwal, Rahul, et al.
Published: (2025)
by: Agarwal, Rahul, et al.
Published: (2025)
When Do We Not Need Larger Vision Models?
by: Shi, Baifeng, et al.
Published: (2024)
by: Shi, Baifeng, et al.
Published: (2024)
CLAIR-A: Leveraging Large Language Models to Judge Audio Captions
by: Wu, Tsung-Han, et al.
Published: (2024)
by: Wu, Tsung-Han, et al.
Published: (2024)
Vision-Language Models Create Cross-Modal Task Representations
by: Luo, Grace, et al.
Published: (2024)
by: Luo, Grace, et al.
Published: (2024)
Jailbreaking in the Haystack
by: Shah, Rishi Rajesh, et al.
Published: (2025)
by: Shah, Rishi Rajesh, et al.
Published: (2025)
VisionArena: 230K Real World User-VLM Conversations with Preference Labels
by: Chou, Christopher, et al.
Published: (2024)
by: Chou, Christopher, et al.
Published: (2024)
Analyzing The Language of Visual Tokens
by: Chan, David M., et al.
Published: (2024)
by: Chan, David M., et al.
Published: (2024)
DENIAHL: In-Context Features Influence LLM Needle-In-A-Haystack Abilities
by: Dai, Hui, et al.
Published: (2024)
by: Dai, Hui, et al.
Published: (2024)
Finding Visual Task Vectors
by: Hojel, Alberto, et al.
Published: (2024)
by: Hojel, Alberto, et al.
Published: (2024)
GOLD PANNING: Strategic Context Shuffling for Needle-in-Haystack Reasoning
by: Byerly, Adam, et al.
Published: (2025)
by: Byerly, Adam, et al.
Published: (2025)
Hidden in the Haystack: Smaller Needles are More Difficult for LLMs to Find
by: Bianchi, Owen, et al.
Published: (2025)
by: Bianchi, Owen, et al.
Published: (2025)
Needles in Haystacks: Exploring Selective Novel Adsorbents for PFAS Treatment
by: Bryan Sadowski, et al.
Published: (2026)
by: Bryan Sadowski, et al.
Published: (2026)
DAVE: A VLM Vision Encoder for Document Understanding and Web Agents
by: Huang, Brandon, et al.
Published: (2025)
by: Huang, Brandon, et al.
Published: (2025)
Visually Prompted Benchmarks Are Surprisingly Fragile
by: Feng, Haiwen, et al.
Published: (2025)
by: Feng, Haiwen, et al.
Published: (2025)
EPIC-Bench: A Perception-Centric Benchmark for Fine-Grained Embodied Visual Grounding in Vision-Language Models
by: Shan, Haozhe, et al.
Published: (2026)
by: Shan, Haozhe, et al.
Published: (2026)
Recursive Visual Programming
by: Ge, Jiaxin, et al.
Published: (2023)
by: Ge, Jiaxin, et al.
Published: (2023)
Similar Items
-
LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning
by: Niu, Dantong, et al.
Published: (2024) -
Needle In A Multimodal Haystack
by: Wang, Weiyun, et al.
Published: (2024) -
Two Causally Related Needles in a Video Haystack
by: Li, Miaoyu, et al.
Published: (2025) -
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery
by: Quenum, Jerome, et al.
Published: (2025) -
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
by: Zhao, Zijia, et al.
Published: (2024)