Saved in:
| Main Authors: | Song, Alan Z., Chen, Yinjie, Nan, Mu, Zhang, Rui, Cao, Jiahang, Mai, Weijian, Yu, Muquan, Adeli, Hossein, Ramanan, Deva, Tarr, Michael J., Luo, Andrew F. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.12491 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding
by: Nan, Mu, et al.
Published: (2026)
by: Nan, Mu, et al.
Published: (2026)
Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex
by: Yu, Muquan, et al.
Published: (2025)
by: Yu, Muquan, et al.
Published: (2025)
NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity
by: Mai, Weijian, et al.
Published: (2026)
by: Mai, Weijian, et al.
Published: (2026)
Reanimating Images using Neural Representations of Dynamic Stimuli
by: Yeung, Jacob, et al.
Published: (2024)
by: Yeung, Jacob, et al.
Published: (2024)
Human-like Object Grouping in Self-supervised Vision Transformers
by: Adeli, Hossein, et al.
Published: (2026)
by: Adeli, Hossein, et al.
Published: (2026)
Predicting Long-horizon Futures by Conditioning on Geometry and Time
by: Khurana, Tarasha, et al.
Published: (2024)
by: Khurana, Tarasha, et al.
Published: (2024)
Revisiting Few-Shot Object Detection with Vision-Language Models
by: Madan, Anish, et al.
Published: (2023)
by: Madan, Anish, et al.
Published: (2023)
Vision Transformers with Self-Distilled Registers
by: Chen, Yinjie, et al.
Published: (2025)
by: Chen, Yinjie, et al.
Published: (2025)
Revisiting the Role of Language Priors in Vision-Language Models
by: Lin, Zhiqiu, et al.
Published: (2023)
by: Lin, Zhiqiu, et al.
Published: (2023)
Using Diffusion Priors for Video Amodal Segmentation
by: Chen, Kaihua, et al.
Published: (2024)
by: Chen, Kaihua, et al.
Published: (2024)
RefAV: Towards Planning-Centric Scenario Mining
by: Davidson, Cainan, et al.
Published: (2025)
by: Davidson, Cainan, et al.
Published: (2025)
Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos
by: Chen, Kaihua, et al.
Published: (2025)
by: Chen, Kaihua, et al.
Published: (2025)
Depth-supervised NeRF: Fewer Views and Faster Training for Free
by: Deng, Kangle, et al.
Published: (2021)
by: Deng, Kangle, et al.
Published: (2021)
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers
by: Robinson, Isaac, et al.
Published: (2025)
by: Robinson, Isaac, et al.
Published: (2025)
Transformer brain encoders explain human high-level visual responses
by: Adeli, Hossein, et al.
Published: (2025)
by: Adeli, Hossein, et al.
Published: (2025)
Explaining state constitutional changes
by: G. Alan Tarr
Published: (2016)
by: G. Alan Tarr
Published: (2016)
NAFTA and Federalism : Are they compatible? / G. Alan Tarr
by: Tarr, G. Alan
by: Tarr, G. Alan
Comprendiendo las constituciones estatales / G. Alan Tarr ; traducción Daniel A. Barceló Rojas
by: Tarr, G. Alan
Published: (2009)
by: Tarr, G. Alan
Published: (2009)
NAFTA and Federalism : Are they compatible? / G. Alan Tarr
by: Tarr, G. Alan
Published: (2007)
by: Tarr, G. Alan
Published: (2007)
Judicial federalism in the United States: structure, jurisdiction and operation
by: G. Alan Tarr
Published: (2015)
by: G. Alan Tarr
Published: (2015)
Less is More Tokens: Efficient Math Reasoning via Difficulty-Aware Chain-of-Thought Distillation
by: Waheed, Abdul, et al.
Published: (2025)
by: Waheed, Abdul, et al.
Published: (2025)
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition
by: Cao, Jiahang, et al.
Published: (2025)
by: Cao, Jiahang, et al.
Published: (2025)
Shelf-Supervised Cross-Modal Pre-Training for 3D Object Detection
by: Khurana, Mehar, et al.
Published: (2024)
by: Khurana, Mehar, et al.
Published: (2024)
SMORE: Simultaneous Map and Object REconstruction
by: Chodosh, Nathaniel, et al.
Published: (2024)
by: Chodosh, Nathaniel, et al.
Published: (2024)
Evaluating a VR System for Collecting Safety-Critical Vehicle-Pedestrian Interactions
by: Weng, Erica, et al.
Published: (2023)
by: Weng, Erica, et al.
Published: (2023)
Language Models as Black-Box Optimizers for Vision-Language Models
by: Liu, Shihong, et al.
Published: (2023)
by: Liu, Shihong, et al.
Published: (2023)
In Silico Mapping of Visual Categorical Selectivity Across the Whole Brain
by: Hwang, Ethan, et al.
Published: (2025)
by: Hwang, Ethan, et al.
Published: (2025)
Planning with Adaptive World Models for Autonomous Driving
by: Vasudevan, Arun Balajee, et al.
Published: (2024)
by: Vasudevan, Arun Balajee, et al.
Published: (2024)
The Neglected Tails in Vision-Language Models
by: Parashar, Shubham, et al.
Published: (2024)
by: Parashar, Shubham, et al.
Published: (2024)
Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models
by: Robicheaux, Peter, et al.
Published: (2025)
by: Robicheaux, Peter, et al.
Published: (2025)
ZeroFlow: Scalable Scene Flow via Distillation
by: Vedder, Kyle, et al.
Published: (2023)
by: Vedder, Kyle, et al.
Published: (2023)
MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
PAI-Bench: A Comprehensive Benchmark For Physical AI
by: Zhou, Fengzhe, et al.
Published: (2025)
by: Zhou, Fengzhe, et al.
Published: (2025)
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
by: Lin, Zhiqiu, et al.
Published: (2023)
by: Lin, Zhiqiu, et al.
Published: (2023)
I Can't Believe It's Not Scene Flow!
by: Khatri, Ishan, et al.
Published: (2024)
by: Khatri, Ishan, et al.
Published: (2024)
DressRecon: Freeform 4D Human Reconstruction from Monocular Video
by: Tan, Jeff, et al.
Published: (2024)
by: Tan, Jeff, et al.
Published: (2024)
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis
by: Vuong, Khiem, et al.
Published: (2025)
by: Vuong, Khiem, et al.
Published: (2025)
Novel View Synthesis as Video Completion
by: Wu, Qi, et al.
Published: (2026)
by: Wu, Qi, et al.
Published: (2026)
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers
by: Luo, Andrew F., et al.
Published: (2024)
by: Luo, Andrew F., et al.
Published: (2024)
Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
by: Mitra, Chancharik, et al.
Published: (2025)
by: Mitra, Chancharik, et al.
Published: (2025)
Similar Items
-
Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding
by: Nan, Mu, et al.
Published: (2026) -
Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex
by: Yu, Muquan, et al.
Published: (2025) -
NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity
by: Mai, Weijian, et al.
Published: (2026) -
Reanimating Images using Neural Representations of Dynamic Stimuli
by: Yeung, Jacob, et al.
Published: (2024) -
Human-like Object Grouping in Self-supervised Vision Transformers
by: Adeli, Hossein, et al.
Published: (2026)