:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Song, Alan Z., Chen, Yinjie, Nan, Mu, Zhang, Rui, Cao, Jiahang, Mai, Weijian, Yu, Muquan, Adeli, Hossein, Ramanan, Deva, Tarr, Michael J., Luo, Andrew F.
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2605.12491
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding
by: Nan, Mu, et al.
Published: (2026)

Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex
by: Yu, Muquan, et al.
Published: (2025)

NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity
by: Mai, Weijian, et al.
Published: (2026)

Reanimating Images using Neural Representations of Dynamic Stimuli
by: Yeung, Jacob, et al.
Published: (2024)

Human-like Object Grouping in Self-supervised Vision Transformers
by: Adeli, Hossein, et al.
Published: (2026)

Predicting Long-horizon Futures by Conditioning on Geometry and Time
by: Khurana, Tarasha, et al.
Published: (2024)

Revisiting Few-Shot Object Detection with Vision-Language Models
by: Madan, Anish, et al.
Published: (2023)

Vision Transformers with Self-Distilled Registers
by: Chen, Yinjie, et al.
Published: (2025)

Revisiting the Role of Language Priors in Vision-Language Models
by: Lin, Zhiqiu, et al.
Published: (2023)

Using Diffusion Priors for Video Amodal Segmentation
by: Chen, Kaihua, et al.
Published: (2024)

RefAV: Towards Planning-Centric Scenario Mining
by: Davidson, Cainan, et al.
Published: (2025)

Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos
by: Chen, Kaihua, et al.
Published: (2025)

Depth-supervised NeRF: Fewer Views and Faster Training for Free
by: Deng, Kangle, et al.
Published: (2021)

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers
by: Robinson, Isaac, et al.
Published: (2025)

Transformer brain encoders explain human high-level visual responses
by: Adeli, Hossein, et al.
Published: (2025)

Explaining state constitutional changes
by: G. Alan Tarr
Published: (2016)

NAFTA and Federalism : Are they compatible? / G. Alan Tarr
by: Tarr, G. Alan

Comprendiendo las constituciones estatales / G. Alan Tarr ; traducción Daniel A. Barceló Rojas
by: Tarr, G. Alan
Published: (2009)

NAFTA and Federalism : Are they compatible? / G. Alan Tarr
by: Tarr, G. Alan
Published: (2007)

Judicial federalism in the United States: structure, jurisdiction and operation
by: G. Alan Tarr
Published: (2015)

Less is More Tokens: Efficient Math Reasoning via Difficulty-Aware Chain-of-Thought Distillation
by: Waheed, Abdul, et al.
Published: (2025)

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition
by: Cao, Jiahang, et al.
Published: (2025)

Shelf-Supervised Cross-Modal Pre-Training for 3D Object Detection
by: Khurana, Mehar, et al.
Published: (2024)

SMORE: Simultaneous Map and Object REconstruction
by: Chodosh, Nathaniel, et al.
Published: (2024)

Evaluating a VR System for Collecting Safety-Critical Vehicle-Pedestrian Interactions
by: Weng, Erica, et al.
Published: (2023)

Language Models as Black-Box Optimizers for Vision-Language Models
by: Liu, Shihong, et al.
Published: (2023)

In Silico Mapping of Visual Categorical Selectivity Across the Whole Brain
by: Hwang, Ethan, et al.
Published: (2025)

Planning with Adaptive World Models for Autonomous Driving
by: Vasudevan, Arun Balajee, et al.
Published: (2024)

The Neglected Tails in Vision-Language Models
by: Parashar, Shubham, et al.
Published: (2024)

Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models
by: Robicheaux, Peter, et al.
Published: (2025)

ZeroFlow: Scalable Scene Flow via Distillation
by: Vedder, Kyle, et al.
Published: (2023)

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
by: Wang, Zihan, et al.
Published: (2025)

PAI-Bench: A Comprehensive Benchmark For Physical AI
by: Zhou, Fengzhe, et al.
Published: (2025)

Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
by: Lin, Zhiqiu, et al.
Published: (2023)

I Can't Believe It's Not Scene Flow!
by: Khatri, Ishan, et al.
Published: (2024)

DressRecon: Freeform 4D Human Reconstruction from Monocular Video
by: Tan, Jeff, et al.
Published: (2024)

AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis
by: Vuong, Khiem, et al.
Published: (2025)

Novel View Synthesis as Video Completion
by: Wu, Qi, et al.
Published: (2026)

Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers
by: Luo, Andrew F., et al.
Published: (2024)

Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
by: Mitra, Chancharik, et al.
Published: (2025)