:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Du, Zhou, Yuan, Zhaoquan, Wu, Xiao, Xu, Changsheng
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2606.02168
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Disentangled Representation Learning via Modular Compositional Bias
by: Jung, Whie, et al.
Published: (2025)

SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks
by: Kahl, Kim-Celine, et al.
Published: (2024)

Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA
by: Safwan, Itbaan, et al.
Published: (2025)

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
by: Zhou, Runjie, et al.
Published: (2026)

WildFireVQA: A Large-Scale Radiometric Thermal VQA Benchmark for Aerial Wildfire Monitoring
by: Habibpour, Mobin, et al.
Published: (2026)

Learning Color Equivariant Representations
by: Yang, Yulong, et al.
Published: (2024)

HAMMR: HierArchical MultiModal React agents for generic VQA
by: Castrejon, Lluis, et al.
Published: (2024)

Cluster-Aware Similarity Diffusion for Instance Retrieval
by: Luo, Jifei, et al.
Published: (2024)

Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations
by: Nguyen, Khoi Anh, et al.
Published: (2025)

VQA-Levels: A Hierarchical Approach for Classifying Questions in VQA
by: Madaka, Madhuri Latha, et al.
Published: (2025)

Unexplored flaws in multiple-choice VQA evaluations
by: Rosenthal, Fabio, et al.
Published: (2025)

BERT-VQA: Visual Question Answering on Plots
by: Vu, Tai, et al.
Published: (2025)

Equivariant neural networks and equivarification
by: Bao, Erkao, et al.
Published: (2019)

Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
by: Hsu, Kyle, et al.
Published: (2024)

Disentangle and Regularize: Sign Language Production with Articulator-Based Disentanglement and Channel-Aware Regularization
by: Tasyurek, Sumeyye Meryem, et al.
Published: (2025)

USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
by: Wu, Shaojin, et al.
Published: (2025)

Improving Medical VQA through Trajectory-Aware Process Supervision
by: Gulluk, Halil Ibrahim, et al.
Published: (2026)

Soft Equivariance Regularization for Invariant Self-Supervised Learning
by: Lee, Joohyung, et al.
Published: (2026)

RotaTouille: Rotation Equivariant Deep Learning for Contours
by: Gardaa, Odin Hoff, et al.
Published: (2025)

EqvAfford: SE(3) Equivariance for Point-Level Affordance Learning
by: Chen, Yue, et al.
Published: (2024)

Attribute Diversity Determines the Systematicity Gap in VQA
by: Berlot-Attwell, Ian, et al.
Published: (2023)

DRESS: Disentangled Representation-based Self-Supervised Meta-Learning for Diverse Tasks
by: Cui, Wei, et al.
Published: (2025)

Overconfidence and Calibration in Medical VQA: Empirical Findings and Hallucination-Aware Mitigation
by: Byun, Ji Young, et al.
Published: (2026)

Current Symmetry Group Equivariant Convolution Frameworks for Representation Learning
by: Basheer, Ramzan, et al.
Published: (2024)

Disentangled Representation Learning with Transmitted Information Bottleneck
by: Dang, Zhuohang, et al.
Published: (2023)

Disentangled Representation Learning with the Gromov-Monge Gap
by: Uscidda, Théo, et al.
Published: (2024)

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders
by: Li, Sheng-Wei, et al.
Published: (2024)

ProtoVQA: An Adaptable Prototypical Framework for Explainable Fine-Grained Visual Question Answering
by: Diao, Xingjian, et al.
Published: (2025)

BloomVQA: Assessing Hierarchical Multi-modal Comprehension
by: Gong, Yunye, et al.
Published: (2023)

Tunable Soft Equivariance with Guarantees
by: Rahman, Md Ashiqur, et al.
Published: (2026)

Diffeomorphism-Equivariant Neural Networks
by: Oettinger, Josephine Elisabeth, et al.
Published: (2026)

Multi-Task Model Merging via Adaptive Weight Disentanglement
by: Xiong, Feng, et al.
Published: (2024)

Disentangled 3D Scene Generation with Layout Learning
by: Epstein, Dave, et al.
Published: (2024)

Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
by: Dunion, Mhairi, et al.
Published: (2024)

LatentGAN Autoencoder: Learning Disentangled Latent Distribution
by: Kalwar, Sanket, et al.
Published: (2022)

Bridging the Semantic Gaps: Improving Medical VQA Consistency with LLM-Augmented Question Sets
by: Ma, Yongpei, et al.
Published: (2025)

NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
by: Tobaben, Marlon, et al.
Published: (2024)

Bases of Steerable Kernels for Equivariant CNNs: From 2D Rotations to the Lorentz Group
by: Garbarz, Alan
Published: (2026)

The Lie Derivative for Measuring Learned Equivariance
by: Gruver, Nate, et al.
Published: (2022)

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
by: Hahm, Jaehoon, et al.
Published: (2024)