Saved in:
| Main Authors: | Du, Zhou, Yuan, Zhaoquan, Wu, Xiao, Xu, Changsheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.02168 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Disentangled Representation Learning via Modular Compositional Bias
by: Jung, Whie, et al.
Published: (2025)
by: Jung, Whie, et al.
Published: (2025)
SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks
by: Kahl, Kim-Celine, et al.
Published: (2024)
by: Kahl, Kim-Celine, et al.
Published: (2024)
Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA
by: Safwan, Itbaan, et al.
Published: (2025)
by: Safwan, Itbaan, et al.
Published: (2025)
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
by: Zhou, Runjie, et al.
Published: (2026)
by: Zhou, Runjie, et al.
Published: (2026)
WildFireVQA: A Large-Scale Radiometric Thermal VQA Benchmark for Aerial Wildfire Monitoring
by: Habibpour, Mobin, et al.
Published: (2026)
by: Habibpour, Mobin, et al.
Published: (2026)
Learning Color Equivariant Representations
by: Yang, Yulong, et al.
Published: (2024)
by: Yang, Yulong, et al.
Published: (2024)
HAMMR: HierArchical MultiModal React agents for generic VQA
by: Castrejon, Lluis, et al.
Published: (2024)
by: Castrejon, Lluis, et al.
Published: (2024)
Cluster-Aware Similarity Diffusion for Instance Retrieval
by: Luo, Jifei, et al.
Published: (2024)
by: Luo, Jifei, et al.
Published: (2024)
Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations
by: Nguyen, Khoi Anh, et al.
Published: (2025)
by: Nguyen, Khoi Anh, et al.
Published: (2025)
VQA-Levels: A Hierarchical Approach for Classifying Questions in VQA
by: Madaka, Madhuri Latha, et al.
Published: (2025)
by: Madaka, Madhuri Latha, et al.
Published: (2025)
Unexplored flaws in multiple-choice VQA evaluations
by: Rosenthal, Fabio, et al.
Published: (2025)
by: Rosenthal, Fabio, et al.
Published: (2025)
BERT-VQA: Visual Question Answering on Plots
by: Vu, Tai, et al.
Published: (2025)
by: Vu, Tai, et al.
Published: (2025)
Equivariant neural networks and equivarification
by: Bao, Erkao, et al.
Published: (2019)
by: Bao, Erkao, et al.
Published: (2019)
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
by: Hsu, Kyle, et al.
Published: (2024)
by: Hsu, Kyle, et al.
Published: (2024)
Disentangle and Regularize: Sign Language Production with Articulator-Based Disentanglement and Channel-Aware Regularization
by: Tasyurek, Sumeyye Meryem, et al.
Published: (2025)
by: Tasyurek, Sumeyye Meryem, et al.
Published: (2025)
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
by: Wu, Shaojin, et al.
Published: (2025)
by: Wu, Shaojin, et al.
Published: (2025)
Improving Medical VQA through Trajectory-Aware Process Supervision
by: Gulluk, Halil Ibrahim, et al.
Published: (2026)
by: Gulluk, Halil Ibrahim, et al.
Published: (2026)
Soft Equivariance Regularization for Invariant Self-Supervised Learning
by: Lee, Joohyung, et al.
Published: (2026)
by: Lee, Joohyung, et al.
Published: (2026)
RotaTouille: Rotation Equivariant Deep Learning for Contours
by: Gardaa, Odin Hoff, et al.
Published: (2025)
by: Gardaa, Odin Hoff, et al.
Published: (2025)
EqvAfford: SE(3) Equivariance for Point-Level Affordance Learning
by: Chen, Yue, et al.
Published: (2024)
by: Chen, Yue, et al.
Published: (2024)
Attribute Diversity Determines the Systematicity Gap in VQA
by: Berlot-Attwell, Ian, et al.
Published: (2023)
by: Berlot-Attwell, Ian, et al.
Published: (2023)
DRESS: Disentangled Representation-based Self-Supervised Meta-Learning for Diverse Tasks
by: Cui, Wei, et al.
Published: (2025)
by: Cui, Wei, et al.
Published: (2025)
Overconfidence and Calibration in Medical VQA: Empirical Findings and Hallucination-Aware Mitigation
by: Byun, Ji Young, et al.
Published: (2026)
by: Byun, Ji Young, et al.
Published: (2026)
Current Symmetry Group Equivariant Convolution Frameworks for Representation Learning
by: Basheer, Ramzan, et al.
Published: (2024)
by: Basheer, Ramzan, et al.
Published: (2024)
Disentangled Representation Learning with Transmitted Information Bottleneck
by: Dang, Zhuohang, et al.
Published: (2023)
by: Dang, Zhuohang, et al.
Published: (2023)
Disentangled Representation Learning with the Gromov-Monge Gap
by: Uscidda, Théo, et al.
Published: (2024)
by: Uscidda, Théo, et al.
Published: (2024)
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders
by: Li, Sheng-Wei, et al.
Published: (2024)
by: Li, Sheng-Wei, et al.
Published: (2024)
ProtoVQA: An Adaptable Prototypical Framework for Explainable Fine-Grained Visual Question Answering
by: Diao, Xingjian, et al.
Published: (2025)
by: Diao, Xingjian, et al.
Published: (2025)
BloomVQA: Assessing Hierarchical Multi-modal Comprehension
by: Gong, Yunye, et al.
Published: (2023)
by: Gong, Yunye, et al.
Published: (2023)
Tunable Soft Equivariance with Guarantees
by: Rahman, Md Ashiqur, et al.
Published: (2026)
by: Rahman, Md Ashiqur, et al.
Published: (2026)
Diffeomorphism-Equivariant Neural Networks
by: Oettinger, Josephine Elisabeth, et al.
Published: (2026)
by: Oettinger, Josephine Elisabeth, et al.
Published: (2026)
Multi-Task Model Merging via Adaptive Weight Disentanglement
by: Xiong, Feng, et al.
Published: (2024)
by: Xiong, Feng, et al.
Published: (2024)
Disentangled 3D Scene Generation with Layout Learning
by: Epstein, Dave, et al.
Published: (2024)
by: Epstein, Dave, et al.
Published: (2024)
Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
by: Dunion, Mhairi, et al.
Published: (2024)
by: Dunion, Mhairi, et al.
Published: (2024)
LatentGAN Autoencoder: Learning Disentangled Latent Distribution
by: Kalwar, Sanket, et al.
Published: (2022)
by: Kalwar, Sanket, et al.
Published: (2022)
Bridging the Semantic Gaps: Improving Medical VQA Consistency with LLM-Augmented Question Sets
by: Ma, Yongpei, et al.
Published: (2025)
by: Ma, Yongpei, et al.
Published: (2025)
NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
by: Tobaben, Marlon, et al.
Published: (2024)
by: Tobaben, Marlon, et al.
Published: (2024)
Bases of Steerable Kernels for Equivariant CNNs: From 2D Rotations to the Lorentz Group
by: Garbarz, Alan
Published: (2026)
by: Garbarz, Alan
Published: (2026)
The Lie Derivative for Measuring Learned Equivariance
by: Gruver, Nate, et al.
Published: (2022)
by: Gruver, Nate, et al.
Published: (2022)
Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
by: Hahm, Jaehoon, et al.
Published: (2024)
by: Hahm, Jaehoon, et al.
Published: (2024)
Similar Items
-
Disentangled Representation Learning via Modular Compositional Bias
by: Jung, Whie, et al.
Published: (2025) -
SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks
by: Kahl, Kim-Celine, et al.
Published: (2024) -
Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA
by: Safwan, Itbaan, et al.
Published: (2025) -
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
by: Zhou, Runjie, et al.
Published: (2026) -
WildFireVQA: A Large-Scale Radiometric Thermal VQA Benchmark for Aerial Wildfire Monitoring
by: Habibpour, Mobin, et al.
Published: (2026)