Saved in:
| Main Authors: | Hamid, Nur Amirah Abd, Shapiai, Mohd Ibrahim, Lai, Daphne Teck Ching |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.17613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improving Interpretability in Alzheimer's Prediction via Joint Learning of ADAS-Cog Scores
by: Hamid, Nur Amirah Abd, et al.
Published: (2025)
by: Hamid, Nur Amirah Abd, et al.
Published: (2025)
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
by: Zhong, Hanwen, et al.
Published: (2025)
by: Zhong, Hanwen, et al.
Published: (2025)
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction
by: Zhu, Jiageng, et al.
Published: (2024)
by: Zhu, Jiageng, et al.
Published: (2024)
Road Segmentation for ADAS/AD Applications
by: Ramasamy, Mathanesh Vellingiri, et al.
Published: (2025)
by: Ramasamy, Mathanesh Vellingiri, et al.
Published: (2025)
Cog3DMap: Multi-View Vision-Language Reasoning with 3D Cognitive Maps
by: Gwak, Chanyoung, et al.
Published: (2026)
by: Gwak, Chanyoung, et al.
Published: (2026)
SatelliteCalculator: A Multi-Task Vision Foundation Model for Quantitative Remote Sensing Inversion
by: Yu, Zhenyu, et al.
Published: (2025)
by: Yu, Zhenyu, et al.
Published: (2025)
HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Task
by: Tian, Yu, et al.
Published: (2024)
by: Tian, Yu, et al.
Published: (2024)
ADAS-TO: A Large-Scale Multimodal Naturalistic Dataset and Empirical Characterization of Human Takeovers during ADAS Engagement
by: Wang, Yuhang, et al.
Published: (2026)
by: Wang, Yuhang, et al.
Published: (2026)
Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies
by: Sarkar, Ayushman, et al.
Published: (2025)
by: Sarkar, Ayushman, et al.
Published: (2025)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
by: Yang, Zhuoyi, et al.
Published: (2024)
by: Yang, Zhuoyi, et al.
Published: (2024)
Inadequate contrast ratio of road markings as an indicator for ADAS failure
by: Certad, Novel, et al.
Published: (2024)
by: Certad, Novel, et al.
Published: (2024)
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
by: Chen, Wei, et al.
Published: (2024)
by: Chen, Wei, et al.
Published: (2024)
Analytical Uncertainty-Based Loss Weighting in Multi-Task Learning
by: Kirchdorfer, Lukas, et al.
Published: (2024)
by: Kirchdorfer, Lukas, et al.
Published: (2024)
Progressive Pretext Task Learning for Human Trajectory Prediction
by: Lin, Xiaotong, et al.
Published: (2024)
by: Lin, Xiaotong, et al.
Published: (2024)
SGW-based Multi-Task Learning in Vision Tasks
by: Zhang, Ruiyuan, et al.
Published: (2024)
by: Zhang, Ruiyuan, et al.
Published: (2024)
Robust ADAS: Enhancing Robustness of Machine Learning-based Advanced Driver Assistance Systems for Adverse Weather
by: Shahzad, Muhammad Zaeem, et al.
Published: (2024)
by: Shahzad, Muhammad Zaeem, et al.
Published: (2024)
Adaptive Multi Scale Document Binarisation Using Vision Mamba
by: Azfar, Mohd., et al.
Published: (2024)
by: Azfar, Mohd., et al.
Published: (2024)
Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer
by: Zhao, Guibin, et al.
Published: (2024)
by: Zhao, Guibin, et al.
Published: (2024)
Vision Transformers for Preoperative CT-Based Prediction of Histopathologic Chemotherapy Response Score in High-Grade Serous Ovarian Carcinoma
by: Fati, Francesca, et al.
Published: (2026)
by: Fati, Francesca, et al.
Published: (2026)
Generative Pre-training for Subjective Tasks: A Diffusion Transformer-Based Framework for Facial Beauty Prediction
by: Boukhari, Djamel Eddine, et al.
Published: (2025)
by: Boukhari, Djamel Eddine, et al.
Published: (2025)
TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning
by: Baek, Seungmin, et al.
Published: (2025)
by: Baek, Seungmin, et al.
Published: (2025)
Deep Learning Based Wildfire Detection for Peatland Fires Using Transfer Learning
by: Hamdan, Emadeldeen, et al.
Published: (2026)
by: Hamdan, Emadeldeen, et al.
Published: (2026)
VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion
by: Hicsonmez, Samet, et al.
Published: (2025)
by: Hicsonmez, Samet, et al.
Published: (2025)
Proximal Vision Transformer: Enhancing Feature Representation through Two-Stage Manifold Geometry
by: Yun, Haoyu, et al.
Published: (2025)
by: Yun, Haoyu, et al.
Published: (2025)
EULER-ADAS: Energy-Efficient & SIMD-Unified Logarithmic-Posit Engine for Precision-Reconfigurable Approximate ADAS Acceleration
by: Lokhande, Mukul, et al.
Published: (2026)
by: Lokhande, Mukul, et al.
Published: (2026)
Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks
by: Singh, Raghavendra
Published: (2024)
by: Singh, Raghavendra
Published: (2024)
Efficient Cross-Country Data Acquisition Strategy for ADAS via Street-View Imagery
by: Wu, Yin, et al.
Published: (2026)
by: Wu, Yin, et al.
Published: (2026)
Efficient Domain-Adaptive Multi-Task Dense Prediction with Vision Foundation Models
by: Kang, Beomseok, et al.
Published: (2025)
by: Kang, Beomseok, et al.
Published: (2025)
Task Indicating Transformer for Task-conditional Dense Predictions
by: Lu, Yuxiang, et al.
Published: (2024)
by: Lu, Yuxiang, et al.
Published: (2024)
CogDoc: Towards Unified thinking in Documents
by: Xu, Qixin, et al.
Published: (2025)
by: Xu, Qixin, et al.
Published: (2025)
ViT-DD: Multi-Task Vision Transformer for Semi-Supervised Driver Distraction Detection
by: Ma, Yunsheng, et al.
Published: (2022)
by: Ma, Yunsheng, et al.
Published: (2022)
Patch Pruning Strategy Based on Robust Statistical Measures of Attention Weight Diversity in Vision Transformers
by: Igaue, Yuki, et al.
Published: (2025)
by: Igaue, Yuki, et al.
Published: (2025)
Multidimensional Task Learning: A Unified Tensor Framework for Computer Vision Tasks
by: Ichi, Alaa El, et al.
Published: (2026)
by: Ichi, Alaa El, et al.
Published: (2026)
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
by: Li, Yongkang, et al.
Published: (2025)
by: Li, Yongkang, et al.
Published: (2025)
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
by: Li, Wei, et al.
Published: (2025)
by: Li, Wei, et al.
Published: (2025)
Safety-Critical Camera Reliability Monitoring for ADAS via Degradation-Aware Uncertainty Pattern Analysis
by: Aher, Shiva
Published: (2026)
by: Aher, Shiva
Published: (2026)
Feature Learning with Multi-Stage Vision Transformers on Inter-Modality HER2 Status Scoring and Tumor Classification on Whole Slides
by: Oyelade, Olaide N., et al.
Published: (2025)
by: Oyelade, Olaide N., et al.
Published: (2025)
CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting
by: Hossain, Md Tanvir, et al.
Published: (2025)
by: Hossain, Md Tanvir, et al.
Published: (2025)
CogVLM: Visual Expert for Pretrained Language Models
by: Wang, Weihan, et al.
Published: (2023)
by: Wang, Weihan, et al.
Published: (2023)
On Convolutional Vision Transformers for Yield Prediction
by: Inderka, Alvin, et al.
Published: (2024)
by: Inderka, Alvin, et al.
Published: (2024)
Similar Items
-
Improving Interpretability in Alzheimer's Prediction via Joint Learning of ADAS-Cog Scores
by: Hamid, Nur Amirah Abd, et al.
Published: (2025) -
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
by: Zhong, Hanwen, et al.
Published: (2025) -
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction
by: Zhu, Jiageng, et al.
Published: (2024) -
Road Segmentation for ADAS/AD Applications
by: Ramasamy, Mathanesh Vellingiri, et al.
Published: (2025) -
Cog3DMap: Multi-View Vision-Language Reasoning with 3D Cognitive Maps
by: Gwak, Chanyoung, et al.
Published: (2026)