Saved in:
| Main Authors: | Ung, Huy Quang, Niu, Hao, Dao, Minh-Son, Wada, Shinya, Minamikawa, Atsunori |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.04841 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding
by: Ung, Huy Quang, et al.
Published: (2025)
by: Ung, Huy Quang, et al.
Published: (2025)
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
by: Dao, Alan, et al.
Published: (2024)
by: Dao, Alan, et al.
Published: (2024)
SAMSA: Efficient Transformer for Many Data Modalities
by: Lenhat, Minh, et al.
Published: (2024)
by: Lenhat, Minh, et al.
Published: (2024)
A model free approach for continuous-time optimal tracking control with unknown user-define cost and constrained control input via advantage function
by: Nguyen, Duc Cuong, et al.
Published: (2025)
by: Nguyen, Duc Cuong, et al.
Published: (2025)
Co-NAML-LSTUR: A Combined Model with Attentive Multi-View Learning and Long- and Short-term User Representations for News Recommendation
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
Deep-Wide Learning Assistance for Insect Pest Classification
by: Nguyen, Toan, et al.
Published: (2024)
by: Nguyen, Toan, et al.
Published: (2024)
Label-Efficient Cross-Modality Generalization for Liver Segmentation in Multi-Phase MRI
by: Bui-Tran, Quang-Khai, et al.
Published: (2025)
by: Bui-Tran, Quang-Khai, et al.
Published: (2025)
E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs
by: Phan, Van-Hoang, et al.
Published: (2025)
by: Phan, Van-Hoang, et al.
Published: (2025)
DuA: Dual Attentive Transformer in Long-Term Continuous EEG Emotion Analysis
by: Pan, Yue, et al.
Published: (2024)
by: Pan, Yue, et al.
Published: (2024)
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning
by: Dao, Alan, et al.
Published: (2025)
by: Dao, Alan, et al.
Published: (2025)
FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction
by: Zhang, Haoyi, et al.
Published: (2026)
by: Zhang, Haoyi, et al.
Published: (2026)
Cross-Attentive Multiview Fusion of Vision-Language Embeddings
by: Martins, Tomas Berriel, et al.
Published: (2026)
by: Martins, Tomas Berriel, et al.
Published: (2026)
TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
by: Dinh, Quang Minh, et al.
Published: (2024)
by: Dinh, Quang Minh, et al.
Published: (2024)
A Knowledge-Guided Cross-Modal Feature Fusion Model for Local Traffic Demand Prediction
by: Zhang, Lingyu, et al.
Published: (2025)
by: Zhang, Lingyu, et al.
Published: (2025)
SearchLLM: Detecting LLM Paraphrased Text by Measuring the Similarity with Regeneration of the Candidate Source via Search Engine
by: Nguyen-Son, Hoang-Quoc, et al.
Published: (2026)
by: Nguyen-Son, Hoang-Quoc, et al.
Published: (2026)
KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
by: La, Tuan-Vinh, et al.
Published: (2025)
by: La, Tuan-Vinh, et al.
Published: (2025)
Analyzing the Burden of Midface Fractures Due to Road Traffic Accidents in Vietnam: An Epidemiological Approach
by: Chon Thanh Ho Nguyen, et al.
Published: (2024)
by: Chon Thanh Ho Nguyen, et al.
Published: (2024)
Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
by: Pham, Hieu Dinh Trung, et al.
Published: (2025)
by: Pham, Hieu Dinh Trung, et al.
Published: (2025)
Expandable Microspheres Transform 2D Electrospun Mats Into 3D Composite Scaffolds
by: Huy Quang Tran, et al.
Published: (2024)
by: Huy Quang Tran, et al.
Published: (2024)
Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios
by: Gan, Wenbin, et al.
Published: (2025)
by: Gan, Wenbin, et al.
Published: (2025)
Cross-Modal Temporal Fusion for Financial Market Forecasting
by: Pei, Yunhua, et al.
Published: (2025)
by: Pei, Yunhua, et al.
Published: (2025)
RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction
by: Nguyen, Huy-Son, et al.
Published: (2025)
by: Nguyen, Huy-Son, et al.
Published: (2025)
Enhancing Alzheimer's Detection through Late Fusion of Multi-Modal EEG Features
by: Vinh, Nguyen Thanh, et al.
Published: (2025)
by: Vinh, Nguyen Thanh, et al.
Published: (2025)
Hybrid Transformer and Spatial-Temporal Self-Supervised Learning for Long-term Traffic Prediction
by: Zhu, Wang, et al.
Published: (2024)
by: Zhu, Wang, et al.
Published: (2024)
3D Dynamic Radio Map Prediction Using Vision Transformers for Low-Altitude Wireless Networks
by: Quang, Nguyen Duc Minh, et al.
Published: (2025)
by: Quang, Nguyen Duc Minh, et al.
Published: (2025)
A Cross‐Attentive Gated Fusion Model for Steel Billet Tapping Temperature Prediction in Heating Furnace
by: Junyu Zhao, et al.
Published: (2026)
by: Junyu Zhao, et al.
Published: (2026)
AdaCM$^2$: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction
by: Man, Yuanbin, et al.
Published: (2024)
by: Man, Yuanbin, et al.
Published: (2024)
PROMISE: Prompt-Attentive Hierarchical Contrastive Learning for Robust Cross-Modal Representation with Missing Modalities
by: Chen, Jiajun, et al.
Published: (2025)
by: Chen, Jiajun, et al.
Published: (2025)
Brauer-Manin obstruction for Wehler K3 surfaces of Markoff type
by: Dao, Quang-Duc
Published: (2023)
by: Dao, Quang-Duc
Published: (2023)
Rational and integral points on Markoff-type K3 surfaces
by: Dao, Quang-Duc
Published: (2025)
by: Dao, Quang-Duc
Published: (2025)
Gender and Its Representation in Vietnamese Online Newspapers: A Discourse Analysis Through a Post‐Colonial Lens
by: Trung Quang Dao
Published: (2025)
by: Trung Quang Dao
Published: (2025)
Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling
by: Tao, Xinhao, et al.
Published: (2023)
by: Tao, Xinhao, et al.
Published: (2023)
CAST: Cross-Attentive Spatio-Temporal feature fusion for deepfake detection
by: Thakre, Aryan, et al.
Published: (2025)
by: Thakre, Aryan, et al.
Published: (2025)
Cyberscurity Threats and Defense Mechanisms in IoT network
by: Dao, Trung, et al.
Published: (2026)
by: Dao, Trung, et al.
Published: (2026)
Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
THE STRATEGIC FIT’S EFFECTIVENESS IN THE COMPETITIVE MARKET: FOCUS ON SMALL BUSINESSES IN AN EMERGING COUNTRY
by: Quang-Huy Ngo
Published: (2022)
by: Quang-Huy Ngo
Published: (2022)
The Impact of Green Market Orientation and Ambidextrous Green Innovation on Organizational Performance: Empirical Study on Small Restaurants in Vietnam
by: Quang‐Huy Ngo
Published: (2024)
by: Quang‐Huy Ngo
Published: (2024)
Green Human Resource Management in Vietnamese Small Restaurants: Integrating Institutional and Natural‐Resource‐Based Perspectives
by: Quang‐Huy Ngo
Published: (2025)
by: Quang‐Huy Ngo
Published: (2025)
CXR-TFT: Multi-Modal Temporal Fusion Transformer for Predicting Chest X-ray Trajectories
by: Arora, Mehak, et al.
Published: (2025)
by: Arora, Mehak, et al.
Published: (2025)
HUTFormer: Hierarchical U-Net Transformer for Long-Term Traffic Forecasting
by: Shao, Zezhi, et al.
Published: (2023)
by: Shao, Zezhi, et al.
Published: (2023)
Similar Items
-
CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding
by: Ung, Huy Quang, et al.
Published: (2025) -
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
by: Dao, Alan, et al.
Published: (2024) -
SAMSA: Efficient Transformer for Many Data Modalities
by: Lenhat, Minh, et al.
Published: (2024) -
A model free approach for continuous-time optimal tracking control with unknown user-define cost and constrained control input via advantage function
by: Nguyen, Duc Cuong, et al.
Published: (2025) -
Co-NAML-LSTUR: A Combined Model with Attentive Multi-View Learning and Long- and Short-term User Representations for News Recommendation
by: Nguyen, Minh Hoang, et al.
Published: (2025)