:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ung, Huy Quang, Niu, Hao, Dao, Minh-Son, Wada, Shinya, Minamikawa, Atsunori
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2405.04841
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding
by: Ung, Huy Quang, et al.
Published: (2025)

Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
by: Dao, Alan, et al.
Published: (2024)

SAMSA: Efficient Transformer for Many Data Modalities
by: Lenhat, Minh, et al.
Published: (2024)

A model free approach for continuous-time optimal tracking control with unknown user-define cost and constrained control input via advantage function
by: Nguyen, Duc Cuong, et al.
Published: (2025)

Co-NAML-LSTUR: A Combined Model with Attentive Multi-View Learning and Long- and Short-term User Representations for News Recommendation
by: Nguyen, Minh Hoang, et al.
Published: (2025)

Deep-Wide Learning Assistance for Insect Pest Classification
by: Nguyen, Toan, et al.
Published: (2024)

Label-Efficient Cross-Modality Generalization for Liver Segmentation in Multi-Phase MRI
by: Bui-Tran, Quang-Khai, et al.
Published: (2025)

E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs
by: Phan, Van-Hoang, et al.
Published: (2025)

DuA: Dual Attentive Transformer in Long-Term Continuous EEG Emotion Analysis
by: Pan, Yue, et al.
Published: (2024)

AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning
by: Dao, Alan, et al.
Published: (2025)

FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction
by: Zhang, Haoyi, et al.
Published: (2026)

Cross-Attentive Multiview Fusion of Vision-Language Embeddings
by: Martins, Tomas Berriel, et al.
Published: (2026)

TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
by: Dinh, Quang Minh, et al.
Published: (2024)

A Knowledge-Guided Cross-Modal Feature Fusion Model for Local Traffic Demand Prediction
by: Zhang, Lingyu, et al.
Published: (2025)

SearchLLM: Detecting LLM Paraphrased Text by Measuring the Similarity with Regeneration of the Candidate Source via Search Engine
by: Nguyen-Son, Hoang-Quoc, et al.
Published: (2026)

KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
by: La, Tuan-Vinh, et al.
Published: (2025)

Analyzing the Burden of Midface Fractures Due to Road Traffic Accidents in Vietnam: An Epidemiological Approach
by: Chon Thanh Ho Nguyen, et al.
Published: (2024)

Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
by: Pham, Hieu Dinh Trung, et al.
Published: (2025)

Expandable Microspheres Transform 2D Electrospun Mats Into 3D Composite Scaffolds
by: Huy Quang Tran, et al.
Published: (2024)

Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios
by: Gan, Wenbin, et al.
Published: (2025)

Cross-Modal Temporal Fusion for Financial Market Forecasting
by: Pei, Yunhua, et al.
Published: (2025)

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction
by: Nguyen, Huy-Son, et al.
Published: (2025)

Enhancing Alzheimer's Detection through Late Fusion of Multi-Modal EEG Features
by: Vinh, Nguyen Thanh, et al.
Published: (2025)

Hybrid Transformer and Spatial-Temporal Self-Supervised Learning for Long-term Traffic Prediction
by: Zhu, Wang, et al.
Published: (2024)

3D Dynamic Radio Map Prediction Using Vision Transformers for Low-Altitude Wireless Networks
by: Quang, Nguyen Duc Minh, et al.
Published: (2025)

A Cross‐Attentive Gated Fusion Model for Steel Billet Tapping Temperature Prediction in Heating Furnace
by: Junyu Zhao, et al.
Published: (2026)

AdaCM$^2$: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction
by: Man, Yuanbin, et al.
Published: (2024)

PROMISE: Prompt-Attentive Hierarchical Contrastive Learning for Robust Cross-Modal Representation with Missing Modalities
by: Chen, Jiajun, et al.
Published: (2025)

Brauer-Manin obstruction for Wehler K3 surfaces of Markoff type
by: Dao, Quang-Duc
Published: (2023)

Rational and integral points on Markoff-type K3 surfaces
by: Dao, Quang-Duc
Published: (2025)

Gender and Its Representation in Vietnamese Online Newspapers: A Discourse Analysis Through a Post‐Colonial Lens
by: Trung Quang Dao
Published: (2025)

Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling
by: Tao, Xinhao, et al.
Published: (2023)

CAST: Cross-Attentive Spatio-Temporal feature fusion for deepfake detection
by: Thakre, Aryan, et al.
Published: (2025)

Cyberscurity Threats and Defense Mechanisms in IoT network
by: Dao, Trung, et al.
Published: (2026)

Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)

THE STRATEGIC FIT’S EFFECTIVENESS IN THE COMPETITIVE MARKET: FOCUS ON SMALL BUSINESSES IN AN EMERGING COUNTRY
by: Quang-Huy Ngo
Published: (2022)

The Impact of Green Market Orientation and Ambidextrous Green Innovation on Organizational Performance: Empirical Study on Small Restaurants in Vietnam
by: Quang‐Huy Ngo
Published: (2024)

Green Human Resource Management in Vietnamese Small Restaurants: Integrating Institutional and Natural‐Resource‐Based Perspectives
by: Quang‐Huy Ngo
Published: (2025)

CXR-TFT: Multi-Modal Temporal Fusion Transformer for Predicting Chest X-ray Trajectories
by: Arora, Mehak, et al.
Published: (2025)

HUTFormer: Hierarchical U-Net Transformer for Long-Term Traffic Forecasting
by: Shao, Zezhi, et al.
Published: (2023)