:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nguyen, Tan M., Nguyen, Tam, Ho, Nhat, Bertozzi, Andrea L., Baraniuk, Richard G., Osher, Stanley J.
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.13781
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sliced Wasserstein with Random-Path Projecting Directions
by: Nguyen, Khai, et al.
Published: (2024)

A Review of Mechanistic Models of Event Comprehension
by: Nguyen, Tan T.
Published: (2024)

Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)

BiMa: Towards Biases Mitigation for Text-Video Retrieval via Scene Element Guidance
by: Le, Huy, et al.
Published: (2025)

Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions
by: Nguyen, Khai, et al.
Published: (2024)

Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024)

Improved GUI Grounding via Iterative Narrowing
by: Nguyen, Anthony
Published: (2024)

Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion
by: Vu, Tuan-Anh, et al.
Published: (2023)

A Novel Framework for Automated Explain Vision Model Using Vision-Language Models
by: Nguyen, Phu-Vinh, et al.
Published: (2025)

MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
by: Teo, Rachel S. Y., et al.
Published: (2025)

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
by: Teo, Rachel S. Y., et al.
Published: (2024)

Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
by: Teo, Rachel S. Y., et al.
Published: (2024)

The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
by: Nguyen, Phuc Minh, et al.
Published: (2025)

Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning
by: Le, Tung, et al.
Published: (2024)

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration
by: Nguyen, Ngoc Son, et al.
Published: (2024)

AHAN: Asymmetric Hierarchical Attention Network for Identical Twin Face Verification
by: Nguyen, Hoang-Nhat
Published: (2026)

Linguistically Informed Multimodal Fusion for Vietnamese Scene-Text Image Captioning: Dataset, Graph Framework, and Phonological Attention
by: Nguyen, Nhi Ngoc-Yen, et al.
Published: (2026)

An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation
by: Nguyen, Giang Son, et al.
Published: (2026)

SimGraph: A Unified Framework for Scene Graph-Based Image Generation and Editing
by: Vo, Thanh-Nhan, et al.
Published: (2026)

NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
by: Cao, Tue M., et al.
Published: (2025)

PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
by: Pham, Thang M., et al.
Published: (2024)

Wavelet Burst Accumulation for turbulence mitigation
by: Gilles, Jerome, et al.
Published: (2024)

Fried deconvolution
by: Gilles, Jerome, et al.
Published: (2024)

Figuring out Figures: Using Textual References to Caption Scientific Figures
by: Cao, Stanley, et al.
Published: (2024)

Deep Networks Always Grok and Here is Why
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)

Learning Transferable Features for Implicit Neural Representations
by: Vyas, Kushal, et al.
Published: (2024)

Energy-Based Sliced Wasserstein Distance
by: Nguyen, Khai, et al.
Published: (2023)

Sliced Wasserstein Estimation with Control Variates
by: Nguyen, Khai, et al.
Published: (2023)

ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
by: Nguyen, Nghia Hieu, et al.
Published: (2024)

DWE+: Dual-Way Matching Enhanced Framework for Multimodal Entity Linking
by: Song, Shezheng, et al.
Published: (2024)

KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
by: La, Tuan-Vinh, et al.
Published: (2025)

GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs
by: Nguyen, Duy, et al.
Published: (2025)

Federated Document Visual Question Answering: A Pilot Study
by: Nguyen, Khanh, et al.
Published: (2024)

IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme
by: Tran, Dinh Dai Quan, et al.
Published: (2025)

Elliptical Attention
by: Nielsen, Stefan K., et al.
Published: (2024)

InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models
by: Nguyen, Pha, et al.
Published: (2025)

GlitchBench: Can large multimodal models detect video game glitches?
by: Taesiri, Mohammad Reza, et al.
Published: (2023)

OSCaR: Object State Captioning and State Change Representation
by: Nguyen, Nguyen, et al.
Published: (2024)

Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection
by: Pham, Duc Thanh, et al.
Published: (2025)

DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
by: Nguyen, Thong, et al.
Published: (2023)