Saved in:
| Main Authors: | Nguyen, Tan M., Nguyen, Tam, Ho, Nhat, Bertozzi, Andrea L., Baraniuk, Richard G., Osher, Stanley J. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.13781 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sliced Wasserstein with Random-Path Projecting Directions
by: Nguyen, Khai, et al.
Published: (2024)
by: Nguyen, Khai, et al.
Published: (2024)
A Review of Mechanistic Models of Event Comprehension
by: Nguyen, Tan T.
Published: (2024)
by: Nguyen, Tan T.
Published: (2024)
Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
BiMa: Towards Biases Mitigation for Text-Video Retrieval via Scene Element Guidance
by: Le, Huy, et al.
Published: (2025)
by: Le, Huy, et al.
Published: (2025)
Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions
by: Nguyen, Khai, et al.
Published: (2024)
by: Nguyen, Khai, et al.
Published: (2024)
Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024)
by: Liao, Zhu, et al.
Published: (2024)
Improved GUI Grounding via Iterative Narrowing
by: Nguyen, Anthony
Published: (2024)
by: Nguyen, Anthony
Published: (2024)
Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion
by: Vu, Tuan-Anh, et al.
Published: (2023)
by: Vu, Tuan-Anh, et al.
Published: (2023)
A Novel Framework for Automated Explain Vision Model Using Vision-Language Models
by: Nguyen, Phu-Vinh, et al.
Published: (2025)
by: Nguyen, Phu-Vinh, et al.
Published: (2025)
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
by: Teo, Rachel S. Y., et al.
Published: (2025)
by: Teo, Rachel S. Y., et al.
Published: (2025)
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
by: Teo, Rachel S. Y., et al.
Published: (2024)
by: Teo, Rachel S. Y., et al.
Published: (2024)
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
by: Teo, Rachel S. Y., et al.
Published: (2024)
by: Teo, Rachel S. Y., et al.
Published: (2024)
The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
by: Nguyen, Phuc Minh, et al.
Published: (2025)
by: Nguyen, Phuc Minh, et al.
Published: (2025)
Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning
by: Le, Tung, et al.
Published: (2024)
by: Le, Tung, et al.
Published: (2024)
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration
by: Nguyen, Ngoc Son, et al.
Published: (2024)
by: Nguyen, Ngoc Son, et al.
Published: (2024)
AHAN: Asymmetric Hierarchical Attention Network for Identical Twin Face Verification
by: Nguyen, Hoang-Nhat
Published: (2026)
by: Nguyen, Hoang-Nhat
Published: (2026)
Linguistically Informed Multimodal Fusion for Vietnamese Scene-Text Image Captioning: Dataset, Graph Framework, and Phonological Attention
by: Nguyen, Nhi Ngoc-Yen, et al.
Published: (2026)
by: Nguyen, Nhi Ngoc-Yen, et al.
Published: (2026)
An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation
by: Nguyen, Giang Son, et al.
Published: (2026)
by: Nguyen, Giang Son, et al.
Published: (2026)
SimGraph: A Unified Framework for Scene Graph-Based Image Generation and Editing
by: Vo, Thanh-Nhan, et al.
Published: (2026)
by: Vo, Thanh-Nhan, et al.
Published: (2026)
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
by: Cao, Tue M., et al.
Published: (2025)
by: Cao, Tue M., et al.
Published: (2025)
PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
by: Pham, Thang M., et al.
Published: (2024)
by: Pham, Thang M., et al.
Published: (2024)
Wavelet Burst Accumulation for turbulence mitigation
by: Gilles, Jerome, et al.
Published: (2024)
by: Gilles, Jerome, et al.
Published: (2024)
Fried deconvolution
by: Gilles, Jerome, et al.
Published: (2024)
by: Gilles, Jerome, et al.
Published: (2024)
Figuring out Figures: Using Textual References to Caption Scientific Figures
by: Cao, Stanley, et al.
Published: (2024)
by: Cao, Stanley, et al.
Published: (2024)
Deep Networks Always Grok and Here is Why
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)
Learning Transferable Features for Implicit Neural Representations
by: Vyas, Kushal, et al.
Published: (2024)
by: Vyas, Kushal, et al.
Published: (2024)
Energy-Based Sliced Wasserstein Distance
by: Nguyen, Khai, et al.
Published: (2023)
by: Nguyen, Khai, et al.
Published: (2023)
Sliced Wasserstein Estimation with Control Variates
by: Nguyen, Khai, et al.
Published: (2023)
by: Nguyen, Khai, et al.
Published: (2023)
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
by: Nguyen, Nghia Hieu, et al.
Published: (2024)
by: Nguyen, Nghia Hieu, et al.
Published: (2024)
DWE+: Dual-Way Matching Enhanced Framework for Multimodal Entity Linking
by: Song, Shezheng, et al.
Published: (2024)
by: Song, Shezheng, et al.
Published: (2024)
KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
by: La, Tuan-Vinh, et al.
Published: (2025)
by: La, Tuan-Vinh, et al.
Published: (2025)
GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs
by: Nguyen, Duy, et al.
Published: (2025)
by: Nguyen, Duy, et al.
Published: (2025)
Federated Document Visual Question Answering: A Pilot Study
by: Nguyen, Khanh, et al.
Published: (2024)
by: Nguyen, Khanh, et al.
Published: (2024)
IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme
by: Tran, Dinh Dai Quan, et al.
Published: (2025)
by: Tran, Dinh Dai Quan, et al.
Published: (2025)
Elliptical Attention
by: Nielsen, Stefan K., et al.
Published: (2024)
by: Nielsen, Stefan K., et al.
Published: (2024)
InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models
by: Nguyen, Pha, et al.
Published: (2025)
by: Nguyen, Pha, et al.
Published: (2025)
GlitchBench: Can large multimodal models detect video game glitches?
by: Taesiri, Mohammad Reza, et al.
Published: (2023)
by: Taesiri, Mohammad Reza, et al.
Published: (2023)
OSCaR: Object State Captioning and State Change Representation
by: Nguyen, Nguyen, et al.
Published: (2024)
by: Nguyen, Nguyen, et al.
Published: (2024)
Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection
by: Pham, Duc Thanh, et al.
Published: (2025)
by: Pham, Duc Thanh, et al.
Published: (2025)
DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
by: Nguyen, Thong, et al.
Published: (2023)
by: Nguyen, Thong, et al.
Published: (2023)
Similar Items
-
Sliced Wasserstein with Random-Path Projecting Directions
by: Nguyen, Khai, et al.
Published: (2024) -
A Review of Mechanistic Models of Event Comprehension
by: Nguyen, Tan T.
Published: (2024) -
Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025) -
BiMa: Towards Biases Mitigation for Text-Video Retrieval via Scene Element Guidance
by: Le, Huy, et al.
Published: (2025) -
Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions
by: Nguyen, Khai, et al.
Published: (2024)