:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Hou-I, Galindo, Marco, Xie, Hongxia, Wong, Lai-Kuan, Shuai, Hong-Han, Li, Yung-Hui, Cheng, Wen-Huang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2404.07236
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RecipeGen: A Benchmark for Real-World Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)

AesCrop: Aesthetic-driven Cropping Guided by Composition
by: Wong, Yen-Hong, et al.
Published: (2025)

RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)

Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network
by: Pan, Lu, et al.
Published: (2025)

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
by: Huang, Yi-Xin, et al.
Published: (2024)

Perspective-Aware Teaching: Adapting Knowledge for Heterogeneous Distillation
by: Lin, Jhe-Hao, et al.
Published: (2025)

One Pool Is Not Enough: Multi-Cluster Memory for Practical Test-Time Adaptation
by: Tseng, Yu-Wen, et al.
Published: (2026)

Deep Active Audio Feature Learning in Resource-Constrained Environments
by: Mohaimenuzzaman, Md, et al.
Published: (2023)

EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning
by: Xie, Hongxia, et al.
Published: (2024)

CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)

A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
by: Liu, Hou-I, et al.
Published: (2024)

The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
by: Yao, Yi, et al.
Published: (2024)

Attribute-Grounded Selective Reasoning for Artwork Emotion Understanding with Multimodal Large Language Models
by: Zhang, Cheng, et al.
Published: (2026)

COACH: Collaborative Agents for Contextual Highlighting -- A Multi-Agent Framework for Sports Video Analysis
by: Wong, Tsz-To, et al.
Published: (2025)

LiPS: Lightweight Panoptic Segmentation for Resource-Constrained Robotics
by: Galagain, Calvin, et al.
Published: (2026)

A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd Counting
by: Chaudhuri, Yashwardhan, et al.
Published: (2024)

An All Deep System for Badminton Game Analysis
by: Chou, Po-Yung, et al.
Published: (2023)

TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
by: Jiang-Lin, Jian-Yu, et al.
Published: (2025)

Future Sight and Tough Fights: Revolutionizing Sequential Recommendation with FENRec
by: Huang, Yu-Hsuan, et al.
Published: (2024)

Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers
by: Huang, Tzuhsuan, et al.
Published: (2025)

FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices
by: Han, Shizhong, et al.
Published: (2025)

LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices
by: Kühne, Jonas, et al.
Published: (2026)

FireLite: Leveraging Transfer Learning for Efficient Fire Detection in Resource-Constrained Environments
by: Hasan, Mahamudul, et al.
Published: (2024)

Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices
by: Shahriar, Tasnim
Published: (2025)

Monocular Lane Detection Based on Deep Learning: A Survey
by: He, Xin, et al.
Published: (2024)

PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos
by: Zhou, Zhiyu, et al.
Published: (2026)

$L^2$FMamba: Lightweight Light Field Image Super-Resolution with State Space Model
by: Wei, Zeqiang, et al.
Published: (2025)

Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
by: Lin, Yung-Hui, et al.
Published: (2024)

Survey on Fundamental Deep Learning 3D Reconstruction Techniques
by: Bai, Yonge, et al.
Published: (2024)

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
by: Huang, Chi-Pin, et al.
Published: (2023)

Towards Calibrated Deep Clustering Network
by: Jia, Yuheng, et al.
Published: (2024)

CSDNet: Detect Salient Object in Depth-Thermal via A Lightweight Cross Shallow and Deep Perception Network
by: Yu, Xiaotong, et al.
Published: (2024)

Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms
by: Shen, Gefei, et al.
Published: (2025)

WildFit: Autonomous In-situ Model Adaptation for Resource-Constrained IoT Systems
by: Rastikerdar, Mohammad Mehdi, et al.
Published: (2024)

SA-MLP: A Low-Power Multiplication-Free Deep Network for 3D Point Cloud Classification in Resource-Constrained Environments
by: Zheng, Qiang, et al.
Published: (2024)

A Lightweight Dual-Branch System for Weakly-Supervised Video Anomaly Detection on Consumer Edge Devices
by: Jiang, Wen-Dong, et al.
Published: (2024)

EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation
by: Zhang, Cheng, et al.
Published: (2025)

Deep Learning for Visual Speech Analysis: A Survey
by: Sheng, Changchong, et al.
Published: (2022)

A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning
by: Cong, Xiaofeng, et al.
Published: (2024)

Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
by: Shihab, Ibne Farabi, et al.
Published: (2025)