Saved in:
| Main Authors: | Liu, Hou-I, Galindo, Marco, Xie, Hongxia, Wong, Lai-Kuan, Shuai, Hong-Han, Li, Yung-Hui, Cheng, Wen-Huang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.07236 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RecipeGen: A Benchmark for Real-World Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)
by: Zhang, Ruoxuan, et al.
Published: (2025)
AesCrop: Aesthetic-driven Cropping Guided by Composition
by: Wong, Yen-Hong, et al.
Published: (2025)
by: Wong, Yen-Hong, et al.
Published: (2025)
RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)
by: Zhang, Ruoxuan, et al.
Published: (2025)
Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network
by: Pan, Lu, et al.
Published: (2025)
by: Pan, Lu, et al.
Published: (2025)
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
by: Huang, Yi-Xin, et al.
Published: (2024)
by: Huang, Yi-Xin, et al.
Published: (2024)
Perspective-Aware Teaching: Adapting Knowledge for Heterogeneous Distillation
by: Lin, Jhe-Hao, et al.
Published: (2025)
by: Lin, Jhe-Hao, et al.
Published: (2025)
One Pool Is Not Enough: Multi-Cluster Memory for Practical Test-Time Adaptation
by: Tseng, Yu-Wen, et al.
Published: (2026)
by: Tseng, Yu-Wen, et al.
Published: (2026)
Deep Active Audio Feature Learning in Resource-Constrained Environments
by: Mohaimenuzzaman, Md, et al.
Published: (2023)
by: Mohaimenuzzaman, Md, et al.
Published: (2023)
EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning
by: Xie, Hongxia, et al.
Published: (2024)
by: Xie, Hongxia, et al.
Published: (2024)
CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)
by: Zhang, Ruoxuan, et al.
Published: (2025)
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
by: Liu, Hou-I, et al.
Published: (2024)
by: Liu, Hou-I, et al.
Published: (2024)
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
by: Yao, Yi, et al.
Published: (2024)
by: Yao, Yi, et al.
Published: (2024)
Attribute-Grounded Selective Reasoning for Artwork Emotion Understanding with Multimodal Large Language Models
by: Zhang, Cheng, et al.
Published: (2026)
by: Zhang, Cheng, et al.
Published: (2026)
COACH: Collaborative Agents for Contextual Highlighting -- A Multi-Agent Framework for Sports Video Analysis
by: Wong, Tsz-To, et al.
Published: (2025)
by: Wong, Tsz-To, et al.
Published: (2025)
LiPS: Lightweight Panoptic Segmentation for Resource-Constrained Robotics
by: Galagain, Calvin, et al.
Published: (2026)
by: Galagain, Calvin, et al.
Published: (2026)
A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd Counting
by: Chaudhuri, Yashwardhan, et al.
Published: (2024)
by: Chaudhuri, Yashwardhan, et al.
Published: (2024)
An All Deep System for Badminton Game Analysis
by: Chou, Po-Yung, et al.
Published: (2023)
by: Chou, Po-Yung, et al.
Published: (2023)
TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
by: Jiang-Lin, Jian-Yu, et al.
Published: (2025)
by: Jiang-Lin, Jian-Yu, et al.
Published: (2025)
Future Sight and Tough Fights: Revolutionizing Sequential Recommendation with FENRec
by: Huang, Yu-Hsuan, et al.
Published: (2024)
by: Huang, Yu-Hsuan, et al.
Published: (2024)
Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers
by: Huang, Tzuhsuan, et al.
Published: (2025)
by: Huang, Tzuhsuan, et al.
Published: (2025)
FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices
by: Han, Shizhong, et al.
Published: (2025)
by: Han, Shizhong, et al.
Published: (2025)
LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices
by: Kühne, Jonas, et al.
Published: (2026)
by: Kühne, Jonas, et al.
Published: (2026)
FireLite: Leveraging Transfer Learning for Efficient Fire Detection in Resource-Constrained Environments
by: Hasan, Mahamudul, et al.
Published: (2024)
by: Hasan, Mahamudul, et al.
Published: (2024)
Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices
by: Shahriar, Tasnim
Published: (2025)
by: Shahriar, Tasnim
Published: (2025)
Monocular Lane Detection Based on Deep Learning: A Survey
by: He, Xin, et al.
Published: (2024)
by: He, Xin, et al.
Published: (2024)
PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos
by: Zhou, Zhiyu, et al.
Published: (2026)
by: Zhou, Zhiyu, et al.
Published: (2026)
$L^2$FMamba: Lightweight Light Field Image Super-Resolution with State Space Model
by: Wei, Zeqiang, et al.
Published: (2025)
by: Wei, Zeqiang, et al.
Published: (2025)
Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
by: Lin, Yung-Hui, et al.
Published: (2024)
by: Lin, Yung-Hui, et al.
Published: (2024)
Survey on Fundamental Deep Learning 3D Reconstruction Techniques
by: Bai, Yonge, et al.
Published: (2024)
by: Bai, Yonge, et al.
Published: (2024)
Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
by: Huang, Chi-Pin, et al.
Published: (2023)
by: Huang, Chi-Pin, et al.
Published: (2023)
Towards Calibrated Deep Clustering Network
by: Jia, Yuheng, et al.
Published: (2024)
by: Jia, Yuheng, et al.
Published: (2024)
CSDNet: Detect Salient Object in Depth-Thermal via A Lightweight Cross Shallow and Deep Perception Network
by: Yu, Xiaotong, et al.
Published: (2024)
by: Yu, Xiaotong, et al.
Published: (2024)
Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms
by: Shen, Gefei, et al.
Published: (2025)
by: Shen, Gefei, et al.
Published: (2025)
WildFit: Autonomous In-situ Model Adaptation for Resource-Constrained IoT Systems
by: Rastikerdar, Mohammad Mehdi, et al.
Published: (2024)
by: Rastikerdar, Mohammad Mehdi, et al.
Published: (2024)
SA-MLP: A Low-Power Multiplication-Free Deep Network for 3D Point Cloud Classification in Resource-Constrained Environments
by: Zheng, Qiang, et al.
Published: (2024)
by: Zheng, Qiang, et al.
Published: (2024)
A Lightweight Dual-Branch System for Weakly-Supervised Video Anomaly Detection on Consumer Edge Devices
by: Jiang, Wen-Dong, et al.
Published: (2024)
by: Jiang, Wen-Dong, et al.
Published: (2024)
EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation
by: Zhang, Cheng, et al.
Published: (2025)
by: Zhang, Cheng, et al.
Published: (2025)
Deep Learning for Visual Speech Analysis: A Survey
by: Sheng, Changchong, et al.
Published: (2022)
by: Sheng, Changchong, et al.
Published: (2022)
A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning
by: Cong, Xiaofeng, et al.
Published: (2024)
by: Cong, Xiaofeng, et al.
Published: (2024)
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
by: Shihab, Ibne Farabi, et al.
Published: (2025)
by: Shihab, Ibne Farabi, et al.
Published: (2025)
Similar Items
-
RecipeGen: A Benchmark for Real-World Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025) -
AesCrop: Aesthetic-driven Cropping Guided by Composition
by: Wong, Yen-Hong, et al.
Published: (2025) -
RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
by: Zhang, Ruoxuan, et al.
Published: (2025) -
Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network
by: Pan, Lu, et al.
Published: (2025) -
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
by: Huang, Yi-Xin, et al.
Published: (2024)