Saved in:
| Main Authors: | Sun, Jilei, Wu, Dianhong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.23655 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Tilewise Domain-Separated Selective Encryption for Remote Sensing Imagery under Chosen-Plaintext Attacks
by: Sun, Jilei, et al.
Published: (2026)
by: Sun, Jilei, et al.
Published: (2026)
Security Analysis of Thumbnail-Preserving Image Encryption and a New Framework
by: Xie, Dong, et al.
Published: (2025)
by: Xie, Dong, et al.
Published: (2025)
QMedShield: A Novel Quantum Chaos-based Image Encryption Scheme for Secure Medical Image Storage in the Cloud
by: Rajan, Arun Amaithi, et al.
Published: (2024)
by: Rajan, Arun Amaithi, et al.
Published: (2024)
A Visual Perception-Based Tunable Framework and Evaluation Benchmark for H.265/HEVC ROI Encryption
by: Zhang, Xiang, et al.
Published: (2025)
by: Zhang, Xiang, et al.
Published: (2025)
CDI-DTI: A Strong Cross-domain Interpretable Drug-Target Interaction Prediction Framework Based on Multi-Strategy Fusion
by: Li, Xiangyu, et al.
Published: (2025)
by: Li, Xiangyu, et al.
Published: (2025)
Music2Palette: Emotion-aligned Color Palette Generation via Cross-Modal Representation Learning
by: Hu, Jiayun, et al.
Published: (2025)
by: Hu, Jiayun, et al.
Published: (2025)
A 3D Framework for Improving Low-Latency Multi-Channel Live Streaming
by: Aiersilan, Aizierjiang, et al.
Published: (2024)
by: Aiersilan, Aizierjiang, et al.
Published: (2024)
Realistic Virtual Flood Experience System Using 360° Videos and 3D City Models Constructed from Building Footprints
by: Banno, Tatsuro, et al.
Published: (2026)
by: Banno, Tatsuro, et al.
Published: (2026)
Efficient and Accurate Image Provenance Analysis: A Scalable Pipeline for Large-scale Images
by: Lai, Jiewei, et al.
Published: (2025)
by: Lai, Jiewei, et al.
Published: (2025)
Structure-Aware Residual-Center Representation for Self-Supervised Open-Set 3D Cross-Modal Retrieval
by: Xu, Yang, et al.
Published: (2024)
by: Xu, Yang, et al.
Published: (2024)
MultiColor: Image Colorization by Learning from Multiple Color Spaces
by: Du, Xiangcheng, et al.
Published: (2024)
by: Du, Xiangcheng, et al.
Published: (2024)
Magic3DSketch: Create Colorful 3D Models From Sketch-Based 3D Modeling Guided by Text and Language-Image Pre-Training
by: Zang, Ying, et al.
Published: (2024)
by: Zang, Ying, et al.
Published: (2024)
GeoLink: A 3D-Aware Framework Towards Better Generalization in Cross-View Geo-Localization
by: Zhang, Hongyang, et al.
Published: (2026)
by: Zhang, Hongyang, et al.
Published: (2026)
Generating Digital Models Using Text-to-3D and Image-to-3D Prompts: Critical Case Study
by: Ziatdinov, Rushan, et al.
Published: (2025)
by: Ziatdinov, Rushan, et al.
Published: (2025)
An Emotion Recognition Framework via Cross-modal Alignment of EEG and Eye Movement Data
by: Wang, Jianlu, et al.
Published: (2025)
by: Wang, Jianlu, et al.
Published: (2025)
AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction
by: Chen, Zixuan, et al.
Published: (2026)
by: Chen, Zixuan, et al.
Published: (2026)
Ges-QA: A Multidimensional Quality Assessment Dataset for Audio-to-3D Gesture Generation
by: Gao, Zhilin, et al.
Published: (2025)
by: Gao, Zhilin, et al.
Published: (2025)
Bridging the Pose-Semantic Gap: A Cascade Framework for Text-Based Person Anomaly Search
by: Xie, Zequn, et al.
Published: (2026)
by: Xie, Zequn, et al.
Published: (2026)
MG-Former: A Transformer-Based Framework for Music-Driven 3D Conducting Gesture Generation
by: Qiu, Ke, et al.
Published: (2026)
by: Qiu, Ke, et al.
Published: (2026)
Video Quality Assessment for Resolution Cross-Over in Live Sports
by: Zhu, Jingwen, et al.
Published: (2025)
by: Zhu, Jingwen, et al.
Published: (2025)
AsCL: An Asymmetry-sensitive Contrastive Learning Method for Image-Text Retrieval with Cross-Modal Fusion
by: Gong, Ziyu, et al.
Published: (2024)
by: Gong, Ziyu, et al.
Published: (2024)
The Sketchfab 3D Creative Commons Collection (S3D3C)
by: Spiess, Florian, et al.
Published: (2024)
by: Spiess, Florian, et al.
Published: (2024)
Cross-Space Synergy: A Unified Framework for Multimodal Emotion Recognition in Conversation
by: Lyu, Xiaosen, et al.
Published: (2025)
by: Lyu, Xiaosen, et al.
Published: (2025)
FoodLogAthl-218: Constructing a Real-World Food Image Dataset Using Dietary Management Applications
by: Watanabe, Mitsuki, et al.
Published: (2025)
by: Watanabe, Mitsuki, et al.
Published: (2025)
DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations
by: Song, Qiya, et al.
Published: (2025)
by: Song, Qiya, et al.
Published: (2025)
Design of 3D Environment Combining Digital Image Processing Technology and Convolutional Neural Network
by: Xiaofei Lu, et al.
Published: (2024)
by: Xiaofei Lu, et al.
Published: (2024)
Intelligent Carrier Allocation: A Cross-Modal Reasoning Framework for Adaptive Multimodal Steganography
by: Das, Abhirup, et al.
Published: (2025)
by: Das, Abhirup, et al.
Published: (2025)
GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation
by: Yang, Quanwei, et al.
Published: (2025)
by: Yang, Quanwei, et al.
Published: (2025)
3D Gaussian Editing with A Single Image
by: Luo, Guan, et al.
Published: (2024)
by: Luo, Guan, et al.
Published: (2024)
MCPNS: A Macropixel Collocated Position and Its Neighbors Search for Plenoptic 2.0 Video Coding
by: Van Duong, Vinh, et al.
Published: (2023)
by: Van Duong, Vinh, et al.
Published: (2023)
CEM-Net: Cross-Emotion Memory Network for Emotional Talking Face Generation
by: Wu, Kangyi, et al.
Published: (2025)
by: Wu, Kangyi, et al.
Published: (2025)
Smart Fitting Room: A One-stop Framework for Matching-aware Virtual Try-on
by: Yu, Mingzhe, et al.
Published: (2024)
by: Yu, Mingzhe, et al.
Published: (2024)
SPP-SCL: Semi-Push-Pull Supervised Contrastive Learning for Image-Text Sentiment Analysis and Beyond
by: Wu, Jiesheng, et al.
Published: (2026)
by: Wu, Jiesheng, et al.
Published: (2026)
Cross-Platform Neural Video Coding: A Case Study
by: Conceição, Ruhan, et al.
Published: (2024)
by: Conceição, Ruhan, et al.
Published: (2024)
EEmo-Bench: A Benchmark for Multi-modal Large Language Models on Image Evoked Emotion Assessment
by: Gao, Lancheng, et al.
Published: (2025)
by: Gao, Lancheng, et al.
Published: (2025)
Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology
by: Mohammadi, Shima, et al.
Published: (2024)
by: Mohammadi, Shima, et al.
Published: (2024)
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
by: Wang, Zixuan, et al.
Published: (2024)
by: Wang, Zixuan, et al.
Published: (2024)
StreamOptix: A Cross-layer Adaptive Video Delivery Scheme
by: Liu, Mufan, et al.
Published: (2024)
by: Liu, Mufan, et al.
Published: (2024)
Image Referenced Sketch Colorization Based on Animation Creation Workflow
by: Yan, Dingkun, et al.
Published: (2025)
by: Yan, Dingkun, et al.
Published: (2025)
A H.265/HEVC Fine-Grained ROI Video Encryption Algorithm Based on Coding Unit and Prompt Segmentation
by: Zhang, Xiang, et al.
Published: (2026)
by: Zhang, Xiang, et al.
Published: (2026)
Similar Items
-
Tilewise Domain-Separated Selective Encryption for Remote Sensing Imagery under Chosen-Plaintext Attacks
by: Sun, Jilei, et al.
Published: (2026) -
Security Analysis of Thumbnail-Preserving Image Encryption and a New Framework
by: Xie, Dong, et al.
Published: (2025) -
QMedShield: A Novel Quantum Chaos-based Image Encryption Scheme for Secure Medical Image Storage in the Cloud
by: Rajan, Arun Amaithi, et al.
Published: (2024) -
A Visual Perception-Based Tunable Framework and Evaluation Benchmark for H.265/HEVC ROI Encryption
by: Zhang, Xiang, et al.
Published: (2025) -
CDI-DTI: A Strong Cross-domain Interpretable Drug-Target Interaction Prediction Framework Based on Multi-Strategy Fusion
by: Li, Xiangyu, et al.
Published: (2025)