Saved in:
| Main Authors: | Saha, Pratim, Zhang, Chengcui |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.04283 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning
by: Saha, Ripon Kumar, et al.
Published: (2024)
by: Saha, Ripon Kumar, et al.
Published: (2024)
Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
by: Nguyen, Tu
Published: (2019)
by: Nguyen, Tu
Published: (2019)
Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse
by: Ma, Wenzhuo, et al.
Published: (2025)
by: Ma, Wenzhuo, et al.
Published: (2025)
Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025)
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025)
Video Denoising in Fluorescence Guided Surgery
by: Seets, Trevor, et al.
Published: (2024)
by: Seets, Trevor, et al.
Published: (2024)
SVFR: A Unified Framework for Generalized Video Face Restoration
by: Wang, Zhiyao, et al.
Published: (2025)
by: Wang, Zhiyao, et al.
Published: (2025)
Towards Controllable Video Synthesis of Routine and Rare OR Events
by: Schneider, Dominik, et al.
Published: (2026)
by: Schneider, Dominik, et al.
Published: (2026)
Insights from Generative Modeling for Neural Video Compression
by: Yang, Ruihan, et al.
Published: (2021)
by: Yang, Ruihan, et al.
Published: (2021)
A Survey of Deep Learning Video Super-Resolution
by: Baniya, Arbind Agrahari, et al.
Published: (2025)
by: Baniya, Arbind Agrahari, et al.
Published: (2025)
Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning
by: Paul, Somdyuti, et al.
Published: (2022)
by: Paul, Somdyuti, et al.
Published: (2022)
Neonatal Face and Facial Landmark Detection from Video Recordings
by: Grooby, Ethan, et al.
Published: (2023)
by: Grooby, Ethan, et al.
Published: (2023)
Nuclear Diffusion Models for Low-Rank Background Suppression in Videos
by: Stevens, Tristan S. W., et al.
Published: (2025)
by: Stevens, Tristan S. W., et al.
Published: (2025)
Unsupervised Multi-Clustering and Decision-Making Strategies for 4D-STEM Orientation Mapping
by: Cao, Junhao, et al.
Published: (2025)
by: Cao, Junhao, et al.
Published: (2025)
HPC: Hierarchical Progressive Coding Framework for Volumetric Video
by: Zheng, Zihan, et al.
Published: (2024)
by: Zheng, Zihan, et al.
Published: (2024)
Efficient Video-Based ALPR System Using YOLO and Visual Rhythm
by: Ribeiro, Victor Nascimento, et al.
Published: (2025)
by: Ribeiro, Victor Nascimento, et al.
Published: (2025)
Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation
by: Buzovkin, Alexey, et al.
Published: (2025)
by: Buzovkin, Alexey, et al.
Published: (2025)
Remote Blood Oxygen Estimation From Videos Using Neural Networks
by: Mathew, Joshua, et al.
Published: (2021)
by: Mathew, Joshua, et al.
Published: (2021)
Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models
by: Pang, Wang, et al.
Published: (2025)
by: Pang, Wang, et al.
Published: (2025)
HRVGAN: High Resolution Video Generation using Spatio-Temporal GAN
by: Sagar, Abhinav
Published: (2020)
by: Sagar, Abhinav
Published: (2020)
Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet
by: Ishrak, Gazi Hasin, et al.
Published: (2024)
by: Ishrak, Gazi Hasin, et al.
Published: (2024)
Diffusion based Zero-shot Medical Image-to-Image Translation for Cross Modality Segmentation
by: Wang, Zihao, et al.
Published: (2024)
by: Wang, Zihao, et al.
Published: (2024)
Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy
by: Yu, Zihao, et al.
Published: (2024)
by: Yu, Zihao, et al.
Published: (2024)
Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields
by: Yilmaz, Rüveyda, et al.
Published: (2024)
by: Yilmaz, Rüveyda, et al.
Published: (2024)
FAKER: Full-body Anonymization with Human Keypoint Extraction for Real-time Video Deidentification
by: Ban, Byunghyun, et al.
Published: (2024)
by: Ban, Byunghyun, et al.
Published: (2024)
Rethinking Video Super-Resolution: Towards Diffusion-Based Methods without Motion Alignment
by: Zhan, Zhihao, et al.
Published: (2025)
by: Zhan, Zhihao, et al.
Published: (2025)
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression
by: Gadot, Uri, et al.
Published: (2025)
by: Gadot, Uri, et al.
Published: (2025)
STROKEVISION-BENCH: A Multimodal Video And 2D Pose Benchmark For Tracking Stroke Recovery
by: Robinson, David, et al.
Published: (2025)
by: Robinson, David, et al.
Published: (2025)
Deep Video Codec Control for Vision Models
by: Reich, Christoph, et al.
Published: (2023)
by: Reich, Christoph, et al.
Published: (2023)
Asymmetric GANs for Image-to-Image Translation
by: Tang, Hao, et al.
Published: (2019)
by: Tang, Hao, et al.
Published: (2019)
Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation
by: Wang, Song, et al.
Published: (2024)
by: Wang, Song, et al.
Published: (2024)
Exploring Video-Based Driver Activity Recognition under Noisy Labels
by: Fan, Linjuan, et al.
Published: (2025)
by: Fan, Linjuan, et al.
Published: (2025)
Rethinking Perceptual Metrics for Medical Image Translation
by: Konz, Nicholas, et al.
Published: (2024)
by: Konz, Nicholas, et al.
Published: (2024)
MRI Scan Synthesis Methods based on Clustering and Pix2Pix
by: Baldini, Giulia, et al.
Published: (2023)
by: Baldini, Giulia, et al.
Published: (2023)
FineVQ: Fine-Grained User Generated Content Video Quality Assessment
by: Duan, Huiyu, et al.
Published: (2024)
by: Duan, Huiyu, et al.
Published: (2024)
Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos
by: Ramesh, Jayroop, et al.
Published: (2024)
by: Ramesh, Jayroop, et al.
Published: (2024)
High-Resolution Daytime Translation Without Domain Labels
by: Anokhin, Ivan, et al.
Published: (2020)
by: Anokhin, Ivan, et al.
Published: (2020)
ContourDiff: Unpaired Medical Image Translation with Structural Consistency
by: Chen, Yuwen, et al.
Published: (2024)
by: Chen, Yuwen, et al.
Published: (2024)
Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs
by: Perevozchikov, Georgy, et al.
Published: (2024)
by: Perevozchikov, Georgy, et al.
Published: (2024)
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation
by: Min, Dongchan, et al.
Published: (2022)
by: Min, Dongchan, et al.
Published: (2022)
Deterministic Medical Image Translation via High-fidelity Brownian Bridges
by: He, Qisheng, et al.
Published: (2025)
by: He, Qisheng, et al.
Published: (2025)
Similar Items
-
Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning
by: Saha, Ripon Kumar, et al.
Published: (2024) -
Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
by: Nguyen, Tu
Published: (2019) -
Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse
by: Ma, Wenzhuo, et al.
Published: (2025) -
Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025) -
Video Denoising in Fluorescence Guided Surgery
by: Seets, Trevor, et al.
Published: (2024)