:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Saha, Pratim, Zhang, Chengcui
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning Image and Video Processing
Online Access:	https://arxiv.org/abs/2404.04283
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning
by: Saha, Ripon Kumar, et al.
Published: (2024)

Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
by: Nguyen, Tu
Published: (2019)

Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse
by: Ma, Wenzhuo, et al.
Published: (2025)

Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025)

Video Denoising in Fluorescence Guided Surgery
by: Seets, Trevor, et al.
Published: (2024)

SVFR: A Unified Framework for Generalized Video Face Restoration
by: Wang, Zhiyao, et al.
Published: (2025)

Towards Controllable Video Synthesis of Routine and Rare OR Events
by: Schneider, Dominik, et al.
Published: (2026)

Insights from Generative Modeling for Neural Video Compression
by: Yang, Ruihan, et al.
Published: (2021)

A Survey of Deep Learning Video Super-Resolution
by: Baniya, Arbind Agrahari, et al.
Published: (2025)

Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning
by: Paul, Somdyuti, et al.
Published: (2022)

Neonatal Face and Facial Landmark Detection from Video Recordings
by: Grooby, Ethan, et al.
Published: (2023)

Nuclear Diffusion Models for Low-Rank Background Suppression in Videos
by: Stevens, Tristan S. W., et al.
Published: (2025)

Unsupervised Multi-Clustering and Decision-Making Strategies for 4D-STEM Orientation Mapping
by: Cao, Junhao, et al.
Published: (2025)

HPC: Hierarchical Progressive Coding Framework for Volumetric Video
by: Zheng, Zihan, et al.
Published: (2024)

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm
by: Ribeiro, Victor Nascimento, et al.
Published: (2025)

Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation
by: Buzovkin, Alexey, et al.
Published: (2025)

Remote Blood Oxygen Estimation From Videos Using Neural Networks
by: Mathew, Joshua, et al.
Published: (2021)

Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models
by: Pang, Wang, et al.
Published: (2025)

HRVGAN: High Resolution Video Generation using Spatio-Temporal GAN
by: Sagar, Abhinav
Published: (2020)

Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet
by: Ishrak, Gazi Hasin, et al.
Published: (2024)

Diffusion based Zero-shot Medical Image-to-Image Translation for Cross Modality Segmentation
by: Wang, Zihao, et al.
Published: (2024)

Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy
by: Yu, Zihao, et al.
Published: (2024)

Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields
by: Yilmaz, Rüveyda, et al.
Published: (2024)

FAKER: Full-body Anonymization with Human Keypoint Extraction for Real-time Video Deidentification
by: Ban, Byunghyun, et al.
Published: (2024)

Rethinking Video Super-Resolution: Towards Diffusion-Based Methods without Motion Alignment
by: Zhan, Zhihao, et al.
Published: (2025)

RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression
by: Gadot, Uri, et al.
Published: (2025)

STROKEVISION-BENCH: A Multimodal Video And 2D Pose Benchmark For Tracking Stroke Recovery
by: Robinson, David, et al.
Published: (2025)

Deep Video Codec Control for Vision Models
by: Reich, Christoph, et al.
Published: (2023)

Asymmetric GANs for Image-to-Image Translation
by: Tang, Hao, et al.
Published: (2019)

Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation
by: Wang, Song, et al.
Published: (2024)

Exploring Video-Based Driver Activity Recognition under Noisy Labels
by: Fan, Linjuan, et al.
Published: (2025)

Rethinking Perceptual Metrics for Medical Image Translation
by: Konz, Nicholas, et al.
Published: (2024)

MRI Scan Synthesis Methods based on Clustering and Pix2Pix
by: Baldini, Giulia, et al.
Published: (2023)

FineVQ: Fine-Grained User Generated Content Video Quality Assessment
by: Duan, Huiyu, et al.
Published: (2024)

Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos
by: Ramesh, Jayroop, et al.
Published: (2024)

High-Resolution Daytime Translation Without Domain Labels
by: Anokhin, Ivan, et al.
Published: (2020)

ContourDiff: Unpaired Medical Image Translation with Structural Consistency
by: Chen, Yuwen, et al.
Published: (2024)

Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs
by: Perevozchikov, Georgy, et al.
Published: (2024)

StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation
by: Min, Dongchan, et al.
Published: (2022)

Deterministic Medical Image Translation via High-fidelity Brownian Bridges
by: He, Qisheng, et al.
Published: (2025)