:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Niu, Yanan, Sarkis, Roy, Psaltis, Demetri, Paolone, Mario, Moser, Christophe, Lambertini, Luisa
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.00250
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies
by: Niu, Yanan, et al.
Published: (2025)

Optical Diffusion Models for Image Generation
by: Oguz, Ilker, et al.
Published: (2024)

Computational Imaging for Long-Term Prediction of Solar Irradiance
by: Julian, Leron, et al.
Published: (2024)

Nonlinear Processing with Linear Optics
by: Yildirim, Mustafa, et al.
Published: (2023)

Multicasting Optical Reconfigurable Switch
by: Dinc, Niyazi Ulas, et al.
Published: (2024)

S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
by: Tran, Minh, et al.
Published: (2024)

Cross-Camera Human Motion Transfer by Time Series Analysis
by: Zhao, Yaping, et al.
Published: (2021)

Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
by: Guo, Danfeng, et al.
Published: (2024)

MaxwellNet: Physics-driven deep neural network training based on Maxwell's equations
by: Lim, Joowon, et al.
Published: (2021)

CFSum: A Transformer-Based Multi-Modal Video Summarization Framework With Coarse-Fine Fusion
by: Guo, Yaowei, et al.
Published: (2025)

CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers
by: Marmon, Andrew, et al.
Published: (2024)

HybridSolarNet: A Lightweight and Explainable EfficientNet-CBAM Architecture for Real-Time Solar Panel Fault Detection
by: Hossain, Md. Asif, et al.
Published: (2026)

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
by: Liao, Kang, et al.
Published: (2025)

Camera clustering for scalable stream-based active distillation
by: Manjah, Dani, et al.
Published: (2024)

Benchmarking CNN and Transformer-Based Object Detectors for UAV Solar Panel Inspection
by: Rodrigo, Ashen, et al.
Published: (2025)

Loss Functions for Predictor-based Neural Architecture Search
by: Ji, Han, et al.
Published: (2025)

MCTR: Multi Camera Tracking Transformer
by: Niculescu-Mizil, Alexandru, et al.
Published: (2024)

Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery
by: Joshi, Deepak, et al.
Published: (2025)

Question Aware Vision Transformer for Multimodal Reasoning
by: Ganz, Roy, et al.
Published: (2024)

Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins
by: Oguz, Ilker, et al.
Published: (2025)

PDC-ViT : Source Camera Identification using Pixel Difference Convolution and Vision Transformer
by: Elharrouss, Omar, et al.
Published: (2025)

EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting
by: Lin, Min-Hui, et al.
Published: (2024)

Plots Unlock Time-Series Understanding in Multimodal Models
by: Daswani, Mayank, et al.
Published: (2024)

Subwavelength Imaging using a Solid-Immersion Diffractive Optical Processor
by: Hu, Jingtian, et al.
Published: (2024)

SolarFCD: A Large-Scale Dataset and Benchmark for Solar Fault Classification in Photovoltaic Systems
by: Ijaz, Misbah, et al.
Published: (2026)

On the use of Graphs for Satellite Image Time Series
by: Dufourg, Corentin, et al.
Published: (2025)

Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups
by: Van Holland, Leif, et al.
Published: (2026)

HGSLoc: 3DGS-based Heuristic Camera Pose Refinement
by: Niu, Zhongyan, et al.
Published: (2024)

SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time
by: Frolov, Stanislav, et al.
Published: (2024)

Real Time Offside Detection using a Single Camera in Soccer
by: Desai, Shounak
Published: (2025)

Facade Segmentation for Solar Photovoltaic Suitability
by: Duran, Ayca, et al.
Published: (2025)

TSP-OCS: A Time-Series Prediction for Optimal Camera Selection in Multi-Viewpoint Surgical Video Analysis
by: Liu, Xinyu, et al.
Published: (2025)

SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms
by: Turki, Haithem, et al.
Published: (2025)

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting
by: Zhong, Siru, et al.
Published: (2025)

VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation
by: MacDonald, Ezra, et al.
Published: (2024)

Multimodal Biometric Authentication Using Camera-Based PPG and Fingerprint Fusion
by: Zheng, Xue Xian, et al.
Published: (2024)

A Unified Formula for Affine Transformations between Calibrated Cameras
by: Hajder, Levente
Published: (2026)

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation
by: Wang, Yuelei, et al.
Published: (2024)

Spatio-temporal Transformers for Action Unit Classification with Event Cameras
by: Cultrera, Luca, et al.
Published: (2024)

Improving Position Encoding of Transformers for Multivariate Time Series Classification
by: Foumani, Navid Mohammadi, et al.
Published: (2023)