Saved in:
| Main Authors: | Niu, Yanan, Sarkis, Roy, Psaltis, Demetri, Paolone, Mario, Moser, Christophe, Lambertini, Luisa |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.00250 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies
by: Niu, Yanan, et al.
Published: (2025)
by: Niu, Yanan, et al.
Published: (2025)
Optical Diffusion Models for Image Generation
by: Oguz, Ilker, et al.
Published: (2024)
by: Oguz, Ilker, et al.
Published: (2024)
Computational Imaging for Long-Term Prediction of Solar Irradiance
by: Julian, Leron, et al.
Published: (2024)
by: Julian, Leron, et al.
Published: (2024)
Nonlinear Processing with Linear Optics
by: Yildirim, Mustafa, et al.
Published: (2023)
by: Yildirim, Mustafa, et al.
Published: (2023)
Multicasting Optical Reconfigurable Switch
by: Dinc, Niyazi Ulas, et al.
Published: (2024)
by: Dinc, Niyazi Ulas, et al.
Published: (2024)
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
by: Tran, Minh, et al.
Published: (2024)
by: Tran, Minh, et al.
Published: (2024)
Cross-Camera Human Motion Transfer by Time Series Analysis
by: Zhao, Yaping, et al.
Published: (2021)
by: Zhao, Yaping, et al.
Published: (2021)
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
by: Guo, Danfeng, et al.
Published: (2024)
by: Guo, Danfeng, et al.
Published: (2024)
MaxwellNet: Physics-driven deep neural network training based on Maxwell's equations
by: Lim, Joowon, et al.
Published: (2021)
by: Lim, Joowon, et al.
Published: (2021)
CFSum: A Transformer-Based Multi-Modal Video Summarization Framework With Coarse-Fine Fusion
by: Guo, Yaowei, et al.
Published: (2025)
by: Guo, Yaowei, et al.
Published: (2025)
CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers
by: Marmon, Andrew, et al.
Published: (2024)
by: Marmon, Andrew, et al.
Published: (2024)
HybridSolarNet: A Lightweight and Explainable EfficientNet-CBAM Architecture for Real-Time Solar Panel Fault Detection
by: Hossain, Md. Asif, et al.
Published: (2026)
by: Hossain, Md. Asif, et al.
Published: (2026)
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
by: Liao, Kang, et al.
Published: (2025)
by: Liao, Kang, et al.
Published: (2025)
Camera clustering for scalable stream-based active distillation
by: Manjah, Dani, et al.
Published: (2024)
by: Manjah, Dani, et al.
Published: (2024)
Benchmarking CNN and Transformer-Based Object Detectors for UAV Solar Panel Inspection
by: Rodrigo, Ashen, et al.
Published: (2025)
by: Rodrigo, Ashen, et al.
Published: (2025)
Loss Functions for Predictor-based Neural Architecture Search
by: Ji, Han, et al.
Published: (2025)
by: Ji, Han, et al.
Published: (2025)
MCTR: Multi Camera Tracking Transformer
by: Niculescu-Mizil, Alexandru, et al.
Published: (2024)
by: Niculescu-Mizil, Alexandru, et al.
Published: (2024)
Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery
by: Joshi, Deepak, et al.
Published: (2025)
by: Joshi, Deepak, et al.
Published: (2025)
Question Aware Vision Transformer for Multimodal Reasoning
by: Ganz, Roy, et al.
Published: (2024)
by: Ganz, Roy, et al.
Published: (2024)
Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins
by: Oguz, Ilker, et al.
Published: (2025)
by: Oguz, Ilker, et al.
Published: (2025)
PDC-ViT : Source Camera Identification using Pixel Difference Convolution and Vision Transformer
by: Elharrouss, Omar, et al.
Published: (2025)
by: Elharrouss, Omar, et al.
Published: (2025)
EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting
by: Lin, Min-Hui, et al.
Published: (2024)
by: Lin, Min-Hui, et al.
Published: (2024)
Plots Unlock Time-Series Understanding in Multimodal Models
by: Daswani, Mayank, et al.
Published: (2024)
by: Daswani, Mayank, et al.
Published: (2024)
Subwavelength Imaging using a Solid-Immersion Diffractive Optical Processor
by: Hu, Jingtian, et al.
Published: (2024)
by: Hu, Jingtian, et al.
Published: (2024)
SolarFCD: A Large-Scale Dataset and Benchmark for Solar Fault Classification in Photovoltaic Systems
by: Ijaz, Misbah, et al.
Published: (2026)
by: Ijaz, Misbah, et al.
Published: (2026)
On the use of Graphs for Satellite Image Time Series
by: Dufourg, Corentin, et al.
Published: (2025)
by: Dufourg, Corentin, et al.
Published: (2025)
Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups
by: Van Holland, Leif, et al.
Published: (2026)
by: Van Holland, Leif, et al.
Published: (2026)
HGSLoc: 3DGS-based Heuristic Camera Pose Refinement
by: Niu, Zhongyan, et al.
Published: (2024)
by: Niu, Zhongyan, et al.
Published: (2024)
SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time
by: Frolov, Stanislav, et al.
Published: (2024)
by: Frolov, Stanislav, et al.
Published: (2024)
Real Time Offside Detection using a Single Camera in Soccer
by: Desai, Shounak
Published: (2025)
by: Desai, Shounak
Published: (2025)
Facade Segmentation for Solar Photovoltaic Suitability
by: Duran, Ayca, et al.
Published: (2025)
by: Duran, Ayca, et al.
Published: (2025)
TSP-OCS: A Time-Series Prediction for Optimal Camera Selection in Multi-Viewpoint Surgical Video Analysis
by: Liu, Xinyu, et al.
Published: (2025)
by: Liu, Xinyu, et al.
Published: (2025)
SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms
by: Turki, Haithem, et al.
Published: (2025)
by: Turki, Haithem, et al.
Published: (2025)
Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting
by: Zhong, Siru, et al.
Published: (2025)
by: Zhong, Siru, et al.
Published: (2025)
VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation
by: MacDonald, Ezra, et al.
Published: (2024)
by: MacDonald, Ezra, et al.
Published: (2024)
Multimodal Biometric Authentication Using Camera-Based PPG and Fingerprint Fusion
by: Zheng, Xue Xian, et al.
Published: (2024)
by: Zheng, Xue Xian, et al.
Published: (2024)
A Unified Formula for Affine Transformations between Calibrated Cameras
by: Hajder, Levente
Published: (2026)
by: Hajder, Levente
Published: (2026)
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation
by: Wang, Yuelei, et al.
Published: (2024)
by: Wang, Yuelei, et al.
Published: (2024)
Spatio-temporal Transformers for Action Unit Classification with Event Cameras
by: Cultrera, Luca, et al.
Published: (2024)
by: Cultrera, Luca, et al.
Published: (2024)
Improving Position Encoding of Transformers for Multivariate Time Series Classification
by: Foumani, Navid Mohammadi, et al.
Published: (2023)
by: Foumani, Navid Mohammadi, et al.
Published: (2023)
Similar Items
-
Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies
by: Niu, Yanan, et al.
Published: (2025) -
Optical Diffusion Models for Image Generation
by: Oguz, Ilker, et al.
Published: (2024) -
Computational Imaging for Long-Term Prediction of Solar Irradiance
by: Julian, Leron, et al.
Published: (2024) -
Nonlinear Processing with Linear Optics
by: Yildirim, Mustafa, et al.
Published: (2023) -
Multicasting Optical Reconfigurable Switch
by: Dinc, Niyazi Ulas, et al.
Published: (2024)