:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Aitrouga, Abdelilah, Hmamouche, Youssef, Seghrouchni, Amal El Fallah
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computer Vision and Pattern Recognition Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2509.25998
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Strategic Deflection: Defending LLMs from Logit Manipulation
von: Rachidy, Yassine, et al.
Veröffentlicht: (2025)

A multimodal LLM for the non-invasive decoding of spoken text from brain recordings
von: Hmamouche, Youssef, et al.
Veröffentlicht: (2024)

A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images
von: Li, Xin, et al.
Veröffentlicht: (2025)

DiTraj: training-free trajectory control for video diffusion transformer
von: Lei, Cheng, et al.
Veröffentlicht: (2025)

Two-Stream temporal transformer for video action classification
von: Kurpukdee, Nattapong, et al.
Veröffentlicht: (2026)

Transfer Learning-based Real-time Handgun Detection
von: Elmir, Youssef
Veröffentlicht: (2023)

EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition
von: Doulfoukar, Youssef, et al.
Veröffentlicht: (2024)

Reducing self-supervised learning complexity improves weakly-supervised classification performance in computational pathology
von: Lenz, Tim, et al.
Veröffentlicht: (2024)

FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
von: Ghazouali, Safouane El, et al.
Veröffentlicht: (2024)

D-STGCNT: A Dense Spatio-Temporal Graph Conv-GRU Network based on transformer for assessment of patient physical rehabilitation
von: Mourchid, Youssef, et al.
Veröffentlicht: (2023)

Enhancing DeepLabV3+ to Fuse Aerial and Satellite Images for Semantic Segmentation
von: Berka, Anas, et al.
Veröffentlicht: (2025)

SPDGAN: A Generative Adversarial Network based on SPD Manifold Learning for Automatic Image Colorization
von: Mourchid, Youssef, et al.
Veröffentlicht: (2023)

Classification of systolic murmurs in heart sounds using multiresolution complex Gabor dictionary and vision transformer
von: Fakhry, Mahmoud, et al.
Veröffentlicht: (2026)

Behavioral Cloning Models Reality Check for Autonomous Driving
von: Yildirim, Mustafa, et al.
Veröffentlicht: (2024)

Birds of a Feather Flock Together: Background-Invariant Representations via Linear Structure in VLMs
von: Zaazou, Youssef, et al.
Veröffentlicht: (2026)

Normalization Equivariance for Arbitrary Backbones, with Application to Image Denoising
von: Saied, Youssef, et al.
Veröffentlicht: (2026)

Eating Smart: Advancing Health Informatics with the Grounding DINO based Dietary Assistant App
von: Nossair, Abdelilah, et al.
Veröffentlicht: (2024)

Redefining cystoscopy with ai: bladder cancer diagnosis using an efficient hybrid cnn-transformer model
von: Amaouche, Meryem, et al.
Veröffentlicht: (2024)

Unified Attention Modeling for Efficient Free-Viewing and Visual Search via Shared Representations
von: Mohammed, Fatma Youssef, et al.
Veröffentlicht: (2025)

CFE-PPAR: Compression-friendly encryption for privacy-preserving action recognition leveraging video transformers
von: Lin, Haiwei, et al.
Veröffentlicht: (2026)

Can video generation replace cinematographers? Research on the cinematic language of generated video
von: Li, Xiaozhe, et al.
Veröffentlicht: (2024)

Breast tumor classification based on self-supervised contrastive learning from ultrasound videos
von: Tang, Yunxin, et al.
Veröffentlicht: (2024)

Improving Pain Classification using Spatio-Temporal Deep Learning Approaches with Facial Expressions
von: Ridouan, Aafaf, et al.
Veröffentlicht: (2025)

Rethinking Deep Clustering Paradigms: Self-Supervision Is All You Need
von: Shaheena, Amal, et al.
Veröffentlicht: (2025)

Flow caching for autoregressive video generation
von: Ma, Yuexiao, et al.
Veröffentlicht: (2026)

Enhancing Deep Learning Model Robustness through Metamorphic Re-Training
von: Togru, Said, et al.
Veröffentlicht: (2024)

From Editor to Dense Geometry Estimator
von: Wang, JiYuan, et al.
Veröffentlicht: (2025)

Multi-Modal interpretable automatic video captioning
von: Hanna-Asaad, Antoine, et al.
Veröffentlicht: (2024)

EAGLE: Egocentric AGgregated Language-video Engine
von: Bi, Jing, et al.
Veröffentlicht: (2024)

A multi-purpose automatic editing system based on lecture semantics for remote education
von: Hu, Panwen, et al.
Veröffentlicht: (2024)

DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET
von: Li, Yitong, et al.
Veröffentlicht: (2024)

RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation
von: You, Siyuan, et al.
Veröffentlicht: (2025)

Neural USD: An object-centric framework for iterative editing and control
von: Escontrela, Alejandro, et al.
Veröffentlicht: (2025)

Generative deep learning for foundational video translation in ultrasound
von: Tomic, Nikolina, et al.
Veröffentlicht: (2025)

Hallucination-aware intermediate representation edit in large vision-language models
von: Suo, Wei, et al.
Veröffentlicht: (2026)

Auto-regressive transformation for image alignment
von: Lee, Kanggeon, et al.
Veröffentlicht: (2025)

video-SALMONN S: Memory-Enhanced Streaming Audio-Visual LLM
von: Sun, Guangzhi, et al.
Veröffentlicht: (2025)

Study of detecting behavioral signatures within DeepFake videos
von: Miao, Qiaomu, et al.
Veröffentlicht: (2022)

FFA Sora, video generation as fundus fluorescein angiography simulator
von: Wu, Xinyuan, et al.
Veröffentlicht: (2024)

A multi-scale vision transformer-based multimodal GeoAI model for mapping Arctic permafrost thaw
von: Li, Wenwen, et al.
Veröffentlicht: (2025)