:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nazir, Danish, Hanna-Asaad, Antoine, Görnhardt, Lucas, Piewek, Jan, Bagdonat, Thorsten, Fingscheidt, Tim
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.13586
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
by: Nazir, Danish, et al.
Published: (2024)

An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications
by: Nazir, Danish, et al.
Published: (2025)

A Lightweight Image Super-Resolution Transformer Trained on Low-Resolution Images Only
by: Möller, Björn, et al.
Published: (2025)

Memory-Efficient Fine-Tuning of Transformers via Token Selection
by: Simoulin, Antoine, et al.
Published: (2025)

Multi-Modal interpretable automatic video captioning
by: Hanna-Asaad, Antoine, et al.
Published: (2024)

FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering
by: Zhong, Liangyu, et al.
Published: (2025)

Explainable Knowledge Distillation for Efficient Medical Image Classification
by: Mir, Aqib Nazir, et al.
Published: (2025)

Efficient High-Performance Bark-Scale Neural Network for Residual Echo and Noise Suppression
by: Seidel, Ernst, et al.
Published: (2024)

DisContSE: Single-Step Diffusion Speech Enhancement Based on Joint Discrete and Continuous Embeddings
by: Fu, Yihui, et al.
Published: (2026)

SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
by: Papais, Sandro, et al.
Published: (2026)

OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
by: Möller, Björn, et al.
Published: (2025)

EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
by: Lee, Yongjin, et al.
Published: (2024)

Taming CATS: Controllable Automatic Text Simplification through Instruction Fine-Tuning with Control Tokens
by: Hubarava, Hanna, et al.
Published: (2026)

Multi-View Attentive Contextualization for Multi-View 3D Object Detection
by: Liu, Xianpeng, et al.
Published: (2024)

Engineering of Hallucination in Generative AI: It's not a Bug, it's a Feature
by: Fingscheidt, Tim, et al.
Published: (2026)

Improving Block-Wise LLM Quantization by 4-bit Block-Wise Optimal Float (BOF4): Analysis and Variations
by: Blumenberg, Patrick, et al.
Published: (2025)

MAPUNetR: A Hybrid Vision Transformer and U-Net Architecture for Efficient and Interpretable Medical Image Segmentation
by: Shah, Ovais Iqbal, et al.
Published: (2024)

A Scalable Multi-Task Model for Virtual Sensors
by: Götz, Leon, et al.
Published: (2026)

MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps
by: Xu, Yating, et al.
Published: (2024)

Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
by: Pang, Jinlong, et al.
Published: (2025)

Software Process Modeled With Objects: Static View
by: Hanna Oktaba
Published: (1998)

DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
by: Wang, Jiasen, et al.
Published: (2024)

View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection
by: Zhang, Qi, et al.
Published: (2024)

PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving
by: Rezaei, Abdolazim, et al.
Published: (2025)

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
by: Elamon, Nirmal, et al.
Published: (2025)

Object Instance Retrieval in Assistive Robotics: Leveraging Fine-Tuned SimSiam with Multi-View Images Based on 3D Semantic Map
by: Sakaguchi, Taichi, et al.
Published: (2024)

Revisiting Token Compression for Accelerating ViT-based Sparse Multi-View 3D Object Detectors
by: Ji, Mingqian, et al.
Published: (2026)

Foundation Models for Amodal Video Instance Segmentation in Automated Driving
by: Breitenstein, Jasmin, et al.
Published: (2024)

Neural Kalman Filters for Acoustic Echo Cancellation
by: Seidel, Ernst, et al.
Published: (2025)

A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning
by: Chen, Hao, et al.
Published: (2025)

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching
by: Zeng, Runjia, et al.
Published: (2026)

Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting
by: Barad, Kuldeep R, et al.
Published: (2024)

Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection
by: Nozarian, Farzad, et al.
Published: (2024)

Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
by: Chang, Gyusam, et al.
Published: (2024)

Dynamic Jointly Batch Selection for Data Efficient Machine Translation Fine-Tuning
by: Ghanizadeh, Mohammad Amin, et al.
Published: (2025)

SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery
by: Xu, Jialang, et al.
Published: (2024)

Parkinson's Disease Diagnosis Through Deep Learning: A Novel LSTM-Based Approach for Freezing of Gait Detection
by: Mir, Aqib Nazir, et al.
Published: (2024)

Chirpy3D: Part-Aware Multi-View Diffusion for Creative Fine-Grained Object Generation
by: Ng, Kam Woh, et al.
Published: (2025)

SemAttNet: Towards Attention-based Semantic Aware Guided Depth Completion
by: Nazir, Danish, et al.
Published: (2022)

GateRA: Token-Aware Modulation for Parameter-Efficient Fine-Tuning
by: Ou, Jie, et al.
Published: (2025)