Saved in:
| Main Authors: | Nazir, Danish, Hanna-Asaad, Antoine, Görnhardt, Lucas, Piewek, Jan, Bagdonat, Thorsten, Fingscheidt, Tim |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.13586 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
by: Nazir, Danish, et al.
Published: (2024)
by: Nazir, Danish, et al.
Published: (2024)
An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications
by: Nazir, Danish, et al.
Published: (2025)
by: Nazir, Danish, et al.
Published: (2025)
A Lightweight Image Super-Resolution Transformer Trained on Low-Resolution Images Only
by: Möller, Björn, et al.
Published: (2025)
by: Möller, Björn, et al.
Published: (2025)
Memory-Efficient Fine-Tuning of Transformers via Token Selection
by: Simoulin, Antoine, et al.
Published: (2025)
by: Simoulin, Antoine, et al.
Published: (2025)
Multi-Modal interpretable automatic video captioning
by: Hanna-Asaad, Antoine, et al.
Published: (2024)
by: Hanna-Asaad, Antoine, et al.
Published: (2024)
FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering
by: Zhong, Liangyu, et al.
Published: (2025)
by: Zhong, Liangyu, et al.
Published: (2025)
Explainable Knowledge Distillation for Efficient Medical Image Classification
by: Mir, Aqib Nazir, et al.
Published: (2025)
by: Mir, Aqib Nazir, et al.
Published: (2025)
Efficient High-Performance Bark-Scale Neural Network for Residual Echo and Noise Suppression
by: Seidel, Ernst, et al.
Published: (2024)
by: Seidel, Ernst, et al.
Published: (2024)
DisContSE: Single-Step Diffusion Speech Enhancement Based on Joint Discrete and Continuous Embeddings
by: Fu, Yihui, et al.
Published: (2026)
by: Fu, Yihui, et al.
Published: (2026)
SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
by: Papais, Sandro, et al.
Published: (2026)
by: Papais, Sandro, et al.
Published: (2026)
OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
by: Möller, Björn, et al.
Published: (2025)
by: Möller, Björn, et al.
Published: (2025)
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
by: Lee, Yongjin, et al.
Published: (2024)
by: Lee, Yongjin, et al.
Published: (2024)
Taming CATS: Controllable Automatic Text Simplification through Instruction Fine-Tuning with Control Tokens
by: Hubarava, Hanna, et al.
Published: (2026)
by: Hubarava, Hanna, et al.
Published: (2026)
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
by: Liu, Xianpeng, et al.
Published: (2024)
by: Liu, Xianpeng, et al.
Published: (2024)
Engineering of Hallucination in Generative AI: It's not a Bug, it's a Feature
by: Fingscheidt, Tim, et al.
Published: (2026)
by: Fingscheidt, Tim, et al.
Published: (2026)
Improving Block-Wise LLM Quantization by 4-bit Block-Wise Optimal Float (BOF4): Analysis and Variations
by: Blumenberg, Patrick, et al.
Published: (2025)
by: Blumenberg, Patrick, et al.
Published: (2025)
MAPUNetR: A Hybrid Vision Transformer and U-Net Architecture for Efficient and Interpretable Medical Image Segmentation
by: Shah, Ovais Iqbal, et al.
Published: (2024)
by: Shah, Ovais Iqbal, et al.
Published: (2024)
A Scalable Multi-Task Model for Virtual Sensors
by: Götz, Leon, et al.
Published: (2026)
by: Götz, Leon, et al.
Published: (2026)
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps
by: Xu, Yating, et al.
Published: (2024)
by: Xu, Yating, et al.
Published: (2024)
Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
by: Pang, Jinlong, et al.
Published: (2025)
by: Pang, Jinlong, et al.
Published: (2025)
Software Process Modeled With Objects: Static View
by: Hanna Oktaba
Published: (1998)
by: Hanna Oktaba
Published: (1998)
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
by: Wang, Jiasen, et al.
Published: (2024)
by: Wang, Jiasen, et al.
Published: (2024)
View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection
by: Zhang, Qi, et al.
Published: (2024)
by: Zhang, Qi, et al.
Published: (2024)
PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving
by: Rezaei, Abdolazim, et al.
Published: (2025)
by: Rezaei, Abdolazim, et al.
Published: (2025)
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
by: Elamon, Nirmal, et al.
Published: (2025)
by: Elamon, Nirmal, et al.
Published: (2025)
Object Instance Retrieval in Assistive Robotics: Leveraging Fine-Tuned SimSiam with Multi-View Images Based on 3D Semantic Map
by: Sakaguchi, Taichi, et al.
Published: (2024)
by: Sakaguchi, Taichi, et al.
Published: (2024)
Revisiting Token Compression for Accelerating ViT-based Sparse Multi-View 3D Object Detectors
by: Ji, Mingqian, et al.
Published: (2026)
by: Ji, Mingqian, et al.
Published: (2026)
Foundation Models for Amodal Video Instance Segmentation in Automated Driving
by: Breitenstein, Jasmin, et al.
Published: (2024)
by: Breitenstein, Jasmin, et al.
Published: (2024)
Neural Kalman Filters for Acoustic Echo Cancellation
by: Seidel, Ernst, et al.
Published: (2025)
by: Seidel, Ernst, et al.
Published: (2025)
A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning
by: Chen, Hao, et al.
Published: (2025)
by: Chen, Hao, et al.
Published: (2025)
TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching
by: Zeng, Runjia, et al.
Published: (2026)
by: Zeng, Runjia, et al.
Published: (2026)
Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting
by: Barad, Kuldeep R, et al.
Published: (2024)
by: Barad, Kuldeep R, et al.
Published: (2024)
Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection
by: Nozarian, Farzad, et al.
Published: (2024)
by: Nozarian, Farzad, et al.
Published: (2024)
Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
by: Chang, Gyusam, et al.
Published: (2024)
by: Chang, Gyusam, et al.
Published: (2024)
Dynamic Jointly Batch Selection for Data Efficient Machine Translation Fine-Tuning
by: Ghanizadeh, Mohammad Amin, et al.
Published: (2025)
by: Ghanizadeh, Mohammad Amin, et al.
Published: (2025)
SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery
by: Xu, Jialang, et al.
Published: (2024)
by: Xu, Jialang, et al.
Published: (2024)
Parkinson's Disease Diagnosis Through Deep Learning: A Novel LSTM-Based Approach for Freezing of Gait Detection
by: Mir, Aqib Nazir, et al.
Published: (2024)
by: Mir, Aqib Nazir, et al.
Published: (2024)
Chirpy3D: Part-Aware Multi-View Diffusion for Creative Fine-Grained Object Generation
by: Ng, Kam Woh, et al.
Published: (2025)
by: Ng, Kam Woh, et al.
Published: (2025)
SemAttNet: Towards Attention-based Semantic Aware Guided Depth Completion
by: Nazir, Danish, et al.
Published: (2022)
by: Nazir, Danish, et al.
Published: (2022)
GateRA: Token-Aware Modulation for Parameter-Efficient Fine-Tuning
by: Ou, Jie, et al.
Published: (2025)
by: Ou, Jie, et al.
Published: (2025)
Similar Items
-
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
by: Nazir, Danish, et al.
Published: (2024) -
An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications
by: Nazir, Danish, et al.
Published: (2025) -
A Lightweight Image Super-Resolution Transformer Trained on Low-Resolution Images Only
by: Möller, Björn, et al.
Published: (2025) -
Memory-Efficient Fine-Tuning of Transformers via Token Selection
by: Simoulin, Antoine, et al.
Published: (2025) -
Multi-Modal interpretable automatic video captioning
by: Hanna-Asaad, Antoine, et al.
Published: (2024)