:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Haicheng, Yu, Zhemeng, Spadaro, Gabriele, Ju, Chen, Quétu, Victor, Xiao, Shuai, Tartaglione, Enzo
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2501.02430
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024)

Memory-Optimized Once-For-All Network
by: Girard, Maxime, et al.
Published: (2024)

Domain Adaptation for Learned Image Compression with Supervised Adapters
by: Presta, Alberto, et al.
Published: (2024)

RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting
by: Tran, Hoang-Nhat, et al.
Published: (2025)

WiGNet: Windowed Vision Graph Neural Network
by: Spadaro, Gabriele, et al.
Published: (2024)

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
by: Ju, Chen, et al.
Published: (2024)

Nix and Fix: Targeting 1000x Compression of 3D Gaussian Splatting with Diffusion Models
by: Eteke, Cem, et al.
Published: (2026)

GABIC: Graph-based Attention Block for Image Compression
by: Spadaro, Gabriele, et al.
Published: (2024)

DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
by: Quétu, Victor, et al.
Published: (2023)

Denoising Diffusion Probabilistic Model for Point Cloud Compression at Low Bit-Rates
by: Spadaro, Gabriele, et al.
Published: (2025)

Capsule Networks Do Not Need to Model Everything
by: Renzulli, Riccardo, et al.
Published: (2022)

ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting
by: Ali, Muhammad Salman, et al.
Published: (2024)

Enhancing Plasticity for First Session Adaptation Continual Learning
by: Marouf, Imad Eddine, et al.
Published: (2023)

Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering
by: Di Sario, Francesco, et al.
Published: (2024)

Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications
by: Olivi, Leonardo, et al.
Published: (2025)

STanH : Parametric Quantization for Variable Rate Learned Image Compression
by: Presta, Alberto, et al.
Published: (2024)

Unified Generative and Discriminative Training for Multi-modal Large Language Models
by: Chow, Wei, et al.
Published: (2024)

The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth
by: Quétu, Victor, et al.
Published: (2024)

Unsupervised Learning of Unbiased Visual Representations
by: Barbano, Carlo Alberto, et al.
Published: (2022)

Contrast-Unity for Partially-Supervised Temporal Sentence Grounding
by: Wang, Haicheng, et al.
Published: (2025)

Efficient Progressive Image Compression with Variance-aware Masking
by: Presta, Alberto, et al.
Published: (2024)

Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning
by: Ali, Muhammad Salman, et al.
Published: (2024)

Weighted Ensemble Models Are Strong Continual Learners
by: Marouf, Imad Eddine, et al.
Published: (2023)

POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
by: Wang, Haicheng, et al.
Published: (2026)

DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark
by: Li, Haodong, et al.
Published: (2024)

Efficient Large Multi-modal Models via Visual Context Compression
by: Chen, Jieneng, et al.
Published: (2024)

How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
by: Roos, Nathan, et al.
Published: (2025)

GoDe: Gaussians on Demand for Progressive Level of Detail and Scalable Compression
by: Di Sario, Francesco, et al.
Published: (2025)

Empowering Segmentation Ability to Multi-modal Large Language Models
by: Yang, Yuqi, et al.
Published: (2024)

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
by: Wang, Haicheng, et al.
Published: (2024)

Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering
by: Marouf, Imad Eddine, et al.
Published: (2025)

Efficient Multi-modal Large Language Models via Visual Token Grouping
by: Huang, Minbin, et al.
Published: (2024)

Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards
by: Yang, Xiaoyu, et al.
Published: (2024)

Point Cloud as a Foreign Language for Multi-modal Large Language Model
by: Paul, Sneha, et al.
Published: (2026)

LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models
by: Liao, Pan, et al.
Published: (2026)

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
by: Wen, Zichen, et al.
Published: (2025)

Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene Understanding
by: Huang, Wencan, et al.
Published: (2025)

LLMRA: Multi-modal Large Language Model based Restoration Assistant
by: Jin, Xiaoyu, et al.
Published: (2024)

Vision-aligned Latent Reasoning for Multi-modal Large Language Model
by: Jeon, Byungwoo, et al.
Published: (2026)

Understanding Information Storage and Transfer in Multi-modal Large Language Models
by: Basu, Samyadeep, et al.
Published: (2024)