Saved in:
| Main Authors: | Wang, Haicheng, Yu, Zhemeng, Spadaro, Gabriele, Ju, Chen, Quétu, Victor, Xiao, Shuai, Tartaglione, Enzo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.02430 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024)
by: Liao, Zhu, et al.
Published: (2024)
Memory-Optimized Once-For-All Network
by: Girard, Maxime, et al.
Published: (2024)
by: Girard, Maxime, et al.
Published: (2024)
Domain Adaptation for Learned Image Compression with Supervised Adapters
by: Presta, Alberto, et al.
Published: (2024)
by: Presta, Alberto, et al.
Published: (2024)
RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting
by: Tran, Hoang-Nhat, et al.
Published: (2025)
by: Tran, Hoang-Nhat, et al.
Published: (2025)
WiGNet: Windowed Vision Graph Neural Network
by: Spadaro, Gabriele, et al.
Published: (2024)
by: Spadaro, Gabriele, et al.
Published: (2024)
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
by: Ju, Chen, et al.
Published: (2024)
by: Ju, Chen, et al.
Published: (2024)
Nix and Fix: Targeting 1000x Compression of 3D Gaussian Splatting with Diffusion Models
by: Eteke, Cem, et al.
Published: (2026)
by: Eteke, Cem, et al.
Published: (2026)
GABIC: Graph-based Attention Block for Image Compression
by: Spadaro, Gabriele, et al.
Published: (2024)
by: Spadaro, Gabriele, et al.
Published: (2024)
DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
by: Quétu, Victor, et al.
Published: (2023)
by: Quétu, Victor, et al.
Published: (2023)
Denoising Diffusion Probabilistic Model for Point Cloud Compression at Low Bit-Rates
by: Spadaro, Gabriele, et al.
Published: (2025)
by: Spadaro, Gabriele, et al.
Published: (2025)
Capsule Networks Do Not Need to Model Everything
by: Renzulli, Riccardo, et al.
Published: (2022)
by: Renzulli, Riccardo, et al.
Published: (2022)
ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting
by: Ali, Muhammad Salman, et al.
Published: (2024)
by: Ali, Muhammad Salman, et al.
Published: (2024)
Enhancing Plasticity for First Session Adaptation Continual Learning
by: Marouf, Imad Eddine, et al.
Published: (2023)
by: Marouf, Imad Eddine, et al.
Published: (2023)
Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering
by: Di Sario, Francesco, et al.
Published: (2024)
by: Di Sario, Francesco, et al.
Published: (2024)
Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications
by: Olivi, Leonardo, et al.
Published: (2025)
by: Olivi, Leonardo, et al.
Published: (2025)
STanH : Parametric Quantization for Variable Rate Learned Image Compression
by: Presta, Alberto, et al.
Published: (2024)
by: Presta, Alberto, et al.
Published: (2024)
Unified Generative and Discriminative Training for Multi-modal Large Language Models
by: Chow, Wei, et al.
Published: (2024)
by: Chow, Wei, et al.
Published: (2024)
The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth
by: Quétu, Victor, et al.
Published: (2024)
by: Quétu, Victor, et al.
Published: (2024)
Unsupervised Learning of Unbiased Visual Representations
by: Barbano, Carlo Alberto, et al.
Published: (2022)
by: Barbano, Carlo Alberto, et al.
Published: (2022)
Contrast-Unity for Partially-Supervised Temporal Sentence Grounding
by: Wang, Haicheng, et al.
Published: (2025)
by: Wang, Haicheng, et al.
Published: (2025)
Efficient Progressive Image Compression with Variance-aware Masking
by: Presta, Alberto, et al.
Published: (2024)
by: Presta, Alberto, et al.
Published: (2024)
Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning
by: Ali, Muhammad Salman, et al.
Published: (2024)
by: Ali, Muhammad Salman, et al.
Published: (2024)
Weighted Ensemble Models Are Strong Continual Learners
by: Marouf, Imad Eddine, et al.
Published: (2023)
by: Marouf, Imad Eddine, et al.
Published: (2023)
POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
by: Wang, Haicheng, et al.
Published: (2026)
by: Wang, Haicheng, et al.
Published: (2026)
DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark
by: Li, Haodong, et al.
Published: (2024)
by: Li, Haodong, et al.
Published: (2024)
Efficient Large Multi-modal Models via Visual Context Compression
by: Chen, Jieneng, et al.
Published: (2024)
by: Chen, Jieneng, et al.
Published: (2024)
How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
by: Roos, Nathan, et al.
Published: (2025)
by: Roos, Nathan, et al.
Published: (2025)
GoDe: Gaussians on Demand for Progressive Level of Detail and Scalable Compression
by: Di Sario, Francesco, et al.
Published: (2025)
by: Di Sario, Francesco, et al.
Published: (2025)
Empowering Segmentation Ability to Multi-modal Large Language Models
by: Yang, Yuqi, et al.
Published: (2024)
by: Yang, Yuqi, et al.
Published: (2024)
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
by: Wang, Haicheng, et al.
Published: (2024)
by: Wang, Haicheng, et al.
Published: (2024)
Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering
by: Marouf, Imad Eddine, et al.
Published: (2025)
by: Marouf, Imad Eddine, et al.
Published: (2025)
Efficient Multi-modal Large Language Models via Visual Token Grouping
by: Huang, Minbin, et al.
Published: (2024)
by: Huang, Minbin, et al.
Published: (2024)
Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards
by: Yang, Xiaoyu, et al.
Published: (2024)
by: Yang, Xiaoyu, et al.
Published: (2024)
Point Cloud as a Foreign Language for Multi-modal Large Language Model
by: Paul, Sneha, et al.
Published: (2026)
by: Paul, Sneha, et al.
Published: (2026)
LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models
by: Liao, Pan, et al.
Published: (2026)
by: Liao, Pan, et al.
Published: (2026)
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
by: Wen, Zichen, et al.
Published: (2025)
by: Wen, Zichen, et al.
Published: (2025)
Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene Understanding
by: Huang, Wencan, et al.
Published: (2025)
by: Huang, Wencan, et al.
Published: (2025)
LLMRA: Multi-modal Large Language Model based Restoration Assistant
by: Jin, Xiaoyu, et al.
Published: (2024)
by: Jin, Xiaoyu, et al.
Published: (2024)
Vision-aligned Latent Reasoning for Multi-modal Large Language Model
by: Jeon, Byungwoo, et al.
Published: (2026)
by: Jeon, Byungwoo, et al.
Published: (2026)
Understanding Information Storage and Transfer in Multi-modal Large Language Models
by: Basu, Samyadeep, et al.
Published: (2024)
by: Basu, Samyadeep, et al.
Published: (2024)
Similar Items
-
Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers
by: Liao, Zhu, et al.
Published: (2024) -
Memory-Optimized Once-For-All Network
by: Girard, Maxime, et al.
Published: (2024) -
Domain Adaptation for Learned Image Compression with Supervised Adapters
by: Presta, Alberto, et al.
Published: (2024) -
RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting
by: Tran, Hoang-Nhat, et al.
Published: (2025) -
WiGNet: Windowed Vision Graph Neural Network
by: Spadaro, Gabriele, et al.
Published: (2024)