Saved in:
| Main Authors: | Ai, Mengting, Wei, Tianxin, Chen, Sirui, He, Jingrui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.14230 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MLP Fusion: Towards Efficient Fine-tuning of Dense and Mixture-of-Experts Language Models
by: Ai, Mengting, et al.
Published: (2023)
by: Ai, Mengting, et al.
Published: (2023)
Heterogeneous Scientific Foundation Model Collaboration
by: Li, Zihao, et al.
Published: (2026)
by: Li, Zihao, et al.
Published: (2026)
Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning
by: Chen, Sirui, et al.
Published: (2026)
by: Chen, Sirui, et al.
Published: (2026)
Graph4MM: Weaving Multimodal Learning with Structural Information
by: Ning, Xuying, et al.
Published: (2025)
by: Ning, Xuying, et al.
Published: (2025)
Panda: Test-Time Adaptation with Negative Data Augmentation
by: Deng, Ruxi, et al.
Published: (2025)
by: Deng, Ruxi, et al.
Published: (2025)
AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning
by: Zou, Jiaru, et al.
Published: (2025)
by: Zou, Jiaru, et al.
Published: (2025)
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration
by: Ai, Mengting, et al.
Published: (2025)
by: Ai, Mengting, et al.
Published: (2025)
APEX$^2$: Adaptive and Extreme Summarization for Personalized Knowledge Graphs
by: Li, Zihao, et al.
Published: (2024)
by: Li, Zihao, et al.
Published: (2024)
Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization
by: Wei, Tianxin, et al.
Published: (2025)
by: Wei, Tianxin, et al.
Published: (2025)
Meta Clustering of Neural Bandits
by: Ban, Yikun, et al.
Published: (2024)
by: Ban, Yikun, et al.
Published: (2024)
PyG-SSL: A Graph Self-Supervised Learning Toolkit
by: Zheng, Lecheng, et al.
Published: (2024)
by: Zheng, Lecheng, et al.
Published: (2024)
Scalable iterative pruning of large language and vision models using block coordinate descent
by: Rosenberg, Gili, et al.
Published: (2024)
by: Rosenberg, Gili, et al.
Published: (2024)
Saffron-1: Safety Inference Scaling
by: Qiu, Ruizhong, et al.
Published: (2025)
by: Qiu, Ruizhong, et al.
Published: (2025)
Latte: Collaborative Test-Time Adaptation of Vision-Language Models in Federated Learning
by: Bao, Wenxuan, et al.
Published: (2025)
by: Bao, Wenxuan, et al.
Published: (2025)
Layer-wise dynamic rank for compressing large language models
by: Mi, Zhendong, et al.
Published: (2025)
by: Mi, Zhendong, et al.
Published: (2025)
Visual prompting reimagined: The power of the Activation Prompts
by: Zhang, Yihua, et al.
Published: (2026)
by: Zhang, Yihua, et al.
Published: (2026)
LLM-Forest: Ensemble Learning of LLMs with Graph-Augmented Prompts for Data Imputation
by: He, Xinrui, et al.
Published: (2024)
by: He, Xinrui, et al.
Published: (2024)
Residual vector quantization for KV cache compression in large language model
by: Kumar, Ankur
Published: (2024)
by: Kumar, Ankur
Published: (2024)
Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction
by: Yang, Guofeng, et al.
Published: (2025)
by: Yang, Guofeng, et al.
Published: (2025)
Learning effective pruning at initialization from iterative pruning
by: Liu, Shengkai, et al.
Published: (2024)
by: Liu, Shengkai, et al.
Published: (2024)
Multi-modal Causal Structure Learning and Root Cause Analysis
by: Zheng, Lecheng, et al.
Published: (2024)
by: Zheng, Lecheng, et al.
Published: (2024)
A general tensor-structured compression scheme for efficient large language models
by: Lu, Ying, et al.
Published: (2026)
by: Lu, Ying, et al.
Published: (2026)
Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting
by: Liu, Zhining, et al.
Published: (2025)
by: Liu, Zhining, et al.
Published: (2025)
CLIMB: Class-imbalanced Learning Benchmark on Tabular Data
by: Liu, Zhining, et al.
Published: (2025)
by: Liu, Zhining, et al.
Published: (2025)
Efficient training for large-scale optical neural network using an evolutionary strategy and attention pruning
by: Yang, Zhiwei, et al.
Published: (2025)
by: Yang, Zhiwei, et al.
Published: (2025)
Benchmarking large language models for biomedical natural language processing applications and recommendations
by: Chen, Qingyu, et al.
Published: (2023)
by: Chen, Qingyu, et al.
Published: (2023)
Trustworthy Transfer Learning: A Survey
by: Wu, Jun, et al.
Published: (2024)
by: Wu, Jun, et al.
Published: (2024)
Quantifying perturbation impacts for large language models
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
RAG over Tables: Hierarchical Memory Index, Multi-Stage Retrieval, and Benchmarking
by: Zou, Jiaru, et al.
Published: (2025)
by: Zou, Jiaru, et al.
Published: (2025)
Visual cognition in multimodal large language models
by: Buschoff, Luca M. Schulze, et al.
Published: (2023)
by: Buschoff, Luca M. Schulze, et al.
Published: (2023)
Hypothesis generation and updating in large language models
by: Xiong, Hua-Dong
Published: (2026)
by: Xiong, Hua-Dong
Published: (2026)
Representation in large language models
by: Yetman, Cameron
Published: (2025)
by: Yetman, Cameron
Published: (2025)
Compressing CNN models for resource-constrained systems by channel and layer pruning
by: Sadaqa, Ahmed, et al.
Published: (2025)
by: Sadaqa, Ahmed, et al.
Published: (2025)
Alignment faking in large language models
by: Greenblatt, Ryan, et al.
Published: (2024)
by: Greenblatt, Ryan, et al.
Published: (2024)
PUMA: margin-based data pruning
by: Maroto, Javier, et al.
Published: (2024)
by: Maroto, Javier, et al.
Published: (2024)
Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation
by: Bao, Wenxuan, et al.
Published: (2024)
by: Bao, Wenxuan, et al.
Published: (2024)
Adaptive pruning-based Newton's method for distributed learning
by: Chen, Shuzhen, et al.
Published: (2023)
by: Chen, Shuzhen, et al.
Published: (2023)
Long-form factuality in large language models
by: Wei, Jerry, et al.
Published: (2024)
by: Wei, Jerry, et al.
Published: (2024)
Physical models realizing the transformer architecture of large language models
by: Chen, Zeqian
Published: (2025)
by: Chen, Zeqian
Published: (2025)
Amortizing intractable inference in large language models
by: Hu, Edward J., et al.
Published: (2023)
by: Hu, Edward J., et al.
Published: (2023)
Similar Items
-
MLP Fusion: Towards Efficient Fine-tuning of Dense and Mixture-of-Experts Language Models
by: Ai, Mengting, et al.
Published: (2023) -
Heterogeneous Scientific Foundation Model Collaboration
by: Li, Zihao, et al.
Published: (2026) -
Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning
by: Chen, Sirui, et al.
Published: (2026) -
Graph4MM: Weaving Multimodal Learning with Structural Information
by: Ning, Xuying, et al.
Published: (2025) -
Panda: Test-Time Adaptation with Negative Data Augmentation
by: Deng, Ruxi, et al.
Published: (2025)