:: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Yan, Minghao, Peng, Bo, Coleman, Benjamin, Chen, Ziqi, Xie, Zhouhang, Chen, Shuo, He, Zhankui, Sachdeva, Noveen, Wang, Weili, Chi, Ed H., Venkataraman, Shivaram, Kang, Wang-Cheng, Cheng, Derek Zhiyuan, Wang, Beidou
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Machine Learning
Accesso online:	https://arxiv.org/abs/2605.07039
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
di: Yan, Minghao, et al.
Pubblicazione: (2026)

AgenticTagger: Structured Item Representation for Recommendation with LLM Agents
di: Xie, Zhouhang, et al.
Pubblicazione: (2026)

ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation
di: Hou, Yupeng, et al.
Pubblicazione: (2025)

Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
di: Wei, Tianxin, et al.
Pubblicazione: (2025)

How to Train Data-Efficient LLMs
di: Sachdeva, Noveen, et al.
Pubblicazione: (2024)

PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices
di: Yan, Minghao, et al.
Pubblicazione: (2023)

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems
di: Coleman, Benjamin, et al.
Pubblicazione: (2023)

Scaling Inference-Efficient Language Models
di: Bian, Song, et al.
Pubblicazione: (2025)

Decoding Speculative Decoding
di: Yan, Minghao, et al.
Pubblicazione: (2024)

PLoRA: Efficient LoRA Hyperparameter Tuning for Large Models
di: Yan, Minghao, et al.
Pubblicazione: (2025)

What Limits Agentic Systems Efficiency?
di: Bian, Song, et al.
Pubblicazione: (2025)

From Good to Great: Improving Memory Tiering Performance Through Parameter Tuning
di: Kanellis, Konstantinos, et al.
Pubblicazione: (2025)

LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models
di: Chang, Tzu-Tao, et al.
Pubblicazione: (2025)

Eva: Cost-Efficient Cloud-Based Cluster Scheduling
di: Chang, Tzu-Tao, et al.
Pubblicazione: (2025)

SYMPHONY: Improving Memory Management for LLM Inference Workloads
di: Agarwal, Saurabh, et al.
Pubblicazione: (2024)

Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation
di: He, Zhankui, et al.
Pubblicazione: (2024)

GC4NC: A Benchmark Framework for Graph Condensation on Node Classification with New Insights
di: Gong, Shengbo, et al.
Pubblicazione: (2024)

ReMem: Mutual Information-Aware Fine-tuning of Pretrained Vision Transformers for Effective Knowledge Distillation
di: Dong, Chengyu, et al.
Pubblicazione: (2025)

From FASTER to F2: Evolving Concurrent Key-Value Store Designs for Large Skewed Workloads
di: Kanellis, Konstantinos, et al.
Pubblicazione: (2023)

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
di: Bian, Song, et al.
Pubblicazione: (2025)

GraphSnapShot: Caching Local Structure for Fast Graph Learning
di: Liu, Dong, et al.
Pubblicazione: (2024)

ARMS: Adaptive and Robust Memory Tiering System
di: Yadalam, Sujay, et al.
Pubblicazione: (2025)

TUNA: Tuning Unstable and Noisy Cloud Applications
di: Freischuetz, Johannes, et al.
Pubblicazione: (2025)

The Magic Correlations: Understanding Knowledge Transfer from Pretraining to Supervised Fine-Tuning
di: Fan, Simin, et al.
Pubblicazione: (2026)

Dual partially harmonic tensors and quantized Schur--Weyl duality
di: Wang, Pei, et al.
Pubblicazione: (2026)

PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters
di: Jain, Rutwik, et al.
Pubblicazione: (2024)

Parallel Quantum Local Search via Evolutionary Mechanism
di: Liu, Chen-Yu, et al.
Pubblicazione: (2024)

Tesserae: Scalable Placement Policies for Deep Learning Workloads
di: Bian, Song, et al.
Pubblicazione: (2025)

Minos: Systematically Classifying Performance and Power Characteristics of GPU Workloads on HPC Clusters
di: Jain, Rutwik, et al.
Pubblicazione: (2026)

UniSTPA: A Safety Analysis Framework for End-to-End Autonomous Driving
di: Kou, Hongrui, et al.
Pubblicazione: (2025)

Improving Stability and Heat Transfer Performance of Nanofluids by Surface‐Modifying Nanoparticles With β‐Cyclodextrin
di: Huazhu Chen, et al.
Pubblicazione: (2025)

RecWizard: A Toolkit for Conversational Recommendation with Modular, Portable Models and Interactive User Interface
di: Zhang, Zeyuan, et al.
Pubblicazione: (2024)

Quake: Adaptive Indexing for Vector Search
di: Mohoney, Jason, et al.
Pubblicazione: (2025)

CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation
di: Wu, Junda, et al.
Pubblicazione: (2024)

PPTNet: A Hybrid Periodic Pattern-Transformer Architecture for Traffic Flow Prediction and Congestion Identification
di: Kou, Hongrui, et al.
Pubblicazione: (2025)

Wattchmen: Watching the Wattchers -- High Fidelity, Flexible GPU Energy Modeling
di: Tran, Brandon, et al.
Pubblicazione: (2026)

Around Segal conjecture in p-adic geometry
di: Mao, Zhouhang
Pubblicazione: (2025)

Revisiting derived crystalline cohomology
di: Mao, Zhouhang
Pubblicazione: (2021)

Noncommutative relative de Rham--Witt complex via the norm
di: Mao, Zhouhang
Pubblicazione: (2024)

Equivariant aspects of de-completing cyclic homology
di: Mao, Zhouhang
Pubblicazione: (2024)