Guardado en:
| Autores principales: | Cha, JooHyoung, Lee, Munyoung, Kwon, Jinse, Lee, Jubin, Lee, Jemin, Kwon, Yongin |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2411.10764 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention
por: Oh, Sehyeon, et al.
Publicado: (2026)
por: Oh, Sehyeon, et al.
Publicado: (2026)
Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant
por: Lee, Jemin, et al.
Publicado: (2024)
por: Lee, Jemin, et al.
Publicado: (2024)
Correction to “NEST‐C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators”
por: Jeman Park, et al.
Publicado: (2024)
por: Jeman Park, et al.
Publicado: (2024)
QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
por: Kim, Jeongseok, et al.
Publicado: (2025)
por: Kim, Jeongseok, et al.
Publicado: (2025)
Wafer-Level Etch Spatial Profiling for Process Monitoring from Time-Series with Time-LLM
por: Kim, Hyunwoo, et al.
Publicado: (2026)
por: Kim, Hyunwoo, et al.
Publicado: (2026)
Mixed Non-linear Quantization for Vision Transformers
por: Kim, Gihwan, et al.
Publicado: (2024)
por: Kim, Gihwan, et al.
Publicado: (2024)
AnyBCQ: Hardware Efficient Flexible Binary-Coded Quantization for Multi-Precision LLMs
por: Park, Gunho, et al.
Publicado: (2025)
por: Park, Gunho, et al.
Publicado: (2025)
Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
por: Lee, Joonhyung, et al.
Publicado: (2024)
por: Lee, Joonhyung, et al.
Publicado: (2024)
Mind the Missing: Variable-Aware Representation Learning for Irregular EHR Time Series using Large Language Models
por: Kwon, Jeong Eul, et al.
Publicado: (2025)
por: Kwon, Jeong Eul, et al.
Publicado: (2025)
Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning
por: Lee, Dongsu, et al.
Publicado: (2024)
por: Lee, Dongsu, et al.
Publicado: (2024)
Easy to Learn, Yet Hard to Forget: Towards Robust Unlearning Under Bias
por: Kwon, JuneHyoung, et al.
Publicado: (2026)
por: Kwon, JuneHyoung, et al.
Publicado: (2026)
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
por: Lee, Dongsu, et al.
Publicado: (2025)
por: Lee, Dongsu, et al.
Publicado: (2025)
Quantum End-to-End Learning for Contextual Combinatorial Optimization
por: Lee, Jaehwan, et al.
Publicado: (2026)
por: Lee, Jaehwan, et al.
Publicado: (2026)
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
por: Lee, Jemin, et al.
Publicado: (2023)
por: Lee, Jemin, et al.
Publicado: (2023)
LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
por: Kim, Taeho, et al.
Publicado: (2024)
por: Kim, Taeho, et al.
Publicado: (2024)
Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
por: Sohn, Jy-yong, et al.
Publicado: (2024)
por: Sohn, Jy-yong, et al.
Publicado: (2024)
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference
por: Lee, Changwoo, et al.
Publicado: (2024)
por: Lee, Changwoo, et al.
Publicado: (2024)
Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning
por: Lee, Joonho, et al.
Publicado: (2019)
por: Lee, Joonho, et al.
Publicado: (2019)
Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximation
por: Park, Jaehyun, et al.
Publicado: (2024)
por: Park, Jaehyun, et al.
Publicado: (2024)
FlowBind: Efficient Any-to-Any Generation with Bidirectional Flows
por: Cha, Yeonwoo, et al.
Publicado: (2025)
por: Cha, Yeonwoo, et al.
Publicado: (2025)
Understanding Impact of Human Feedback via Influence Functions
por: Min, Taywon, et al.
Publicado: (2025)
por: Min, Taywon, et al.
Publicado: (2025)
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset
por: Lee, Dongsu, et al.
Publicado: (2024)
por: Lee, Dongsu, et al.
Publicado: (2024)
LLMs Are Already Good Tutors: Training-Free Prompt Optimization for Pedagogical Math Tutoring
por: Lee, Unggi, et al.
Publicado: (2026)
por: Lee, Unggi, et al.
Publicado: (2026)
A Predictive Model Based on Transformer with Statistical Feature Embedding in Manufacturing Sensor Dataset
por: Lee, Gyeong Taek, et al.
Publicado: (2024)
por: Lee, Gyeong Taek, et al.
Publicado: (2024)
$α$-divergence Improves the Entropy Production Estimation via Machine Learning
por: Kwon, Euijoon, et al.
Publicado: (2023)
por: Kwon, Euijoon, et al.
Publicado: (2023)
RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning
por: Lee, Cheol-Hui, et al.
Publicado: (2026)
por: Lee, Cheol-Hui, et al.
Publicado: (2026)
Group Shapley Value and Counterfactual Simulations in a Structural Model
por: Kwon, Yongchan, et al.
Publicado: (2024)
por: Kwon, Yongchan, et al.
Publicado: (2024)
DIVER-1: Scaling Intracranial EEG Foundation Models for Transferable Representations
por: Han, Danny Dongyeop, et al.
Publicado: (2025)
por: Han, Danny Dongyeop, et al.
Publicado: (2025)
Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning
por: Willemsen, Floris-Jan, et al.
Publicado: (2025)
por: Willemsen, Floris-Jan, et al.
Publicado: (2025)
Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models
por: Heo, Jung Hwan, et al.
Publicado: (2023)
por: Heo, Jung Hwan, et al.
Publicado: (2023)
Quantifying Qualitative Insights: Leveraging LLMs to Market Predict
por: Lee, Hoyoung, et al.
Publicado: (2024)
por: Lee, Hoyoung, et al.
Publicado: (2024)
Automated Model Evaluation for Object Detection via Prediction Consistency and Reliability
por: Yoo, Seungju, et al.
Publicado: (2025)
por: Yoo, Seungju, et al.
Publicado: (2025)
SUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving
por: Woo, Sunghyeon, et al.
Publicado: (2026)
por: Woo, Sunghyeon, et al.
Publicado: (2026)
ICaRus: Identical Cache Reuse for Efficient Multi Model Inference
por: Woo, Sunghyeon, et al.
Publicado: (2026)
por: Woo, Sunghyeon, et al.
Publicado: (2026)
CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs
por: Park, Gunho, et al.
Publicado: (2025)
por: Park, Gunho, et al.
Publicado: (2025)
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
por: Woo, Sunghyeon, et al.
Publicado: (2024)
por: Woo, Sunghyeon, et al.
Publicado: (2024)
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization
por: Lee, Jung Hyun, et al.
Publicado: (2023)
por: Lee, Jung Hyun, et al.
Publicado: (2023)
Rethinking Open-World Semi-Supervised Learning: Distribution Mismatch and Inductive Inference
por: Park, Seongheon, et al.
Publicado: (2024)
por: Park, Seongheon, et al.
Publicado: (2024)
TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models
por: Choo, Jinho, et al.
Publicado: (2026)
por: Choo, Jinho, et al.
Publicado: (2026)
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
por: Lee, Sang-Hyun, et al.
Publicado: (2024)
por: Lee, Sang-Hyun, et al.
Publicado: (2024)
Ejemplares similares
-
QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention
por: Oh, Sehyeon, et al.
Publicado: (2026) -
Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant
por: Lee, Jemin, et al.
Publicado: (2024) -
Correction to “NEST‐C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators”
por: Jeman Park, et al.
Publicado: (2024) -
QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
por: Kim, Jeongseok, et al.
Publicado: (2025) -
Wafer-Level Etch Spatial Profiling for Process Monitoring from Time-Series with Time-LLM
por: Kim, Hyunwoo, et al.
Publicado: (2026)