:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Cha, JooHyoung, Lee, Munyoung, Kwon, Jinse, Lee, Jubin, Lee, Jemin, Kwon, Yongin
Formato:	Preprint
Publicado:	2024
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2411.10764
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention
por: Oh, Sehyeon, et al.
Publicado: (2026)

Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant
por: Lee, Jemin, et al.
Publicado: (2024)

Correction to “NEST‐C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators”
por: Jeman Park, et al.
Publicado: (2024)

QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
por: Kim, Jeongseok, et al.
Publicado: (2025)

Wafer-Level Etch Spatial Profiling for Process Monitoring from Time-Series with Time-LLM
por: Kim, Hyunwoo, et al.
Publicado: (2026)

Mixed Non-linear Quantization for Vision Transformers
por: Kim, Gihwan, et al.
Publicado: (2024)

AnyBCQ: Hardware Efficient Flexible Binary-Coded Quantization for Multi-Precision LLMs
por: Park, Gunho, et al.
Publicado: (2025)

Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
por: Lee, Joonhyung, et al.
Publicado: (2024)

Mind the Missing: Variable-Aware Representation Learning for Irregular EHR Time Series using Large Language Models
por: Kwon, Jeong Eul, et al.
Publicado: (2025)

Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning
por: Lee, Dongsu, et al.
Publicado: (2024)

Easy to Learn, Yet Hard to Forget: Towards Robust Unlearning Under Bias
por: Kwon, JuneHyoung, et al.
Publicado: (2026)

Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
por: Lee, Dongsu, et al.
Publicado: (2025)

Quantum End-to-End Learning for Contextual Combinatorial Optimization
por: Lee, Jaehwan, et al.
Publicado: (2026)

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
por: Lee, Jemin, et al.
Publicado: (2023)

LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
por: Kim, Taeho, et al.
Publicado: (2024)

Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
por: Sohn, Jy-yong, et al.
Publicado: (2024)

BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference
por: Lee, Changwoo, et al.
Publicado: (2024)

Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning
por: Lee, Joonho, et al.
Publicado: (2019)

Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximation
por: Park, Jaehyun, et al.
Publicado: (2024)

FlowBind: Efficient Any-to-Any Generation with Bidirectional Flows
por: Cha, Yeonwoo, et al.
Publicado: (2025)

Understanding Impact of Human Feedback via Influence Functions
por: Min, Taywon, et al.
Publicado: (2025)

AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset
por: Lee, Dongsu, et al.
Publicado: (2024)

LLMs Are Already Good Tutors: Training-Free Prompt Optimization for Pedagogical Math Tutoring
por: Lee, Unggi, et al.
Publicado: (2026)

A Predictive Model Based on Transformer with Statistical Feature Embedding in Manufacturing Sensor Dataset
por: Lee, Gyeong Taek, et al.
Publicado: (2024)

$α$-divergence Improves the Entropy Production Estimation via Machine Learning
por: Kwon, Euijoon, et al.
Publicado: (2023)

RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning
por: Lee, Cheol-Hui, et al.
Publicado: (2026)

Group Shapley Value and Counterfactual Simulations in a Structural Model
por: Kwon, Yongchan, et al.
Publicado: (2024)

DIVER-1: Scaling Intracranial EEG Foundation Models for Transferable Representations
por: Han, Danny Dongyeop, et al.
Publicado: (2025)

Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning
por: Willemsen, Floris-Jan, et al.
Publicado: (2025)

Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models
por: Heo, Jung Hwan, et al.
Publicado: (2023)

Quantifying Qualitative Insights: Leveraging LLMs to Market Predict
por: Lee, Hoyoung, et al.
Publicado: (2024)

Automated Model Evaluation for Object Detection via Prediction Consistency and Reliability
por: Yoo, Seungju, et al.
Publicado: (2025)

SUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving
por: Woo, Sunghyeon, et al.
Publicado: (2026)

ICaRus: Identical Cache Reuse for Efficient Multi Model Inference
por: Woo, Sunghyeon, et al.
Publicado: (2026)

CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs
por: Park, Gunho, et al.
Publicado: (2025)

DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
por: Woo, Sunghyeon, et al.
Publicado: (2024)

FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization
por: Lee, Jung Hyun, et al.
Publicado: (2023)

Rethinking Open-World Semi-Supervised Learning: Distribution Mismatch and Inductive Inference
por: Park, Seongheon, et al.
Publicado: (2024)

TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models
por: Choo, Jinho, et al.
Publicado: (2026)

Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
por: Lee, Sang-Hyun, et al.
Publicado: (2024)