:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sundaram, Jainaveen, Iyer, Ravi
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2408.13402
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
by: Wang, Jinheng, et al.
Published: (2025)

2 OLMo 2 Furious
by: OLMo, Team, et al.
Published: (2024)

OLMoE: Open Mixture-of-Experts Language Models
by: Muennighoff, Niklas, et al.
Published: (2024)

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
by: Ngo, Huong, et al.
Published: (2025)

Generation of Human Comprehensible Access Control Policies from Audit Logs
by: Kumar, Gautam, et al.
Published: (2026)

Graph Persistence goes Spectral
by: Ji, Mattie, et al.
Published: (2025)

TernaryLLM: Ternarized Large Language Model
by: Chen, Tianqi, et al.
Published: (2024)

Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought
by: Havrilla, Alex, et al.
Published: (2024)

SMA: Submodular Modality Aligner For Data Efficient Multimodal Learning
by: Pham, Truong, et al.
Published: (2026)

TinyLLaVA: A Framework of Small-scale Large Multimodal Models
by: Zhou, Baichuan, et al.
Published: (2024)

TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
by: Jia, Junlong, et al.
Published: (2024)

State Contamination in Memory-Augmented LLM Agents
by: Wang, Yian, et al.
Published: (2026)

The Fourth State: Signed-Zero Ternary for Stable LLM Quantization (and More)
by: Uhlmann, Jeffrey
Published: (2025)

LLaPipe: LLM-Guided Reinforcement Learning for Automated Data Preparation Pipeline Construction
by: Chang, Jing, et al.
Published: (2025)

LogLLaMA: Transformer-based log anomaly detection with LLaMA
by: Yang, Zhuoyi, et al.
Published: (2025)

ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training
by: Dialameh, Maryam, et al.
Published: (2025)

KaVa: Latent Reasoning via Compressed KV-Cache Distillation
by: Kuzina, Anna, et al.
Published: (2025)

Single-Stage Huffman Encoder for ML Compression
by: Agrawal, Aditya, et al.
Published: (2026)

VaCDA: Variational Contrastive Alignment-based Scalable Human Activity Recognition
by: Khisa, Soham, et al.
Published: (2025)

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets
by: Perkowski, Ernest, et al.
Published: (2024)

FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
by: Zuo, Fei, et al.
Published: (2026)

MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning
by: Zhang, Jianyi, et al.
Published: (2024)

HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model
by: Guo, Haiyang, et al.
Published: (2025)

DeRDaVa: Deletion-Robust Data Valuation for Machine Learning
by: Tian, Xiao, et al.
Published: (2023)

Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models
by: Gupta, Sharut, et al.
Published: (2025)

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving
by: Dai, Yinwei, et al.
Published: (2023)

Transformer-based CoVaR: Systemic Risk in Textual Information
by: Chen, Junyu, et al.
Published: (2026)

Classification with a Network of Partially Informative Agents: Enabling Wise Crowds from Individually Myopic Classifiers
by: Yao, Tong, et al.
Published: (2024)

Quad Length Codes for Lossless Compression of e4m3
by: Agrawal, Aditya, et al.
Published: (2026)

FiSH: Fair Spatial Hotspots
by: P, Deepak, et al.
Published: (2021)

BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference
by: Gulhan, Ahmed Burak, et al.
Published: (2025)

The Uniqueness of LLaMA3-70B Series with Per-Channel Quantization
by: Qin, Minghai
Published: (2024)

VaPR -- Vision-language Preference alignment for Reasoning
by: Wadhawan, Rohan, et al.
Published: (2025)

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models
by: Wang, Hongyu, et al.
Published: (2025)

TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs
by: Qiao, Ye, et al.
Published: (2025)

LLaGA: Large Language and Graph Assistant
by: Chen, Runjin, et al.
Published: (2024)

Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study
by: Ma, Chi, et al.
Published: (2024)

STENCIL: Submodular Mutual Information Based Weak Supervision for Cold-Start Active Learning
by: Beck, Nathan, et al.
Published: (2024)

Learning State-Space Models of Dynamic Systems from Arbitrary Data using Joint Embedding Predictive Architectures
by: Ulmen, Jonas, et al.
Published: (2025)

DIVEBATCH: Accelerating Model Training Through Gradient-Diversity Aware Batch Size Adaptation
by: Chen, Yuen, et al.
Published: (2025)