Saved in:
| Main Authors: | Sundaram, Jainaveen, Iyer, Ravi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.13402 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
by: Wang, Jinheng, et al.
Published: (2025)
by: Wang, Jinheng, et al.
Published: (2025)
2 OLMo 2 Furious
by: OLMo, Team, et al.
Published: (2024)
by: OLMo, Team, et al.
Published: (2024)
OLMoE: Open Mixture-of-Experts Language Models
by: Muennighoff, Niklas, et al.
Published: (2024)
by: Muennighoff, Niklas, et al.
Published: (2024)
OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
by: Ngo, Huong, et al.
Published: (2025)
by: Ngo, Huong, et al.
Published: (2025)
Generation of Human Comprehensible Access Control Policies from Audit Logs
by: Kumar, Gautam, et al.
Published: (2026)
by: Kumar, Gautam, et al.
Published: (2026)
Graph Persistence goes Spectral
by: Ji, Mattie, et al.
Published: (2025)
by: Ji, Mattie, et al.
Published: (2025)
TernaryLLM: Ternarized Large Language Model
by: Chen, Tianqi, et al.
Published: (2024)
by: Chen, Tianqi, et al.
Published: (2024)
Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought
by: Havrilla, Alex, et al.
Published: (2024)
by: Havrilla, Alex, et al.
Published: (2024)
SMA: Submodular Modality Aligner For Data Efficient Multimodal Learning
by: Pham, Truong, et al.
Published: (2026)
by: Pham, Truong, et al.
Published: (2026)
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
by: Zhou, Baichuan, et al.
Published: (2024)
by: Zhou, Baichuan, et al.
Published: (2024)
TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
by: Jia, Junlong, et al.
Published: (2024)
by: Jia, Junlong, et al.
Published: (2024)
State Contamination in Memory-Augmented LLM Agents
by: Wang, Yian, et al.
Published: (2026)
by: Wang, Yian, et al.
Published: (2026)
The Fourth State: Signed-Zero Ternary for Stable LLM Quantization (and More)
by: Uhlmann, Jeffrey
Published: (2025)
by: Uhlmann, Jeffrey
Published: (2025)
LLaPipe: LLM-Guided Reinforcement Learning for Automated Data Preparation Pipeline Construction
by: Chang, Jing, et al.
Published: (2025)
by: Chang, Jing, et al.
Published: (2025)
LogLLaMA: Transformer-based log anomaly detection with LLaMA
by: Yang, Zhuoyi, et al.
Published: (2025)
by: Yang, Zhuoyi, et al.
Published: (2025)
ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training
by: Dialameh, Maryam, et al.
Published: (2025)
by: Dialameh, Maryam, et al.
Published: (2025)
KaVa: Latent Reasoning via Compressed KV-Cache Distillation
by: Kuzina, Anna, et al.
Published: (2025)
by: Kuzina, Anna, et al.
Published: (2025)
Single-Stage Huffman Encoder for ML Compression
by: Agrawal, Aditya, et al.
Published: (2026)
by: Agrawal, Aditya, et al.
Published: (2026)
VaCDA: Variational Contrastive Alignment-based Scalable Human Activity Recognition
by: Khisa, Soham, et al.
Published: (2025)
by: Khisa, Soham, et al.
Published: (2025)
AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets
by: Perkowski, Ernest, et al.
Published: (2024)
by: Perkowski, Ernest, et al.
Published: (2024)
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
by: Zuo, Fei, et al.
Published: (2026)
by: Zuo, Fei, et al.
Published: (2026)
MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning
by: Zhang, Jianyi, et al.
Published: (2024)
by: Zhang, Jianyi, et al.
Published: (2024)
HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model
by: Guo, Haiyang, et al.
Published: (2025)
by: Guo, Haiyang, et al.
Published: (2025)
DeRDaVa: Deletion-Robust Data Valuation for Machine Learning
by: Tian, Xiao, et al.
Published: (2023)
by: Tian, Xiao, et al.
Published: (2023)
Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models
by: Gupta, Sharut, et al.
Published: (2025)
by: Gupta, Sharut, et al.
Published: (2025)
Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving
by: Dai, Yinwei, et al.
Published: (2023)
by: Dai, Yinwei, et al.
Published: (2023)
Transformer-based CoVaR: Systemic Risk in Textual Information
by: Chen, Junyu, et al.
Published: (2026)
by: Chen, Junyu, et al.
Published: (2026)
Classification with a Network of Partially Informative Agents: Enabling Wise Crowds from Individually Myopic Classifiers
by: Yao, Tong, et al.
Published: (2024)
by: Yao, Tong, et al.
Published: (2024)
Quad Length Codes for Lossless Compression of e4m3
by: Agrawal, Aditya, et al.
Published: (2026)
by: Agrawal, Aditya, et al.
Published: (2026)
FiSH: Fair Spatial Hotspots
by: P, Deepak, et al.
Published: (2021)
by: P, Deepak, et al.
Published: (2021)
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference
by: Gulhan, Ahmed Burak, et al.
Published: (2025)
by: Gulhan, Ahmed Burak, et al.
Published: (2025)
The Uniqueness of LLaMA3-70B Series with Per-Channel Quantization
by: Qin, Minghai
Published: (2024)
by: Qin, Minghai
Published: (2024)
VaPR -- Vision-language Preference alignment for Reasoning
by: Wadhawan, Rohan, et al.
Published: (2025)
by: Wadhawan, Rohan, et al.
Published: (2025)
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models
by: Wang, Hongyu, et al.
Published: (2025)
by: Wang, Hongyu, et al.
Published: (2025)
TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs
by: Qiao, Ye, et al.
Published: (2025)
by: Qiao, Ye, et al.
Published: (2025)
LLaGA: Large Language and Graph Assistant
by: Chen, Runjin, et al.
Published: (2024)
by: Chen, Runjin, et al.
Published: (2024)
Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study
by: Ma, Chi, et al.
Published: (2024)
by: Ma, Chi, et al.
Published: (2024)
STENCIL: Submodular Mutual Information Based Weak Supervision for Cold-Start Active Learning
by: Beck, Nathan, et al.
Published: (2024)
by: Beck, Nathan, et al.
Published: (2024)
Learning State-Space Models of Dynamic Systems from Arbitrary Data using Joint Embedding Predictive Architectures
by: Ulmen, Jonas, et al.
Published: (2025)
by: Ulmen, Jonas, et al.
Published: (2025)
DIVEBATCH: Accelerating Model Training Through Gradient-Diversity Aware Batch Size Adaptation
by: Chen, Yuen, et al.
Published: (2025)
by: Chen, Yuen, et al.
Published: (2025)
Similar Items
-
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
by: Wang, Jinheng, et al.
Published: (2025) -
2 OLMo 2 Furious
by: OLMo, Team, et al.
Published: (2024) -
OLMoE: Open Mixture-of-Experts Language Models
by: Muennighoff, Niklas, et al.
Published: (2024) -
OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
by: Ngo, Huong, et al.
Published: (2025) -
Generation of Human Comprehensible Access Control Policies from Audit Logs
by: Kumar, Gautam, et al.
Published: (2026)