Saved in:
| Main Authors: | Arora, Simran, Yang, Brandon, Eyuboglu, Sabri, Narayan, Avanika, Hojel, Andrew, Trummer, Immanuel, Ré, Christopher |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2304.09433 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models
by: Narayan, Avanika, et al.
Published: (2025)
by: Narayan, Avanika, et al.
Published: (2025)
Simple linear attention language models balance the recall-throughput tradeoff
by: Arora, Simran, et al.
Published: (2024)
by: Arora, Simran, et al.
Published: (2024)
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads
by: Lao, Jiale, et al.
Published: (2025)
by: Lao, Jiale, et al.
Published: (2025)
Just read twice: closing the recall gap for recurrent language models
by: Arora, Simran, et al.
Published: (2024)
by: Arora, Simran, et al.
Published: (2024)
Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
by: Narayan, Avanika, et al.
Published: (2024)
by: Narayan, Avanika, et al.
Published: (2024)
SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Fees
by: Jo, Saehan, et al.
Published: (2024)
by: Jo, Saehan, et al.
Published: (2024)
GenDB: The Next Generation of Query Processing -- Synthesized, Not Engineered
by: Lao, Jiale, et al.
Published: (2026)
by: Lao, Jiale, et al.
Published: (2026)
Cartridges: Lightweight and general-purpose long context representations via self-study
by: Eyuboglu, Sabri, et al.
Published: (2025)
by: Eyuboglu, Sabri, et al.
Published: (2025)
An Information Theoretic Perspective on Agentic System Design
by: He, Shizhe, et al.
Published: (2025)
by: He, Shizhe, et al.
Published: (2025)
λ-Tune: Harnessing Large Language Models for Automated Database System Tuning
by: Giannankouris, Victor, et al.
Published: (2024)
by: Giannankouris, Victor, et al.
Published: (2024)
Implementing Semantic Join Operators Efficiently
by: Trummer, Immanuel
Published: (2025)
by: Trummer, Immanuel
Published: (2025)
LoLCATs: On Low-Rank Linearizing of Large Language Models
by: Zhang, Michael, et al.
Published: (2024)
by: Zhang, Michael, et al.
Published: (2024)
ThunderKittens: Simple, Fast, and Adorable AI Kernels
by: Spector, Benjamin F., et al.
Published: (2024)
by: Spector, Benjamin F., et al.
Published: (2024)
OpenJarvis: Personal AI, On Personal Devices
by: Saad-Falcon, Jon, et al.
Published: (2026)
by: Saad-Falcon, Jon, et al.
Published: (2026)
Hybrid Mixed Integer Linear Programming for Large-Scale Join Order Optimisation
by: Schönberger, Manuel, et al.
Published: (2025)
by: Schönberger, Manuel, et al.
Published: (2025)
Automating the Enterprise with Foundation Models
by: Wornow, Michael, et al.
Published: (2024)
by: Wornow, Michael, et al.
Published: (2024)
Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning
by: Kaur, Simran, et al.
Published: (2024)
by: Kaur, Simran, et al.
Published: (2024)
ParallelKittens: Systematic and Practical Simplification of Multi-GPU AI Kernels
by: Sul, Stuart H., et al.
Published: (2025)
by: Sul, Stuart H., et al.
Published: (2025)
RELIC: Investigating Large Language Model Responses using Self-Consistency
by: Cheng, Furui, et al.
Published: (2023)
by: Cheng, Furui, et al.
Published: (2023)
Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis
by: Boughorbel, Sabri, et al.
Published: (2024)
by: Boughorbel, Sabri, et al.
Published: (2024)
Can Models Learn Skill Composition from Examples?
by: Zhao, Haoyu, et al.
Published: (2024)
by: Zhao, Haoyu, et al.
Published: (2024)
Aioli: A Unified Optimization Framework for Language Model Data Mixing
by: Chen, Mayee F., et al.
Published: (2024)
by: Chen, Mayee F., et al.
Published: (2024)
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
by: Saad-Falcon, Jon, et al.
Published: (2024)
by: Saad-Falcon, Jon, et al.
Published: (2024)
Counting Clinical Trials: New Evidence on Pharmaceutical Sector Productivity
by: Durvasula, Maya M., et al.
Published: (2024)
by: Durvasula, Maya M., et al.
Published: (2024)
Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters
by: Garcia, Roberto, et al.
Published: (2025)
by: Garcia, Roberto, et al.
Published: (2025)
Late Time Acceleration with Observational Constraints in Modified Theories of Gravity
by: Arora, Simran
Published: (2023)
by: Arora, Simran
Published: (2023)
Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)
by: Wu, Zhengxuan, et al.
Published: (2025)
Constructing Efficient Fact-Storing MLPs for Transformers
by: Dugan, Owen, et al.
Published: (2025)
by: Dugan, Owen, et al.
Published: (2025)
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers
by: Clavié, Benjamin, et al.
Published: (2025)
by: Clavié, Benjamin, et al.
Published: (2025)
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
by: Wu, Zhengxuan, et al.
Published: (2025)
by: Wu, Zhengxuan, et al.
Published: (2025)
Masked Language Models are Good Heterogeneous Graph Generalizers
by: Yang, Jinyu, et al.
Published: (2025)
by: Yang, Jinyu, et al.
Published: (2025)
Enabling Communication via APIs for Mainframe Applications
by: Kanvar, Vini, et al.
Published: (2024)
by: Kanvar, Vini, et al.
Published: (2024)
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
by: Agarwal, Mayank, et al.
Published: (2024)
by: Agarwal, Mayank, et al.
Published: (2024)
Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
by: Boughorbel, Sabri, et al.
Published: (2025)
by: Boughorbel, Sabri, et al.
Published: (2025)
The unregulated plant‐based ‘milk’ industry: A threat to nutrition, health and safety?
by: Simran Kaur Arora
Published: (2024)
by: Simran Kaur Arora
Published: (2024)
Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision
by: He, Yinghui, et al.
Published: (2026)
by: He, Yinghui, et al.
Published: (2026)
Token-Level Privacy in Large Language Models
by: Harel, Re'em, et al.
Published: (2025)
by: Harel, Re'em, et al.
Published: (2025)
Towards Learning High-Precision Least Squares Algorithms with Sequence Models
by: Liu, Jerry, et al.
Published: (2025)
by: Liu, Jerry, et al.
Published: (2025)
Asterisk*: Keep it Simple
by: Semenov, Andrew
Published: (2024)
by: Semenov, Andrew
Published: (2024)
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
by: Zhang, Michael, et al.
Published: (2024)
by: Zhang, Michael, et al.
Published: (2024)
Similar Items
-
Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models
by: Narayan, Avanika, et al.
Published: (2025) -
Simple linear attention language models balance the recall-throughput tradeoff
by: Arora, Simran, et al.
Published: (2024) -
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads
by: Lao, Jiale, et al.
Published: (2025) -
Just read twice: closing the recall gap for recurrent language models
by: Arora, Simran, et al.
Published: (2024) -
Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
by: Narayan, Avanika, et al.
Published: (2024)