:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Arora, Simran, Yang, Brandon, Eyuboglu, Sabri, Narayan, Avanika, Hojel, Andrew, Trummer, Immanuel, Ré, Christopher
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2304.09433
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models
by: Narayan, Avanika, et al.
Published: (2025)

Simple linear attention language models balance the recall-throughput tradeoff
by: Arora, Simran, et al.
Published: (2024)

SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads
by: Lao, Jiale, et al.
Published: (2025)

Just read twice: closing the recall gap for recurrent language models
by: Arora, Simran, et al.
Published: (2024)

Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
by: Narayan, Avanika, et al.
Published: (2024)

SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Fees
by: Jo, Saehan, et al.
Published: (2024)

GenDB: The Next Generation of Query Processing -- Synthesized, Not Engineered
by: Lao, Jiale, et al.
Published: (2026)

Cartridges: Lightweight and general-purpose long context representations via self-study
by: Eyuboglu, Sabri, et al.
Published: (2025)

An Information Theoretic Perspective on Agentic System Design
by: He, Shizhe, et al.
Published: (2025)

λ-Tune: Harnessing Large Language Models for Automated Database System Tuning
by: Giannankouris, Victor, et al.
Published: (2024)

Implementing Semantic Join Operators Efficiently
by: Trummer, Immanuel
Published: (2025)

LoLCATs: On Low-Rank Linearizing of Large Language Models
by: Zhang, Michael, et al.
Published: (2024)

ThunderKittens: Simple, Fast, and Adorable AI Kernels
by: Spector, Benjamin F., et al.
Published: (2024)

OpenJarvis: Personal AI, On Personal Devices
by: Saad-Falcon, Jon, et al.
Published: (2026)

Hybrid Mixed Integer Linear Programming for Large-Scale Join Order Optimisation
by: Schönberger, Manuel, et al.
Published: (2025)

Automating the Enterprise with Foundation Models
by: Wornow, Michael, et al.
Published: (2024)

Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning
by: Kaur, Simran, et al.
Published: (2024)

ParallelKittens: Systematic and Practical Simplification of Multi-GPU AI Kernels
by: Sul, Stuart H., et al.
Published: (2025)

RELIC: Investigating Large Language Model Responses using Self-Consistency
by: Cheng, Furui, et al.
Published: (2023)

Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis
by: Boughorbel, Sabri, et al.
Published: (2024)

Can Models Learn Skill Composition from Examples?
by: Zhao, Haoyu, et al.
Published: (2024)

Aioli: A Unified Optimization Framework for Language Model Data Mixing
by: Chen, Mayee F., et al.
Published: (2024)

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
by: Saad-Falcon, Jon, et al.
Published: (2024)

Counting Clinical Trials: New Evidence on Pharmaceutical Sector Productivity
by: Durvasula, Maya M., et al.
Published: (2024)

Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters
by: Garcia, Roberto, et al.
Published: (2025)

Late Time Acceleration with Observational Constraints in Modified Theories of Gravity
by: Arora, Simran
Published: (2023)

Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)

Constructing Efficient Fact-Storing MLPs for Transformers
by: Dugan, Owen, et al.
Published: (2025)

It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers
by: Clavié, Benjamin, et al.
Published: (2025)

AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
by: Wu, Zhengxuan, et al.
Published: (2025)

Masked Language Models are Good Heterogeneous Graph Generalizers
by: Yang, Jinyu, et al.
Published: (2025)

Enabling Communication via APIs for Mainframe Applications
by: Kanvar, Vini, et al.
Published: (2024)

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
by: Agarwal, Mayank, et al.
Published: (2024)

Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
by: Boughorbel, Sabri, et al.
Published: (2025)

The unregulated plant‐based ‘milk’ industry: A threat to nutrition, health and safety?
by: Simran Kaur Arora
Published: (2024)

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision
by: He, Yinghui, et al.
Published: (2026)

Token-Level Privacy in Large Language Models
by: Harel, Re'em, et al.
Published: (2025)

Towards Learning High-Precision Least Squares Algorithms with Sequence Models
by: Liu, Jerry, et al.
Published: (2025)

Asterisk*: Keep it Simple
by: Semenov, Andrew
Published: (2024)

The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
by: Zhang, Michael, et al.
Published: (2024)