Saved in:
| Main Authors: | Wang, Hongyi, Polo, Felipe Maia, Sun, Yuekai, Kundu, Souvik, Xing, Eric, Yurochkin, Mikhail |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.01542 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families
by: Polo, Felipe Maia, et al.
Published: (2024)
by: Polo, Felipe Maia, et al.
Published: (2024)
Weak Supervision Performance Evaluation via Partial Identification
by: Polo, Felipe Maia, et al.
Published: (2023)
by: Polo, Felipe Maia, et al.
Published: (2023)
Bridging Human and LLM Judgments: Understanding and Narrowing the Gap
by: Polo, Felipe Maia, et al.
Published: (2025)
by: Polo, Felipe Maia, et al.
Published: (2025)
A transfer learning framework for weak-to-strong generalization
by: Somerstep, Seamus, et al.
Published: (2024)
by: Somerstep, Seamus, et al.
Published: (2024)
tinyBenchmarks: evaluating LLMs with fewer examples
by: Polo, Felipe Maia, et al.
Published: (2024)
by: Polo, Felipe Maia, et al.
Published: (2024)
Prompt Exploration with Prompt Regression
by: Feffer, Michael, et al.
Published: (2024)
by: Feffer, Michael, et al.
Published: (2024)
A Latent Variable Framework for Scaling Laws in Large Language Models
by: Cai, Peiyao, et al.
Published: (2025)
by: Cai, Peiyao, et al.
Published: (2025)
Efficient multi-prompt evaluation of LLMs
by: Polo, Felipe Maia, et al.
Published: (2024)
by: Polo, Felipe Maia, et al.
Published: (2024)
Limitations of refinement methods for weak to strong generalization
by: Somerstep, Seamus, et al.
Published: (2025)
by: Somerstep, Seamus, et al.
Published: (2025)
Aligners: Decoupling LLMs and Alignment
by: Ngweta, Lilian, et al.
Published: (2024)
by: Ngweta, Lilian, et al.
Published: (2024)
Microfoundation Inference for Strategic Prediction
by: Bracale, Daniele, et al.
Published: (2024)
by: Bracale, Daniele, et al.
Published: (2024)
PRISM: Enhancing Protein Inverse Folding through Fine-Grained Retrieval on Structure-Sequence Multimodal Representations
by: Mahbub, Sazan, et al.
Published: (2025)
by: Mahbub, Sazan, et al.
Published: (2025)
CARROT: A Cost Aware Rate Optimal Router
by: Somerstep, Seamus, et al.
Published: (2025)
by: Somerstep, Seamus, et al.
Published: (2025)
Synapse: Adaptive Arbitration of Complementary Expertise in Time Series Foundational Models
by: Das, Sarkar Snigdha Sarathi, et al.
Published: (2025)
by: Das, Sarkar Snigdha Sarathi, et al.
Published: (2025)
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
by: Zheng, Wenhao, et al.
Published: (2025)
by: Zheng, Wenhao, et al.
Published: (2025)
Distributionally Robust Performative Prediction
by: Xue, Songkai, et al.
Published: (2024)
by: Xue, Songkai, et al.
Published: (2024)
Out-of-Distribution Detection using Synthetic Data Generation
by: Abbas, Momin, et al.
Published: (2025)
by: Abbas, Momin, et al.
Published: (2025)
MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization
by: Ramachandran, Akshat, et al.
Published: (2024)
by: Ramachandran, Akshat, et al.
Published: (2024)
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation
by: Azizi, Seyedarmin, et al.
Published: (2024)
by: Azizi, Seyedarmin, et al.
Published: (2024)
Personalized Image Generation for Recommendations Beyond Catalogs
by: Patron, Gabriel, et al.
Published: (2025)
by: Patron, Gabriel, et al.
Published: (2025)
Linearizing Models for Efficient yet Robust Private Inference
by: Sarkar, Sreetama, et al.
Published: (2024)
by: Sarkar, Sreetama, et al.
Published: (2024)
CharED: Character-wise Ensemble Decoding for Large Language Models
by: Gu, Kevin, et al.
Published: (2024)
by: Gu, Kevin, et al.
Published: (2024)
COBRA: Catastrophic Bit-flip Reliability Analysis of State-Space Models
by: Das, Sanjay, et al.
Published: (2025)
by: Das, Sanjay, et al.
Published: (2025)
Rich Insights from Cheap Signals: Efficient Evaluations via Tensor Factorization
by: Polo, Felipe Maia, et al.
Published: (2026)
by: Polo, Felipe Maia, et al.
Published: (2026)
Likelihood-Free Estimation for Spatiotemporal Hawkes processes with missing data and application to predictive policing
by: Das, Pramit, et al.
Published: (2025)
by: Das, Pramit, et al.
Published: (2025)
Memory-adaptive Depth-wise Heterogeneous Federated Learning
by: Zhang, Kai, et al.
Published: (2023)
by: Zhang, Kai, et al.
Published: (2023)
Dynamic Pricing in the Linear Valuation Model using Shape Constraints
by: Bracale, Daniele, et al.
Published: (2025)
by: Bracale, Daniele, et al.
Published: (2025)
Uncertainty Quantification via Stable Distribution Propagation
by: Petersen, Felix, et al.
Published: (2024)
by: Petersen, Felix, et al.
Published: (2024)
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention
by: Ro, Yeonju, et al.
Published: (2025)
by: Ro, Yeonju, et al.
Published: (2025)
Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation
by: Somerstep, Seamus, et al.
Published: (2025)
by: Somerstep, Seamus, et al.
Published: (2025)
Learning the Distribution Map in Reverse Causal Performative Prediction
by: Bracale, Daniele, et al.
Published: (2024)
by: Bracale, Daniele, et al.
Published: (2024)
Maximin Relative Improvement: Fair Learning as a Bargaining Problem
by: Han, Jiwoo, et al.
Published: (2026)
by: Han, Jiwoo, et al.
Published: (2026)
Algorithmic Fairness in Performative Policy Learning: Escaping the Impossibility of Group Fairness
by: Somerstep, Seamus, et al.
Published: (2024)
by: Somerstep, Seamus, et al.
Published: (2024)
Junk DNA Hypothesis: Pruning Small Pre-Trained Weights Irreversibly and Monotonically Impairs "Difficult" Downstream Tasks in LLMs
by: Yin, Lu, et al.
Published: (2023)
by: Yin, Lu, et al.
Published: (2023)
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
by: Shabtay, Nimrod, et al.
Published: (2024)
by: Shabtay, Nimrod, et al.
Published: (2024)
Learning In Reverse Causal Strategic Environments With Ramifications on Two Sided Markets
by: Somerstep, Seamus, et al.
Published: (2024)
by: Somerstep, Seamus, et al.
Published: (2024)
Improving the Throughput of Diffusion-based Large Language Models via a Training-Free Confidence-Aware Calibration
by: Shen, Jucheng, et al.
Published: (2025)
by: Shen, Jucheng, et al.
Published: (2025)
Revenue Maximization Under Sequential Price Competition Via The Estimation Of s-Concave Demand Functions
by: Bracale, Daniele, et al.
Published: (2025)
by: Bracale, Daniele, et al.
Published: (2025)
Elucidating Subspace Perturbation in Zeroth-Order Optimization: Theory and Practice at Scale
by: Park, Sihwan, et al.
Published: (2025)
by: Park, Sihwan, et al.
Published: (2025)
Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning
by: Azizi, Seyedarmin, et al.
Published: (2026)
by: Azizi, Seyedarmin, et al.
Published: (2026)
Similar Items
-
Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families
by: Polo, Felipe Maia, et al.
Published: (2024) -
Weak Supervision Performance Evaluation via Partial Identification
by: Polo, Felipe Maia, et al.
Published: (2023) -
Bridging Human and LLM Judgments: Understanding and Narrowing the Gap
by: Polo, Felipe Maia, et al.
Published: (2025) -
A transfer learning framework for weak-to-strong generalization
by: Somerstep, Seamus, et al.
Published: (2024) -
tinyBenchmarks: evaluating LLMs with fewer examples
by: Polo, Felipe Maia, et al.
Published: (2024)