Saved in:
| Main Authors: | Phan, Buu, Amos, Brandon, Gat, Itai, Havasi, Marton, Muckley, Matthew, Ullrich, Karen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.09303 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding and Mitigating Tokenization Bias in Language Models
by: Phan, Buu, et al.
Published: (2024)
by: Phan, Buu, et al.
Published: (2024)
Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation
by: Phan, Buu, et al.
Published: (2025)
by: Phan, Buu, et al.
Published: (2025)
Edit Flows: Flow Matching with Edit Operations
by: Havasi, Marton, et al.
Published: (2025)
by: Havasi, Marton, et al.
Published: (2025)
Corrector Sampling in Language Models
by: Gat, Itai, et al.
Published: (2025)
by: Gat, Itai, et al.
Published: (2025)
Set Block Decoding is a Language Model Inference Accelerator
by: Gat, Itai, et al.
Published: (2025)
by: Gat, Itai, et al.
Published: (2025)
Channel Simulation and Distributed Compression with Ensemble Rejection Sampling
by: Phan, Buu, et al.
Published: (2025)
by: Phan, Buu, et al.
Published: (2025)
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer
by: Deng, Chunyuan, et al.
Published: (2026)
by: Deng, Chunyuan, et al.
Published: (2026)
Diverse Concept Proposals for Concept Bottleneck Models
by: Brown, Katrina, et al.
Published: (2024)
by: Brown, Katrina, et al.
Published: (2024)
Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective
by: Shaul, Neta, et al.
Published: (2024)
by: Shaul, Neta, et al.
Published: (2024)
Generator Matching: Generative modeling with arbitrary Markov processes
by: Holderrieth, Peter, et al.
Published: (2024)
by: Holderrieth, Peter, et al.
Published: (2024)
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
by: Severo, Daniel, et al.
Published: (2025)
by: Severo, Daniel, et al.
Published: (2025)
Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models
by: Shravan, Rohan
Published: (2026)
by: Shravan, Rohan
Published: (2026)
List-Level Distribution Coupling with Applications to Speculative Decoding and Lossy Compression
by: Rowan, Joseph, et al.
Published: (2025)
by: Rowan, Joseph, et al.
Published: (2025)
SpaceByte: Towards Deleting Tokenization from Large Language Modeling
by: Slagle, Kevin
Published: (2024)
by: Slagle, Kevin
Published: (2024)
MambaByte: Token-free Selective State Space Model
by: Wang, Junxiong, et al.
Published: (2024)
by: Wang, Junxiong, et al.
Published: (2024)
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
by: Kallini, Julie, et al.
Published: (2024)
by: Kallini, Julie, et al.
Published: (2024)
Scratchpad Patching: Decoupling Compute from Patch Size in Byte-Level Language Models
by: Zheng, Lin, et al.
Published: (2026)
by: Zheng, Lin, et al.
Published: (2026)
Flow Matching Guide and Code
by: Lipman, Yaron, et al.
Published: (2024)
by: Lipman, Yaron, et al.
Published: (2024)
GPUTOK: GPU Accelerated Byte Level BPE Tokenization
by: Kadamba, Venu Gopal, et al.
Published: (2026)
by: Kadamba, Venu Gopal, et al.
Published: (2026)
Guarantee Regions for Local Explanations
by: Havasi, Marton, et al.
Published: (2024)
by: Havasi, Marton, et al.
Published: (2024)
Textually Pretrained Speech Language Models
by: Hassid, Michael, et al.
Published: (2023)
by: Hassid, Michael, et al.
Published: (2023)
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
by: Zheng, Qinqing, et al.
Published: (2024)
by: Zheng, Qinqing, et al.
Published: (2024)
Just on Time: Token-Level Early Stopping for Diffusion Language Models
by: Kohut, Zahar, et al.
Published: (2026)
by: Kohut, Zahar, et al.
Published: (2026)
Transition Matching: Scalable and Flexible Generative Modeling
by: Shaul, Neta, et al.
Published: (2025)
by: Shaul, Neta, et al.
Published: (2025)
Specialising and Analysing Instruction-Tuned and Byte-Level Language Models for Organic Reaction Prediction
by: Pang, Jiayun, et al.
Published: (2024)
by: Pang, Jiayun, et al.
Published: (2024)
Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models
by: Neitemeier, Pit, et al.
Published: (2025)
by: Neitemeier, Pit, et al.
Published: (2025)
Language Models over Canonical Byte-Pair Encodings
by: Vieira, Tim, et al.
Published: (2025)
by: Vieira, Tim, et al.
Published: (2025)
Scalable Token-Level Hallucination Detection in Large Language Models
by: Min, Rui, et al.
Published: (2026)
by: Min, Rui, et al.
Published: (2026)
Byte-token Enhanced Language Models for Temporal Point Processes Analysis
by: Kong, Quyu, et al.
Published: (2025)
by: Kong, Quyu, et al.
Published: (2025)
Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
by: Quevedo, Ernesto, et al.
Published: (2024)
by: Quevedo, Ernesto, et al.
Published: (2024)
Sampling from Your Language Model One Byte at a Time
by: Hayase, Jonathan, et al.
Published: (2025)
by: Hayase, Jonathan, et al.
Published: (2025)
TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models
by: Choo, Jinho, et al.
Published: (2026)
by: Choo, Jinho, et al.
Published: (2026)
AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation
by: Zhang, Songming, et al.
Published: (2025)
by: Zhang, Songming, et al.
Published: (2025)
Large Language Models Struggle in Token-Level Clinical Named Entity Recognition
by: Lu, Qiuhao, et al.
Published: (2024)
by: Lu, Qiuhao, et al.
Published: (2024)
DP-Fusion: Token-Level Differentially Private Inference for Large Language Models
by: Thareja, Rushil, et al.
Published: (2025)
by: Thareja, Rushil, et al.
Published: (2025)
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
by: Shao, Chenze, et al.
Published: (2024)
by: Shao, Chenze, et al.
Published: (2024)
TokUR: Token-Level Uncertainty Estimation for Large Language Model Reasoning
by: Zhang, Tunyu, et al.
Published: (2025)
by: Zhang, Tunyu, et al.
Published: (2025)
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
by: Su, Jingtong, et al.
Published: (2025)
by: Su, Jingtong, et al.
Published: (2025)
Mission Impossible: A Statistical Perspective on Jailbreaking LLMs
by: Su, Jingtong, et al.
Published: (2024)
by: Su, Jingtong, et al.
Published: (2024)
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
by: Fu, Deqing, et al.
Published: (2024)
by: Fu, Deqing, et al.
Published: (2024)
Similar Items
-
Understanding and Mitigating Tokenization Bias in Language Models
by: Phan, Buu, et al.
Published: (2024) -
Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation
by: Phan, Buu, et al.
Published: (2025) -
Edit Flows: Flow Matching with Edit Operations
by: Havasi, Marton, et al.
Published: (2025) -
Corrector Sampling in Language Models
by: Gat, Itai, et al.
Published: (2025) -
Set Block Decoding is a Language Model Inference Accelerator
by: Gat, Itai, et al.
Published: (2025)