Saved in:
| Main Authors: | Alarcia, Ramon Maria Garcia, Golkar, Alessandro |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.00749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
What's In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models
by: Das, Abhipsha, et al.
Published: (2025)
by: Das, Abhipsha, et al.
Published: (2025)
Is There a Case for Conversation Optimized Tokenizers in Large Language Models?
by: Ferrando, Raquel, et al.
Published: (2025)
by: Ferrando, Raquel, et al.
Published: (2025)
Using a Large Language Model to generate a Design Structure Matrix
by: Koh, Edwin C. Y.
Published: (2023)
by: Koh, Edwin C. Y.
Published: (2023)
Large Language Models for Combinatorial Optimization of Design Structure Matrix
by: Jiang, Shuo, et al.
Published: (2024)
by: Jiang, Shuo, et al.
Published: (2024)
STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning
by: Bharadwaj, Aryasomayajula Ram
Published: (2025)
by: Bharadwaj, Aryasomayajula Ram
Published: (2025)
Chaplains' Reflections on the Design and Usage of AI for Conversational Care
by: Wester, Joel, et al.
Published: (2026)
by: Wester, Joel, et al.
Published: (2026)
Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage
by: Hoque, Shahinul, et al.
Published: (2026)
by: Hoque, Shahinul, et al.
Published: (2026)
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
by: Wu, Minghao, et al.
Published: (2024)
by: Wu, Minghao, et al.
Published: (2024)
Contextually Structured Token Dependency Encoding for Large Language Models
by: Blades, James, et al.
Published: (2025)
by: Blades, James, et al.
Published: (2025)
Affect Recognition in Conversations Using Large Language Models
by: Feng, Shutong, et al.
Published: (2023)
by: Feng, Shutong, et al.
Published: (2023)
Speed and Conversational Large Language Models: Not All Is About Tokens per Second
by: Conde, Javier, et al.
Published: (2025)
by: Conde, Javier, et al.
Published: (2025)
Incorporating Token Usage into Prompting Strategy Evaluation
by: Sypherd, Chris, et al.
Published: (2025)
by: Sypherd, Chris, et al.
Published: (2025)
Structured Token Retention and Computational Memory Paths in Large Language Models
by: Delena, Jonathan, et al.
Published: (2025)
by: Delena, Jonathan, et al.
Published: (2025)
Problematic Tokens: Tokenizer Bias in Large Language Models
by: Yang, Jin, et al.
Published: (2024)
by: Yang, Jin, et al.
Published: (2024)
SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models
by: Liu, Shicheng, et al.
Published: (2023)
by: Liu, Shicheng, et al.
Published: (2023)
xVal: A Continuous Numerical Tokenization for Scientific Language Models
by: Golkar, Siavash, et al.
Published: (2023)
by: Golkar, Siavash, et al.
Published: (2023)
Rethinking Tokenization: Crafting Better Tokenizers for Large Language Models
by: Yang, Jinbiao
Published: (2024)
by: Yang, Jinbiao
Published: (2024)
Evaluating the Usage of African-American Vernacular English in Large Language Models
by: Dunlap, Deja, et al.
Published: (2026)
by: Dunlap, Deja, et al.
Published: (2026)
Retrofitting Large Language Models with Dynamic Tokenization
by: Feher, Darius, et al.
Published: (2024)
by: Feher, Darius, et al.
Published: (2024)
Large Language Model as Token Compressor and Decompressor
by: Li, Wenbing, et al.
Published: (2026)
by: Li, Wenbing, et al.
Published: (2026)
Basic Category Usage in Vision Language Models
by: Sawyer, Hunter, et al.
Published: (2025)
by: Sawyer, Hunter, et al.
Published: (2025)
Probing for the Usage of Grammatical Number
by: Lasri, Karim, et al.
Published: (2022)
by: Lasri, Karim, et al.
Published: (2022)
Performance Evaluation of Tokenizers in Large Language Models for the Assamese Language
by: Tamang, Sagar, et al.
Published: (2024)
by: Tamang, Sagar, et al.
Published: (2024)
Steering Conversational Large Language Models for Long Emotional Support Conversations
by: Madani, Navid, et al.
Published: (2024)
by: Madani, Navid, et al.
Published: (2024)
Rethinking Personalization in Large Language Models at the Token Level
by: Zhang, Chenheng, et al.
Published: (2026)
by: Zhang, Chenheng, et al.
Published: (2026)
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
by: Wang, Dixuan, et al.
Published: (2024)
by: Wang, Dixuan, et al.
Published: (2024)
TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models
by: Choo, Jinho, et al.
Published: (2026)
by: Choo, Jinho, et al.
Published: (2026)
CROP: Token-Efficient Reasoning in Large Language Models via Regularized Prompt Optimization
by: Shah, Deep, et al.
Published: (2026)
by: Shah, Deep, et al.
Published: (2026)
Sparse Matrix in Large Language Model Fine-tuning
by: He, Haoze, et al.
Published: (2024)
by: He, Haoze, et al.
Published: (2024)
Stop Taking Tokenizers for Granted: They Are Core Design Decisions in Large Language Models
by: Alqahtani, Sawsan, et al.
Published: (2026)
by: Alqahtani, Sawsan, et al.
Published: (2026)
Token-Level Privacy in Large Language Models
by: Harel, Re'em, et al.
Published: (2025)
by: Harel, Re'em, et al.
Published: (2025)
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
by: Zhao, Chenzhuo, et al.
Published: (2025)
by: Zhao, Chenzhuo, et al.
Published: (2025)
Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems
by: Roy, Soham, et al.
Published: (2025)
by: Roy, Soham, et al.
Published: (2025)
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
by: Ning, Kangyun, et al.
Published: (2024)
by: Ning, Kangyun, et al.
Published: (2024)
Identifying and Analyzing Performance-Critical Tokens in Large Language Models
by: Bai, Yu, et al.
Published: (2024)
by: Bai, Yu, et al.
Published: (2024)
Tokenization Falling Short: On Subword Robustness in Large Language Models
by: Chai, Yekun, et al.
Published: (2024)
by: Chai, Yekun, et al.
Published: (2024)
MorphPiece : A Linguistic Tokenizer for Large Language Models
by: Jabbar, Haris
Published: (2023)
by: Jabbar, Haris
Published: (2023)
Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models
by: Gui, Anchun, et al.
Published: (2024)
by: Gui, Anchun, et al.
Published: (2024)
Leave No TRACE: Black-box Detection of Copyrighted Dataset Usage in Large Language Models via Watermarking
by: Zhang, Jingqi, et al.
Published: (2025)
by: Zhang, Jingqi, et al.
Published: (2025)
Vision-centric Token Compression in Large Language Model
by: Xing, Ling, et al.
Published: (2025)
by: Xing, Ling, et al.
Published: (2025)
Similar Items
-
What's In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models
by: Das, Abhipsha, et al.
Published: (2025) -
Is There a Case for Conversation Optimized Tokenizers in Large Language Models?
by: Ferrando, Raquel, et al.
Published: (2025) -
Using a Large Language Model to generate a Design Structure Matrix
by: Koh, Edwin C. Y.
Published: (2023) -
Large Language Models for Combinatorial Optimization of Design Structure Matrix
by: Jiang, Shuo, et al.
Published: (2024) -
STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning
by: Bharadwaj, Aryasomayajula Ram
Published: (2025)