Saved in:
| Main Authors: | An, Hongjun, Chen, Yifan, Sun, Zhe, Li, Xuelong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.00655 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Physics in Next-token Prediction
by: An, Hongjun, et al.
Published: (2024)
by: An, Hongjun, et al.
Published: (2024)
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
by: Warner, Benjamin, et al.
Published: (2024)
by: Warner, Benjamin, et al.
Published: (2024)
Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models
by: Appicharla, Ramakrishna, et al.
Published: (2025)
by: Appicharla, Ramakrishna, et al.
Published: (2025)
Predicting Sentence Acceptability Judgments in Multimodal Contexts
by: Jang, Hyewon, et al.
Published: (2026)
by: Jang, Hyewon, et al.
Published: (2026)
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models
by: Wijesiriwardene, Thilini, et al.
Published: (2023)
by: Wijesiriwardene, Thilini, et al.
Published: (2023)
SpecFuse: Ensembling Large Language Models via Next-Segment Prediction
by: Lv, Bo, et al.
Published: (2024)
by: Lv, Bo, et al.
Published: (2024)
Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
by: Luo, Yifan, et al.
Published: (2024)
by: Luo, Yifan, et al.
Published: (2024)
Single-Pixel Vision-Language Model for Intrinsic Privacy-Preserving Behavioral Intelligence
by: An, Hongjun, et al.
Published: (2026)
by: An, Hongjun, et al.
Published: (2026)
Think in Sentences: Explicit Sentence Boundaries Enhance Language Model's Capabilities
by: Liu, Zhichen, et al.
Published: (2026)
by: Liu, Zhichen, et al.
Published: (2026)
EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts
by: Chaudhury, Subhajit, et al.
Published: (2025)
by: Chaudhury, Subhajit, et al.
Published: (2025)
Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators
by: Rhee, Phill Kyu
Published: (2025)
by: Rhee, Phill Kyu
Published: (2025)
A Law of Next-Token Prediction in Large Language Models
by: He, Hangfeng, et al.
Published: (2024)
by: He, Hangfeng, et al.
Published: (2024)
Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression
by: Yuan, Cheng, et al.
Published: (2025)
by: Yuan, Cheng, et al.
Published: (2025)
Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks
by: Song, Yiliang, et al.
Published: (2026)
by: Song, Yiliang, et al.
Published: (2026)
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding
by: Yang, Xinyu, et al.
Published: (2025)
by: Yang, Xinyu, et al.
Published: (2025)
SegNSP: Revisiting Next Sentence Prediction for Linear Text Segmentation
by: Isidro, José, et al.
Published: (2026)
by: Isidro, José, et al.
Published: (2026)
Do Influence Functions Work on Large Language Models?
by: Li, Zhe, et al.
Published: (2024)
by: Li, Zhe, et al.
Published: (2024)
Context-level Language Modeling by Learning Predictive Context Embeddings
by: Dai, Beiya, et al.
Published: (2025)
by: Dai, Beiya, et al.
Published: (2025)
You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models
by: He, Wenchong, et al.
Published: (2025)
by: He, Wenchong, et al.
Published: (2025)
On Speeding Up Language Model Evaluation
by: Zhou, Jin Peng, et al.
Published: (2024)
by: Zhou, Jin Peng, et al.
Published: (2024)
An In-depth Evaluation of Large Language Models in Sentence Simplification with Error-based Human Assessment
by: Wu, Xuanxin, et al.
Published: (2024)
by: Wu, Xuanxin, et al.
Published: (2024)
Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing
by: Koneru, Sai, et al.
Published: (2023)
by: Koneru, Sai, et al.
Published: (2023)
Theoretical Foundations of Scaling Law in Familial Models
by: Song, Huan, et al.
Published: (2025)
by: Song, Huan, et al.
Published: (2025)
Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction
by: Chen, Jianhao, et al.
Published: (2024)
by: Chen, Jianhao, et al.
Published: (2024)
Contextual Clarity: Generating Sentences with Transformer Models using Context-Reverso Data
by: Musaev, Ruslan
Published: (2024)
by: Musaev, Ruslan
Published: (2024)
LangVAE and LangSpace: Building and Probing for Language Model VAEs
by: Carvalho, Danilo S., et al.
Published: (2025)
by: Carvalho, Danilo S., et al.
Published: (2025)
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
by: Shao, Chenze, et al.
Published: (2024)
by: Shao, Chenze, et al.
Published: (2024)
Unveiling Trust in Multimodal Large Language Models: Evaluation, Analysis, and Mitigation
by: Zhang, Yichi, et al.
Published: (2025)
by: Zhang, Yichi, et al.
Published: (2025)
A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction
by: Nishida, Ryo, et al.
Published: (2026)
by: Nishida, Ryo, et al.
Published: (2026)
SQLong: Enhanced NL2SQL for Longer Contexts with LLMs
by: Nguyen, Dai Quoc, et al.
Published: (2025)
by: Nguyen, Dai Quoc, et al.
Published: (2025)
CreditAudit: 2$^\text{nd}$ Dimension for LLM Evaluation and Selection
by: Song, Yiliang, et al.
Published: (2026)
by: Song, Yiliang, et al.
Published: (2026)
AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning
by: Jin, Qiao, et al.
Published: (2024)
by: Jin, Qiao, et al.
Published: (2024)
Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models
by: Liu, Yuliang, et al.
Published: (2026)
by: Liu, Yuliang, et al.
Published: (2026)
Memory Tokens: Large Language Models Can Generate Reversible Sentence Embeddings
by: Sastre, Ignacio, et al.
Published: (2025)
by: Sastre, Ignacio, et al.
Published: (2025)
Latent Reasoning via Sentence Embedding Prediction
by: Hwang, Hyeonbin, et al.
Published: (2025)
by: Hwang, Hyeonbin, et al.
Published: (2025)
VisionZip: Longer is Better but Not Necessary in Vision Language Models
by: Yang, Senqiao, et al.
Published: (2024)
by: Yang, Senqiao, et al.
Published: (2024)
Faster-GCG: Efficient Discrete Optimization Jailbreak Attacks against Aligned Large Language Models
by: Li, Xiao, et al.
Published: (2024)
by: Li, Xiao, et al.
Published: (2024)
MEXMA: Token-level objectives improve sentence representations
by: Janeiro, João Maria, et al.
Published: (2024)
by: Janeiro, João Maria, et al.
Published: (2024)
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models
by: Chen, Yanjun, et al.
Published: (2024)
by: Chen, Yanjun, et al.
Published: (2024)
Similar Items
-
Physics in Next-token Prediction
by: An, Hongjun, et al.
Published: (2024) -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
by: Warner, Benjamin, et al.
Published: (2024) -
Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models
by: Appicharla, Ramakrishna, et al.
Published: (2025) -
Predicting Sentence Acceptability Judgments in Multimodal Contexts
by: Jang, Hyewon, et al.
Published: (2026) -
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models
by: Wijesiriwardene, Thilini, et al.
Published: (2023)