Saved in:
| Main Author: | Pan, Wuming |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.11624 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multiple Token Divergence: Measuring and Steering In-Context Computation Density
by: Herrmann, Vincent, et al.
Published: (2025)
by: Herrmann, Vincent, et al.
Published: (2025)
Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training
by: McEntire, Jeremy
Published: (2026)
by: McEntire, Jeremy
Published: (2026)
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025)
by: Khorasani, Sadegh, et al.
Published: (2025)
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)
by: Yousaf, Iqra
Published: (2024)
Before the Last Token: Diagnosing Final-Token Safety Probe Failures
by: Doda, Shravan
Published: (2026)
by: Doda, Shravan
Published: (2026)
Dual VC Dimension Obstructs Sample Compression by Embeddings
by: Chase, Zachary, et al.
Published: (2024)
by: Chase, Zachary, et al.
Published: (2024)
Spherical dimension
by: Chornomaz, Bogdan, et al.
Published: (2025)
by: Chornomaz, Bogdan, et al.
Published: (2025)
Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
by: Walker, Nicholas
Published: (2024)
by: Walker, Nicholas
Published: (2024)
ArrowFlow: Hierarchical Machine Learning in the Space of Permutations
by: Yilmaz, Ozgur
Published: (2026)
by: Yilmaz, Ozgur
Published: (2026)
Loss-Complexity Landscape and Model Structure Functions
by: Kolpakov, Alexander
Published: (2025)
by: Kolpakov, Alexander
Published: (2025)
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
by: Li, Bangzheng, et al.
Published: (2024)
by: Li, Bangzheng, et al.
Published: (2024)
Measuring In-Context Computation Complexity via Hidden State Prediction
by: Herrmann, Vincent, et al.
Published: (2025)
by: Herrmann, Vincent, et al.
Published: (2025)
Explanations Based on Item Response Theory (eXirt): A Model-Specific Method to Explain Tree-Ensemble Model in Trust Perspective
by: Ribeiro, José, et al.
Published: (2022)
by: Ribeiro, José, et al.
Published: (2022)
Beyond Random Sampling: Instance Quality-Based Data Partitioning via Item Response Theory
by: Cardoso, Lucas, et al.
Published: (2025)
by: Cardoso, Lucas, et al.
Published: (2025)
Versatile Ordering Network: An Attention-based Neural Network for Ordering Across Scales and Quality Metrics
by: Yu, Zehua, et al.
Published: (2024)
by: Yu, Zehua, et al.
Published: (2024)
TimeCatcher: A Variational Framework for Volatility-Aware Forecasting of Non-Stationary Time Series
by: Chen, Zhiyu, et al.
Published: (2026)
by: Chen, Zhiyu, et al.
Published: (2026)
A Dynamic Model of Performative Human-ML Collaboration: Theory and Empirical Evidence
by: Sühr, Tom, et al.
Published: (2024)
by: Sühr, Tom, et al.
Published: (2024)
Why LoRA Resists Label Noise: A Theoretical Framework for Noise-Robust Parameter-Efficient Fine-Tuning
by: Steele, Brady
Published: (2026)
by: Steele, Brady
Published: (2026)
Rethinking Thinking Tokens: Understanding Why They Underperform in Practice
by: Vennam, Sreeram, et al.
Published: (2024)
by: Vennam, Sreeram, et al.
Published: (2024)
Pre-trained Models Perform the Best When Token Distributions Follow Zipf's Law
by: He, Yanjin, et al.
Published: (2025)
by: He, Yanjin, et al.
Published: (2025)
Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models
by: Shravan, Rohan
Published: (2026)
by: Shravan, Rohan
Published: (2026)
ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space
by: Yong, Shim Soon
Published: (2025)
by: Yong, Shim Soon
Published: (2025)
Forecasting Company Fundamentals
by: Divo, Felix, et al.
Published: (2024)
by: Divo, Felix, et al.
Published: (2024)
2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
by: Mongaras, Gabriel, et al.
Published: (2026)
by: Mongaras, Gabriel, et al.
Published: (2026)
FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models
by: Polly, Fabien
Published: (2026)
by: Polly, Fabien
Published: (2026)
I-GLIDE: Input Groups for Latent Health Indicators in Degradation Estimation
by: Thil, Lucas, et al.
Published: (2025)
by: Thil, Lucas, et al.
Published: (2025)
Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms
by: Shen, Jingran, et al.
Published: (2023)
by: Shen, Jingran, et al.
Published: (2023)
LLM Vocabulary Compression for Low-Compute Environments
by: Vennam, Sreeram, et al.
Published: (2024)
by: Vennam, Sreeram, et al.
Published: (2024)
A Survey of Reinforcement Learning from Human Feedback
by: Kaufmann, Timo, et al.
Published: (2023)
by: Kaufmann, Timo, et al.
Published: (2023)
A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
by: Zhao, Zhilong, et al.
Published: (2025)
by: Zhao, Zhilong, et al.
Published: (2025)
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
by: Huang, Yunpeng, et al.
Published: (2024)
by: Huang, Yunpeng, et al.
Published: (2024)
Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
Enhancing Classifier Evaluation: A Fairer Benchmarking Strategy Based on Ability and Robustness
by: Cardoso, Lucas, et al.
Published: (2025)
by: Cardoso, Lucas, et al.
Published: (2025)
A Comparative Analysis of Reinforcement Learning and Conventional Deep Learning Approaches for Bearing Fault Diagnosis
by: Çakır, Efe, et al.
Published: (2025)
by: Çakır, Efe, et al.
Published: (2025)
HGCN(O): A Self-Tuning GCN HyperModel Toolkit for Outcome Prediction in Event-Sequence Data
by: Wang, Fang, et al.
Published: (2025)
by: Wang, Fang, et al.
Published: (2025)
Kolmogorov Arnold Networks and Multi-Layer Perceptrons: A Paradigm Shift in Neural Modelling
by: Gaonkar, Aradhya, et al.
Published: (2026)
by: Gaonkar, Aradhya, et al.
Published: (2026)
Synergy over Discrepancy: A Partition-Based Approach to Multi-Domain LLM Fine-Tuning
by: Ye, Hua, et al.
Published: (2025)
by: Ye, Hua, et al.
Published: (2025)
A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length
by: Xian, Zeqiang, et al.
Published: (2026)
by: Xian, Zeqiang, et al.
Published: (2026)
SQARL: A Size-Agnostic Reinforcement Learning approach for Circuit Allocation in Distributed Quantum Architectures
by: Carballo, Víctor, et al.
Published: (2026)
by: Carballo, Víctor, et al.
Published: (2026)
Similar Items
-
Multiple Token Divergence: Measuring and Steering In-Context Computation Density
by: Herrmann, Vincent, et al.
Published: (2025) -
Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training
by: McEntire, Jeremy
Published: (2026) -
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025) -
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024) -
Before the Last Token: Diagnosing Final-Token Safety Probe Failures
by: Doda, Shravan
Published: (2026)