:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Pan, Wuming
Format:	Preprint
Published:	2024
Subjects:	General Mathematics Machine Learning I.2.6
Online Access:	https://arxiv.org/abs/2404.11624
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multiple Token Divergence: Measuring and Steering In-Context Computation Density
by: Herrmann, Vincent, et al.
Published: (2025)

Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training
by: McEntire, Jeremy
Published: (2026)

Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025)

AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)

Before the Last Token: Diagnosing Final-Token Safety Probe Failures
by: Doda, Shravan
Published: (2026)

Dual VC Dimension Obstructs Sample Compression by Embeddings
by: Chase, Zachary, et al.
Published: (2024)

Spherical dimension
by: Chornomaz, Bogdan, et al.
Published: (2025)

Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)

Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
by: Walker, Nicholas
Published: (2024)

ArrowFlow: Hierarchical Machine Learning in the Space of Permutations
by: Yilmaz, Ozgur
Published: (2026)

Loss-Complexity Landscape and Model Structure Functions
by: Kolpakov, Alexander
Published: (2025)

Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
by: Li, Bangzheng, et al.
Published: (2024)

Measuring In-Context Computation Complexity via Hidden State Prediction
by: Herrmann, Vincent, et al.
Published: (2025)

Explanations Based on Item Response Theory (eXirt): A Model-Specific Method to Explain Tree-Ensemble Model in Trust Perspective
by: Ribeiro, José, et al.
Published: (2022)

Beyond Random Sampling: Instance Quality-Based Data Partitioning via Item Response Theory
by: Cardoso, Lucas, et al.
Published: (2025)

Versatile Ordering Network: An Attention-based Neural Network for Ordering Across Scales and Quality Metrics
by: Yu, Zehua, et al.
Published: (2024)

TimeCatcher: A Variational Framework for Volatility-Aware Forecasting of Non-Stationary Time Series
by: Chen, Zhiyu, et al.
Published: (2026)

A Dynamic Model of Performative Human-ML Collaboration: Theory and Empirical Evidence
by: Sühr, Tom, et al.
Published: (2024)

Why LoRA Resists Label Noise: A Theoretical Framework for Noise-Robust Parameter-Efficient Fine-Tuning
by: Steele, Brady
Published: (2026)

Rethinking Thinking Tokens: Understanding Why They Underperform in Practice
by: Vennam, Sreeram, et al.
Published: (2024)

Pre-trained Models Perform the Best When Token Distributions Follow Zipf's Law
by: He, Yanjin, et al.
Published: (2025)

Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models
by: Shravan, Rohan
Published: (2026)

ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space
by: Yong, Shim Soon
Published: (2025)

Forecasting Company Fundamentals
by: Divo, Felix, et al.
Published: (2024)

2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
by: Mongaras, Gabriel, et al.
Published: (2026)

FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models
by: Polly, Fabien
Published: (2026)

I-GLIDE: Input Groups for Latent Health Indicators in Degradation Estimation
by: Thil, Lucas, et al.
Published: (2025)

Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms
by: Shen, Jingran, et al.
Published: (2023)

LLM Vocabulary Compression for Low-Compute Environments
by: Vennam, Sreeram, et al.
Published: (2024)

A Survey of Reinforcement Learning from Human Feedback
by: Kaufmann, Timo, et al.
Published: (2023)

A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
by: Zhao, Zhilong, et al.
Published: (2025)

Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
by: Huang, Yunpeng, et al.
Published: (2024)

Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)

Enhancing Classifier Evaluation: A Fairer Benchmarking Strategy Based on Ability and Robustness
by: Cardoso, Lucas, et al.
Published: (2025)

A Comparative Analysis of Reinforcement Learning and Conventional Deep Learning Approaches for Bearing Fault Diagnosis
by: Çakır, Efe, et al.
Published: (2025)

HGCN(O): A Self-Tuning GCN HyperModel Toolkit for Outcome Prediction in Event-Sequence Data
by: Wang, Fang, et al.
Published: (2025)

Kolmogorov Arnold Networks and Multi-Layer Perceptrons: A Paradigm Shift in Neural Modelling
by: Gaonkar, Aradhya, et al.
Published: (2026)

Synergy over Discrepancy: A Partition-Based Approach to Multi-Domain LLM Fine-Tuning
by: Ye, Hua, et al.
Published: (2025)

A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length
by: Xian, Zeqiang, et al.
Published: (2026)

SQARL: A Size-Agnostic Reinforcement Learning approach for Circuit Allocation in Distributed Quantum Architectures
by: Carballo, Víctor, et al.
Published: (2026)