Saved in:
| Main Authors: | Sediqin, Mohammadreza, Argamon, Shlomo Engelson |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.07723 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Simple and Effective Input Reformulations for Translation
by: Yu, Brian, et al.
Published: (2023)
by: Yu, Brian, et al.
Published: (2023)
Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material
by: Tannor, Shlomo, et al.
Published: (2022)
by: Tannor, Shlomo, et al.
Published: (2022)
FoldGPT: Simple and Effective Large Language Model Compression Scheme
by: Liu, Songwei, et al.
Published: (2024)
by: Liu, Songwei, et al.
Published: (2024)
Simple and Effective Masked Diffusion Language Models
by: Sahoo, Subham Sekhar, et al.
Published: (2024)
by: Sahoo, Subham Sekhar, et al.
Published: (2024)
MAPLE: A Framework for Active Preference Learning Guided by Large Language Models
by: Mahmud, Saaduddin, et al.
Published: (2024)
by: Mahmud, Saaduddin, et al.
Published: (2024)
A Simple and Effective Pruning Approach for Large Language Models
by: Sun, Mingjie, et al.
Published: (2023)
by: Sun, Mingjie, et al.
Published: (2023)
Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence Model
by: Chu, Jiqun, et al.
Published: (2024)
by: Chu, Jiqun, et al.
Published: (2024)
Simple Yet Effective: An Information-Theoretic Approach to Multi-LLM Uncertainty Quantification
by: Kruse, Maya, et al.
Published: (2025)
by: Kruse, Maya, et al.
Published: (2025)
SOI Matters: Analyzing Multi-Setting Training Dynamics in Pretrained Language Models via Subsets of Interest
by: Vassef, Shayan, et al.
Published: (2025)
by: Vassef, Shayan, et al.
Published: (2025)
InnerQ: Hardware-Aware Tuning-Free Quantization of KV Cache for Large Language Models
by: Hosseini, Sayed Mohammadreza Tayaranian, et al.
Published: (2026)
by: Hosseini, Sayed Mohammadreza Tayaranian, et al.
Published: (2026)
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
by: Guo, Yiran, et al.
Published: (2025)
by: Guo, Yiran, et al.
Published: (2025)
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
by: Wang, Guoqing, et al.
Published: (2025)
by: Wang, Guoqing, et al.
Published: (2025)
Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
by: Li, Mufei, et al.
Published: (2024)
by: Li, Mufei, et al.
Published: (2024)
It's Not That Simple. An Analysis of Simple Test-Time Scaling
by: Wu, Guojun
Published: (2025)
by: Wu, Guojun
Published: (2025)
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
by: Kim, Dahyun, et al.
Published: (2023)
by: Kim, Dahyun, et al.
Published: (2023)
A Simple but Effective Closed-form Solution for Extreme Multi-label Learning
by: Onishi, Kazuma, et al.
Published: (2025)
by: Onishi, Kazuma, et al.
Published: (2025)
Hyperparameter Loss Surfaces Are Simple Near their Optima
by: Lourie, Nicholas, et al.
Published: (2025)
by: Lourie, Nicholas, et al.
Published: (2025)
Simple Mechanistic Explanations for Out-Of-Context Reasoning
by: Wang, Atticus, et al.
Published: (2025)
by: Wang, Atticus, et al.
Published: (2025)
Simple Mechanisms for Representing, Indexing and Manipulating Concepts
by: Li, Yuanzhi, et al.
Published: (2023)
by: Li, Yuanzhi, et al.
Published: (2023)
Compressing LLMs: The Truth is Rarely Pure and Never Simple
by: Jaiswal, Ajay, et al.
Published: (2023)
by: Jaiswal, Ajay, et al.
Published: (2023)
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters
by: Bałazy, Klaudia, et al.
Published: (2024)
by: Bałazy, Klaudia, et al.
Published: (2024)
Four Quadrants of Difficulty: A Simple Categorisation and its Limits
by: Toborek, Vanessa, et al.
Published: (2026)
by: Toborek, Vanessa, et al.
Published: (2026)
AAVGen: Precision Engineering of Adeno-associated Viral Capsids for Renal Selective Targeting
by: Ghaffarzadeh-Esfahani, Mohammadreza, et al.
Published: (2026)
by: Ghaffarzadeh-Esfahani, Mohammadreza, et al.
Published: (2026)
Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models
by: Tayaranian, Mohammadreza, et al.
Published: (2024)
by: Tayaranian, Mohammadreza, et al.
Published: (2024)
SimpleGPT: Improving GPT via A Simple Normalization Strategy
by: Chen, Marco, et al.
Published: (2026)
by: Chen, Marco, et al.
Published: (2026)
SimPO: Simple Preference Optimization with a Reference-Free Reward
by: Meng, Yu, et al.
Published: (2024)
by: Meng, Yu, et al.
Published: (2024)
Simple linear attention language models balance the recall-throughput tradeoff
by: Arora, Simran, et al.
Published: (2024)
by: Arora, Simran, et al.
Published: (2024)
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
by: Cai, Tianle, et al.
Published: (2024)
by: Cai, Tianle, et al.
Published: (2024)
J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge
by: Chan, Chi-Min, et al.
Published: (2025)
by: Chan, Chi-Min, et al.
Published: (2025)
Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings
by: Xu, Liyan, et al.
Published: (2025)
by: Xu, Liyan, et al.
Published: (2025)
Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
by: Jain, Neel, et al.
Published: (2024)
by: Jain, Neel, et al.
Published: (2024)
Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
Factual Knowledge in Language Models: Robustness and Anomalies under Simple Temporal Context Variations
by: Khodja, Hichem Ammar, et al.
Published: (2025)
by: Khodja, Hichem Ammar, et al.
Published: (2025)
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
by: Ren, Liliang, et al.
Published: (2024)
by: Ren, Liliang, et al.
Published: (2024)
Re-Examine Distantly Supervised NER: A New Benchmark and a Simple Approach
by: Li, Yuepei, et al.
Published: (2024)
by: Li, Yuepei, et al.
Published: (2024)
s1: Simple test-time scaling
by: Muennighoff, Niklas, et al.
Published: (2025)
by: Muennighoff, Niklas, et al.
Published: (2025)
Can LLMs Follow Simple Rules?
by: Mu, Norman, et al.
Published: (2023)
by: Mu, Norman, et al.
Published: (2023)
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
by: Ben-Zaken, Elad, et al.
Published: (2021)
by: Ben-Zaken, Elad, et al.
Published: (2021)
Beyond Simple Averaging: Improving NLP Ensemble Performance with Topological-Data-Analysis-Based Weighting
by: Proskura, Polina, et al.
Published: (2024)
by: Proskura, Polina, et al.
Published: (2024)
How Ambiguous Are the Rationales for Natural Language Reasoning? A Simple Approach to Handling Rationale Uncertainty
by: Kim, Hazel H.
Published: (2024)
by: Kim, Hazel H.
Published: (2024)
Similar Items
-
Simple and Effective Input Reformulations for Translation
by: Yu, Brian, et al.
Published: (2023) -
Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material
by: Tannor, Shlomo, et al.
Published: (2022) -
FoldGPT: Simple and Effective Large Language Model Compression Scheme
by: Liu, Songwei, et al.
Published: (2024) -
Simple and Effective Masked Diffusion Language Models
by: Sahoo, Subham Sekhar, et al.
Published: (2024) -
MAPLE: A Framework for Active Preference Learning Guided by Large Language Models
by: Mahmud, Saaduddin, et al.
Published: (2024)