Saved in:
| Main Authors: | Lamott, Marcel, Shakir, Muhammad Armaghan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.11282 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LAPDoc: Layout-Aware Prompting for Documents
by: Lamott, Marcel, et al.
Published: (2024)
by: Lamott, Marcel, et al.
Published: (2024)
Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models
by: Wang, Xindi, et al.
Published: (2024)
by: Wang, Xindi, et al.
Published: (2024)
DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion
by: Lamott, Marcel, et al.
Published: (2026)
by: Lamott, Marcel, et al.
Published: (2026)
Memory-Augmented Transformers: A Systematic Review from Neuroscience Principles to Enhanced Model Architectures
by: Omidi, Parsa, et al.
Published: (2025)
by: Omidi, Parsa, et al.
Published: (2025)
BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)
Understanding Structured Financial Data with LLMs: A Case Study on Fraud Detection
by: Tan, Xuwei, et al.
Published: (2025)
by: Tan, Xuwei, et al.
Published: (2025)
Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi
by: Deshmukh, Pranita, et al.
Published: (2024)
by: Deshmukh, Pranita, et al.
Published: (2024)
Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
by: Nath, Swaroop, et al.
Published: (2024)
by: Nath, Swaroop, et al.
Published: (2024)
Leveraging Zero-Shot Prompting for Efficient Language Model Distillation
by: Vöge, Lukas, et al.
Published: (2024)
by: Vöge, Lukas, et al.
Published: (2024)
Advancing Exchange Rate Forecasting: Leveraging Machine Learning and AI for Enhanced Accuracy in Global Financial Markets
by: Rahat, Md. Yeasin, et al.
Published: (2025)
by: Rahat, Md. Yeasin, et al.
Published: (2025)
Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding
by: Guo, Ruohao, et al.
Published: (2023)
by: Guo, Ruohao, et al.
Published: (2023)
ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
by: Ni, Xinyi, et al.
Published: (2025)
by: Ni, Xinyi, et al.
Published: (2025)
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?
by: Hościłowicz, Jakub, et al.
Published: (2023)
by: Hościłowicz, Jakub, et al.
Published: (2023)
Document Understanding, Measurement, and Manipulation Using Category Theory
by: Claypoole, Jared, et al.
Published: (2025)
by: Claypoole, Jared, et al.
Published: (2025)
AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu
by: Ammar, Muhammad, et al.
Published: (2025)
by: Ammar, Muhammad, et al.
Published: (2025)
Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques
by: Kasa, Siva Rajesh, et al.
Published: (2024)
by: Kasa, Siva Rajesh, et al.
Published: (2024)
Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks
by: Datta, Joyeeta, et al.
Published: (2025)
by: Datta, Joyeeta, et al.
Published: (2025)
Leveraging Large Language Models for Automated Causal Loop Diagram Generation: Enhancing System Dynamics Modeling through Curated Prompting Techniques
by: Liu, Ning-Yuan Georgia, et al.
Published: (2025)
by: Liu, Ning-Yuan Georgia, et al.
Published: (2025)
Leveraging Large Language Models for Bengali Math Word Problem Solving with Chain of Thought Reasoning
by: Paul, Bidyarthi, et al.
Published: (2025)
by: Paul, Bidyarthi, et al.
Published: (2025)
Neural Sequence-to-Sequence Modeling with Attention by Leveraging Deep Learning Architectures for Enhanced Contextual Understanding in Abstractive Text Summarization
by: Challagundla, Bhavith Chandra, et al.
Published: (2024)
by: Challagundla, Bhavith Chandra, et al.
Published: (2024)
DRAG: Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation
by: Chen, Jennifer, et al.
Published: (2025)
by: Chen, Jennifer, et al.
Published: (2025)
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)
by: Zhao, Siyan, et al.
Published: (2026)
BitNet Distillation
by: Wu, Xun, et al.
Published: (2025)
by: Wu, Xun, et al.
Published: (2025)
Self-Distilled RLVR
by: Yang, Chenxu, et al.
Published: (2026)
by: Yang, Chenxu, et al.
Published: (2026)
DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
by: Maekawa, Aru, et al.
Published: (2024)
by: Maekawa, Aru, et al.
Published: (2024)
A Comprehensive Study on Quantization Techniques for Large Language Models
by: Lang, Jiedong, et al.
Published: (2024)
by: Lang, Jiedong, et al.
Published: (2024)
Understanding Post-hoc Explainers: The Case of Anchors
by: Lopardo, Gianluigi, et al.
Published: (2023)
by: Lopardo, Gianluigi, et al.
Published: (2023)
Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top
by: Cheng, Keyuan, et al.
Published: (2024)
by: Cheng, Keyuan, et al.
Published: (2024)
Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy
by: Biswas, Anjanava, et al.
Published: (2024)
by: Biswas, Anjanava, et al.
Published: (2024)
Breaking MLPerf Training: A Case Study on Optimizing BERT
by: Kim, Yongdeok, et al.
Published: (2024)
by: Kim, Yongdeok, et al.
Published: (2024)
Towards Understanding the Word Sensitivity of Attention Layers: A Study via Random Features
by: Bombari, Simone, et al.
Published: (2024)
by: Bombari, Simone, et al.
Published: (2024)
Knowledge Distillation with Training Wheels
by: Liu, Guanlin, et al.
Published: (2025)
by: Liu, Guanlin, et al.
Published: (2025)
Trust Region On-Policy Distillation
by: Xing, Xingrun, et al.
Published: (2026)
by: Xing, Xingrun, et al.
Published: (2026)
A Survey of On-Policy Distillation for Large Language Models
by: Song, Mingyang, et al.
Published: (2026)
by: Song, Mingyang, et al.
Published: (2026)
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
by: Fang, Luyang, et al.
Published: (2025)
by: Fang, Luyang, et al.
Published: (2025)
Sociolinguistically Informed Interpretability: A Case Study on Hinglish Emotion Classification
by: Tatariya, Kushal, et al.
Published: (2024)
by: Tatariya, Kushal, et al.
Published: (2024)
LLM Library Learning Fails: A LEGO-Prover Case Study
by: Berlot-Attwell, Ian, et al.
Published: (2025)
by: Berlot-Attwell, Ian, et al.
Published: (2025)
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
by: Yoo, YoungJoon, et al.
Published: (2023)
by: Yoo, YoungJoon, et al.
Published: (2023)
Leverage Unlearning to Sanitize LLMs
by: Boutet, Antoine, et al.
Published: (2025)
by: Boutet, Antoine, et al.
Published: (2025)
Leveraging the true depth of LLMs
by: González, Ramón Calvo, et al.
Published: (2025)
by: González, Ramón Calvo, et al.
Published: (2025)
Similar Items
-
LAPDoc: Layout-Aware Prompting for Documents
by: Lamott, Marcel, et al.
Published: (2024) -
Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models
by: Wang, Xindi, et al.
Published: (2024) -
DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion
by: Lamott, Marcel, et al.
Published: (2026) -
Memory-Augmented Transformers: A Systematic Review from Neuroscience Principles to Enhanced Model Architectures
by: Omidi, Parsa, et al.
Published: (2025) -
BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)