Saved in:
| Main Authors: | Wu, Haolun, Yuan, Ye, Mikaelyan, Liana, Meulemans, Alexander, Liu, Xue, Hensman, James, Mitra, Bhaskar |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.04437 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
KBLaM: Knowledge Base augmented Language Model
by: Wang, Xi, et al.
Published: (2024)
by: Wang, Xi, et al.
Published: (2024)
DiSK: A Diffusion Model for Structured Knowledge
by: Kitouni, Ouail, et al.
Published: (2023)
by: Kitouni, Ouail, et al.
Published: (2023)
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
by: Ashkboos, Saleh, et al.
Published: (2024)
by: Ashkboos, Saleh, et al.
Published: (2024)
OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization
by: Gadhikar, Advait, et al.
Published: (2025)
by: Gadhikar, Advait, et al.
Published: (2025)
Extracting Rule-based Descriptions of Attention Features in Transformers
by: Friedman, Dan, et al.
Published: (2025)
by: Friedman, Dan, et al.
Published: (2025)
Understanding 6G through Language Models: A Case Study on LLM-aided Structured Entity Extraction in Telecom Domain
by: Yuan, Ye, et al.
Published: (2025)
by: Yuan, Ye, et al.
Published: (2025)
Logits are All We Need to Adapt Closed Models
by: Hiranandani, Gaurush, et al.
Published: (2025)
by: Hiranandani, Gaurush, et al.
Published: (2025)
The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models
by: Bhaskar, Adithya, et al.
Published: (2024)
by: Bhaskar, Adithya, et al.
Published: (2024)
Entity Matching using Large Language Models
by: Peeters, Ralph, et al.
Published: (2023)
by: Peeters, Ralph, et al.
Published: (2023)
Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks
by: Loureiro, Manuel V., et al.
Published: (2023)
by: Loureiro, Manuel V., et al.
Published: (2023)
Confidence Calibration in Large Language Model-Based Entity Matching
by: Kamsteeg, Iris, et al.
Published: (2025)
by: Kamsteeg, Iris, et al.
Published: (2025)
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures
by: Chang, Fu-Chieh, et al.
Published: (2024)
by: Chang, Fu-Chieh, et al.
Published: (2024)
Large language models can accurately predict searcher preferences
by: Thomas, Paul, et al.
Published: (2023)
by: Thomas, Paul, et al.
Published: (2023)
Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models
by: Liu, Tong, et al.
Published: (2024)
by: Liu, Tong, et al.
Published: (2024)
H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models
by: Dawes, Cutter, et al.
Published: (2026)
by: Dawes, Cutter, et al.
Published: (2026)
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model
by: Yan, Yongyu, et al.
Published: (2022)
by: Yan, Yongyu, et al.
Published: (2022)
Empirical Study of Named Entity Recognition Performance Using Distribution-aware Word Embedding
by: Chen, Xin, et al.
Published: (2021)
by: Chen, Xin, et al.
Published: (2021)
How do Language Models Bind Entities in Context?
by: Feng, Jiahai, et al.
Published: (2023)
by: Feng, Jiahai, et al.
Published: (2023)
Large Language Models Struggle in Token-Level Clinical Named Entity Recognition
by: Lu, Qiuhao, et al.
Published: (2024)
by: Lu, Qiuhao, et al.
Published: (2024)
Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models
by: Wang, Fei, et al.
Published: (2024)
by: Wang, Fei, et al.
Published: (2024)
Structured Agent Distillation for Large Language Model
by: Liu, Jun, et al.
Published: (2025)
by: Liu, Jun, et al.
Published: (2025)
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
by: Manvi, Rohin, et al.
Published: (2023)
by: Manvi, Rohin, et al.
Published: (2023)
Democratizing Tool Learning with Environments Fully Simulated by a Free 8B Language Model
by: Tang, Chenming, et al.
Published: (2026)
by: Tang, Chenming, et al.
Published: (2026)
Multi-View Empowered Structural Graph Wordification for Language Models
by: Liu, Zipeng, et al.
Published: (2024)
by: Liu, Zipeng, et al.
Published: (2024)
Continual Learning for Large Language Models: A Survey
by: Wu, Tongtong, et al.
Published: (2024)
by: Wu, Tongtong, et al.
Published: (2024)
SUMIE: A Synthetic Benchmark for Incremental Entity Summarization
by: Hwang, Eunjeong, et al.
Published: (2024)
by: Hwang, Eunjeong, et al.
Published: (2024)
Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024)
by: Shan, Alexander, et al.
Published: (2024)
Comparative Study of Pre-Trained BERT and Large Language Models for Code-Mixed Named Entity Recognition
by: Shirke, Mayur, et al.
Published: (2025)
by: Shirke, Mayur, et al.
Published: (2025)
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data
by: Wu, Xue, et al.
Published: (2024)
by: Wu, Xue, et al.
Published: (2024)
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
by: Ferrando, Javier, et al.
Published: (2024)
by: Ferrando, Javier, et al.
Published: (2024)
AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling
by: Capstick, Alexander, et al.
Published: (2024)
by: Capstick, Alexander, et al.
Published: (2024)
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient
by: Gao, Yuan, et al.
Published: (2024)
by: Gao, Yuan, et al.
Published: (2024)
Design Editing for Offline Model-based Optimization
by: Yuan, Ye, et al.
Published: (2024)
by: Yuan, Ye, et al.
Published: (2024)
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
by: Heng, Yuzhao, et al.
Published: (2024)
by: Heng, Yuzhao, et al.
Published: (2024)
On the Thinking-Language Modeling Gap in Large Language Models
by: Liu, Chenxi, et al.
Published: (2025)
by: Liu, Chenxi, et al.
Published: (2025)
Comparative Performance Evaluation of Large Language Models for Extracting Molecular Interactions and Pathway Knowledge
by: Park, Gilchan, et al.
Published: (2023)
by: Park, Gilchan, et al.
Published: (2023)
Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech
by: Li, Menglin, et al.
Published: (2024)
by: Li, Menglin, et al.
Published: (2024)
Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models
by: Yuan, Yu, et al.
Published: (2024)
by: Yuan, Yu, et al.
Published: (2024)
Rethinking Token Prediction: Tree-Structured Diffusion Language Model
by: Wu, Zihao, et al.
Published: (2026)
by: Wu, Zihao, et al.
Published: (2026)
MOFI: Learning Image Representations from Noisy Entity Annotated Images
by: Wu, Wentao, et al.
Published: (2023)
by: Wu, Wentao, et al.
Published: (2023)
Similar Items
-
KBLaM: Knowledge Base augmented Language Model
by: Wang, Xi, et al.
Published: (2024) -
DiSK: A Diffusion Model for Structured Knowledge
by: Kitouni, Ouail, et al.
Published: (2023) -
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
by: Ashkboos, Saleh, et al.
Published: (2024) -
OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization
by: Gadhikar, Advait, et al.
Published: (2025) -
Extracting Rule-based Descriptions of Attention Features in Transformers
by: Friedman, Dan, et al.
Published: (2025)