Saved in:
| Main Authors: | Nikolich, Aleksandr, Korolev, Konstantin, Bratchikov, Sergei, Kiselev, Igor, Shelmanov, Artem |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.13929 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring Large Language Models for Detecting Mental Disorders
by: Kuzmin, Gleb, et al.
Published: (2024)
by: Kuzmin, Gleb, et al.
Published: (2024)
GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture
by: GigaChat team, et al.
Published: (2025)
by: GigaChat team, et al.
Published: (2025)
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs
by: Shelmanov, Artem, et al.
Published: (2025)
by: Shelmanov, Artem, et al.
Published: (2025)
Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models
by: Kuzmin, Gleb, et al.
Published: (2024)
by: Kuzmin, Gleb, et al.
Published: (2024)
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
by: Fadeeva, Ekaterina, et al.
Published: (2024)
by: Fadeeva, Ekaterina, et al.
Published: (2024)
Prompting and Fine-Tuning Open-Sourced Large Language Models for Stance Classification
by: Cruickshank, Iain J., et al.
Published: (2023)
by: Cruickshank, Iain J., et al.
Published: (2023)
Phased Instruction Fine-Tuning for Large Language Models
by: Pang, Wei, et al.
Published: (2024)
by: Pang, Wei, et al.
Published: (2024)
Investigating Instruction Tuning Large Language Models on Graphs
by: Zhu, Kerui, et al.
Published: (2024)
by: Zhu, Kerui, et al.
Published: (2024)
ReProbe: Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models
by: Ni, Jingwei, et al.
Published: (2025)
by: Ni, Jingwei, et al.
Published: (2025)
Towards Robust Instruction Tuning on Multimodal Large Language Models
by: Han, Wei, et al.
Published: (2024)
by: Han, Wei, et al.
Published: (2024)
Optimizing Psychological Counseling with Instruction-Tuned Large Language Models
by: Li, Wenjie, et al.
Published: (2024)
by: Li, Wenjie, et al.
Published: (2024)
GraphGPT: Graph Instruction Tuning for Large Language Models
by: Tang, Jiabin, et al.
Published: (2023)
by: Tang, Jiabin, et al.
Published: (2023)
OctoPack: Instruction Tuning Code Large Language Models
by: Muennighoff, Niklas, et al.
Published: (2023)
by: Muennighoff, Niklas, et al.
Published: (2023)
Fine-Tuning and Evaluating Open-Source Large Language Models for the Army Domain
by: Ruiz, Daniel C., et al.
Published: (2024)
by: Ruiz, Daniel C., et al.
Published: (2024)
Instruction Mining: Instruction Data Selection for Tuning Large Language Models
by: Cao, Yihan, et al.
Published: (2023)
by: Cao, Yihan, et al.
Published: (2023)
Instruction Tuning for Large Language Models: A Survey
by: Zhang, Shengyu, et al.
Published: (2023)
by: Zhang, Shengyu, et al.
Published: (2023)
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
by: Wang, Jun, et al.
Published: (2024)
by: Wang, Jun, et al.
Published: (2024)
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing
by: Tran, Hieu, et al.
Published: (2023)
by: Tran, Hieu, et al.
Published: (2023)
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design
by: Snegirev, Artem, et al.
Published: (2024)
by: Snegirev, Artem, et al.
Published: (2024)
An Open Source Data Contamination Report for Large Language Models
by: Li, Yucheng, et al.
Published: (2023)
by: Li, Yucheng, et al.
Published: (2023)
Noise Augmented Fine Tuning for Mitigating Hallucinations in Large Language Models
by: Khadangi, Afshin, et al.
Published: (2025)
by: Khadangi, Afshin, et al.
Published: (2025)
Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining
by: Tan, Yanchao, et al.
Published: (2024)
by: Tan, Yanchao, et al.
Published: (2024)
Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning
by: Zhang, Zhongjian, et al.
Published: (2026)
by: Zhang, Zhongjian, et al.
Published: (2026)
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data
by: Toshniwal, Shubham, et al.
Published: (2024)
by: Toshniwal, Shubham, et al.
Published: (2024)
Federated Data-Efficient Instruction Tuning for Large Language Models
by: Qin, Zhen, et al.
Published: (2024)
by: Qin, Zhen, et al.
Published: (2024)
AuditWen:An Open-Source Large Language Model for Audit
by: Huang, Jiajia, et al.
Published: (2024)
by: Huang, Jiajia, et al.
Published: (2024)
Classifying Cancer Stage with Open-Source Clinical Large Language Models
by: Chang, Chia-Hsuan, et al.
Published: (2024)
by: Chang, Chia-Hsuan, et al.
Published: (2024)
CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions
by: Rao, Jun, et al.
Published: (2024)
by: Rao, Jun, et al.
Published: (2024)
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning
by: Zhang, Xinlu, et al.
Published: (2024)
by: Zhang, Xinlu, et al.
Published: (2024)
Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models
by: Xu, Jiashu, et al.
Published: (2023)
by: Xu, Jiashu, et al.
Published: (2023)
KodeXv0.1: A Family of State-of-the-Art Financial Large Language Models
by: Rajani, Neel, et al.
Published: (2024)
by: Rajani, Neel, et al.
Published: (2024)
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Models
by: Pipatanakul, Kunat, et al.
Published: (2024)
by: Pipatanakul, Kunat, et al.
Published: (2024)
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
by: Zhuo, Terry Yue, et al.
Published: (2024)
by: Zhuo, Terry Yue, et al.
Published: (2024)
Benchmarking Open-Source Large Language Models on Healthcare Text Classification Tasks
by: Guo, Yuting, et al.
Published: (2025)
by: Guo, Yuting, et al.
Published: (2025)
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
by: Chen, Pinzhen, et al.
Published: (2024)
by: Chen, Pinzhen, et al.
Published: (2024)
Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models
by: Peeperkorn, Max, et al.
Published: (2025)
by: Peeperkorn, Max, et al.
Published: (2025)
A Study of Large Language Models for Patient Information Extraction: Model Architecture, Fine-Tuning Strategy, and Multi-task Instruction Tuning
by: Peng, Cheng, et al.
Published: (2025)
by: Peng, Cheng, et al.
Published: (2025)
LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language
by: Toraman, Cagri
Published: (2024)
by: Toraman, Cagri
Published: (2024)
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
by: Toshniwal, Shubham, et al.
Published: (2024)
by: Toshniwal, Shubham, et al.
Published: (2024)
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
by: Singh, Shivalika, et al.
Published: (2024)
by: Singh, Shivalika, et al.
Published: (2024)
Similar Items
-
Exploring Large Language Models for Detecting Mental Disorders
by: Kuzmin, Gleb, et al.
Published: (2024) -
GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture
by: GigaChat team, et al.
Published: (2025) -
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs
by: Shelmanov, Artem, et al.
Published: (2025) -
Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models
by: Kuzmin, Gleb, et al.
Published: (2024) -
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
by: Fadeeva, Ekaterina, et al.
Published: (2024)