Saved in:
| Main Authors: | Dou, Longxu, Liu, Qian, Zeng, Guangtao, Guo, Jia, Zhou, Jiahui, Lu, Wei, Lin, Min |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.03608 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
by: Dou, Longxu, et al.
Published: (2025)
by: Dou, Longxu, et al.
Published: (2025)
RegMix: Data Mixture as Regression for Language Model Pre-training
by: Liu, Qian, et al.
Published: (2024)
by: Liu, Qian, et al.
Published: (2024)
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
by: Guo, Jia, et al.
Published: (2024)
by: Guo, Jia, et al.
Published: (2024)
TinyLlama: An Open-Source Small Language Model
by: Zhang, Peiyuan, et al.
Published: (2024)
by: Zhang, Peiyuan, et al.
Published: (2024)
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
by: Tao, Chaofan, et al.
Published: (2024)
by: Tao, Chaofan, et al.
Published: (2024)
Reasoning Does Not Necessarily Improve Role-Playing Ability
by: Feng, Xiachong, et al.
Published: (2025)
by: Feng, Xiachong, et al.
Published: (2025)
Training Optimal Large Diffusion Language Models
by: Ni, Jinjie, et al.
Published: (2025)
by: Ni, Jinjie, et al.
Published: (2025)
A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
by: Feng, Xiachong, et al.
Published: (2024)
by: Feng, Xiachong, et al.
Published: (2024)
WebSailor: Navigating Super-human Reasoning for Web Agent
by: Li, Kuan, et al.
Published: (2025)
by: Li, Kuan, et al.
Published: (2025)
Scaling up Masked Diffusion Models on Text
by: Nie, Shen, et al.
Published: (2024)
by: Nie, Shen, et al.
Published: (2024)
Unnatural Languages Are Not Bugs but Features for LLMs
by: Duan, Keyu, et al.
Published: (2025)
by: Duan, Keyu, et al.
Published: (2025)
MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis
by: Zhou, Yingjie, et al.
Published: (2024)
by: Zhou, Yingjie, et al.
Published: (2024)
Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench
by: Liu, Zheyuan, et al.
Published: (2024)
by: Liu, Zheyuan, et al.
Published: (2024)
Grounding Language Model with Chunking-Free In-Context Retrieval
by: Qian, Hongjin, et al.
Published: (2024)
by: Qian, Hongjin, et al.
Published: (2024)
Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Models
by: Zhao, Lulu, et al.
Published: (2024)
by: Zhao, Lulu, et al.
Published: (2024)
Beyond Memorization: The Challenge of Random Memory Access in Language Models
by: Zhu, Tongyao, et al.
Published: (2024)
by: Zhu, Tongyao, et al.
Published: (2024)
PsychCounsel-Bench: Evaluating the Psychology Intelligence of Large Language Models
by: Zeng, Min
Published: (2025)
by: Zeng, Min
Published: (2025)
Evaluating from Benign to Dynamic Adversarial: A Squid Game for Large Language Models
by: Chen, Zijian, et al.
Published: (2025)
by: Chen, Zijian, et al.
Published: (2025)
CareBot: A Pioneering Full-Process Open-Source Medical Language Model
by: Zhao, Lulu, et al.
Published: (2024)
by: Zhao, Lulu, et al.
Published: (2024)
YuLan: An Open-source Large Language Model
by: Zhu, Yutao, et al.
Published: (2024)
by: Zhu, Yutao, et al.
Published: (2024)
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant
by: Zhou, Yujia, et al.
Published: (2024)
by: Zhou, Yujia, et al.
Published: (2024)
Purifying Large Language Models by Ensembling a Small Language Model
by: Li, Tianlin, et al.
Published: (2024)
by: Li, Tianlin, et al.
Published: (2024)
TasTe: Teaching Large Language Models to Translate through Self-Reflection
by: Wang, Yutong, et al.
Published: (2024)
by: Wang, Yutong, et al.
Published: (2024)
Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models
by: Chen, Xiaojun, et al.
Published: (2024)
by: Chen, Xiaojun, et al.
Published: (2024)
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
by: Zeng, Weihao, et al.
Published: (2025)
by: Zeng, Weihao, et al.
Published: (2025)
Natural Language Fine-Tuning
by: Liu, Jia, et al.
Published: (2024)
by: Liu, Jia, et al.
Published: (2024)
LKD-KGC: Domain-Specific KG Construction via LLM-driven Knowledge Dependency Parsing
by: Sun, Jiaqi, et al.
Published: (2025)
by: Sun, Jiaqi, et al.
Published: (2025)
SeaLLMs-Audio: Large Audio-Language Models for Southeast Asia
by: Liu, Chaoqun, et al.
Published: (2025)
by: Liu, Chaoqun, et al.
Published: (2025)
An Open Source Data Contamination Report for Large Language Models
by: Li, Yucheng, et al.
Published: (2023)
by: Li, Yucheng, et al.
Published: (2023)
Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering
by: Liu, Jiaxiang, et al.
Published: (2024)
by: Liu, Jiaxiang, et al.
Published: (2024)
Benchmarking Open-Source Large Language Models on Healthcare Text Classification Tasks
by: Guo, Yuting, et al.
Published: (2025)
by: Guo, Yuting, et al.
Published: (2025)
Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements
by: Qian, Yushan, et al.
Published: (2023)
by: Qian, Yushan, et al.
Published: (2023)
A Multi-To-One Interview Paradigm for Efficient MLLM Evaluation
by: Shen, Ye, et al.
Published: (2025)
by: Shen, Ye, et al.
Published: (2025)
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
by: Luo, Renjie, et al.
Published: (2025)
by: Luo, Renjie, et al.
Published: (2025)
One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models
by: Shao, Hang, et al.
Published: (2023)
by: Shao, Hang, et al.
Published: (2023)
FlexOlmo: Open Language Models for Flexible Data Use
by: Shi, Weijia, et al.
Published: (2025)
by: Shi, Weijia, et al.
Published: (2025)
Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models
by: Cao, Jiaqi, et al.
Published: (2025)
by: Cao, Jiaqi, et al.
Published: (2025)
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
by: Shen, Maohao, et al.
Published: (2025)
by: Shen, Maohao, et al.
Published: (2025)
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model
by: Qian, Cheng, et al.
Published: (2023)
by: Qian, Cheng, et al.
Published: (2023)
Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model
by: Zhou, Peng, et al.
Published: (2024)
by: Zhou, Peng, et al.
Published: (2024)
Similar Items
-
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
by: Dou, Longxu, et al.
Published: (2025) -
RegMix: Data Mixture as Regression for Language Model Pre-training
by: Liu, Qian, et al.
Published: (2024) -
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
by: Guo, Jia, et al.
Published: (2024) -
TinyLlama: An Open-Source Small Language Model
by: Zhang, Peiyuan, et al.
Published: (2024) -
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
by: Tao, Chaofan, et al.
Published: (2024)