Saved in:
| Main Authors: | Wang, Xiaohua, Huang, Zisu, Zhang, Feiran, Xu, Zhibo, Zhang, Cenyuan, Qian, Qi, Zheng, Xiaoqing, Huang, Xuanjing |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.01461 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023)
by: Liu, Wenhao, et al.
Published: (2023)
SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation
by: Lv, Changze, et al.
Published: (2023)
by: Lv, Changze, et al.
Published: (2023)
Enhancing Model Privacy in Federated Learning with Random Masking and Quantization
by: Xu, Zhibo, et al.
Published: (2025)
by: Xu, Zhibo, et al.
Published: (2025)
Promoting Data and Model Privacy in Federated Learning through Quantized LoRA
by: Zhu, JianHao, et al.
Published: (2024)
by: Zhu, JianHao, et al.
Published: (2024)
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning
by: Wu, Muling, et al.
Published: (2025)
by: Wu, Muling, et al.
Published: (2025)
IntentionReasoner: Facilitating Adaptive LLM Safeguards through Intent Reasoning and Selective Query Refinement
by: Shen, Yuanzhe, et al.
Published: (2025)
by: Shen, Yuanzhe, et al.
Published: (2025)
Advancing Parameter Efficiency in Fine-tuning via Representation Editing
by: Wu, Muling, et al.
Published: (2024)
by: Wu, Muling, et al.
Published: (2024)
Searching for Best Practices in Retrieval-Augmented Generation
by: Wang, Xiaohua, et al.
Published: (2024)
by: Wang, Xiaohua, et al.
Published: (2024)
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
by: Lv, Changze, et al.
Published: (2026)
by: Lv, Changze, et al.
Published: (2026)
SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network
by: Lv, Changze, et al.
Published: (2023)
by: Lv, Changze, et al.
Published: (2023)
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling
by: Wu, Yixin, et al.
Published: (2025)
by: Wu, Yixin, et al.
Published: (2025)
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
by: Zhang, Yuansen, et al.
Published: (2024)
by: Zhang, Yuansen, et al.
Published: (2024)
Decoding Continuous Character-based Language from Non-invasive Brain Recordings
by: Zhang, Cenyuan, et al.
Published: (2024)
by: Zhang, Cenyuan, et al.
Published: (2024)
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
by: Xi, Zhiheng, et al.
Published: (2023)
by: Xi, Zhiheng, et al.
Published: (2023)
On the Tip of the Tongue: Analyzing Conceptual Representation in Large Language Models with Reverse-Dictionary Probe
by: Xu, Ningyu, et al.
Published: (2024)
by: Xu, Ningyu, et al.
Published: (2024)
CSSG: Measuring Code Similarity with Semantic Graphs
by: Lu, Yiyang, et al.
Published: (2026)
by: Lu, Yiyang, et al.
Published: (2026)
Benchmark^2: Systematic Evaluation of LLM Benchmarks
by: Qian, Qi, et al.
Published: (2026)
by: Qian, Qi, et al.
Published: (2026)
Improving Continual Pre-training Through Seamless Data Packing
by: Yin, Ruicheng, et al.
Published: (2025)
by: Yin, Ruicheng, et al.
Published: (2025)
VIB-Probe: Detecting and Mitigating Hallucinations in Vision-Language Models via Variational Information Bottleneck
by: Zhang, Feiran, et al.
Published: (2026)
by: Zhang, Feiran, et al.
Published: (2026)
Emergent Structured Representations Support Flexible In-Context Inference in Large Language Models
by: Xu, Ningyu, et al.
Published: (2026)
by: Xu, Ningyu, et al.
Published: (2026)
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data
by: Xia, Han, et al.
Published: (2024)
by: Xia, Han, et al.
Published: (2024)
UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation
by: Li, Tianlong, et al.
Published: (2023)
by: Li, Tianlong, et al.
Published: (2023)
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective
by: Li, Tianlong, et al.
Published: (2024)
by: Li, Tianlong, et al.
Published: (2024)
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
by: Zhao, Jun, et al.
Published: (2024)
by: Zhao, Jun, et al.
Published: (2024)
Unveiling Linguistic Regions in Large Language Models
by: Zhang, Zhihao, et al.
Published: (2024)
by: Zhang, Zhihao, et al.
Published: (2024)
BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation
by: Xu, Jingwen, et al.
Published: (2026)
by: Xu, Jingwen, et al.
Published: (2026)
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling
by: Wang, Zhenghua, et al.
Published: (2025)
by: Wang, Zhenghua, et al.
Published: (2025)
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
by: Ye, Junjie, et al.
Published: (2025)
by: Ye, Junjie, et al.
Published: (2025)
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
by: Ye, Junjie, et al.
Published: (2024)
by: Ye, Junjie, et al.
Published: (2024)
SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading
by: Shen, Yuanzhe, et al.
Published: (2025)
by: Shen, Yuanzhe, et al.
Published: (2025)
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
by: Ye, Junjie, et al.
Published: (2024)
by: Ye, Junjie, et al.
Published: (2024)
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning
by: Zhao, Jun, et al.
Published: (2024)
by: Zhao, Jun, et al.
Published: (2024)
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2024)
by: Xi, Zhiheng, et al.
Published: (2024)
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
by: Yu, Song, et al.
Published: (2024)
by: Yu, Song, et al.
Published: (2024)
Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM
by: Hong, Zijin, et al.
Published: (2024)
by: Hong, Zijin, et al.
Published: (2024)
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
Improving RL Exploration for LLM Reasoning through Retrospective Replay
by: Dou, Shihan, et al.
Published: (2025)
by: Dou, Shihan, et al.
Published: (2025)
LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition
by: Ye, Junjie, et al.
Published: (2024)
by: Ye, Junjie, et al.
Published: (2024)
Structure Guided Large Language Model for SQL Generation
by: Zhang, Qinggang, et al.
Published: (2024)
by: Zhang, Qinggang, et al.
Published: (2024)
ErrorLLM: Modeling SQL Errors for Text-to-SQL Refinement
by: Hong, Zijin, et al.
Published: (2026)
by: Hong, Zijin, et al.
Published: (2026)
Similar Items
-
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023) -
SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation
by: Lv, Changze, et al.
Published: (2023) -
Enhancing Model Privacy in Federated Learning with Random Masking and Quantization
by: Xu, Zhibo, et al.
Published: (2025) -
Promoting Data and Model Privacy in Federated Learning through Quantized LoRA
by: Zhu, JianHao, et al.
Published: (2024) -
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning
by: Wu, Muling, et al.
Published: (2025)