Saved in:
| Main Authors: | Liu, Wenhao, An, Siyu, Lu, Junru, Wu, Muling, Li, Tianlong, Wang, Xiaohua, lv, Changze, Zheng, Xiaoqing, Yin, Di, Sun, Xing, Huang, Xuanjing |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.16913 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Advancing Parameter Efficiency in Fine-tuning via Representation Editing
by: Wu, Muling, et al.
Published: (2024)
by: Wu, Muling, et al.
Published: (2024)
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective
by: Li, Tianlong, et al.
Published: (2024)
by: Li, Tianlong, et al.
Published: (2024)
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023)
by: Liu, Wenhao, et al.
Published: (2023)
UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation
by: Li, Tianlong, et al.
Published: (2023)
by: Li, Tianlong, et al.
Published: (2023)
Promoting Data and Model Privacy in Federated Learning through Quantized LoRA
by: Zhu, JianHao, et al.
Published: (2024)
by: Zhu, JianHao, et al.
Published: (2024)
NeIn: Telling What You Don't Want
by: Bui, Nhat-Tan, et al.
Published: (2024)
by: Bui, Nhat-Tan, et al.
Published: (2024)
SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network
by: Lv, Changze, et al.
Published: (2023)
by: Lv, Changze, et al.
Published: (2023)
Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs
by: Johnson, Daniel D., et al.
Published: (2024)
by: Johnson, Daniel D., et al.
Published: (2024)
Visually Dehallucinative Instruction Generation: Know What You Don't Know
by: Cha, Sungguk, et al.
Published: (2024)
by: Cha, Sungguk, et al.
Published: (2024)
Show Me What You Don't Know: Efficient Sampling from Invariant Sets for Model Validation
by: Rousselot, Armand, et al.
Published: (2026)
by: Rousselot, Armand, et al.
Published: (2026)
Know What You Don't Know: Uncertainty Calibration of Process Reward Models
by: Park, Young-Jin, et al.
Published: (2025)
by: Park, Young-Jin, et al.
Published: (2025)
Know What You Don't Know: Selective Prediction for Early Exit DNNs
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
What LLMs Think When You Don't Tell Them What to Think About?
by: Kwon, Yongchan, et al.
Published: (2026)
by: Kwon, Yongchan, et al.
Published: (2026)
Imagining What We Don't Know
by: Samuels, Lisa
Published: (2026)
by: Samuels, Lisa
Published: (2026)
Imagining What We Don't Know
by: Samuels, Lisa
Published: (2026)
by: Samuels, Lisa
Published: (2026)
Can AI Assistants Know What They Don't Know?
by: Cheng, Qinyuan, et al.
Published: (2024)
by: Cheng, Qinyuan, et al.
Published: (2024)
Knowing What You Know Is Not Enough: Large Language Model Confidences Don't Align With Their Actions
by: Pal, Arka, et al.
Published: (2025)
by: Pal, Arka, et al.
Published: (2025)
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs
by: Yeom, Jewon, et al.
Published: (2026)
by: Yeom, Jewon, et al.
Published: (2026)
Enhancing Model Privacy in Federated Learning with Random Masking and Quantization
by: Xu, Zhibo, et al.
Published: (2025)
by: Xu, Zhibo, et al.
Published: (2025)
What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction
by: Li, Lehui, et al.
Published: (2026)
by: Li, Lehui, et al.
Published: (2026)
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning
by: Wu, Muling, et al.
Published: (2025)
by: Wu, Muling, et al.
Published: (2025)
Hatevolution: What Static Benchmarks Don't Tell Us
by: Di Bonaventura, Chiara, et al.
Published: (2025)
by: Di Bonaventura, Chiara, et al.
Published: (2025)
Euphemisms: Tell Me What You Do and I'll Tell You What You Are!
by: Perez, Cenel Augusto
Published: (2025)
by: Perez, Cenel Augusto
Published: (2025)
When You Don't Know the Answer, Say So
Published: (2024)
Published: (2024)
Improving Continual Pre-training Through Seamless Data Packing
by: Yin, Ruicheng, et al.
Published: (2025)
by: Yin, Ruicheng, et al.
Published: (2025)
Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell
by: Lu, Taiming, et al.
Published: (2024)
by: Lu, Taiming, et al.
Published: (2024)
Large Language Models Must Be Taught to Know What They Don't Know
by: Kapoor, Sanyam, et al.
Published: (2024)
by: Kapoor, Sanyam, et al.
Published: (2024)
"ChatGPT, Don't Tell Me What to Do": Designing AI for Context Analysis in Humanitarian Frontline Negotiations
by: Ma, ZIlin, et al.
Published: (2024)
by: Ma, ZIlin, et al.
Published: (2024)
First, Learn What You Don't Know: Active Information Gathering for Driving at the Limits of Handling
by: Davydov, Alexander, et al.
Published: (2024)
by: Davydov, Alexander, et al.
Published: (2024)
What We Know and What We Don't Know About the Function of γδ T Cells
by: Immo Prinz, et al.
Published: (2025)
by: Immo Prinz, et al.
Published: (2025)
Don't Say No: Jailbreaking LLM by Suppressing Refusal
by: Zhou, Yukai, et al.
Published: (2024)
by: Zhou, Yukai, et al.
Published: (2024)
Pulsed‐Field Ablation: What We Know and What We Don't
by: Sanghamitra Mohanty, et al.
Published: (2025)
by: Sanghamitra Mohanty, et al.
Published: (2025)
Bayesian Mixture-of-Experts: Towards Making LLMs Know What They Don't Know
by: Li, Albus Yizhuo
Published: (2025)
by: Li, Albus Yizhuo
Published: (2025)
But Don't Play With Me `Cause You're Playing With a Farmer – Electoral Consequences of Proposed Animal Welfare Reforms in Rural Poland
by: Karol Degórski, et al.
Published: (2026)
by: Karol Degórski, et al.
Published: (2026)
What Librarians Still Don't Know about Free Software
by: Chudnov, Daniel
Published: (2009)
by: Chudnov, Daniel
Published: (2009)
VIB-Probe: Detecting and Mitigating Hallucinations in Vision-Language Models via Variational Information Bottleneck
by: Zhang, Feiran, et al.
Published: (2026)
by: Zhang, Feiran, et al.
Published: (2026)
What You Don't Know Won't Hurt You: Self-Consistent Hierarchical Inference with Unknown Follow-up Selection Strategies
by: Essick, Reed, et al.
Published: (2026)
by: Essick, Reed, et al.
Published: (2026)
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
by: de Carvalho, Gonçalo Hora, et al.
Published: (2024)
by: de Carvalho, Gonçalo Hora, et al.
Published: (2024)
Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users
by: Balepur, Nishant, et al.
Published: (2026)
by: Balepur, Nishant, et al.
Published: (2026)
How to Computerize Your Serials and Periodicals When You Don't Know How
by: Matthews, Mary, et al.
Published: (1970)
by: Matthews, Mary, et al.
Published: (1970)
Similar Items
-
Advancing Parameter Efficiency in Fine-tuning via Representation Editing
by: Wu, Muling, et al.
Published: (2024) -
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective
by: Li, Tianlong, et al.
Published: (2024) -
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023) -
UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation
by: Li, Tianlong, et al.
Published: (2023) -
Promoting Data and Model Privacy in Federated Learning through Quantized LoRA
by: Zhu, JianHao, et al.
Published: (2024)