Saved in:
| Main Authors: | Liu, Mingchao, Sun, Yu, Sun, Ruixiao, Dong, Xin, Shen, Xiang, Wang, Hongwei, Xiong, Hongyu, Song, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.15251 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
by: Lu, Xingyu, et al.
Published: (2025)
by: Lu, Xingyu, et al.
Published: (2025)
Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025)
by: Dinuta, Eduard Stefan, et al.
Published: (2025)
Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation
by: Zhang, Shutong, et al.
Published: (2026)
by: Zhang, Shutong, et al.
Published: (2026)
BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction
by: Li, Jiangmeng, et al.
Published: (2024)
by: Li, Jiangmeng, et al.
Published: (2024)
Self-Supervised Prompt Optimization
by: Xiang, Jinyu, et al.
Published: (2025)
by: Xiang, Jinyu, et al.
Published: (2025)
RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition
by: Ran, Kun, et al.
Published: (2026)
by: Ran, Kun, et al.
Published: (2026)
Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation
by: Schwartz, Daniel, et al.
Published: (2025)
by: Schwartz, Daniel, et al.
Published: (2025)
Class-RAG: Real-Time Content Moderation with Retrieval Augmented Generation
by: Chen, Jianfa, et al.
Published: (2024)
by: Chen, Jianfa, et al.
Published: (2024)
SPIN: Self-Supervised Prompt INjection
by: Zhou, Leon, et al.
Published: (2024)
by: Zhou, Leon, et al.
Published: (2024)
NeurIPS 2023 LLM Efficiency Fine-tuning Competition
by: Saroufim, Mark, et al.
Published: (2025)
by: Saroufim, Mark, et al.
Published: (2025)
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
by: Sun, Zhiqing, et al.
Published: (2024)
by: Sun, Zhiqing, et al.
Published: (2024)
Experience-Guided Reflective Co-Evolution of Prompts and Heuristics for Automatic Algorithm Design
by: Liu, Yihong, et al.
Published: (2025)
by: Liu, Yihong, et al.
Published: (2025)
Relevance to Utility: Process-Supervised Rewrite for RAG
by: Kim, Jaeyoung, et al.
Published: (2025)
by: Kim, Jaeyoung, et al.
Published: (2025)
Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation
by: Liu, Yanjiang, et al.
Published: (2026)
by: Liu, Yanjiang, et al.
Published: (2026)
Adaptive Content Restriction for Large Language Models via Suffix Optimization
by: Li, Yige, et al.
Published: (2025)
by: Li, Yige, et al.
Published: (2025)
Context-Aware Content Moderation for German Newspaper Comments
by: Krejca, Felix, et al.
Published: (2025)
by: Krejca, Felix, et al.
Published: (2025)
Reasoning-Enhanced Domain-Adaptive Pretraining of Multimodal Large Language Models for Short Video Content Governance
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
by: Sun, Shichao, et al.
Published: (2024)
by: Sun, Shichao, et al.
Published: (2024)
Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators
by: Cao, Yang Trista, et al.
Published: (2023)
by: Cao, Yang Trista, et al.
Published: (2023)
From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design
by: Li, Sha, et al.
Published: (2026)
by: Li, Sha, et al.
Published: (2026)
Socio-Culturally Aware Evaluation Framework for LLM-Based Content Moderation
by: Kumar, Shanu, et al.
Published: (2024)
by: Kumar, Shanu, et al.
Published: (2024)
Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models
by: Li, Zhuoqun, et al.
Published: (2024)
by: Li, Zhuoqun, et al.
Published: (2024)
Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation
by: Lian, Junhong, et al.
Published: (2025)
by: Lian, Junhong, et al.
Published: (2025)
Adapting LLMs for Efficient Context Processing through Soft Prompt Compression
by: Wang, Cangqing, et al.
Published: (2024)
by: Wang, Cangqing, et al.
Published: (2024)
MIRAGE: Context-Aware Prompt Injection against Mobile GUI Agents via User-Generated Content
by: Guo, Ruoqi, et al.
Published: (2026)
by: Guo, Ruoqi, et al.
Published: (2026)
Metamorphic Testing for Audio Content Moderation Software
by: Wang, Wenxuan, et al.
Published: (2025)
by: Wang, Wenxuan, et al.
Published: (2025)
Knowledge Updating? No More Model Editing! Just Selective Contextual Reasoning
by: He, Guoxiu, et al.
Published: (2025)
by: He, Guoxiu, et al.
Published: (2025)
SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization
by: Sun, Huashan, et al.
Published: (2025)
by: Sun, Huashan, et al.
Published: (2025)
Audio-Enhanced Vision-Language Modeling with Latent Space Broadening for High Quality Data Expansion
by: Sun, Yu, et al.
Published: (2025)
by: Sun, Yu, et al.
Published: (2025)
Executing Natural Language-Described Algorithms with Large Language Models: An Investigation
by: Zheng, Xin, et al.
Published: (2024)
by: Zheng, Xin, et al.
Published: (2024)
Semi-Clairvoyant Scheduling of Speculative Decoding Requests to Minimize LLM Inference Latency
by: Li, Ruixiao, et al.
Published: (2025)
by: Li, Ruixiao, et al.
Published: (2025)
A Multi-Source Heterogeneous Knowledge Injected Prompt Learning Method for Legal Charge Prediction
by: Sun, Jingyun, et al.
Published: (2024)
by: Sun, Jingyun, et al.
Published: (2024)
Towards the Holographic Characteristic of LLMs for Efficient Short-text Generation
by: Qian, Shun, et al.
Published: (2026)
by: Qian, Shun, et al.
Published: (2026)
Fine-tuning vs Prompting, Can Language Models Understand Human Values?
by: Sun, Pingwei
Published: (2024)
by: Sun, Pingwei
Published: (2024)
Generate, Not Recommend: Personalized Multimodal Content Generation
by: Liu, Jiongnan, et al.
Published: (2025)
by: Liu, Jiongnan, et al.
Published: (2025)
Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents
by: Kuang, Jiayi, et al.
Published: (2025)
by: Kuang, Jiayi, et al.
Published: (2025)
From Flat to Structural: Enhancing Automated Short Answer Grading with GraphRAG
by: Chu, Yucheng, et al.
Published: (2026)
by: Chu, Yucheng, et al.
Published: (2026)
LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content
by: Foo, Jessica, et al.
Published: (2024)
by: Foo, Jessica, et al.
Published: (2024)
Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation
by: Zhuang, Jun, et al.
Published: (2025)
by: Zhuang, Jun, et al.
Published: (2025)
GRL-Prompt: Towards Knowledge Graph based Prompt Optimization via Reinforcement Learning
by: Liu, Yuze, et al.
Published: (2024)
by: Liu, Yuze, et al.
Published: (2024)
Similar Items
-
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
by: Lu, Xingyu, et al.
Published: (2025) -
Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025) -
Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation
by: Zhang, Shutong, et al.
Published: (2026) -
BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction
by: Li, Jiangmeng, et al.
Published: (2024) -
Self-Supervised Prompt Optimization
by: Xiang, Jinyu, et al.
Published: (2025)