:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Mingchao, Sun, Yu, Sun, Ruixiao, Dong, Xin, Shen, Xiang, Wang, Hongwei, Xiong, Hongyu, Song, Yang
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.15251
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
by: Lu, Xingyu, et al.
Published: (2025)

Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025)

Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation
by: Zhang, Shutong, et al.
Published: (2026)

BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction
by: Li, Jiangmeng, et al.
Published: (2024)

Self-Supervised Prompt Optimization
by: Xiang, Jinyu, et al.
Published: (2025)

RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition
by: Ran, Kun, et al.
Published: (2026)

Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation
by: Schwartz, Daniel, et al.
Published: (2025)

Class-RAG: Real-Time Content Moderation with Retrieval Augmented Generation
by: Chen, Jianfa, et al.
Published: (2024)

SPIN: Self-Supervised Prompt INjection
by: Zhou, Leon, et al.
Published: (2024)

NeurIPS 2023 LLM Efficiency Fine-tuning Competition
by: Saroufim, Mark, et al.
Published: (2025)

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
by: Sun, Zhiqing, et al.
Published: (2024)

Experience-Guided Reflective Co-Evolution of Prompts and Heuristics for Automatic Algorithm Design
by: Liu, Yihong, et al.
Published: (2025)

Relevance to Utility: Process-Supervised Rewrite for RAG
by: Kim, Jaeyoung, et al.
Published: (2025)

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation
by: Liu, Yanjiang, et al.
Published: (2026)

Adaptive Content Restriction for Large Language Models via Suffix Optimization
by: Li, Yige, et al.
Published: (2025)

Context-Aware Content Moderation for German Newspaper Comments
by: Krejca, Felix, et al.
Published: (2025)

Reasoning-Enhanced Domain-Adaptive Pretraining of Multimodal Large Language Models for Short Video Content Governance
by: Wang, Zixuan, et al.
Published: (2025)

Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
by: Sun, Shichao, et al.
Published: (2024)

Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators
by: Cao, Yang Trista, et al.
Published: (2023)

From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design
by: Li, Sha, et al.
Published: (2026)

Socio-Culturally Aware Evaluation Framework for LLM-Based Content Moderation
by: Kumar, Shanu, et al.
Published: (2024)

Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models
by: Li, Zhuoqun, et al.
Published: (2024)

Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation
by: Lian, Junhong, et al.
Published: (2025)

Adapting LLMs for Efficient Context Processing through Soft Prompt Compression
by: Wang, Cangqing, et al.
Published: (2024)

MIRAGE: Context-Aware Prompt Injection against Mobile GUI Agents via User-Generated Content
by: Guo, Ruoqi, et al.
Published: (2026)

Metamorphic Testing for Audio Content Moderation Software
by: Wang, Wenxuan, et al.
Published: (2025)

Knowledge Updating? No More Model Editing! Just Selective Contextual Reasoning
by: He, Guoxiu, et al.
Published: (2025)

SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization
by: Sun, Huashan, et al.
Published: (2025)

Audio-Enhanced Vision-Language Modeling with Latent Space Broadening for High Quality Data Expansion
by: Sun, Yu, et al.
Published: (2025)

Executing Natural Language-Described Algorithms with Large Language Models: An Investigation
by: Zheng, Xin, et al.
Published: (2024)

Semi-Clairvoyant Scheduling of Speculative Decoding Requests to Minimize LLM Inference Latency
by: Li, Ruixiao, et al.
Published: (2025)

A Multi-Source Heterogeneous Knowledge Injected Prompt Learning Method for Legal Charge Prediction
by: Sun, Jingyun, et al.
Published: (2024)

Towards the Holographic Characteristic of LLMs for Efficient Short-text Generation
by: Qian, Shun, et al.
Published: (2026)

Fine-tuning vs Prompting, Can Language Models Understand Human Values?
by: Sun, Pingwei
Published: (2024)

Generate, Not Recommend: Personalized Multimodal Content Generation
by: Liu, Jiongnan, et al.
Published: (2025)

Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents
by: Kuang, Jiayi, et al.
Published: (2025)

From Flat to Structural: Enhancing Automated Short Answer Grading with GraphRAG
by: Chu, Yucheng, et al.
Published: (2026)

LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content
by: Foo, Jessica, et al.
Published: (2024)

Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation
by: Zhuang, Jun, et al.
Published: (2025)

GRL-Prompt: Towards Knowledge Graph based Prompt Optimization via Reinforcement Learning
by: Liu, Yuze, et al.
Published: (2024)