Saved in:
Bibliographic Details
Main Authors: Ma, Luyi, Zhang, Wanjia Sherry, Fan, Zezhong, Thakur, Shubham, Zhao, Kai, Yao, Kehui, Agarwal, Ayush, Iyer, Rahul, Cho, Jason, Xu, Jianpeng, Korpeoglu, Evren, Kumar, Sushant, Achan, Kannan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.12096
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913028949147648
author Ma, Luyi
Zhang, Wanjia Sherry
Fan, Zezhong
Thakur, Shubham
Zhao, Kai
Yao, Kehui
Agarwal, Ayush
Iyer, Rahul
Cho, Jason
Xu, Jianpeng
Korpeoglu, Evren
Kumar, Sushant
Achan, Kannan
author_facet Ma, Luyi
Zhang, Wanjia Sherry
Fan, Zezhong
Thakur, Shubham
Zhao, Kai
Yao, Kehui
Agarwal, Ayush
Iyer, Rahul
Cho, Jason
Xu, Jianpeng
Korpeoglu, Evren
Kumar, Sushant
Achan, Kannan
contents On online advertising platforms, newly introduced promotional ads face the cold-start problem, as they lack sufficient user feedback for model training. In this work, we propose LLM-HYPER, a novel framework that treats large language models (LLMs) as hypernetworks to directly generate the parameters of the click-through rate (CTR) estimator in a training-free manner. LLM-HYPER uses few-shot Chain-of-Thought prompting over multimodal ad content (text and images) to infer feature-wise model weights for a linear CTR predictor. By retrieving semantically similar past campaigns via CLIP embeddings and formatting them into prompt-based demonstrations, the LLM learns to reason about customer intent, feature influence, and content relevance. To ensure numerical stability and serviceability, we introduce normalization and calibration techniques that align the generated weights with production-ready CTR distributions. Extensive offline experiments show that LLM-HYPER significantly outperforms cold-start baselines in NDCG$@10$ by 55.9\%. Our real-world online A/B test on one of the top e-commerce platforms in the U.S. demonstrates the strong performance of LLM-HYPER, which drastically reduces the cold-start period and achieves competitive performance. LLM-HYPER has been successfully deployed in production.
format Preprint
id arxiv_https___arxiv_org_abs_2604_12096
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle LLM-HYPER: Generative CTR Modeling for Cold-Start Ad Personalization via LLM-Based Hypernetworks
Ma, Luyi
Zhang, Wanjia Sherry
Fan, Zezhong
Thakur, Shubham
Zhao, Kai
Yao, Kehui
Agarwal, Ayush
Iyer, Rahul
Cho, Jason
Xu, Jianpeng
Korpeoglu, Evren
Kumar, Sushant
Achan, Kannan
Artificial Intelligence
On online advertising platforms, newly introduced promotional ads face the cold-start problem, as they lack sufficient user feedback for model training. In this work, we propose LLM-HYPER, a novel framework that treats large language models (LLMs) as hypernetworks to directly generate the parameters of the click-through rate (CTR) estimator in a training-free manner. LLM-HYPER uses few-shot Chain-of-Thought prompting over multimodal ad content (text and images) to infer feature-wise model weights for a linear CTR predictor. By retrieving semantically similar past campaigns via CLIP embeddings and formatting them into prompt-based demonstrations, the LLM learns to reason about customer intent, feature influence, and content relevance. To ensure numerical stability and serviceability, we introduce normalization and calibration techniques that align the generated weights with production-ready CTR distributions. Extensive offline experiments show that LLM-HYPER significantly outperforms cold-start baselines in NDCG$@10$ by 55.9\%. Our real-world online A/B test on one of the top e-commerce platforms in the U.S. demonstrates the strong performance of LLM-HYPER, which drastically reduces the cold-start period and achieves competitive performance. LLM-HYPER has been successfully deployed in production.
title LLM-HYPER: Generative CTR Modeling for Cold-Start Ad Personalization via LLM-Based Hypernetworks
topic Artificial Intelligence
url https://arxiv.org/abs/2604.12096