-д хадгалсан:
Номзүйн дэлгэрэнгүй
Үндсэн зохиолч: Shanmugam, Alagappan
Формат: Recurso digital
Хэл сонгох:англи
Хэвлэсэн: Zenodo 2021
Нөхцлүүд:
Онлайн хандалт:https://doi.org/10.5281/zenodo.20131064
Шошгууд: Шошго нэмэх
Шошго байхгүй, Энэхүү баримтыг шошголох эхний хүн болох!
_version_ 1866901748251099136
author Shanmugam, Alagappan
author_facet Shanmugam, Alagappan
contents <p>Streaming platforms have scaled their recommendation engines largely through collaborative filtering (CF), a family of techniques that infers user preferences from behavioral patterns. While CF has proven effective, it carries well known limitations: poor handling of new content with no viewing history, a tendency to reinforce popularity bias, and an inability to explain why a given title was recommended. This article examines how multimodal content understanding, where systems jointly analyze video, audio, and textual signals from the media itself, offers a practical path beyond these constraints. I describe a three pillar framework (visual intelligence, audio intelligence, and semantic intelligence) that produces unified content embeddings, and discuss how these representations address cold start, long tail discovery, and recommendation transparency. This paper draws on lessons from building personalization systems at production scale.</p>
format Recurso digital
id zenodo_https___doi_org_10_5281_zenodo_20131064
institution Zenodo
language eng
publishDate 2021
publisher Zenodo
record_format zenodo
spellingShingle Multimodal Content Understanding as the Next Frontier in Streaming Personalization
Shanmugam, Alagappan
Multimodal Content Understanding
Streaming
Personalization
Platforms
collaborative filtering
recommendation transparency
<p>Streaming platforms have scaled their recommendation engines largely through collaborative filtering (CF), a family of techniques that infers user preferences from behavioral patterns. While CF has proven effective, it carries well known limitations: poor handling of new content with no viewing history, a tendency to reinforce popularity bias, and an inability to explain why a given title was recommended. This article examines how multimodal content understanding, where systems jointly analyze video, audio, and textual signals from the media itself, offers a practical path beyond these constraints. I describe a three pillar framework (visual intelligence, audio intelligence, and semantic intelligence) that produces unified content embeddings, and discuss how these representations address cold start, long tail discovery, and recommendation transparency. This paper draws on lessons from building personalization systems at production scale.</p>
title Multimodal Content Understanding as the Next Frontier in Streaming Personalization
topic Multimodal Content Understanding
Streaming
Personalization
Platforms
collaborative filtering
recommendation transparency
url https://doi.org/10.5281/zenodo.20131064