Saved in:
Bibliographic Details
Main Authors: Shah, Deven Santosh, He, Shiying, Siddiqi, Gosuddin Kamaruddin, Bansal, Radhika
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2301.08146
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914825819389952
author Shah, Deven Santosh
He, Shiying
Siddiqi, Gosuddin Kamaruddin
Bansal, Radhika
author_facet Shah, Deven Santosh
He, Shiying
Siddiqi, Gosuddin Kamaruddin
Bansal, Radhika
contents Local news articles are a subset of news that impact users in a geographical area, such as a city, county, or state. Detecting local news (Step 1) and subsequently deciding its geographical location as well as radius of impact (Step 2) are two important steps towards accurate local news recommendation. Naive rule-based methods, such as detecting city names from the news title, tend to give erroneous results due to lack of understanding of the news content. Empowered by the latest development in natural language processing, we develop an integrated pipeline that enables automatic local news detection and content-based local news recommendations. In this paper, we focus on Step 1 of the pipeline, which highlights: (1) a weakly supervised framework incorporated with domain knowledge and auto data processing, and (2) scalability to multi-lingual settings. Compared with Stanford CoreNLP NER model, our pipeline has higher precision and recall evaluated on a real-world and human-labeled dataset. This pipeline has potential to more precise local news to users, helps local businesses get more exposure, and gives people more information about their neighborhood safety.
format Preprint
id arxiv_https___arxiv_org_abs_2301_08146
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle What's happening in your neighborhood? A Weakly Supervised Approach to Detect Local News
Shah, Deven Santosh
He, Shiying
Siddiqi, Gosuddin Kamaruddin
Bansal, Radhika
Information Retrieval
Computation and Language
Machine Learning
Local news articles are a subset of news that impact users in a geographical area, such as a city, county, or state. Detecting local news (Step 1) and subsequently deciding its geographical location as well as radius of impact (Step 2) are two important steps towards accurate local news recommendation. Naive rule-based methods, such as detecting city names from the news title, tend to give erroneous results due to lack of understanding of the news content. Empowered by the latest development in natural language processing, we develop an integrated pipeline that enables automatic local news detection and content-based local news recommendations. In this paper, we focus on Step 1 of the pipeline, which highlights: (1) a weakly supervised framework incorporated with domain knowledge and auto data processing, and (2) scalability to multi-lingual settings. Compared with Stanford CoreNLP NER model, our pipeline has higher precision and recall evaluated on a real-world and human-labeled dataset. This pipeline has potential to more precise local news to users, helps local businesses get more exposure, and gives people more information about their neighborhood safety.
title What's happening in your neighborhood? A Weakly Supervised Approach to Detect Local News
topic Information Retrieval
Computation and Language
Machine Learning
url https://arxiv.org/abs/2301.08146