Fake News Detection

Social Media is a place where a massive data is generated in seconds and it’s the first place for any news to land. Having said that, there is huge chance to spread Fake news and it’s  one the biggest challenging issue in social media. Following are some of the high level concepts in Fake new detection.

Characterization and Detection are the two aspects of fake news detection.

What is Characterization ?

This phase talks about fake news and its  characteristics. Classified into the following two categories Traditional and Social Media.  

Traditional Media ( Psychology and Social Foundation )

Naıve RealismConsumers believe their perceptions are correct
Confirmation BiasConsumers prefer to receive information that confirms to their views

Social Media ( Malicious Accounts and Echo Chambers )

Malicious AccountsBots ,Cyborg users, Trolls.
Echo ChambersGroups of like minded people spreading the news

What is Detection ?

This  Phase talks about the fake news detection methodologies.  Detection Algorithms are highly classified into two categories Content and Social context based.

Knowledge BasedUsed External Sources for checking the credibility of the news.
Style BasedRelies on the same source but manipulates basing on the writing style.  
Stance BasedUses  user viewpoint
Propagation BasedChecks for the relevance in social media posts.

To train the model  We can get data from different sources . However checking veracity of the news is challenging . Therefore following are  some of the publicly available data sets

BuzzFeedNewsThis dataset comprises a complete sample of news published in Facebook from 9 news agencies over a week close to the 2016 U.S. election from September 19 to 23 and September 26 and 27. Every post and the linked article were fact-checked
LIARThis dataset is collected from fact-checking website PolitiFact through its API
BS DetectorThis dataset is collected from a browser extension called BS detector developed for checking news veracity
CREDBANKThis is a large scale crowdsourced dataset of approximately 60 million tweets that cover 96 days starting from October 2015

REPORT THIS AD

Fake News Related Areas

Rumor ClassificationAims to classify piece  of information as rumor or not and its has different phases as Rumor detection, rumor tracking, stance classification, and veracity classification
Truth DiscoveryDetecting truth from multiple  conflicting sources.
Clickbait DetectionDetecting eye catching teaser  in online Media
Spammer and Bot DetectionDetecting malicious users , Spreading ads, Viruses and Phishing etc.

Future Research 

Fake news detection is one of the emerging research areas . Its broadly outlined into four different areas like Data-oriented, Feature-oriented, Model-oriented, and Application-oriented.

Data-orientedThe main focus here is on data such as benchmark data collection, Fake news validation etc
Feature-orientedFocus here is for detecting fake news from multiple data sources, such as news content and social context.
Model-orientedFocuses more on  practical and effective models for fake news detection, including supervised, semi-supervised and unsupervised models
Application-orientedThis research  goes beyond fake news detection, such as fake new diffusion and intervention

Reference: https://www.kdd.org/exploration_files/19-1-Article2.pdf

Leave a comment