Introducing the Gab Hate Corpus: defining and applying hate-based rhetoric to social media posts at scale

Published in Language Resources and Evaluation, 2022

The Gab Hate Corpus (GHC) contains 27,665 posts from gab.com, annotated for "hate-based rhetoric" by three or more annotators. It includes hierarchical labels for dehumanizing and violent speech, targeted groups, and rhetorical framing. The GHC enhances existing hate speech datasets with a large, representative collection of richly annotated social media posts

Download paper here

Recommended citation: Kennedy, Brendan, et al. “Introducing the Gab Hate Corpus: defining and applying hate-based rhetoric to social media posts at scale.” Language Resources and Evaluation (2022): 1-30.