Analysis Dataset Collection

What data is available? 

Analysis requires data. Over the past few years, there has been a surge in the creation of online information datasets within many research disciplines, including linguistics, computer science, social science, journalism, and psychology. 

This curated collection contains datasets that are particularly relevant to ARTT’s analysis frameworks: News Quality, Psychological Misinformation Tactics, Online Misinformation Harms, and Community Trust Issues. Datasets that specifically contain vaccine-related content are indicated. This collection also contains datasets that are unrelated to vaccines, but part of the larger umbrella of ARTT research. 

Overall, this collection provides a snapshot into the evolving landscape of data available for the evaluation of online information. They are or will be available for use by fellow researchers and practitioners. Datasets selected for inclusion satisfy at least one of the following criteria:

  • Dataset contains qualitative or quantitative content/process analysis on any of the indicators, concepts, or dimensions within the four analysis taxonomies; 
  • Dataset is derived from ARTT-related activities and partnerships focused on the analysis of vaccine-related content;
  • Dataset includes dictionaries or codebooks that support the ARTT Guide’s analysis capabilities.

Our collection is currently a work-in-progress. If there is a dataset you believe should be included, please submit it for consideration using this form.

For other questions or feedback, please contact us at artt [dot] hackshackers [dot] com with the subject line “Dataset Collection.” 

ARTT Analysis Dataset Collection

last updated: 29 April 2022