Ongoing work at the Social Science Research Council (SSRC), including our 2018 “ To Secure Knowledge ” report, identified the inaccessibility of the social data increasingly necessary for robust research as a core concern. The following spreadsheet aggregates a list of datasets available for social media research. This list is by no means exhaustive, nor does it aim to be. Many social science researchers collect and collate their own data, especially from Twitter’s Search API . The datasets listed here have been curated and released for use in studying specific themes or topics. The spreadsheet began as a project of SSRC’s Social Data Initiative, and thus many of the datasets are focused on the topic of elections and politics.
Users can filter, group, and sort datasets by variables such as descriptor tag and access type. If you have a dataset that you do not see listed here, and you would like to submit it for inclusion, please email us at firstname.lastname@example.org. Please include “Social media dataset nomination” in the subject line of your email.
For obvious reasons, we are currently especially interested in adding datasets related to the spread of COVID-19, or of misinformation about the pandemic.