This respository provides access to the TBCOV dataset comprising more than two billion multilingual tweets related to the COVID-19 pandemic. Specifically, TBCOV offers 2,014,792,896 tweets collected using more than 800 multilingual keywords over a 14-month period from February 1st, 2020 till March 31st, 2021. These tweets span 67 international languages, posted by 87 million unique users across 218 countries worldwide.
Link to the dataset: crisisnlp.qcri.org/tbcov
Link to the paper: arxiv.org/pdf/2110.03664