Corona Virus (COVID-19) Tweets Dataset

View Dataset
Lamsal, Rabindra

Description

Tweets Counter: 22,599,198This dataset includes CSV files which contain the tweet IDs. The tweets have been collected by the LSTM model deployed here at sentiment.live. The model monitors the real-time Twitter feed for corona virus-related tweets, using filters: language “en” and keyword “corona”. As per the Twitter Developer Policy, it is not possible for me to provide information other than the Tweet IDs (this dataset has been completely re-designed on March 20, 2020, to comply with data sharing policies set by Twitter). Note: This dataset should be solely used for non-commercial research purpose (ignore every other LICENSE category given in this page).Schema of the CSV files: First column: tweet ID, Second column: Sentiment score for the particular tweet.Files details (Tweets collected in GMT+0; Local time mentioned below: GMT+5:45):corona_tweets_01.csv: 831,327 tweets (March 20, 2020 01:37 AM - March 20, 2020 10:28 AM)corona_tweets_02.csv: 870,924 tweets (March 20, 2020 10:31 AM - March 20, 2020 09:43 PM)corona_tweets_03.csv: 773,729 tweets (March 20, 2020 09:49 PM - March 21, 2020 09:25 AM)corona_tweets_04.csv: 1,233,340 tweets (March 21, 2020 09:27 AM - March 22, 2020 07:46 AM)corona_tweets_05.csv: 1,782,157 tweets (March 22, 2020 07:50 AM - March 23, 2020 09:08 AM)corona_tweets_06.csv: 1,771,295 tweets (March 23, 2020 09:11 AM - March 24, 2020 11:35 AM)corona_tweets_07.csv: 1,479,651 tweets (March 24, 2020 11:42 AM - March 25, 2020 11:43 AM)corona_tweets_08.csv: 1,272,592 tweets (March 25, 2020 11:47 AM - March 26, 2020 12:46 PM)corona_tweets_09.csv: 1,091,429 tweets (March 26, 2020 12:51 PM - March 27, 2020 11:53 AM)corona_tweets_10.csv: 1,172,013 tweets (March 27, 2020 11:56 AM - March 28, 2020 01:59 PM)corona_tweets_11.csv: 1,141,210 tweets (March 28, 2020 02:03 PM - March 29, 2020 04:01 PM)----- March 29, 2020 04:05 PM - March 30, 2020 12:30 PM -- Some folk(s) messed around with the server. Tweets for this period won't be available. However, I'll be continuing adding the new Tweet IDs. Some preventive measures have been taken. Sorry for the inconvenience. -----corona_tweets_12.csv: 793,417 tweets. (March 30, 2020 02:01 PM - March 31, 2020 10:16 AM)corona_tweets_13.csv: 1,029,294 tweets (March 31, 2020 10:20 AM - April 01, 2020 10:59 AM)corona_tweets_14.csv: 920,076 tweets (April 01, 2020 11:02 AM - April 02, 2020 12:19 PM)corona_tweets_15.csv: 826,271 tweets (April 02, 2020 12:21 PM - April 03, 2020 02:38 PM)corona_tweets_16.csv: 612,512 tweets (April 03, 2020 02:40 PM - April 04, 2020 11:54 AM)corona_tweets_17.csv: 685,560 tweets (April 04, 2020 11:56 AM - April 05, 2020 12:54 PM)corona_tweets_18.csv: 717,301 tweets (April 05, 2020 12:56 PM - April 06, 2020 10:57 AM)corona_tweets_19.csv: 722,921 tweets (April 06, 2020 10:58 AM - April 07, 2020 12:28 PM)corona_tweets_20.csv: 554,012 tweets (April 07, 2020 12:29 PM - April 08, 2020 12:34 PM)corona_tweets_21.csv: 589,679 tweets (April 08, 2020 12:37 PM - April 09, 2020 12:18 PM)corona_tweets_22.csv: 517,718 tweets (April 09, 2020 12:20 PM - April 10, 2020 09:20 AM)corona_tweets_23.csv: 601,199 tweets (April 10, 2020 09:22 AM - April 11, 2020 10:22 AM)corona_tweets_24.csv: 497,655 tweets (April 11, 2020 10:24 AM - April 12, 2020 10:53 AM)To make it easy for the NLP researchers to get access to the sentiment analysis of each collected tweet, the sentiment score out of TextBlob [1] has been appended as the second column. New databases will be added to this dataset every day. Bookmark this page for further updates. [1] https://textblob.readthedocs.io/en/dev/

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.4

FAIR Score

58%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

IEEE DataPort

Assigned Domain

Subfield

Safety Research

Field

Social Sciences

Domain

Social Sciences

Confidence Score

46%

Source

Scholar Data Model

Keywords

COVID-19Machine LearningCorona Tweets DatasetCOVID-19 Tweets DatasetCorona TweetsCOVID-19 TweetsCorona Twitter SentimentCOVID-19 Twitter Sentiment

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00