USC X 24 US Election Twitter/X Dataset

The repository contains multiple directories named part_{part_number}, where each directory consists of chunk files prefixed with a timeline. Each chunk file contains 50,000 tweets related to the US elections 2024. Specifically, each subdirectory labeled with the prefix "part" contains 20 chunk files, resulting in a total of 1,000,000 tweets per part.

Repository Structure

usc-x-24-us-election/
├── part_1/
│   ├── timeline_chunk_1.csv.gz
│   ├── timeline_chunk_2.csv.gz
│   └── ...
├── part_2/
│   ├── timeline_2_chunk_21.csv.gz
│   ├── timeline_2_chunk_22.csv.gz
│   └── ...
├── part_3/
│   ├── timeline_3_chunk_41.csv.gz
│   ├── timeline_3_chunk_42.csv.gz
│   └── ...
└── ...

Cloning the Repository

To clone this repository, use the following command in your terminal:

git clone https://github.com/sinking8/usc-x-24-us-election.git

This will create a local copy of the repository on your machine.

Data Description

Each directory part_{part_number} contains chunk files that are prefixed with the timeline name.
Each chunk file consists of 50,000 tweets related to the US election, allowing for extensive data analysis and processing.

Data Schema (Updated as of 10/04/2025)

Field Name	Data Type	Description
id	object	Unique identifier for each entry.
text	object	Text content of the tweet.
url	object	URL associated with the tweet or content.
epoch	object	Epoch timestamp when the tweet was created.
media	object	Media content included in the tweet (images, videos, etc.).
retweetedTweet	object	Content of the retweeted tweet, if applicable.
retweetedTweetID	object	ID of the retweeted tweet.
retweetedUserID	object	ID of the user who originally tweeted the retweeted content.
id_str	object	ID of the tweet as a string (alternative format).
lang	object	Language of the tweet content.
rawContent	object	Raw unprocessed text of the tweet.
replyCount	object	Number of replies to the tweet.
retweetCount	object	Number of retweets.
likeCount	object	Number of likes.
quoteCount	object	Number of quotes.
conversationId	object	ID of the conversation the tweet is part of.
conversationIdStr	object	Conversation ID as a string.
hashtags	object	Hashtags included in the tweet.
mentionedUsers	object	Users mentioned in the tweet.
links	object	External links included in the tweet.
viewCount	object	View count of the tweet.
quotedTweet	object	Content of the quoted tweet, if applicable.
in_reply_to_screen_name	object	Screen name of the user being replied to.
in_reply_to_status_id_str	object	ID of the tweet being replied to as a string.
in_reply_to_user_id_str	object	User ID of the user being replied to as a string.
location	object	Location information of the tweet or user.
cash_app_handle	object	Cash App handle mentioned in the tweet, if applicable.
user	object	User information or metadata.
date	object	Date of the tweet.
type	object	Type of tweet (e.g., original, reply, retweet,sponsored).
user_id	float64	ID of the user as a float.

Note (Updated as of 13/02/2025)

Ad tweets contain only text, with all other fields remaining as NaN.
Data Available (1st May 2024 - 30th November 2024)

Usage

You can navigate to the relevant part directory and read the chunk files for further analysis. The structure allows you to process tweets in manageable chunks, facilitating easier handling of large datasets.

Contact

Please email ashwinblaze111@gmail.com

Data usage agreement

This dataset is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License (CC BY-NC-SA 4.0). By using this dataset, you agree to abide by the stipulations in the license and cite the following manuscript:

Memos

Check out our memo that provides a deeper insight into the dataset: https://arxiv.org/abs/2411.00376

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
part_1		part_1
part_10		part_10
part_11		part_11
part_12		part_12
part_13		part_13
part_14		part_14
part_15		part_15
part_16		part_16
part_17		part_17
part_18		part_18
part_19		part_19
part_2		part_2
part_20		part_20
part_21		part_21
part_22		part_22
part_23		part_23
part_24		part_24
part_25		part_25
part_26		part_26
part_27		part_27
part_28		part_28
part_29		part_29
part_3		part_3
part_30		part_30
part_31		part_31
part_32		part_32
part_33		part_33
part_34		part_34
part_35		part_35
part_37		part_37
part_38		part_38
part_39		part_39
part_4		part_4
part_40		part_40
part_41		part_41
part_42		part_42
part_43		part_43
part_44		part_44
part_45		part_45
part_46		part_46
part_47		part_47
part_5		part_5
part_6		part_6
part_7		part_7
part_8		part_8
part_9		part_9
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

USC X 24 US Election Twitter/X Dataset

Repository Structure

Cloning the Repository

Data Description

Data Schema (Updated as of 10/04/2025)

Note (Updated as of 13/02/2025)

Usage

Contact

Data usage agreement

Memos

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

USC X 24 US Election Twitter/X Dataset

Repository Structure

Cloning the Repository

Data Description

Data Schema (Updated as of 10/04/2025)

Note (Updated as of 13/02/2025)

Usage

Contact

Data usage agreement

Memos

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages