BigDataTwitter

Team

Tools:

Infrastructure:

The objective of this project is to show what we learned during the fifth quarter in the subject BigData, as a project we developed a pipeline by which we mine data from Twitter through the library Twint and ingest them into a topic of kafka Confluent, and then enrich the data through python as well as ingest them into a new topic for indexing in ElasticSearch using as an intermediary Logstash and as a final component we would use Kibana for the visualization of data.

As shown in the picture below:

To facilitate the installation we decided to create a bash-programmed installer to speed up the installation and deployment on the nodes or clusters.

Install Instructions

git clone https://github.com/DanielDCM212/BigDataTwitter.git

cd BigDataTwitter

bash InstallHub.sh

Run Service

bash StartService.sh

Run Listening and Enrichment

python3 ListeningKafka.py

python3 Enriqueser.py

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Examples		Examples
confluent		confluent
img		img
.gitignore		.gitignore
Enriqueser.py		Enriqueser.py
InstallHub.sh		InstallHub.sh
ListeningKafka.py		ListeningKafka.py
README (esp).md		README (esp).md
README.md		README.md
StartService.sh		StartService.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BigDataTwitter

Team

Tools:

Infrastructure:

As shown in the picture below:

Install Instructions

Run Service

Run Listening and Enrichment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BigDataTwitter

Team

Tools:

Infrastructure:

As shown in the picture below:

Install Instructions

Run Service

Run Listening and Enrichment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages