Skip to content
View tuni56's full-sized avatar
🤓
Developing new things
🤓
Developing new things

Block or report tuni56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tuni56/README.md

Hi, I’m Rocío

I’m a Data Engineer who builds data systems that actually run in production.

I work with streaming data, batch pipelines, and ML workflows, designing systems that can handle failures, limited budgets, and real operational constraints.

I come from an engineering background and transitioned into data engineering by building end-to-end systems: ingestion, processing, observability, and deployment.

My focus is simple: make data reliable, usable, and ready for decisions.

What I Work On

I design and implement systems that:

  • Ingest and process data reliably
  • Support analytics and machine learning use cases
  • Follow cloud-native and event-driven principles
  • Can be operated and understood by real teams

My work lives at the intersection of data engineering, backend systems, and applied machine learning.


Selected Projects

Ecommerce Streaming Data Platform

Real-time, event-driven architecture

I’m watching events arrive in real time: orders placed, payments confirmed, inventory updated.

Data is flowing fast, and if something breaks, the business feels it immediately.

I built a streaming platform that listens to those events, routes them, processes them, and exposes what’s happening through clear observability.

  • Real-time ingestion using Kafka
  • Event-driven processing and routing
  • Routing concepts inspired by AWS Route 53
  • Observability with Grafana
  • Designed for reliability and failure visibility

Tech: Python, Kafka, Event-driven architecture, Grafana
Repository: https://github.com/tuni56/ecommerce-streaming-data-platform


Data Lake Analytics Pipeline

Batch ingestion and analytics

I’m organizing raw data as it arrives, structuring it so analytics teams don’t fight the data.

  • End-to-end ingestion and processing
  • Structured data lake layout
  • Designed for analytics and reporting
  • Automation and data quality checks

Tech: Python, Data Pipelines
Repository: https://github.com/tuni56/datalake-analytics-pipeline


Customer Churn Prediction

Machine learning on AWS

I’m preparing data, training models, evaluating results, and making the workflow reproducible.

  • End-to-end ML lifecycle
  • Built on AWS SageMaker
  • Focus on deployable ML workflows

Tech: Python, AWS SageMaker, Machine Learning
Repository: https://github.com/tuni56/churn-prediction-aws-streamlit


Demand Forecasting System

Predictive analytics for inventory decisions

I’m forecasting demand to support business decisions before problems happen.

  • Time-series forecasting
  • Feature engineering
  • Model training pipelines

Tech: Python, Forecasting Models
Repository: https://github.com/tuni56/demand-forecasting-system


Radio Station Microservices Platform

Distributed systems & event-driven backend

I’m coordinating services that need to talk, fail, recover, and stay consistent.

  • Microservices architecture
  • Event-driven communication using Kafka
  • Coordination with ZooKeeper
  • Built with Java, Spring Boot, Spring Cloud
  • Designed to simulate AWS-managed services locally

Focus: Distributed systems, messaging, service discovery
Tech: Java, Spring Boot, Spring Cloud, Kafka, ZooKeeper


Tech Stack

Data Engineering

  • Python, SQL
  • Kafka
  • Batch and streaming pipelines
  • Data modeling and data flow design

Cloud & Infrastructure

  • AWS (S3, Lambda, DynamoDB, SageMaker, API Gateway)
  • Infrastructure as Code: Terraform
  • IAM and least-privilege design

Machine Learning

  • scikit-learn
  • Time-series forecasting
  • ML pipelines
  • SageMaker workflows

Backend & Systems

  • Java
  • Spring Boot, Spring Cloud
  • Microservices
  • Event-driven architectures

What I’m Focusing On Now

  • AWS-native data architectures
  • Infrastructure as Code with Terraform
  • Observability and system design
  • Data Engineer / Data Platform Engineer roles

Let’s Connect

LinkedIn: https://www.linkedin.com/in/rociobaigorria/
Email: rociomnbaigorria@gmail.com
Location: Argentina (GMT-3) – open to remote roles


Making data accessible to people who actually need to use it.

Data Flow in Motion

This is how I think about systems: flow, pressure, failures, recovery.

GitHub Space Invaders


Pinned Loading

  1. ecommerce-streaming-data-platform ecommerce-streaming-data-platform Public

    Real-time ecommerce streaming data platform using Kafka, AWS Route 53 routing, event-driven architecture, and observability with Grafana.

    Python

  2. datalake-analytics-pipeline datalake-analytics-pipeline Public

    Python

  3. customer-churn-prediction customer-churn-prediction Public

    customer churn prediction using AWS SageMaker

    1

  4. demand-forecasting-system demand-forecasting-system Public

    Build predictive analytics solution for inventory management.

    Python