Skip to content
View PavelGrigoryevDS's full-sized avatar
πŸ’­
Python β€’ SQL β€’ A/B-testing β€’ Visualization β€’ DE β€’ ML β€’ AI
πŸ’­
Python β€’ SQL β€’ A/B-testing β€’ Visualization β€’ DE β€’ ML β€’ AI

Block or report PavelGrigoryevDS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PavelGrigoryevDS/README.md

πŸ‘‹ Welcome! I'm Pavel

Gmail

πŸ§‘β€πŸ’» About me

I have a technical background and specialize in data analysis with a strong engineering focus on data pipelines and architecture. I design scalable data solutions and extract actionable insights from complex datasets to support strategic decisions and deliver measurable business growth. I truly enjoy bridging data engineering with product analytics.


πŸ› οΈ Languages and Tools

  • Programming Languages: Python, SQL (PostgreSQL, MySQL, ClickHouse), NoSQL (MongoDB).
  • Data Analysis & Visualization:
    • Libraries: Pandas, NumPy, SciPy, Statsmodels, Pingouin, Plotly, Matplotlib, Seaborn.
    • Tools & Frameworks: Dash, Power BI, Tableau, Redash, DataLens, Superset.
  • Big Data & Distributed Computing: Apache Hadoop, Apache Spark, Apache Kafka, Apache Airflow, S3
  • Machine learning and AI: Scikit-learn, MLlib.
  • Time Series Forecasting: Facebook Prophet, Uber Orbit.
  • Natural Language Processing: NLTK, SpaCy, TextBlob.
  • Web scraping: BeautifulSoup, Selenium, Scrapy.
  • DevOps: Linux, Git, Docker.
  • IDEs: VS Code, Google Colab, Jupyter Notebook, Zeppelin, PyCharm.
PythonΒ  PandasΒ  NumPyΒ  PlotlyΒ  PostgreSQLΒ  MySQLΒ  MongoDBΒ  SklearnΒ  VS CodeΒ  JupyterΒ  LinuxΒ  GitΒ  DockerΒ  AirflowΒ  HadoopΒ  SparkΒ  KafkaΒ 

🎯 Skills

  • Data Architecture Design:
    • Designing data lakes, data warehouses, and implementing Data Vault 2.0 modeling principles for scalable and resilient systems.
  • End-to-End Data Pipelines:
    • Building and orchestrating automated ETL/ELT processes from databases to dashboards using Apache Airflow.
  • Working with Big Data:
    • Extensive experience with the Hadoop ecosystem (HDFS, Hive) and Apache Spark for processing and analyzing large-scale datasets.
  • Streaming & Messaging Systems:
    • Working with Apache Kafka for real-time data streaming and integration.
  • Data Transformation & Modeling:
    • Transforming, testing, and documenting data models using dbt to ensure data quality and reliability.
  • Business Requirements Management:
    • Eliciting, gathering, and documenting business requirements to translate stakeholder needs into actionable data solutions.
  • Deep data analysis:
    • Preprocessing, cleaning, and identifying patterns using visualization to support decision-making.
  • Writing complex SQL queries:
    • Working with nested queries, window functions, CASE and WITH statements for data extraction and analysis.
  • Understanding product strategy:
    • Knowledge of product development and improvement principles, including analyzing user needs and formulating recommendations for its growth.
  • Product metrics analysis:
    • LTV, RR, CR, ARPU, ARPPU, MAU, DAU, and other key performance indicators.
  • Conducting A/B testing:
    • Analyzing results using statistical methods to evaluate the effectiveness of changes.
  • Cohort analysis and RFM segmentation:
    • Identifying user behavior patterns to optimize marketing strategies.
  • Data visualization and dashboard development:
    • Creating interactive reports in Tableau, Redash, Power BI, and other tools for presenting analytics.
  • Web scraping:
    • Experience in extracting data from websites using tools and libraries such as BeautifulSoup, Scrapy, and Selenium for information gathering and data analysis.
  • Machine Learning Applications:
    • Capable of building and applying machine learning models for data analysis tasks, including forecasting, classification, and clustering, to uncover deeper insights and enhance decision-making processes.
  • Business and Metric Forecasting:
    • Building and interpreting time series forecasts for key business metrics using libraries like Uber Orbit and Facebook Prophet for intuitive, robust forecasting to support strategic planning and goal-setting.
  • Working with APIs:
    • Integrating and extracting data from various sources via APIs.
  • Process Automation:
    • Automating data workflows and routine tasks using Linux scripting, Apache Airflow and other DevOps tools.

Pinned Loading

  1. awesome-data-analysis awesome-data-analysis Public

    πŸš€ 500+ curated resources for Data Analysis & Data Science: Python, SQL, Statistics, ML, AI, Visualization, Cheatsheets, Roadmaps, Interview Prep. For beginners and experts.

    1.4k 193

  2. olist-deep-dive olist-deep-dive Public

    🌊 Deep Sales Analysis of Olist E-Commerce: EDA | Time Series| Viz | RFM | NLP | Geospatial | Segmentation & Actionable Business Recommendations.

    Jupyter Notebook 9 2