Daniel Chaves

Daniel Chaves

Delivery Manager & Data Engineer

Leading enterprise data initiatives with 6 years of experience delivering AI/ML solutions and managing cross-functional teams of 30+ professionals.

Specializing in modern data stack implementations with Python, SQL, dbt, Airflow, Snowflake and cloud platforms.

Core Expertise

Data Platform Architecture & Engineering
AI/ML Solutions & Data Products
Delivery Management & Team Leadership
Modern Data Stack (dbt, Airflow, Snowflake)

Featured Project

A highlight of my recent technical work and architectural decisions.

Adventure Works Data Pipeline with dbt and BigQuery
dbtBigQuerySQLPython+1

Adventure Works Data Pipeline with dbt and BigQuery

This project implements a data pipeline for the Adventure Works dataset using dbt Core and BigQuery. It includes data ingestion, transformation, and modeling following best practices in data engineering. The project features a comprehensive data model with staging, intermediate, and mart layers, along with automated testing and documentation.

Technical Expertise

A comprehensive overview of my technical skills and technology stack.

Data Platform & Architecture

Extensive experience in designing and implementing modern data platforms from scratch in cloud environments (AWS, GCP, Azure), including data lakes, data warehouses, and data marts.
Proven track record in migrating legacy data systems to modern architectures, implementing robust data governance frameworks, and ensuring data quality and reliability.
Expertise in implementing and optimizing data platforms using tools like dbt, Dagster, Airflow, Snowflake, and Databricks, with a focus on scalability and performance.
Strong experience in data modeling, ETL/ELT processes, and data pipeline development, ensuring efficient data flow and transformation.

Technical Skills

Programming Languages

PythonSQL

Data Engineering

dbtAirflowDagsterSparkKafka

Cloud Platforms

AWSGCPAzure

Databases

SnowflakeBigQueryPostgreSQLMySQLDuckDB

Data Visualization

Power BILookerStreamlit

AI/ML

TensorFlowPyTorchScikit-learn

DevOps

DockerTerraformGit

Data Product Development

Successfully delivered multiple data products including AI/ML solutions, intelligent agents, and generative AI applications, driving significant business value.
Experience in developing and implementing data strategies for enterprise clients, with a focus on pharmaceutical and healthcare sectors.
Proven ability to lead cross-functional teams in developing and deploying data products, from concept to production.
Strong background in data visualization and dashboard development using tools like Power BI, Looker, and Streamlit.

Languages

English (Fluent)Portuguese (Native)Spanish (Intermediate)

Portfolio Projects

A collection of data engineering and data science projects showcasing modern data stack implementations.

IBM Data Science Capstone Project
PythonMachine LearningData ScienceCRISP-DM

IBM Data Science Capstone Project

In this final capstone project, I followed the CRISP-DM process model, including data collection, data wrangling, exploratory data analysis, data visualization, model development, model evaluation and results reporting.

Python Project For Data Engineering
PythonAPIsWeb ScrapingData Collection

Python Project For Data Engineering

As a data engineer working for an international financial analysis company, my job in this project was to collect financial data from various sources such as websites, APIs, and files provided by financial analysis firms.

Rainfall Prediction Using Classification Algorithms: A ML Project
PythonMachine LearningClassificationSVM+1

Rainfall Prediction Using Classification Algorithms: A ML Project

In this project, I used the weatherAUS.csv dataset which includes weather observations from 2008 to 2017. The implemented algorithms were Linear Regression, Decision Tree, Logistic Regression, and SVM. I evaluated each model using performance metrics such as Accuracy Score, Mean Squared Error and R2-Score.