VO MINH HIEU

@hieuvm2911_19322
5.0
Developer
Python PL/SQL Bash Git
Tham gia: 04/08/2025 1:02:22
Seeking an entry-level role in Data Engineering to apply my knowledge of pipelines, ETL, and real-time data processing. In the short term, aiming to gain hands-on experience with distributed systems and cloud-based data workflows to strengthen technical foundation. Over the long term, aspiring to become a specialized Data Engineer who can design scalable architectures and lead data infrastructure initiatives that drive business growth.

Quá Trình Học Tập

Vietnam National University - Ho Chi Minh City University of Science Mar 2025
Bằng cấp: Bachelor of Information Technology
(Gpa: 8.2)

Kinh Nghiệm Làm Việc

Vietnam National University - Ho Chi Minh City University of Science (Oct 2024 - Dec 2024)
- Designed a Snowflake schema data model to analyze U.S. air quality trends from 2021 to 2023
- Developed ETL workflows using SSIS for data extraction, cleaning, transformation, and loading into the data warehouse
- Developed OLAP cubes and MDX queries to explore AQI trends and quarterly state-level statistics
Data Engineering Course project
Personal project - Github repository (July 2025 - )
- Built a real-time CDC pipeline from MySQL to Azure using Debezium, Kafka, Spark Structured Streaming, and Airflow
- Streamed binlog changes to Kafka, transformed data with Spark, and stored it in ADLS Gen2 & Azure SQL DWH
- Orchestrated and containerized the entire pipeline with Airflow and Docker Compose
REAL-TIME DATA PIPELINE: MYSQL TO AZURE CLOUD

0 Review