Pham Thanh Dat

@phamthanhdat387_19360
5.0
Developer
SQL PYTHON Apache Airflow Apache Spark
TP.HCM Tham gia: 04/09/2025 6:59:43
To excel in the field of data engineering and analytics

Quá Trình Học Tập

Ho Chi Minh City University of Technology (HUTECH) Jan 2025
Bằng cấp: Bachelor's Degree in Management Information Systems
(Gpa: 3.15 / 4)

Kinh Nghiệm Làm Việc

ShopBack (Mar 2025 - Sep 2025)
- Orchestrated batch ingestion and cross-region S3 loads with Airflow and PySpark
- Modeled datasets end-to-end in DBT with partitioning/clustering and Hudi time-travel comparisons
- Migrated scan-heavy BI tables from Hudi to Iceberg using DBT, Airflow, validated via QA reports, Glue, Trino
- Built Metabase dashboards from Trino/S3 logs to monitor query hotspots, S3 scan volume, storage, cost
- Developed an internal S3-based backup for the Salesforce team with JupyterHub access, replacing a third-party tool and enabling ongoing annual cost savings
Data Engineer Intern

0 Review