Delen
Bewaren

Zhe

Data engineer; Data platform engineer
nog geen reviews Almere

Over deze freelancer

Data Platform Engineer with 5+ years of experience designing and building production-grade data infrastructure on GCP. Proven track record of owning complex platforms end-to-end — from connector frameworks and streaming pipelines to orchestration, observability, and dbt transformation layers. Strong focus on reliability, scalability, and clean engineering practices.


Opleiding


Werk & Ervaring

D
15-09-2023 — 01-03-2026
Data Engineer
Dexter Energy

Worked as a Data Engineer with a strong focus on data platform engineering, owning core infrastructure for energy market data ingestion, validation, orchestration, and consolidation across the European energy market. Key responsibilities and achievements: - Co-led the design and build of the Market Data Platform (MDP) from scratch — a production-grade, streaming-first platform (Medallion + Kappa-style) managing ~20 data sources and ~200 pipelines, replacing the legacy ingestion system (MDIS) with migration ~90% complete at departure. Built around: a connector-based ingestion framework unifying polling and push-based sources; a YAML-based data contract system as single source of truth for schema, validation, and config; and Redis-backed deduplication enabling idempotent reprocessing and realtime vs backfill separation. - Implemented 7–9 production connectors (EPEX MATS, Energy Quantified, Volue, EEX, APG, GME, IPTO, DAMAS) across varying API constraints and polling patterns. - Delivered under pressure: debugged undocumented EPEX MATS streaming API issues ahead of a conformance deadline; built a PDF parser for DAMAS (Romania), unlocking the company's most profitable market. - Owned production reliability — triaging missing data, upstream outages, schema drift; established post-mortem process and alerting channel from scratch. - Maintained a data consolidation layer using dbt, using MDP outputs as canonical sources to support downstream use cases including DWH, analytical workloads, production model inference, and backtesting. - Operated self-hosted Airflow on GKE; independently completed a major version upgrade; improved observability with Grafana/Prometheus. - Frequently covered ownership and operational gaps during extended periods of team instability (multiple management changes, sustained low headcount), ensuring continuity of critical data services relied upon by Trade Optimisation, Market Forecasting, Flex, and Intraday teams.Worked as a Data Engineer with a strong focus on data platform engineering, owning core infrastructure for energy market data ingestion, validation, orchestration, and consolidation across the European energy market. Key responsibilities and achievements: - Co-led the design and build of the Market Data Platform (MDP) from scratch — a production-grade, streaming-first platform (Medallion + Kappa-style) managing ~20 data sources and ~200 pipelines, replacing the legacy ingestion system (MDIS) with migration ~90% complete at departure. Built around: a connector-based ingestion framework unifying polling and push-based sources; a YAML-based data contract system as single source of truth for schema, validation, and config; and Redis-backed deduplication enabling idempotent reprocessing and realtime vs backfill separation. - Implemented 7–9 production connectors (EPEX MATS, Energy Quantified, Volue, EEX, APG, GME, IPTO, DAMAS) across varying API constraints and polling patterns. - Delivered under pressure: debugged undocumented EPEX MATS streaming API issues ahead of a conformance deadline; built a PDF parser for DAMAS (Romania), unlocking the company's most profitable market. - Owned production reliability — triaging missing data, upstream outages, schema drift; established post-mortem process and alerting channel from scratch. - Maintained a data consolidation layer using dbt, using MDP outputs as canonical sources to support downstream use cases including DWH, analytical workloads, production model inference, and backtesting. - Operated self-hosted Airflow on GKE; independently completed a major version upgrade; improved observability with Grafana/Prometheus. - Frequently covered ownership and operational gaps during extended periods of team instability (multiple management changes, sustained low headcount), ensuring continuity of critical data services relied upon by Trade Optimisation, Market Forecasting, Flex, and Intraday teams.

D
01-05-2020 — 01-09-2023
Data Platform Engineer
FedEx

- Built and maintained an analytical data platform on GCP. - Scheduled ETLs to gather data from various data sources within the organization to the data lake. - Enabled over 50 analysts to build and maintain their scheduled pyspark(-sql) jobs through self-service templates. - Enabled over 50 analysts to explore data from the data lake and their pyspark(-sql) job output via Superset. - Established CI/CD pipelines to ensure quality code delivery. - Powered all these functionalities through the GKE cluster.


Certificeringen


Portfolio


Reviews

nog geen reviews
5 Sterren
0%
4 Sterren
0%
3 Sterren
0%
2 Sterren
0%
1 Sterren
0%

€ 90 / uur
  • Locatie Almere
  • Categorie Techniek & Engineering
    Development & IT
  • Geverifieerd Email
  • Lid Sinds 05-03-2026