Data Engineer / Analytics Engineer
Building reliable data pipelines and analytics-ready datasets. Experienced in end-to-end data orchestration, transformation modeling, and cloud infrastructure.
Building end-to-end data infrastructure
I'm a Data Engineer focused on building reliable, scalable data pipelines that transform raw data into analytics-ready datasets. My work spans the full data engineering lifecycle: from ingestion and orchestration to transformation modeling and serving data for business intelligence.
I specialize in modern data stack technologies including Apache Airflow for orchestration, dbt for transformation modeling, and cloud platforms like AWS for storage and compute. My experience includes building production pipelines processing millions of records daily with strong data quality guarantees.
Beyond infrastructure, I understand the analytics layer—working with business stakeholders to translate requirements into dimensional models, metrics, and dashboards. I'm equally comfortable writing SQL transformations, Python ETL scripts, or deploying containerized workflows with Docker.
Building production data systems in healthcare and fintech
Built production-grade dbt transformation models converting Oracle ERP datasets into Snowflake analytics tables. Standardized SQL logic using Jinja templates, improving reusability and maintainability across finance, supply chain, and marketing analytics. Participated in Agile sprints with Git-based version control and CI/CD deployment cycles. Explored Dagster-based orchestration for pipeline automation. Supported cross-functional analytics teams with clean, well-documented dimensional models.
Developed ETL pipelines using SSIS and SQL Server for major banking clients (BIMB, AmBank). Delivered critical reporting systems including Data Quality Framework (DQF), STATSMART, and Information Security System (ISS). Translated complex business requirements into efficient SQL queries and Java modules. Contributed to regulatory reporting for Bank Negara Malaysia, ensuring compliance and data accuracy in high-stakes financial environments.
Production-grade data pipelines and analytics infrastructure
Problem: Malaysian parents and policymakers lack accessible, standardized data to compare school performance across states and regions. Raw government datasets are fragmented, inconsistent, and difficult to analyze at scale.
Problem: Analyzing flight delays and cancellations requires integrating real-time weather data with historical flight records. Manual data collection is slow and error-prone, preventing timely operational insights.
Open to data engineering and analytics opportunities