Skip to content
View Rahul06x1's full-sized avatar

Block or report Rahul06x1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Rahul06x1/README.md

Hi, I'm Rahul Rajeev 👋

Data & Analytics Engineer — GCP · BigQuery · dbt · Terraform

I build cloud-native data platforms: ingestion → warehouse → transformation → activation, with infrastructure as code, keyless CI/CD, and observability baked in from day one.


🚀 Featured projects

A four-part GCP data-engineering portfolio of escalating complexity — each a standalone, tested, CI-green, Terraform-deployed repo using only public/mock data.

Project What it shows Level
SkyCast Scheduled API → BigQuery → dbt ELT (Cloud Functions, Cloud Run, Workflows) Beginner
PulseStream Real-time event streaming — Pub/Sub adapter/mapper/redrive with dead-letter handling Intermediate
TripLake Cost-efficient BigQuery lakehouse over NYC taxi data — external tables + incremental dbt MERGE Intermediate
OmniPipe End-to-end governed platform — Datastream CDC, Cloud Workflows, PII governance, reverse-sync, multi-env Terraform Advanced

Together they cover scheduled, streaming, batch, and CDC ingestion; dbt transformation; data governance; reverse-ETL; reusable IaC + CI/CD; and observability.


🧰 Tech I work with

Cloud: Google Cloud — BigQuery, Cloud Run, Cloud Functions, Pub/Sub, Cloud Workflows, Datastream, Cloud SQL, Cloud Storage, Secret Manager, Data Catalog Data: dbt, SQL, incremental models, partitioning/clustering, PII governance Languages: Python (ruff, pytest), SQL Platform: Terraform, GitHub Actions (Workload Identity Federation), Docker


📫 Get in touch

Pinned Loading

  1. omnipipe omnipipe Public

    End-to-end governed GCP data platform — Datastream CDC, Cloud Workflows, dbt with PII governance, reverse-sync, multi-env Terraform & reusable CI

    HCL

  2. pulsestream pulsestream Public

    Real-time event ingestion on GCP — Pub/Sub, dead-letters & redrive, with BigQuery, Cloud Run, Terraform & GitHub Actions

    Python

  3. triplake triplake Public

    Cost-efficient BigQuery lakehouse over public NYC taxi data — GCS, external tables, incremental dbt, Cloud Run, Terraform & GitHub Actions

    Python

  4. skycast skycast Public

    Serverless weather analytics on GCP — scheduled ELT with Cloud Functions, BigQuery, dbt, Terraform & GitHub Actions

    Python

  5. hr hr Public

    Python

  6. hrms-react hrms-react Public

    SCSS