Data Engineer specializing in Web Scraping, PDF Extraction & ETL Pipelines — Python, AWS, Airflow, PostgreSQL. Clean data delivered fast.
I'm a Data Engineer with hands-on production experience building automated data systems — web scrapers, ETL pipelines, and PDF extraction workflows that deliver clean, structured output.
I work with Python, AWS (Lambda, S3, SNS), Apache Airflow, and PostgreSQL. My core strength is taking messy, hard-to-access data — whether it's locked inside JS-rendered websites or buried in PDF documents — and turning it into clean, structured Excel or JSON files that are actually usable.
I've built scrapers for JS-heavy sites (Angular, Vue, Nuxt) using Playwright, extracted data from hundreds of PDF technical documents, and deployed fully automated cloud pipelines on AWS with zero manual intervention.
I take quality seriously. I don't guess, I don't cut corners, and I don't deliver half-finished work. Every output is validated before delivery.
Work Terms
Available Monday to Saturday. I respond within a few hours during business hours (IST). I prefer milestone-based or fixed-price contracts for defined projects. For new clients, I'm happy to start with a small paid pilot before scaling. Payment via Guru SafePay only.