Data Engineer | PySpark • Azure Synapse • SQL • Azure Data Factory • Azure Databricks • Big Data Pipelines • ETL/ELT
                    
                    
                    
                        Data Engineer with a Ph.D. in Physics, combining strong analytical thinking and scientific problem-solving with hands-on experience in large-scale data systems.
Over the past 4 years, I’ve designed, developed, and optimized data pipelines processing billions of records daily — ensuring scalability, data integrity, and operational resilience.
My main expertise includes:
• ETL/ELT pipeline development using PySpark, Databricks, and SQL Server
• Microsoft Azure ecosystem (Data Factory, Data Lake, Synapse, and Databricks integration)
• Big Data processing, performance optimization, and real-time analytics
• Graph-based data mining for fraud detection and relationship analysis
My background in scientific research taught me to approach every challenge with structured reasoning and data-driven decision-making — transforming complex problems into efficient, measurable solutions.
Whether you need a robust and scalable data architecture, a reliable ETL pipeline, or cloud migration with best-practice engineering, I deliver solutions that combine technical excellence and business value.
Let’s build something powerful together!
                    
                    
                    
                    
                        Work Terms
                    
                    
                        I’m available for both short-term and long-term projects.
Clear and consistent communication through Guru chat, Slack/Google chat/Teams
Project updates provided regularly (daily or weekly).
Preferred working hours: flexible, aligned with client’s time zone when needed.
Payments through Guru’s SafePay system only.
Code, documentation, and deliverables are provided upon project completion.