Data Engineer @AstraZeneca | Founder @DataMasteryLab | AI & Big Data Architect | YouTuber @CodeWithYu | Teaching 50K+ students worldwide
I build AI-powered, production-grade data systems and architect big data solutions for the future:
- ๐ค AI & Machine Learning (MLOps, LLMs, Generative AI, Vector Databases)
- ๐ง Big Data Engineering (Spark, Kafka, Airflow, Flink, dbt)
- โ๏ธ Cloud & Distributed Systems
- ๐ Real-time Streaming & Intelligent Analytics
- ๐ง AI-Native Data Platforms & Data Mesh
Building the future of data with:
- Generative AI integration into data pipelines
- Real-time ML systems and intelligent data workflows
- Scalable big data architectures for AI workloads
- LLM fine-tuning and RAG (Retrieval-Augmented Generation)
- Next-gen data platforms and AI-powered analytics
- ๐ Data Mastery Lab - My AI & Data educational platform
- ๐ฅ YouTube - Code With Yu - End-to-end data engineering tutorials
- โ๏ธ Medium - 3K+ followers | Writing about AI, Big Data & Future Tech
- ๐ฐ Substack | Writing about Big Data, ML, AI, AI Agents & Future Tech
- ๐ผ LinkedIn - Let's connect!
- ๐ Udemy - Teaching AI-powered data engineering & emerging technologies
MSc in Computational Intelligence and Data Analytics | Cranfield University
Empowering the next generation of data & AI professionals to build intelligent, scalable, future-ready solutions.
๐ซ Let's collaborate on AI and big data projects!
- Real-Time Stock Market Anomaly Detection Using Machine Learning: An End-to-End Data Engineeringโฆ
- Building Realtime Data Warehouse with Apache Airflow, Redpanda, Pinot and Superset
- Decodable vs. AWS Managed Service for Apache Flink (MSF): An End-to-End Data Engineering Showdown
- Apache Spark vs Apache Flink: Choosing the Right Tools and Technologies
- End to End Data Engineering for Data Lakehouse with Airflow, Minio, Kafka, Apache Spark, Apacheโฆ





