Skip to content

vedanthv/data-engineering-portfolio

Repository files navigation

image

Hello World! I'm Vedanth.

This is a complete portfolio of the projects I have designed with a major focus on implementing various data engineering tech and cloud services across Azure, AWS and GCP.

Feel Free to Connect with me 🤠

LinkedIn | GitHub

Infrastructure and Tech Stack

DE Portfolio Tool Used

Quick Links

Here is an index of projects with the tech, domain and link.

Projects

Project Domain
Pt 1 : Streaming ETL with Airflow Orchestrator and Postgres DB Near Realtime Streaming,SQL Database, Kafka, Cricket!
Pt 2 : Serving Postgres Data using FastAPI Backend API Development, Streaming Data
Pt 3 : Frontend Web App Powered by Streaming Data with Streamlit Frontend Application Development,MVC Architecture
Pt 4 : Realtime Ingestion and Transformation with Apache Druid as OLAP Database Data Ingestion and Big Data Processing
CricAIde : Realtime Cricket Analytics Visualization Data Transformation and BI - Apache Airflow,Pandas,Slowly Changing Dimensions,dbt,Apache Superset
InvestIQ Metrics AWS EC2,Docker,Airflow,ReapidAPI,AWS Lambda,AWS S3,AWS CloudWatch,AWS Redshift,Prometheus,Grafana,PowerBI
User Mingle : Kafka Driven User Profile Streaming Airflow, Zookeeper, Cassandra, Schema Registry, Spark
Grand Prix Data Odyssey Azure Databricks,Spark SQL,Postman,Blob Storage,Unity Catalog,ADF,Azure Devops,Synapse Studio, Delta Lake,PowerBI
Medal Metrics: Tokyo Olympics Data Alchemy ADF,Azure Data Lake Gen2,Blob Storage,Databricks,Synapse Analytics,PowerBI
Airflow with Postgres as OLTP Database [Batch Processing] Beautiful Soup, Apache Airflow, Postgres, Docker