Skip to content
View Sarishc's full-sized avatar

Organizations

@hackforla

Block or report Sarishc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sarishc/README.md

I'm Sarish Chavan, a passionate data enthusiast with a keen eye for transforming data into impactful insights. With a strong foundation in Information Systems from Illinois Institute of Technology and hands-on experience in data engineering and machine learning, I am driven by the excitement of solving complex data problems and creating value through innovative solutions.

🚀 About Me

  • 🎓 Education:
    • Currently pursuing an M.S. in Computer Science from Illinois Institute of Technology.
    • Earned a B.E. in Computer Science from Savitribai Phule Pune University.
  • 💡 Technical Skills:
    • Data Analysis & Visualization: SQL, Python, R, Tableau, Power BI, Excel
    • Cloud & Data Engineering: AWS, GCP, Azure, Snowflake
    • Programming Languages: Python, SQL, Go, C++
    • Machine Learning: Hugging Face, GPT, CNN, Deep Learning, Mistral LLM
  • 🏆 Professional Highlights:
    • Boosted efficiency by 40% and cut down errors through Python and AWS-powered automation at Nice Systems.
    • Spearheaded the creation of an AWS monitoring solution achieving 99.9% uptime, reducing pipeline failures by 30%.
    • Developed data models and predictive solutions with MedTour Easy, enhancing forecasting accuracy and data processing speed.
    • Contributing to impactful civic tech projects at HACK FOR LA by designing and optimizing data pipelines for efficient data processing and analysis.

🛠 Projects

  • Automated Text Extraction & Client-Facing Application**: Designed an efficient pipeline using Python and Azure AI, cutting manual text extraction efforts by 50%.
  • Model Evaluation System for GAIA Benchmark**: Built a Streamlit-based tool to enhance model accuracy assessment and streamline performance comparison.
  • Urban Traffic Safety Analysis**: Refined data pipelines to boost data quality, accuracy, and decision-making speed for traffic safety insights.
  • Open-Source Notion Integrated Action Item Extractor**: Implemented the Mistral-7B model to automate email parsing, reducing manual workload by 50%.
  • Drug Satisfaction Prediction App**: Developed an interactive predictive tool for healthcare providers, achieving an 85% accuracy rate.
  • Drowsiness Detection System**: Engineered a CNN-powered real-time driver fatigue detection system with 99.93% accuracy, recognized at the NES Innovation Awards.

🤔 Fun Facts

  • I'm an advocate of lifelong learning and am currently expanding my knowledge in data engineering and AI model optimization.
  • I love exploring new technologies and am always on the lookout for ways to apply machine learning to solve real-world problems.
  • Outside of tech, I’m a curious explorer who enjoys analyzing data for fun and discovering hidden insights.

🌱 I’m Currently Learning

  • Advanced techniques in cloud data infrastructure, specifically in Snowflake and BigQuery.
  • Experimenting with large language models (LLMs) and exploring model fine-tuning on Hugging Face and OpenAI platforms.

💬 Ask me about

  • Data pipelines, cloud infrastructure, and how to optimize your analytics workflows.
  • Using visualization tools like Tableau and Power BI to tell compelling stories with data.
  • Predictive modeling and implementing machine learning solutions to tackle data-centric challenges.

📫 How to Reach Me

Popular repositories Loading

  1. Detection-of-Arrhytmia Detection-of-Arrhytmia Public

    C++

  2. VuMeterGG VuMeterGG Public

    Forked from DavidGG-dev/VuMeterGG

    Vu meter for PySide6, compatible with QT Designer.

    Python

  3. Reddit-Data-Pipeline-Engineering Reddit-Data-Pipeline-Engineering Public

    Python

  4. Real-Time-Data-Streaming Real-Time-Data-Streaming Public

    Python

  5. Airbnb-Data-Science-Project Airbnb-Data-Science-Project Public

    This Airbnb Data Science project investigates the factors influencing the pricing of listings in Seattle, aiming to provide valuable insights for both hosts and potential guests in the Airbnb marke…

    Jupyter Notebook

  6. MLOS MLOS Public

    Forked from microsoft/MLOS

    MLOS is a project to enable autotuning for systems.

    Python