Skip to content
View xuwenyihust's full-sized avatar

Block or report xuwenyihust

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Flowchart for debugging Spark applications

Shell 104 27 Updated Sep 25, 2024

DeepSeek LLM: Let there be answers

Makefile 1,855 141 Updated Feb 4, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,758 1,749 Updated Jan 17, 2025

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,236 3,362 Updated Mar 25, 2024

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

Python 444 124 Updated Sep 16, 2024

MLeap: Deploy ML Pipelines to Production

Scala 1,504 314 Updated Nov 27, 2024

Streamlit — A faster way to build and share data apps.

Python 36,756 3,166 Updated Jan 18, 2025

PawMark is a platform for developers to build, schedule and monitor data pipelines.

JavaScript 22 Updated Dec 10, 2024

Practice machine learning/deep learning.

Jupyter Notebook 1 Updated Oct 9, 2023

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

Jupyter Notebook 3,145 1,808 Updated May 22, 2023

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 170,606 44,853 Updated Jan 18, 2025

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 66,643 14,741 Updated Dec 16, 2024

A natural language interface for computers

Python 57,867 4,965 Updated Dec 10, 2024

Open source platform for the machine learning lifecycle

Python 19,258 4,320 Updated Jan 17, 2025

Jupyter handsontable integration

Python 545 67 Updated Jan 4, 2024

The official Notion API client library, but rewritten in Python! (sync + async)

Python 1,863 145 Updated Jan 15, 2025

A Jupyter - Leaflet.js bridge

TypeScript 1,501 363 Updated Dec 5, 2024

Interactive Widgets for the Jupyter Notebook

TypeScript 3,180 947 Updated Oct 22, 2024

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 63,845 14,259 Updated Jan 18, 2025

A better notebook for Scala (and more)

Jupyter Notebook 4,540 397 Updated Jan 10, 2025

Jupyter Interactive Notebook

Jupyter Notebook 11,922 5,069 Updated Jan 7, 2025

Jupyter metapackage for installation, docs and chat

Python 14,981 4,164 Updated Oct 27, 2024

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,561 2,448 Updated Jan 18, 2025

Koalas: pandas API on Apache Spark

Python 3,346 358 Updated Mar 20, 2024

Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…

Scala 86 15 Updated Apr 2, 2024

Examples for High Performance Spark

Scala 506 235 Updated Nov 3, 2024

A data pipeline developing kit.

Java 1 Updated Apr 8, 2021

Flink 中文视频课程(持续更新...)

4,561 1,153 Updated Jun 18, 2020

VIP cheatsheets for Stanford's CS 229 Machine Learning

17,839 3,984 Updated May 20, 2020
Next
Showing results