Skip to content

O'Reilly Strata Data & AI 2020 Tutorial

dennislwei edited this page Feb 14, 2020 · 8 revisions

Introducing the AI Explainability 360 Open-Source Toolkit

Date: Monday, March 16, 2020

Time: 1:30pm-5:00pm

Location: San Jose, CA at the San Jose Convention Center, Room 210 B.

Presenter: Dennis Wei

Conference: O'Reilly Strata Data & AI [session link]

Tutorial slides: To be posted

Content

This tutorial will teach participants to use and contribute to AI Explainability 360 (AIX360), a comprehensive and extensible open-source Python package that supports interpretability and explainability of data and machine learning models.

Our aim is to help educate multiple audiences, including practicing data scientists working in different application domains, business leaders, domain experts, as well as machine learning researchers. To this end, we will present an overview of AI explainability, common terminology, an interactive web demo, and detailed Jupyter notebooks covering use cases in different domains.

A major motivation for creating AIX360 is that there are many ways to explain: data vs. model, direct vs. post-hoc, local vs. global. We will present a taxonomy to help practitioners navigate the explainability space. The toolkit itself includes 10 state-of-the-art algorithms covering different modes of explanation along with two explainability metrics, and the interactive web demo will feature three of these algorithms.

Outline

  • Introduction to AI explainability and AIX360
  • Interactive web demo
  • Taxonomy for choosing explanation algorithms
  • Jupyter notebook examples
    • Consumer lending
    • Health and nutrition
    • Medical images (dermoscopy)
  • Future directions

Audience

The tutorial will be aimed at an audience with different backgrounds and computer science expertise levels. For all audience members and especially those unfamiliar with Python programming, the interactive web demo will serve as a grounded introduction to concepts and capabilities. Through the explainability taxonomy, we will teach all participants which type of explanation method is most appropriate for a given use case, which is beneficial regardless of technical background. The three Jupyter notebook examples in different application domains will allow data scientists and developers to gain hands-on experience with the toolkit, while others will be able to follow along by viewing rendered versions of the notebooks.

Instructions for Attendees

  1. Please join the Slack channel dedicated to this tutorial. It can be used to communicate with the presenter and other attendees.

    1. Join the AIX360 Slack workspace: Instructions
    2. Subscribe to the #oreilly-ai-tutorial-2020 channel. You can do this by clicking on "Channels" in Slack and searching for "oreilly-ai-tutorial-2020".
  2. Please bring your laptop as this is a hands-on tutorial.

  3. Please install the Anaconda Python distribution and AIX360 library ahead of time by following these instructions:

    1. Install Anaconda Python distribution by following instructions here:
      https://www.anaconda.com/distribution/#download-section (Python 3.x version)
    2. Create a Python virtual environment (recommended), clone the AIX360 GitHub repository, and install it by following instructions here:
      https://github.com/IBM/AIX360#setup
    3. Run a Jupyter notebook server locally on your machine by following instructions below. If you created a Python virtual environment in the previous step, make sure to run the command within it.
      https://jupyter-notebook.readthedocs.io/en/stable/notebook.html#starting-the-notebook-server
    4. Register and download FICO and ISIC datasets:
      • FICO Dataset
      • ISIC Dataset(-> "Participate in this phase" -> login (e.g. using your Google/GitHub account) -> Download training and ground truth data zip files)
    5. Try running the following Jupyter notebooks locally on your machine:

Thanks!

Clone this wiki locally