Skip to content

Capstone Project for DS-GA-3001 Probability and Statistics for Data Science

License

Notifications You must be signed in to change notification settings

sharad5/EnergyPrediction-ASHRAE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 

Repository files navigation

EnergyPrediction-ASHRAE

This is a capstone project for Probability and Statistics for Data Science 2 (DS-GA 3001) at NYU Center for Data Science.

Project Intro/Objective

The purpose of this project is to model the Energy consumption of the buildings depending on their basic configuration, weather conditions and usage patterns

Methods Used

  • Inferential Statistics
  • Data Visualization
  • Predictive Modeling
  • Counterfactual Analysis

Technologies

  • Python
  • Pandas, jupyter
  • Sklearn

Motivation

The cost of cooling skyscrapers in the summer is not only high in dollars but also has a considerable environmental impact. However, significant investments are being made to improve building efficiencies to reduce costs and emissions. An accurate model of metered building energy usage( in the areas of chilled water, electric, hot water, and steam meters) will assist in this goal by giving an expected improvement in energy efficiency. Therefore, the project goal is to improve building efficiencies, reduce costs, and decrease environmental impact by developing accurate models of metered building energy usage.

Data Source:

This the largest comprehensive energy dataset with over 20 million points of training data from 2,380 energy meters collected for 1,448 buildings from 16 sources. In this project, we are working with three types of datasets:

  • Buildings Data - This contains some of the building-level attributes such as nearest weather station, building_id (foreign key for building energy usage data), primary_use (indicator of the primary category of activities for the building based on EnergyStar property type definitions), square_feet (gross floor area of the building), year_built (year building was opened), and floor_count (number of floors of the building). This data is useful for attributing the energy usage to some building characteristics.
  • Building Energy Usage - The building energy usage data contains four main features: building_id (foreign key for building metadata), meter (meter id code for electricity, chilled water, steam, and hot water), timestamp (when the measurement was taken), and meter_reading (energy consumption in kWh or equivalent). This data will be used to model the energy usage for buildings
  • Weather Data - This contains several features such as air temperature, cloud coverage, dew temperature,hourly precipitation depth, sea level pressure, wind direction, and wind speed, collected from a meteorological station as close as possible to the building site. This data will be used to investigate the relationship between weather conditions and building energy usage, which is important for understanding energy consumption patterns and identifying potential opportunities for energy conservation.

Inferential Statistics:

  • TBD

Energy Prediction:

  • TBD

Findings and analysis: TBD

Team Members

Name Github
John Sutor johnsutor
Sharad Dargan sharaddargan

About

Capstone Project for DS-GA-3001 Probability and Statistics for Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published