diff --git a/homework.html b/homework.html index e244fdc..c33e73b 100644 --- a/homework.html +++ b/homework.html @@ -550,8 +550,8 @@

Homework #6 -

Homework #7:

+
+

Homework #7 (.qmd)

Due: Tue Apr 02 3:30pm

diff --git a/homework/hw7.html b/homework/hw7.html new file mode 100644 index 0000000..7fc3e19 --- /dev/null +++ b/homework/hw7.html @@ -0,0 +1,3421 @@ + + + + + + + + + + + +Homework #7: Stacking and Boosting + + + + + + + + + + + + + + + + + + + + + + + +
+ +
+

+SYS 6018 | Spring 2024 | University of Virginia +

+ +
+
+

Homework #7: Stacking and Boosting

+
+ + + +
+ +
+
Author
+
+

Your Name Here

+
+
+ +
+
Published
+
+

March 15, 2024

+
+
+ + +
+ + +
+ +
+

Problem 1: Stacking for Kaggle

+

You are to make at least one official entry in the House Prices: Advanced Regression Techniques Kaggle contest using stacking or model averaging; at least one component model must be a boosting model.

+
    +
  • You will need to register in Kaggle (its free)
  • +
  • Read the details of the contest. Understand the data and evaluation function.
  • +
  • Make at least one submission that uses stacking or model averaging.
  • +
  • If you get a score on the public leaderboard of \(\text{RMSE}<0.50\) (note RMSE is calculated on the log scale), you receive full credit, otherwise, you’ll lose 10 points. +
      +
    • I’ll allow teaming. Each team member can produce one component model and then use stacking or model averaging to combine predictions.
    • +
    • You don’t need to team, but must still combine multiple models. At least one of the component models should be boosting.
    • +
  • +
  • Each person submit the following in Canvas: +
      +
    • Code (if teaming, your code and the shared stacking code)
    • +
    • kaggle name (or team name) so we can ensure you had a valid submission.
    • +
    • your score and current ranking on the kaggle leaderboard
    • +
  • +
  • Top 5 scores get 2 bonus points +
      +
    • Teams will split their bonus points among team members
    • +
  • +
+
+
+
+ +
+
+Solution +
+
+
+

Add your code here.

+
+
+
+ +
+ + +
+ + + + \ No newline at end of file diff --git a/homework/hw7.qmd b/homework/hw7.qmd new file mode 100644 index 0000000..5443115 --- /dev/null +++ b/homework/hw7.qmd @@ -0,0 +1,35 @@ +--- +title: "Homework #7: Stacking and Boosting" +author: "**Your Name Here**" +format: sys6018hw-html +--- + +```{r config, include=FALSE} +# Set global configurations and settings here +knitr::opts_chunk$set() # set global chunk options +ggplot2::theme_set(ggplot2::theme_bw()) # set ggplot2 theme +``` + +# Problem 1: Stacking for Kaggle + +You are to make at least one official entry in the [House Prices: Advanced Regression Techniques](https://www.kaggle.com/c/house-prices-advanced-regression-techniques/overview) Kaggle contest **using stacking or model averaging**; at least one component model must be a boosting model. + +- You will need to register in Kaggle (its free) +- Read the details of the contest. Understand the data and evaluation function. +- Make at least one submission that uses **stacking or model averaging**. +- If you get a score on the public leaderboard of $\text{RMSE}<0.50$ (note RMSE is calculated on the log scale), you receive full credit, otherwise, you'll lose 10 points. + - I'll allow [teaming](https://www.kaggle.com/c/house-prices-advanced-regression-techniques/team). Each team member can produce one component model and then use stacking or model averaging to combine predictions. + - You don't need to team, but must still combine multiple models. At least one of the component models should be boosting. +- Each person submit the following in Canvas: + - Code (if teaming, your code and the shared stacking code) + - kaggle name (or team name) so we can ensure you had a valid submission. + - your score and current ranking on the kaggle leaderboard +- Top 5 scores get 2 bonus points + - Teams will split their bonus points among team members + + +::: {.callout-note title="Solution"} + +Add your code here. + +::: \ No newline at end of file diff --git a/index.html b/index.html index 13fbfca..9a03888 100644 --- a/index.html +++ b/index.html @@ -681,7 +681,7 @@

Course Schedule

16 Mar 18 Mon -Ensembles +Ensembles ISL 8.2 HW 6 Due @@ -689,7 +689,7 @@

Course Schedule

17 Mar 20 Wed -Boosting +Boosting ISL 8.2.3; ESL 10.1-10.9
LogitBoost paper diff --git a/lectures/ensembles.pdf b/lectures/ensembles.pdf new file mode 100644 index 0000000..905164b Binary files /dev/null and b/lectures/ensembles.pdf differ