|
1 | 1 | TITLE: Exercise week 47
|
2 |
| -AUTHOR: November 20-24, 2023 |
3 |
| -DATE: Deadline is Sunday November 26 at midnight |
| 2 | +AUTHOR: November 18-22, 2024 |
| 3 | +DATE: Deadline is Friday November 22 at midnight |
4 | 4 |
|
5 | 5 |
|
6 | 6 | ======= Overarching aims of the exercises this week =======
|
7 | 7 |
|
8 |
| -The exercise this week is a simple course survey and feedback. This |
9 |
| -is important for us in order to improve our teaching material, the |
10 |
| -active learning format and anything else related to a succesful |
11 |
| -mastering of central machine learning methods and their applications. |
| 8 | +The exercise set this week is meant as a summary of many of the |
| 9 | +central elements in various machine learning algorithms, with a slight |
| 10 | +bias towards deep learning methods and their training. You don't need to answer all questions. |
12 | 11 |
|
| 12 | +The last weekly exercise (week 48) is a general course survey. |
13 | 13 |
|
14 |
| -=== Why did you choose this course? === |
15 | 14 |
|
16 |
| -=== What was your programming knowledge before you started? === |
17 |
| -And do you feel this course added to your programming competences and skills? |
| 15 | +===== Exercise 1: Linear and logistic regression methods ===== |
18 | 16 |
|
19 |
| -=== How do you judge your own level of knowledge on machine learning before and after this course? === |
| 17 | +o What is the main difference between ordinary least squares and Ridge regression? |
| 18 | +o Which kind of data set would you use logistic regression for? |
| 19 | +o In linear regression you assume that your output is described by a continuous non-stochastic function $f(x)$. Which is the equivalent function in logistic regression? |
| 20 | +o Can you find an analytic solution to a logistic regression type of problem? |
| 21 | +o What kind of cost function would you use in logistic regression? |
20 | 22 |
|
21 |
| -Here you can discuss your level of skill/knowledge at start of course |
22 |
| -and at the end of the course and how these matched the level of |
23 |
| -skill/knowledge needed to complete the projects. |
24 | 23 |
|
25 |
| -=== Did the projects and the teaching material allow you to deepen your insights about Machine Learning? === |
| 24 | +===== Exercise 2: Deep learning ===== |
26 | 25 |
|
27 |
| -Feel free to comment here. |
| 26 | +o What is an activation function and discuss the use of an activation function? Explain three different types of activation functions? |
| 27 | +o Describe the architecture of a typical feed forward Neural Network (NN). |
| 28 | +o You are using a deep neural network for a prediction task. After training your model, you notice that it is strongly overfitting the training set and that the performance on the test isn’t good. What can you do to reduce overfitting? |
| 29 | +o How would you know if your model is suffering from the problem of exploding Gradients? |
| 30 | +o Can you name and explain a few hyperparameters used for training a neural network? |
| 31 | +o Describe the architecture of a typical Convolutional Neural Network (CNN) |
| 32 | +o What is the vanishing gradient problem in Neural Networks and how to fix it? |
| 33 | +o When it comes to training an artificial neural network, what could the reason be for why the cost/loss doesn't decrease in a few epochs? |
| 34 | +o How does L1/L2 regularization affect a neural network? |
| 35 | +o What is(are) the advantage(s) of deep learning over traditional methods like linear regression or logistic regression? |
28 | 36 |
|
| 37 | +===== Exercise 3: Decision trees and ensemble methods ===== |
29 | 38 |
|
30 |
| -=== Project based teaching and active learning === |
| 39 | +o Mention some pros and cons when using decision trees |
| 40 | +o How do we grow a tree? And which are the main parameters? |
| 41 | +o Mention some of the benefits with using ensemble methods (like bagging, random forests and boosting methods)? |
| 42 | +o Why would you prefer a random forest instead of using Bagging to grow a forest? |
| 43 | +o What is the basic philosophy behind boosting methods? |
31 | 44 |
|
32 |
| -This is a project based course and we as teachers would like to keep |
33 |
| -it as it is since we see very clearly that people who attend this |
34 |
| -course have a very good learning outcome. Project based courses are |
35 |
| -however demanding (and expensive seen from the university admin) when |
36 |
| -it comes to proper feedback and evaluations. Feel free to discuss |
37 |
| -whether you found a project-based and active learning approach |
38 |
| -useful. Feel also free to comment upon things we can improve upon or |
39 |
| -alternative ways to assess whether the learning outcomes have been |
40 |
| -achieved. Would you for example prefer a standard 4 hours written exam |
41 |
| -be something you would prefer? Or other alternatives to projects? We |
42 |
| -would very much value your thoughts here since projects are an |
43 |
| -essential part of this course. |
| 45 | +===== Exercise 4: Optimization part ===== |
44 | 46 |
|
45 |
| -=== Usefulness of the weekly exercises === |
| 47 | +o Which is the basic mathematical root-finding method behind essentially all gradient descent approaches(stochastic and non-stochastic)? |
| 48 | +o And why don't we use it? Or stated differently, why do we introduce the learning rate as a parameter? |
| 49 | +o What might happen if you set the momentum hyperparameter too close to 1 (e.g., 0.9999) when using an optimizer for the learning rate? |
| 50 | +o Why should we use stochastic gradient descent instead of plain gradient descent? |
| 51 | +o Which parameters would you need to tune when use a stochastic gradient descent approach? |
46 | 52 |
|
47 |
| -Did the weekly exercises help in getting started with the projects? |
48 |
| -How relevant where they for solving the projects? Feel free to |
49 |
| -elaborate |
50 |
| - |
51 |
| -=== Active learning/lab sessions and lectures === |
52 |
| - |
53 |
| -Was there a good link between lectures and active learning sessions? |
54 |
| -Would you prefer an active learning environment only with no lectures |
55 |
| -or would you prefer a more lecture based format with lab sessions only |
56 |
| -(that is no discussion at the beginning of the lab sessions)? Feel |
57 |
| -free to comment. |
58 |
| - |
59 |
| - |
60 |
| - |
61 |
| -=== How would you improve this course? === |
62 |
| - |
63 |
| -Are there topics which are missing, topics which could have been |
64 |
| -omitted and/or discussed in more depth? Feel free to add your comments |
65 |
| -here such as how to improve to teaching material and more. |
66 |
| - |
67 |
| - |
68 |
| - |
69 |
| -===== Then some basic questions ===== |
70 |
| - |
71 |
| -=== Which is your preferred information chanel, Canvas, Discord, mail or other? === |
72 |
| - |
73 |
| -=== Was the weekly update with plans etc useful? === |
74 |
| - |
75 |
| -=== Was it easy to access the course material? === |
76 |
| - |
77 |
| -=== Which resources and tools did you use? Jupyter-notebooks, GitHub, the various textbooks we have recommended, etc etc === |
78 |
| - |
79 |
| -=== If you did not attend the lectures or the active learning/lab sessions, which resources did you use? === |
80 |
| - |
81 |
| - |
82 |
| -=== Any other topics, impressions, ideas etc you would like to share with us? === |
83 | 53 |
|
84 | 54 |
|
| 55 | +===== Exercise 5: Analysis of results ===== |
| 56 | +o How do you assess overfitting and underfitting? |
| 57 | +o Why do we divide the data in test and train and/or eventually validation sets? |
| 58 | +o Why would you use resampling methods in the data analysis? Mention some widely popular resampling methods. |
0 commit comments