Machine Learning Exam 1 CS 4375 Part 2

Question 1

MSE or RMSE

Accepted Answer

Mean square error, or root, are useful in comparing two models built on the same training set. Root mean square is in terms of Y.

Question 2

Loss function

Accepted Answer

describes how much accuracy we lose in our model.

Question 3

Gradient Decent

Accepted Answer

One of the most commonly used optimization techniques. Will not be bogged down for any large datasets. The algorithm starts with some value for the parameter w and keeps changing them in an interactive loop until it finds a minimum. If step size is too small it will take too long to calculate, too large and we might step over minimum.

Question 4

Risidual vs. Fitted

Accepted Answer

Plots the residuals with a red trend line. You want to see a fairly horizontal red line. Otherwise the plot is showing some variation in the data that your model did not capture.

Question 5

Normal Q - Q

Accepted Answer

If the residuals are normally distributed, you will see a fairly straight diagonal line following the dashed line.

Question 6

Scale-Location

Accepted Answer

You want to see a fairly horizontal line with points distributed equally around it. If not then your data may not be of same variance.

Question 7

Residuals vs. Leverage

Accepted Answer

This plot will indicate leverage points which are influencing the regression line.

Question 8

Outlier

Accepted Answer

a data point with an unusual y value

Question 9

Leverage Point

Accepted Answer

a data point with an unusual x value

Question 10

Ocam's Razor

Accepted Answer

When choosing between two likely explanations, pick the simpler one.

Question 11

High bias, low variance model

Accepted Answer

Is likely to underfit and not capture the shape of the data. Happens with simpler models such as linear and logistic regression.

Question 12

Low bias, high variance model

Accepted Answer

captures too much complexity and noise in the data, may not generalize well new data.

Question 13

Confounding variable (interaction effect)

Accepted Answer

a variable that correlate with both the target and a predictor.

Question 14

additive assumption

Accepted Answer

each predictor contributes to the model independently of other predictors.

Question 15

Regularization

Accepted Answer

The added term penalizes large coefficients. It helps prevent overfitting in small datasets using complex models.

Question 16

Deviance residual

Accepted Answer

a mathematical transformation of the loss function, and quantifies a given point's contribution to the overall likelyhood. (similar to a RSS stat in Linreg)

Question 17

Null deviance

Accepted Answer

measures the lack of fit of the model, considering only the intercept.

Question 18

Residual deviance

Accepted Answer

measure the lack of fit of the entire model. (we want to see Residual Dev <<< Null Dev)

Question 19

AIC

Accepted Answer

Akaike Information Criterion is useful for comparing models. It penalizes overly complex models.

Question 20

Difference between linear and logistic regression

Accepted Answer

Whereas the coefficient of a linear regression predictor quantifies the difference in the target variable as the predictor changes, in logistic regression, the coefficient quantifies the difference in the log odds of the target variable.

Machine Learning Exam 1 CS 4375 Part 2

Flash Cards

Spaced Repetition

Take Quiz

MSE or RMSE

Key Terms

Related Flashcard Decks

Study Tips

Company

Explore

Study Tools