日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程资源 > 编程问答 >内容正文

编程问答

Machine Learning week 6 quiz: Machine Learning System Design

發布時間:2025/3/21 编程问答 17 豆豆
生活随笔 收集整理的這篇文章主要介紹了 Machine Learning week 6 quiz: Machine Learning System Design 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

Machine Learning System Design

5?試題

1.?

You are working on a spam classification system using regularized logistic regression. "Spam" is a positive class (y = 1) and "not spam" is the negative class (y = 0). You have trained your classifier and there are m = 1000 examples in the cross-validation set. The chart of predicted class vs. actual class is:

? Actual Class: 1 Actual Class: 0
Predicted Class: 1 85 890
Predicted Class: 0 15 10

For reference:

  • Accuracy = (true positives + true negatives) / (total examples)
  • Precision = (true positives) / (true positives + false positives)
  • Recall = (true positives) / (true positives + false negatives)
  • F1?score = (2 * precision * recall) / (precision + recall)

What is the classifier's recall (as a value from 0 to 1)?

Enter your answer in the box below. If necessary, provide at least two values after the decimal point.

2.?

Suppose a massive dataset is available for training a learning algorithm. Training on a lot of data is likely to give good performance when two of the following conditions hold true.

Which are the two?

The classes are not too skewed.

A human expert on the application domain

can confidently predict?y?when given only the features?x

(or more generally, if we have some way to be confident

that?x?contains sufficient information to predict?y

accurately).

Our learning algorithm is able to

represent fairly complex functions (for example, if we

train a neural network or other model with a large

number of parameters).

When we are willing to include high

order polynomial features of?x?(such as?x21,?x22,

x1x2, etc.).

3.?

Suppose you have trained a logistic regression classifier which is outputing?hθ(x).

Currently, you predict 1 if?hθ(x)threshold, and predict 0 if?hθ(x)ltthreshold, where currently the threshold is set to 0.5.

Suppose you?decrease?the threshold to 0.1. Which of the following are true? Check all that apply.

The classifier is likely to now have higher recall.

The classifier is likely to have unchanged precision and recall, but

higher accuracy.

The classifier is likely to now have higher precision.

The classifier is likely to have unchanged precision and recall, but

lower accuracy.

4.?

Suppose you are working on a spam classifier, where spam

emails are positive examples (y=1) and non-spam emails are

negative examples (y=0). You have a training set of emails

in which 99% of the emails are non-spam and the other 1% is

spam. Which of the following statements are true? Check all

that apply.

If you always predict non-spam (output

y=0), your classifier will have 99% accuracy on the

training set, and it will likely perform similarly on

the cross validation set.

If you always predict non-spam (output

y=0), your classifier will have an accuracy of

99%.

A good classifier should have both a

high precision and high recall on the cross validation

set.

If you always predict non-spam (output

y=0), your classifier will have 99% accuracy on the

training set, but it will do much worse on the cross

validation set because it has overfit the training

data.

5.?

Which of the following statements are true? Check all that apply.

It is a good idea to spend a lot of time

collecting a?large?amount of data before building

your first version of a learning algorithm.

If your model is underfitting the

training set, then obtaining more data is likely to

help.

On skewed datasets (e.g., when there are

more positive examples than negative examples), accuracy

is not a good measure of performance and you should

instead use?F1?score based on the

precision and recall.

After training a logistic regression

classifier, you?must?use 0.5 as your threshold

for predicting whether an example is positive or

negative.

Using a?very large?training set

makes it unlikely for model to overfit the training

data.

總結

以上是生活随笔為你收集整理的Machine Learning week 6 quiz: Machine Learning System Design的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。