You work for a bank and are building a random forest model for fraud detection. You have a dataset that includes transactions, of which 1% are identified as fraudulent. Which data transformation strategy would likely improve the performance of your classifier?

Question

Yunus Emre Akay · Accepted Answer

Oversample the fraudulent transaction 10 times.

Yunus Emre Akay · Answer

Write your data in TFRecords.

Yunus Emre Akay · Answer

Z-normalize all the numeric features.

Yunus Emre Akay · Answer

Use one-hot encoding on all categorical features.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 69 - Professional Machine Learning Engineer discussion

Suggested answer: C

0 comments