Your organization manages an online message board A few months ago, you discovered an increase in toxic language and bullying on the message board. You deployed an automated text classifier that flags certain comments as toxic or harmful. Now some users are reporting that benign comments referencing their religion are being misclassified as abusive Upon further inspection, you find that your classifier's false positive rate is higher for comments that reference certain underrepresented religious groups. Your team has a limited budget and is already overextended. What should you do?

Question

Brant McGurk · Accepted Answer

Add synthetic training data where those phrases are used in non-toxic ways

Brant McGurk · Answer

Remove the model and replace it with human moderation.

Brant McGurk · Answer

Replace your model with a different text classifier.

Brant McGurk · Answer

Raise the threshold for comments to be considered toxic or harmful

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 142 - Professional Machine Learning Engineer discussion

Suggested answer: A

0 comments