ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 81 - Professional Cloud DevOps Engineer discussion

Report
Export

You support a popular mobile game application deployed on Google Kubernetes Engine (GKE) across several Google Cloud regions. Each region has multiple Kubernetes clusters. You receive a report that none of the users in a specific region can connect to the application. You want to resolve the incident while following Site Reliability Engineering practices. What should you do first?

A.
Reroute the user traffic from the affected region to other regions that don't report issues.
Answers
A.
Reroute the user traffic from the affected region to other regions that don't report issues.
B.
Use Stackdriver Monitoring to check for a spike in CPU or memory usage for the affected region.
Answers
B.
Use Stackdriver Monitoring to check for a spike in CPU or memory usage for the affected region.
C.
Add an extra node pool that consists of high memory and high CPU machine type instances to the cluster.
Answers
C.
Add an extra node pool that consists of high memory and high CPU machine type instances to the cluster.
D.
Use Stackdriver Logging to filter on the clusters in the affected region, and inspect error messages in the logs.
Answers
D.
Use Stackdriver Logging to filter on the clusters in the affected region, and inspect error messages in the logs.
Suggested answer: A

Explanation:

Google always aims to first stop the impact of an incident, and then find the root cause (unless the root cause just happens to be identified early on).

asked 18/09/2024
JED MEDIA
37 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first