Home / Google / Professional Data Engineer / List of questions

Ask Question

Google Professional Data Engineer Practice Test - Questions Answers, Page 24

Add to Whishlist

List of questions

Question 231

You plan to deploy Cloud SQL using MySQL. You need to ensure high availability in the event of a zone failure. What should you do?

Become a Premium Member for full access

Unlock Premium Member

Question 232

Your company is selecting a system to centralize data ingestion and delivery. You are considering messaging and data integration systems to address the requirements. The key requirements are:

The ability to seek to a particular offset in a topic, possibly back to the start of all data ever captured Support for publish/subscribe semantics on hundreds of topics Retain per-key ordering Which system should you choose?

Become a Premium Member for full access

Unlock Premium Member

Question 233

You are planning to migrate your current on-premises Apache Hadoop deployment to the cloud. You need to ensure that the deployment is as fault-tolerant and cost-effective as possible for longrunning batch jobs. You want to use a managed service. What should you do?

Become a Premium Member for full access

Unlock Premium Member

Question 234

Your team is working on a binary classification problem. You have trained a support vector machine (SVM) classifier with default parameters, and received an area under the Curve (AUC) of 0.87 on the validation set. You want to increase the AUC of the model. What should you do?

Become a Premium Member for full access

Unlock Premium Member

Question 235

You need to deploy additional dependencies to all of a Cloud Dataproc cluster at startup using an existing initialization action. Company security policies require that Cloud Dataproc nodes do not have access to the Internet so public initialization actions cannot fetch resources. What should you do?

Become a Premium Member for full access

Unlock Premium Member

Question 236

You need to choose a database for a new project that has the following requirements:

Fully managed

Able to automatically scale up

Transactionally consistent

Able to scale up to 6 TB

Able to be queried using SQL

Which database do you choose?

Become a Premium Member for full access

Unlock Premium Member

Question 237

You work for a mid-sized enterprise that needs to move its operational system transaction data from an on-premises database to GCP. The database is about 20 TB in size. Which database should you choose?

Become a Premium Member for full access

Unlock Premium Member

Question 238

You need to choose a database to store time series CPU and memory usage for millions of computers. You need to store this data in one-second interval samples. Analysts will be performing real-time, ad hoc analytics against the database.

You want to avoid being charged for every query executed and ensure that the schema design will allow for future growth of the dataset. Which database and data model should you choose?

Become a Premium Member for full access

Unlock Premium Member

Question 239

You want to archive data in Cloud Storage. Because some data is very sensitive, you want to use the "Trust No One" (TNO) approach to encrypt your data to prevent the cloud provider staff from decrypting your dat a. What should you do?

Become a Premium Member for full access

Unlock Premium Member

Question 240

You have data pipelines running on BigQuery, Cloud Dataflow, and Cloud Dataproc. You need to perform health checks and monitor their behavior, and then notify the team managing the pipelines if they fail. You also need to be able to work across multiple projects. Your preference is to use managed products of features of the platform. What should you do?

Become a Premium Member for full access

Unlock Premium Member

Total 377 questions