List of questions
Related questions
Question 233 - Professional Data Engineer discussion
You are planning to migrate your current on-premises Apache Hadoop deployment to the cloud. You need to ensure that the deployment is as fault-tolerant and cost-effective as possible for longrunning batch jobs. You want to use a managed service. What should you do?
A.
Deploy a Cloud Dataproc cluster. Use a standard persistent disk and 50% preemptible workers.Store data in Cloud Storage, and change references in scripts from hdfs:// to gs://
B.
Deploy a Cloud Dataproc cluster. Use an SSD persistent disk and 50% preemptible workers. Store data in Cloud Storage, and change references in scripts from hdfs:// to gs://
C.
Install Hadoop and Spark on a 10-node Compute Engine instance group with standard instances.Install the Cloud Storage connector, and store the data in Cloud Storage. Change references in scripts from hdfs:// to gs://
D.
Install Hadoop and Spark on a 10-node Compute Engine instance group with preemptible instances. Store data in HDFS. Change references in scripts from hdfs:// to gs://
Your answer:
0 comments
Sorted by
Leave a comment first