ExamGecko
Home Home / Google / Professional Data Engineer

Google Professional Data Engineer Practice Test - Questions Answers, Page 13

Question list
Search
Search

Related questions











Which is not a valid reason for poor Cloud Bigtable performance?

A.
The workload isn't appropriate for Cloud Bigtable.
A.
The workload isn't appropriate for Cloud Bigtable.
Answers
B.
The table's schema is not designed correctly.
B.
The table's schema is not designed correctly.
Answers
C.
The Cloud Bigtable cluster has too many nodes.
C.
The Cloud Bigtable cluster has too many nodes.
Answers
D.
There are issues with the network connection.
D.
There are issues with the network connection.
Answers
Suggested answer: C

Explanation:

The Cloud Bigtable cluster doesn't have enough nodes. If your Cloud Bigtable cluster is overloaded, adding more nodes can improve performance. Use the monitoring tools to check whether the cluster is overloaded.

Reference: https://cloud.google.com/bigtable/docs/performance

Which is the preferred method to use to avoid hotspotting in time series data in Bigtable?

A.
Field promotion
A.
Field promotion
Answers
B.
Randomization
B.
Randomization
Answers
C.
Salting
C.
Salting
Answers
D.
Hashing
D.
Hashing
Answers
Suggested answer: A

Explanation:

By default, prefer field promotion. Field promotion avoids hotspotting in almost all cases, and it tends to make it easier to design a row key that facilitates queries.

Reference: https://cloud.google.com/bigtable/docs/schema-design-timeseries#ensure_that_your_row_key_avoids_hotspotting

When you design a Google Cloud Bigtable schema it is recommended that you _________.

A.
Avoid schema designs that are based on NoSQL concepts
A.
Avoid schema designs that are based on NoSQL concepts
Answers
B.
Create schema designs that are based on a relational database design
B.
Create schema designs that are based on a relational database design
Answers
C.
Avoid schema designs that require atomicity across rows
C.
Avoid schema designs that require atomicity across rows
Answers
D.
Create schema designs that require atomicity across rows
D.
Create schema designs that require atomicity across rows
Answers
Suggested answer: C

Explanation:

All operations are atomic at the row level. For example, if you update two rows in a table, it's possible that one row will be updated successfully and the other update will fail. Avoid schema designs that require atomicity across rows.

Reference: https://cloud.google.com/bigtable/docs/schema-design#row-keys

Which of the following is NOT a valid use case to select HDD (hard disk drives) as the storage for Google Cloud Bigtable?

A.
You expect to store at least 10 TB of data.
A.
You expect to store at least 10 TB of data.
Answers
B.
You will mostly run batch workloads with scans and writes, rather than frequently executing random reads of a small number of rows.
B.
You will mostly run batch workloads with scans and writes, rather than frequently executing random reads of a small number of rows.
Answers
C.
You need to integrate with Google BigQuery.
C.
You need to integrate with Google BigQuery.
Answers
D.
You will not use the data to back a user-facing or latency-sensitive application.
D.
You will not use the data to back a user-facing or latency-sensitive application.
Answers
Suggested answer: C

Explanation:

For example, if you plan to store extensive historical data for a large number of remote-sensing devices and then use the data to generate daily reports, the cost savings for HDD storage may justify the performance tradeoff. On the other hand, if you plan to use the data to display a real-time dashboard, it probably would not make sense to use HDD storageóreads would be much more frequent in this case, and reads are much slower with HDD storage.

Reference: https://cloud.google.com/bigtable/docs/choosing-ssd-hdd

Cloud Bigtable is Google's ______ Big Data database service.

A.
Relational
A.
Relational
Answers
B.
mySQL
B.
mySQL
Answers
C.
NoSQL
C.
NoSQL
Answers
D.
SQL Server
D.
SQL Server
Answers
Suggested answer: C

Explanation:

Cloud Bigtable is Google's NoSQL Big Data database service. It is the same database that Google uses for services, such as Search, Analytics, Maps, and Gmail.

It is used for requirements that are low latency and high throughput including Internet of Things (IoT), user analytics, and financial data analysis.

Reference: https://cloud.google.com/bigtable/

When you store data in Cloud Bigtable, what is the recommended minimum amount of stored data?

A.
500 TB
A.
500 TB
Answers
B.
1 GB
B.
1 GB
Answers
C.
1 TB
C.
1 TB
Answers
D.
500 GB
D.
500 GB
Answers
Suggested answer: C

Explanation:

Cloud Bigtable is not a relational database. It does not support SQL queries, joins, or multi-row transactions. It is not a good solution for less than 1 TB of data.

Reference:

https://cloud.google.com/bigtable/docs/overview#title_short_and_other_storage_options

If you're running a performance test that depends upon Cloud Bigtable, all the choices except one below are recommended steps. Which is NOT a recommended step to follow?

A.
Do not use a production instance.
A.
Do not use a production instance.
Answers
B.
Run your test for at least 10 minutes.
B.
Run your test for at least 10 minutes.
Answers
C.
Before you test, run a heavy pre-test for several minutes.
C.
Before you test, run a heavy pre-test for several minutes.
Answers
D.
Use at least 300 GB of data.
D.
Use at least 300 GB of data.
Answers
Suggested answer: A

Explanation:

If you're running a performance test that depends upon Cloud Bigtable, be sure to follow these steps as you plan and execute your test:

Use a production instance. A development instance will not give you an accurate sense of how a production instance performs under load.

Use at least 300 GB of data. Cloud Bigtable performs best with 1 TB or more of data. However, 300 GB of data is enough to provide reasonable results in a performance test on a 3-node cluster. On larger clusters, use 100 GB of data per node.

Before you test, run a heavy pre-test for several minutes. This step gives Cloud Bigtable a chance to balance data across your nodes based on the access patterns it observes.

Run your test for at least 10 minutes. This step lets Cloud Bigtable further optimize your data, and it helps ensure that you will test reads from disk as well as cached reads from memory.

Reference: https://cloud.google.com/bigtable/docs/performance

Cloud Bigtable is a recommended option for storing very large amounts of ____________________________?

A.
multi-keyed data with very high latency
A.
multi-keyed data with very high latency
Answers
B.
multi-keyed data with very low latency
B.
multi-keyed data with very low latency
Answers
C.
single-keyed data with very low latency
C.
single-keyed data with very low latency
Answers
D.
single-keyed data with very high latency
D.
single-keyed data with very high latency
Answers
Suggested answer: C

Explanation:

Cloud Bigtable is a sparsely populated table that can scale to billions of rows and thousands of columns, allowing you to store terabytes or even petabytes of data. A single value in each row is indexed; this value is known as the row key.

Cloud Bigtable is ideal for storing very large amounts of single-keyed data with very low latency. It supports high read and write throughput at low latency, and it is an ideal data source for MapReduce operations.

Reference: https://cloud.google.com/bigtable/docs/overview

Google Cloud Bigtable indexes a single value in each row. This value is called the _______.

A.
primary key
A.
primary key
Answers
B.
unique key
B.
unique key
Answers
C.
row key
C.
row key
Answers
D.
master key
D.
master key
Answers
Suggested answer: C

Explanation:

Cloud Bigtable is a sparsely populated table that can scale to billions of rows and thousands of columns, allowing you to store terabytes or even petabytes of data. A single value in each row is indexed; this value is known as the row key.

Reference: https://cloud.google.com/bigtable/docs/overview

What is the HBase Shell for Cloud Bigtable?

A.
The HBase shell is a GUI based interface that performs administrative tasks, such as creating and deleting tables.
A.
The HBase shell is a GUI based interface that performs administrative tasks, such as creating and deleting tables.
Answers
B.
The HBase shell is a command-line tool that performs administrative tasks, such as creating and deleting tables.
B.
The HBase shell is a command-line tool that performs administrative tasks, such as creating and deleting tables.
Answers
C.
The HBase shell is a hypervisor based shell that performs administrative tasks, such as creating and deleting new virtualized instances.
C.
The HBase shell is a hypervisor based shell that performs administrative tasks, such as creating and deleting new virtualized instances.
Answers
D.
The HBase shell is a command-line tool that performs only user account management functions to grant access to Cloud Bigtable instances.
D.
The HBase shell is a command-line tool that performs only user account management functions to grant access to Cloud Bigtable instances.
Answers
Suggested answer: B

Explanation:

The HBase shell is a command-line tool that performs administrative tasks, such as creating and deleting tables. The Cloud Bigtable HBase client for Java makes it possible to use the HBase shell to connect to Cloud Bigtable.

Reference: https://cloud.google.com/bigtable/docs/installing-hbase-shell

Total 372 questions
Go to page: of 38