Skip to main content

Professional Data Engineer Certification Exam

Question 1 to 50

Question 1

Your company built a TensorFlow neutral-network model with a large number of neurons and layers. ...

Question 2

You are building a model to make clothing recommendations. You know a user's fashion preference i...

Question 3

You designed a database for patient records as a pilot project to cover a few hundred patients in...

Question 4

You create an important report for your large team in Google Data Studio 360. The report uses Goo...

Question 5

An external customer provides you with a daily dump of data from their database. The data flows i...

Question 6

Your weather app queries a database every 15 minutes to get the current temperature. The frontend...

Question 7

You are creating a model to predict housing prices. Due to budget constraints, you must run it on...

Question 8

You are building new real-time data warehouse for your company and will use Google BigQuery strea...

Question 9

Your company is using WILDCARD tables to query data across multiple tables with similar names. Th...

Question 10

Your company is in a highly regulated industry. One of your requirements is to ensure individual ...

Question 11

You are designing a basket abandonment system for an ecommerce company. The system will send a me...

Question 12

Your company handles data processing for a number of different clients. Each client prefers to us...

Question 13

You want to process payment transactions in a point-of-sale application that will run on Google C...

Question 14

You want to use a database of information about tissue samples to classify future tissue samples ...

Question 15

You need to store and analyze social media postings in Google BigQuery at a rate of 10,000 messag...

Question 16

Your startup has never implemented a formal security policy. Currently, everyone in the company h...

Question 17

Your company is migrating their 30-node Apache Hadoop cluster to the cloud. They want to re-use H...

Question 18

Business owners at your company have given you a database of bank transactions. Each row contains...

Question 19

Your company's on-premises Apache Hadoop servers are approaching end-of-life, and IT has decided ...

Question 20

You work for a car manufacturer and have set up a data pipeline using Google Cloud Pub/Sub to cap...

Question 21

Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion s...

Question 22

Your company has hired a new data scientist who wants to perform complicated analyses across very...

Question 23

You are deploying 10,000 new Internet of Things devices to collect temperature data in your wareh...

Question 24

You have spent a few days loading data from comma-separated values (CSV) files into the Google Bi...

Question 25

You want to use Google Stackdriver Logging to monitor Google BigQuery usage. You need an instant ...

Question 26

You are working on a sensitive project involving private user data. You have set up a project on ...

Question 27

You are building a model to predict whether or not it will rain on a given day. You have thousand...

Question 28

Your company is performing data preprocessing for a learning algorithm in Google Cloud Dataflow. ...

Question 29

Your company is streaming real-time sensor data from their factory floor into Bigtable and they h...

Question 30

Your company's customer and order databases are often under heavy load. This makes performing ana...

Question 31

You have Google Cloud Dataflow streaming pipeline running with a Google Cloud Pub/Sub subscriptio...

Question 32

Your company is running their first dynamic campaign, serving different offers by analyzing real-...

Question 33

Your software uses a simple JSON format for all messages. These messages are published to Google ...

Question 34

Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...

Question 35

Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...

Question 36

Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...

Question 37

Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...

Question 38

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...

Question 39

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...

Question 40

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...

Question 41

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...

Question 42

Your company has recently grown rapidly and now ingesting data at a significantly higher rate tha...

Question 43

You work for a large fast food restaurant chain with over 400,000 employees. You store employee i...

Question 44

You are deploying a new storage system for your mobile application, which is a media streaming se...

Question 45

You work for a manufacturing plant that batches application log files together into a single log ...

Question 46

You work for an economic consulting firm that helps companies identify economic trends as they ha...

Question 47

You are designing the database schema for a machine learning-based food ordering service that wil...

Question 48

Your company is loading comma-separated values (CSV) files into Google BigQuery. The data is full...

Question 49

Your company produces 20,000 files every hour. Each data file is formatted as a comma separated v...

Question 50

You are choosing a NoSQL database to handle telemetry data submitted from millions of Internet-of...

Question 51 to 100

Question 51

You are training a spam classifier. You notice that you are overfitting the training data. Which ...

Question 52

You are implementing security best practices on your data pipeline. Currently, you are manually e...

Question 53

You are using Google BigQuery as your data warehouse. Your users report that the following simple...

Question 54

Your globally distributed auction application allows users to bid on items. Occasionally, users p...

Question 55

Your organization has been collecting and analyzing data in Google BigQuery for 6 months. The maj...

Question 56

You have enabled the free integration between Firebase Analytics and Google BigQuery. Firebase no...

Question 57

Your company is currently setting up data pipelines for their campaign. For all the Google Cloud ...

Question 58

You architect a system to analyze seismic data. Your extract, transform, and load (ETL) process r...

Question 59

An online retailer has built their current application on Google App Engine. A new initiative at ...

Question 60

You launched a new gaming app almost three years ago. You have been uploading log files from the ...

Question 61

Your analytics team wants to build a simple statistical model to determine which customers are mo...

Question 62

Your company receives both batch- and stream-based event data. You want to process the data using...

Question 63

You have some data, which is shown in the graphic below. The two dimensions are X and Y, and the ...

Question 64

You are integrating one of your internal IT applications and Google BigQuery, so users can query ...

Question 65

You are building a data pipeline on Google Cloud. You need to prepare data using a casual method ...

Question 66

You set up a streaming data insert into a Redis cluster via a Kafka cluster. Both clusters are ru...

Question 67

You are developing an application that uses a recommendation engine on Google Cloud. Your solutio...

Question 68

You are selecting services to write and transform JSON messages from Cloud Pub/Sub to BigQuery fo...

Question 69

Your infrastructure includes a set of YouTube channels. You have been tasked with creating a proc...

Question 70

You are designing storage for very large text files for a data pipeline on Google Cloud. You want...

Question 71

You are developing an application on Google Cloud that will automatically generate subject labels...

Question 72

You are designing storage for 20 TB of text files as part of deploying a data pipeline on Google ...

Question 73

You are designing storage for two relational tables that are part of a 10-TB database on Google C...

Question 74

Your financial services company is moving to cloud technology and wants to store 50 TB of financi...

Question 75

An organization maintains a Google BigQuery dataset that contains tables with user-level data. Th...

Question 76

Government regulations in your industry mandate that you have to maintain an auditable record of ...

Question 77

Your neural network model is taking days to train. You want to increase the training speed. What ...

Question 78

You are responsible for writing your company's ETL pipelines to run on an Apache Hadoop cluster. ...

Question 79

Your company maintains a hybrid deployment with GCP, where analytics are performed on your anonym...

Question 80

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...

Question 81

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...

Question 82

MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...