Professional Data Engineer Certification Exam
Question 1 to 50
Question 1
Your company built a TensorFlow neutral-network model with a large number of neurons and layers. ...
Question 2
You are building a model to make clothing recommendations. You know a user's fashion preference i...
Question 3
You designed a database for patient records as a pilot project to cover a few hundred patients in...
Question 4
You create an important report for your large team in Google Data Studio 360. The report uses Goo...
Question 5
An external customer provides you with a daily dump of data from their database. The data flows i...
Question 6
Your weather app queries a database every 15 minutes to get the current temperature. The frontend...
Question 7
You are creating a model to predict housing prices. Due to budget constraints, you must run it on...
Question 8
You are building new real-time data warehouse for your company and will use Google BigQuery strea...
Question 9
Your company is using WILDCARD tables to query data across multiple tables with similar names. Th...
Question 10
Your company is in a highly regulated industry. One of your requirements is to ensure individual ...
Question 11
You are designing a basket abandonment system for an ecommerce company. The system will send a me...
Question 12
Your company handles data processing for a number of different clients. Each client prefers to us...
Question 13
You want to process payment transactions in a point-of-sale application that will run on Google C...
Question 14
You want to use a database of information about tissue samples to classify future tissue samples ...
Question 15
You need to store and analyze social media postings in Google BigQuery at a rate of 10,000 messag...
Question 16
Your startup has never implemented a formal security policy. Currently, everyone in the company h...
Question 17
Your company is migrating their 30-node Apache Hadoop cluster to the cloud. They want to re-use H...
Question 18
Business owners at your company have given you a database of bank transactions. Each row contains...
Question 19
Your company's on-premises Apache Hadoop servers are approaching end-of-life, and IT has decided ...
Question 20
You work for a car manufacturer and have set up a data pipeline using Google Cloud Pub/Sub to cap...
Question 21
Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion s...
Question 22
Your company has hired a new data scientist who wants to perform complicated analyses across very...
Question 23
You are deploying 10,000 new Internet of Things devices to collect temperature data in your wareh...
Question 24
You have spent a few days loading data from comma-separated values (CSV) files into the Google Bi...
Question 25
You want to use Google Stackdriver Logging to monitor Google BigQuery usage. You need an instant ...
Question 26
You are working on a sensitive project involving private user data. You have set up a project on ...
Question 27
You are building a model to predict whether or not it will rain on a given day. You have thousand...
Question 28
Your company is performing data preprocessing for a learning algorithm in Google Cloud Dataflow. ...
Question 29
Your company is streaming real-time sensor data from their factory floor into Bigtable and they h...
Question 30
Your company's customer and order databases are often under heavy load. This makes performing ana...
Question 31
You have Google Cloud Dataflow streaming pipeline running with a Google Cloud Pub/Sub subscriptio...
Question 32
Your company is running their first dynamic campaign, serving different offers by analyzing real-...
Question 33
Your software uses a simple JSON format for all messages. These messages are published to Google ...
Question 34
Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...
Question 35
Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...
Question 36
Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...
Question 37
Flowlogistic Case Study -Company Overview -Flowlogistic is a leading logistics and supply chain p...
Question 38
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...
Question 39
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...
Question 40
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...
Question 41
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...
Question 42
Your company has recently grown rapidly and now ingesting data at a significantly higher rate tha...
Question 43
You work for a large fast food restaurant chain with over 400,000 employees. You store employee i...
Question 44
You are deploying a new storage system for your mobile application, which is a media streaming se...
Question 45
You work for a manufacturing plant that batches application log files together into a single log ...
Question 46
You work for an economic consulting firm that helps companies identify economic trends as they ha...
Question 47
You are designing the database schema for a machine learning-based food ordering service that wil...
Question 48
Your company is loading comma-separated values (CSV) files into Google BigQuery. The data is full...
Question 49
Your company produces 20,000 files every hour. Each data file is formatted as a comma separated v...
Question 50
You are choosing a NoSQL database to handle telemetry data submitted from millions of Internet-of...
Question 51 to 100
Question 51
You are training a spam classifier. You notice that you are overfitting the training data. Which ...
Question 52
You are implementing security best practices on your data pipeline. Currently, you are manually e...
Question 53
You are using Google BigQuery as your data warehouse. Your users report that the following simple...
Question 54
Your globally distributed auction application allows users to bid on items. Occasionally, users p...
Question 55
Your organization has been collecting and analyzing data in Google BigQuery for 6 months. The maj...
Question 56
You have enabled the free integration between Firebase Analytics and Google BigQuery. Firebase no...
Question 57
Your company is currently setting up data pipelines for their campaign. For all the Google Cloud ...
Question 58
You architect a system to analyze seismic data. Your extract, transform, and load (ETL) process r...
Question 59
An online retailer has built their current application on Google App Engine. A new initiative at ...
Question 60
You launched a new gaming app almost three years ago. You have been uploading log files from the ...
Question 61
Your analytics team wants to build a simple statistical model to determine which customers are mo...
Question 62
Your company receives both batch- and stream-based event data. You want to process the data using...
Question 63
You have some data, which is shown in the graphic below. The two dimensions are X and Y, and the ...
Question 64
You are integrating one of your internal IT applications and Google BigQuery, so users can query ...
Question 65
You are building a data pipeline on Google Cloud. You need to prepare data using a casual method ...
Question 66
You set up a streaming data insert into a Redis cluster via a Kafka cluster. Both clusters are ru...
Question 67
You are developing an application that uses a recommendation engine on Google Cloud. Your solutio...
Question 68
You are selecting services to write and transform JSON messages from Cloud Pub/Sub to BigQuery fo...
Question 69
Your infrastructure includes a set of YouTube channels. You have been tasked with creating a proc...
Question 70
You are designing storage for very large text files for a data pipeline on Google Cloud. You want...
Question 71
You are developing an application on Google Cloud that will automatically generate subject labels...
Question 72
You are designing storage for 20 TB of text files as part of deploying a data pipeline on Google ...
Question 73
You are designing storage for two relational tables that are part of a 10-TB database on Google C...
Question 74
Your financial services company is moving to cloud technology and wants to store 50 TB of financi...
Question 75
An organization maintains a Google BigQuery dataset that contains tables with user-level data. Th...
Question 76
Government regulations in your industry mandate that you have to maintain an auditable record of ...
Question 77
Your neural network model is taking days to train. You want to increase the training speed. What ...
Question 78
You are responsible for writing your company's ETL pipelines to run on an Apache Hadoop cluster. ...
Question 79
Your company maintains a hybrid deployment with GCP, where analytics are performed on your anonym...
Question 80
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...
Question 81
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...
Question 82
MJTelco Case Study -Company Overview -MJTelco is a startup that plans to build networks in rapidl...