Skip to main content

Question 63

  • You work for a large retailer and have been asked to segment your customers by their purchasing habits. The purchase history of all customers has been uploaded to BigQuery. You suspect that there may be several distinct customer segments, however you are unsure of how many, and you don’t yet understand the commonalities in their behavior. You want to find the most efficient solution. What should you do?
  • A. Create a k-means clustering model using BigQuery ML. Allow BigQuery to automatically optimize the number of clusters.
  • B. Create a new dataset in Dataprep that references your BigQuery table. Use Dataprep to identify similarities within each column.
  • C. Use the Data Labeling Service to label each customer record in BigQuery. Train a model on your labeled data using AutoML Tables. Review the evaluation metrics to understand whether there is an underlying pattern in the data.
  • D. Get a list of the customer segments from your company’s Marketing team. Use the Data Labeling Service to label each customer record in BigQuery according to the list. Analyze the distribution of labels in your dataset using Data Studio.


References 
https://cloud.google.com/bigquery/docs/kmeans-tutorial

https://towardsdatascience.com/how-to-use-k-means-clustering-in-bigquery-ml-to-understand-and-describe-your-data-better-c972c6f5733b