Create a regression model with BigQuery DataFrames
Stay organized with collections
Save and categorize content based on your preferences.
Create a linear regression model on the body mass of penguins using the BigQuery DataFrames API.
Explore further
For detailed documentation that includes this code sample, see the following:
Code sample
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],[],[[["\u003cp\u003eThis example demonstrates creating a linear regression model to predict penguin body mass using the BigQuery DataFrames API.\u003c/p\u003e\n"],["\u003cp\u003eThe code utilizes the \u003ccode\u003ebigquery-public-data.ml_datasets.penguins\u003c/code\u003e dataset, specifically focusing on the Adelie Penguin species.\u003c/p\u003e\n"],["\u003cp\u003eThe script involves loading data, filtering by species, dropping irrelevant columns, handling null values, and splitting data into training sets.\u003c/p\u003e\n"],["\u003cp\u003eA \u003ccode\u003eLinearRegression\u003c/code\u003e model is created, trained, and scored using specified feature and label columns, and predictions are made on the test set.\u003c/p\u003e\n"],["\u003cp\u003eThe sample uses the BigQuery DataFrames library with the python language.\u003c/p\u003e\n"]]],[],null,[]]