Select the predict tool and provide your DataRobot API token, which is used to retrieve a list of your DataRobot deployments. Note: your token is obtained from the DataRobot application under your user profile option for "API Key Management".
2. Provide a column name for the prediction score (see the field at the top of the screen):
3. Select the deployment—your data will be scored against the model in this deployment. If the model used for scoring is a Time Series Model, then you must indicate this by checking the Time Series Model checkbox, and then go to the Options tab to specify the Forecast Point and, optionally, the Series Id—both of these options are explained below.
Note: deployments for time-aware and custom models are not currently supported.
By default, the new column for the prediction score will be created as “Target” in the dataset. To change this name, click the Options tab and provide a different name in the “Prediction Column” field.
After you select the deployment, the prediction runs. The new column (with the name you provided in step 1) is created and provides the prediction score. In addition, the “Target Prediction Value” column is also generated to provide the associated prediction value for each score. For multiclass predictions, the prediction values are returned per classification. For example, if classifying images into “apple”, “orange” or “pear”, then three additional columns are returned--one value for each corresponding score.
Examples of use case prediction values:
Predict the probability that a hospital patient may be readmitted after discharge. The prediction column will contain a binary value of 1 or 0 to indicate if the patient is likely to be readmitted or not readmitted.
Classify a set of images into one of three fruits: oranges, pears or apples. The prediction column will contain one of three values: orange, pear, apple.
Forecast sales based on forecast dates. The prediction column in this case will contain the sales dollar amount.
For binary and time series prediction deployments, the Options tab provides additional options, which are explained below.
For Times Series predictions, you must also provide the Forecast Point, which is the point you are making a prediction from; a relative time “if it was now…” DataRobot trains models using all potential forecast points in the training data. In production, it is typically the most recent time.
IMPORTANT: the format of this date must be ISO 2014-08-12T00:00:00Z
Optionally, if your dataset has multiseries data, for example a dataset that contains multiple time series to forecast the sales for multiple stores, then you can specify a column as a Series Id to group the data and return the predictions, separately, for each group.
For Binary predictions, the Options tab provides Prediction Explanations that help you to understand why a prediction was returned:
For , for example, “Why did this patient score a 1 for possibility of readmission”; “why was this image identified as an apple".
Additionally, Low and High Threshold values can be set so that explanations are only generated for scores outside of the threshold.
See Prediction Explanations for complete details on values returned for Predictions.