The Data Prep (Paxata) documentation is now available on the DataRobot public documentation site. See the Data Prep section for user documentation and connector information. After the 2021.2 SP1 release, the content on this site will be removed and replaced with a link to the DataRobot public documentation site.
Use this article to remove rows of data from your Project.
As you prepare your data, you will find times when you want to retain a specific subset of data. The best way to accomplish this is to remove the rows of data that don’t meet your needs. The following is an overview of the elements you work with when you remove rows from your Project.
The data in your Project, you will see your data change as you prepare it.
Filters your data down to a subset that you want to isolate. See theData Filtergramsarticle.
The tool that removes rows from your Project.
Remove rows from your Project
Follow these steps to remove rows of data from your Project.
Add aFiltergramto isolate the rows you want to remove. See theData Filtergramsarticle.
Result:TheData Previewdisplays the records that match the criteria of your filter.
Result:TheFilters on the Current Datasetpanel becomes theFilters for Remove Rowspanel.
Result:The filtered rows are removed. TheData Previewis blank because the filter used is still active and the data has been removed.
To display your updated dataset, do one of the following:
On theFiltergram, clickx clear.
Result:TheFiltergramis cleared. TheData Previewdisplays the updated data.
Pax Pro Tip: Publish the data from the removed rows to a separate AnswerSet for later reference. See the next section for details.
Note: If you update or add to your dataset after the removing rows, the remove rows Step will be applied to the new data in the following ways: 1. If you selected histogram ranges or individual values to remove rows, rows from the new data will only be removed if they satisfy that exact condition. 2. If you selected rows with a string search or dynamic percentile range, then the tool will recompute based on the new data.
Capture data from rows you’ve removed
Follow these steps to add a Lens to publish the rows you removed.
Click the step prior to theRemove Rowsstep.
Result: TheData Previewdisplays all rows, including the ones that existed prior to applying theRemove tool.