AutoClean involves transforming your input data by providing examples of the clean data in the output column.
We automatically create a program to fill the remaining cells in the output column as we sense a pattern across the provided examples.
Step 1: Once you have mapped the input column(s) to the output column, click on any cell in the output (cleaned) column to open the Data Cleanup panel on the right.
Step 2: Select AutoClean your Data at the bottom of the panel.
Step 3: Click into any cell in the output (cleaned) column and provide an example of the clean data for that specific row. Press enter or click the purple enter button to submit the training example. We will learn from this example and output values for each cell based on your training example(s).
Sometimes the transformation is simple enough to output the correct value for each cell, but you may need to provide additional training examples to refine the program.
Step 4: Use the toggles at the top of the page to filter for rows with errors and/or rows flagged for review and provide additional training examples where required.
Step 5: Once you are done, go to the bottom of the right panel andclick Save, then move onto the next output column in the left panel. If you want to restart and choose a different Data Cleanup method for this column, click Start Over.
Training examples have black borders. If you want to adjust the AutoClean program for that column, click on these cells to delete or edit them.
Flagged for Review
Cells that have been flagged for review have orange borders. We automatically detect potential values that are incorrect. Review these cells and provide another training example if they are incorrect.
Provide Example, Timeout, or Error
Cells that have validation errors, timeouts, or cells that require an additional training example have red borders. Click on these cells and provide another training example to resolve the errors and improve the AutoClean program.
Editing or Deleting Training Examples
If you want to edit a training example, click on the cell that contains the training example, edit the training example, then press enter to submit the revised example.
If you want to delete a specific training example, click on the cell that contains the training example, then click on the delete button to the right of that cell. You can also hover over the cell and click “Delete Example” in the tool tip.
Note: if you want to clear all training examples for a specific column, open the Data Cleanup panel and click Start Over. From here you can select AutoClean again and provide new training examples.
Inserting a Training Example with No Data
If you want to provide a training example for a row that does not have data, click on the output cell, leave the cell blank, then press enter.