LogoLogo
Back to OsmosDeveloper DocsOsmos BlogWhat's New
  • Welcome to Osmos
  • 👋Getting Started with Osmos
    • Terminology
  • 🎉What's New
  • 🧩Osmos API Reference
  • ⌨️Osmos Chat
  • 👩‍💻Developer Docs
    • Manage API Keys
    • Embedding an Osmos Uploader
    • Embedding Uploader Jobs Table
    • Turning on Advanced Mode Uploader
    • Customizing Uploader Styling
    • Passing Parameterized Fields
    • Configuring Uploader's "Recall" functionality
    • Optional Uploader Settings
    • Uploader Submission Callback
    • Configuring AutoClean for your Uploader
    • Uploader Client-Side Validation
      • Data Validators
      • Checking for Duplicate values in a field
      • Creating Dropdown-Controlled Fields
      • Dynamic Dropdown Options
      • Dropdown Interaction with Validation Functions
    • Validation and Transformation Webhooks
      • OpenAPI Validation Webhook Testing
    • Parser Webhook for file based connectors
  • 🔠Datasets
    • Osmos Datasets
      • Uploading Data to your Table
      • Creating Primary and Foreign keys
      • Osmos Dataset Destination Connector
      • Osmos Dataset Source Connector
      • Dataset Edits
    • Datasets Query Builder
      • Query Builder Metadata
    • Performing Look Ups
      • Performing Joins
        • Types of Joins
  • ⏏️Uploader
    • Creating an Osmos Uploader
      • Testing your Osmos Uploader
    • Uploader Validation Summary
    • Advanced Mode
      • Overview
      • Process
    • Standard Mode
      • Overview
      • AutoClean
      • Process
    • AI AutoMapping
    • Uploaders Page
    • Uploader Details Page
  • 🔀Pipelines
    • Step 1. Select the Source
    • Step 2. Select a Destination
    • Step 3. Map & Transform Data
    • Step 4. Schedule the Pipeline
    • Step 5. Review & Confirm
    • Pipelines Page
    • Pipeline Details Page
  • ⏩Data Transformations
    • AutoMap
    • Column Mapping & Data Cleanup Panel
    • QuickFixes
    • AI Value Mapping
    • AI AutoClean
    • Lookups
      • Performing Lookups
    • SmartFill
    • Formulas
      • Date & Time Formulas
        • DateTime Format Specifiers
        • Timezone specifiers
      • Math Formulas and Operators
      • Logical Formulas & Operators
        • True & False Casting
      • Text Formulas
      • Other Formulas
    • Deduplication
  • ↘️Source Connectors
    • Amazon S3
    • Azure Blob Storage
    • BigQuery
    • Email
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
    • Osmos Dataset
    • Snowflake
    • Accessing Sources behind firewall
  • ↖️Destination Connectors
    • Amazon S3
    • BigQuery
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
      • Passing Dynamic Tokens in the API Header
    • MySQL
    • Osmos Dataset
    • PostgreSQL
    • Snowflake
    • Accessing Destinations behind firewall
  • 🗂️Projects
  • ⚙️Administration
    • Email Notifications
  • 🔒Security
  • 📞Support
  • Back to Osmos.io
Powered by GitBook
On this page
  • Contents
  • What is AutoClean?
  • Getting Started
  • AI Value Mapping through AutoClean
  • AI Value Mapping AutoClean
  • Osmos AutoClean Operations

Was this helpful?

  1. Uploader
  2. Standard Mode

AutoClean

PreviousOverviewNextProcess

Last updated 10 months ago

Was this helpful?

Contents

  1. What is Auto Clean?

  2. Getting Started

  3. AI Value Mapping (Enums)

  4. Clean up operations

What is AutoClean?

AutoClean is a capability that allows end users to clean data with one click. Your Osmos Uploader comes with prebuilt AutoClean capability for certain scenarios detailed below but also empowers developers with the ability to transform data on the fly by creating their AutoClean functions. AutoClean also allows end users to quickly map a dropdown list for enums.

AutoClean is a feature for the Standard Mode Uploader.

Getting Started

Step 1: Once you have mapped the input (source) column(s) to the output (destination) column, you can activate the AutoClean toggle to on.

Step 2: If there is data that can be cleaned, it will display a sample of the data to be cleaned. Review the sample of the new values and hit Accept.

If there is no data to clean, you will receive a message that no data can be AutoCleaned.

Step 3: The AutoCleaned data will maintain a purple visual to let you know it has been cleaned through the AutoClean process. Any field can still be edited directly in the field itself.

Step 4: To Review all of your AutoCleaned Records, you can select the Filter at the top.

AI Value Mapping through AutoClean

AI Value Mapping AutoClean

Value Mapping is a capability that allows end users to map enumerations ("enums"). Enums are integrated as part of the Uploader Validation. Users can now map a group of constants to a specific valid option. For the Standard Mode, the list of valid options is configured in your validation and the mapping occurs in AutoClean.

AI Value Mapping in Action

Osmos AutoClean Operations

The table below describes what cleanup operations will be performed by AutoClean, depending on the data type of the destination field, and whether or not the field is required.

Destination Field Type
Nullable
Required

Integer

  1. Strip non-numeric symbols

    • Example: $8 -> 8

  2. Round to the nearest whole number

    • Example: 8.34 -> 8

    • Example: $8.79 -> 9

  1. Strip non-numeric symbols

    • Example: $8 -> 8

  2. Round to the nearest whole number

    • Example: 8.34 -> 8

    • Example: $8.79 -> 9

  3. If no value is in the source data, enter 0

  4. If source data is not parse-able as a number, it will remain unaltered and show an error

Float

  1. Strip non-numeric symbols

    • Example: $8.79 -> 8.79

  1. Strip non-numeric symbols

    • Example: $8.79 -> 8.79

  2. If no value is in the source data, set to 0.0

  3. If source data is not parse-able as a float, it will remain unaltered and show an error

Date

If data is not parse-able as a date, it will be set to null

If data is not parse-able as a date, it will remain unaltered and show an error

Datetime

If data is not parse-able as a date and time, it will be set to null

If data is not parse-able, it will remain unaltered and show an error

Boolean

  • 0, F, False, N, No (case insensitive) will map to false

  • 1, T, True, Y, Yes (case insensitive) will map to true

If data is none of the above, the output will be set to null

  • 0, F, False, N, No (case insensitive) will map to false

  • 1, T, True, Y, Yes (case insensitive) will map to true

If data is none of the above, it will remain unaltered and show an error

Text

Text will remain unaltered by Osmos AutoClean

Text will remain unaltered by Osmos AutoClean

Note: For you can update the configuration schema to always have AutoClean on for a specific field(s).

Note: You can also set up your own instances of AutoClean to perform custom cleanup operations on input data by setting up for the destination connector of your uploader. Please review the for more info.

⏏️
Uploader embedded deployments,
Server Side Validation Webhooks
AutoClean Developer Docs