LogoLogo
Back to OsmosDeveloper DocsOsmos BlogWhat's New
  • Welcome to Osmos
  • 👋Getting Started with Osmos
    • Terminology
  • 🎉What's New
  • 🧩Osmos API Reference
  • ⌨️Osmos Chat
  • 👩‍💻Developer Docs
    • Manage API Keys
    • Embedding an Osmos Uploader
    • Embedding Uploader Jobs Table
    • Turning on Advanced Mode Uploader
    • Customizing Uploader Styling
    • Passing Parameterized Fields
    • Configuring Uploader's "Recall" functionality
    • Optional Uploader Settings
    • Uploader Submission Callback
    • Configuring AutoClean for your Uploader
    • Uploader Client-Side Validation
      • Data Validators
      • Checking for Duplicate values in a field
      • Creating Dropdown-Controlled Fields
      • Dynamic Dropdown Options
      • Dropdown Interaction with Validation Functions
    • Validation and Transformation Webhooks
      • OpenAPI Validation Webhook Testing
    • Parser Webhook for file based connectors
  • 🔠Datasets
    • Osmos Datasets
      • Uploading Data to your Table
      • Creating Primary and Foreign keys
      • Osmos Dataset Destination Connector
      • Osmos Dataset Source Connector
      • Dataset Edits
    • Datasets Query Builder
      • Query Builder Metadata
    • Performing Look Ups
      • Performing Joins
        • Types of Joins
  • ⏏️Uploader
    • Creating an Osmos Uploader
      • Testing your Osmos Uploader
    • Uploader Validation Summary
    • Advanced Mode
      • Overview
      • Process
    • Standard Mode
      • Overview
      • AutoClean
      • Process
    • AI AutoMapping
    • Uploaders Page
    • Uploader Details Page
  • 🔀Pipelines
    • Step 1. Select the Source
    • Step 2. Select a Destination
    • Step 3. Map & Transform Data
    • Step 4. Schedule the Pipeline
    • Step 5. Review & Confirm
    • Pipelines Page
    • Pipeline Details Page
  • ⏩Data Transformations
    • AutoMap
    • Column Mapping & Data Cleanup Panel
    • QuickFixes
    • AI Value Mapping
    • AI AutoClean
    • Lookups
      • Performing Lookups
    • SmartFill
    • Formulas
      • Date & Time Formulas
        • DateTime Format Specifiers
        • Timezone specifiers
      • Math Formulas and Operators
      • Logical Formulas & Operators
        • True & False Casting
      • Text Formulas
      • Other Formulas
    • Deduplication
  • ↘️Source Connectors
    • Amazon S3
    • Azure Blob Storage
    • BigQuery
    • Email
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
    • Osmos Dataset
    • Snowflake
    • Accessing Sources behind firewall
  • ↖️Destination Connectors
    • Amazon S3
    • BigQuery
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
      • Passing Dynamic Tokens in the API Header
    • MySQL
    • Osmos Dataset
    • PostgreSQL
    • Snowflake
    • Accessing Destinations behind firewall
  • 🗂️Projects
  • ⚙️Administration
    • Email Notifications
  • 🔒Security
  • 📞Support
  • Back to Osmos.io
Powered by GitBook
On this page
  • Function
  • AutoClean in Action
  • AutoClean Configuration Example
  • AutoClean Operations
  • Customizing AutoClean Output

Was this helpful?

  1. Developer Docs

Configuring AutoClean for your Uploader

PreviousUploader Submission CallbackNextUploader Client-Side Validation

Last updated 9 months ago

Was this helpful?

Function

AutoClean is a capability which allows end users clean data with one click. Your Osmos Uploader comes with prebuilt AutoClean capability for certain scenarios detailed below.

AutoClean is not available in Advanced Mode. For more detail on how to clean messy data in Advanced Mode, please see our documentation.

In step 2 of an upload, Map and Transform Data, AutoClean is displayed as a toggle above the "our Field" pane.

AutoClean in Action

AutoClean Configuration Example

On your schema fields, you can add an attribute called autoCleanMode that will let you configure whether or not AutoClean is on or off by default, or disabled entirely, for a given field. The options are:

  • 'auto': Toggle is off by default, can be turned on. This is the default setting if you don't specify a mode for a field.

  • 'onForced': AutoClean is on by default and cannot be turned off.

  • 'onButAllowDisable': AutoClean is on by default and can be turned off.

  • 'disabled': AutoClean is off and cannot be turned on for this field.


 Osmos.configure({
        schema: {
          fields: [
            {
              name: 'date',
              displayName: 'date',
              description: '<Your field description here>',
              // Disabled for this field, cannot be turned on
              autoCleanMode: 'disabled',
            },
            {
              name: 'datetime',
              displayName: 'datetime',
              description: '<Your field description here>',
              // autoCleanMode defaults to 'auto' if you don't set it
            },
            {
              name: 'bool',
              displayName: 'bool',
              description: '<Your field description here>',
              // Toggle starts off, and you can turn it on
              autoCleanMode: 'auto',
            },
            {
              name: 'int',
              displayName: 'int',
              description: '<Your field description here>',
              // Toggle starts on, and you cannot turn it off
              autoCleanMode: 'onForced',
            },
            {
              name: 'double',
              displayName: 'double',
              description: '<Your field description here>',
              // Toggle starts on, and you can turn it off
              autoCleanMode: 'onButAllowDisable',
            },
          ],
        },
        token: 'your_token',
        uploadDescription:
          '<Include a description of your uploader upload here which will be shown to users that click the uploader>',
        hideUploadSchema: true,
        hideUploadDescription: true,
        disableAdvancedMode: true,
      });

AutoClean Operations

The table below describes what cleanup operations will be performed by AutoClean, depending on the data type of the destination field, and whether or not the field is required.

Destination Field Type
Nullable
Required

Integer

  1. Strip non-numeric symbols

    • Example: $8 -> 8

  2. Round to the nearest whole number

    • Example: 8.34 -> 8

    • Example: $8.79 -> 9

  1. Strip non-numeric symbols

    • Example: $8 -> 8

  2. Round to the nearest whole number

    • Example: 8.34 -> 8

    • Example: $8.79 -> 9

  3. If no value is in the source data, enter 0

  4. If source data is not parse-able as a number, it will remain unaltered and show an error

Float

  1. Strip non-numeric symbols

    • Example: $8.79 -> 8.79

  1. Strip non-numeric symbols

    • Example: $8.79 -> 8.79

  2. If no value is in the source data, set to 0.0

  3. If source data is not parse-able as a float, it will remain unaltered and show an error

Date

If data is not parse-able as a date, it will be set to null

If data is not parse-able as a date, it will remain unaltered and show an error

Datetime

If data is not parse-able as a date and time, it will be set to null

If data is not parse-able, it will remain unaltered and show an error

Boolean

  • 0, F, False, N, No (case insensitive) will map to false

  • 1, T, True, Y, Yes (case insensitive) will map to true

If data is none of the above, the output will be set to null

  • 0, F, False, N, No (case insensitive) will map to false

  • 1, T, True, Y, Yes (case insensitive) will map to true

If data is none of the above, it will remain unaltered and show an error

Text

Text will remain unaltered by Osmos AutoClean

Text will remain unaltered by Osmos AutoClean

Customizing AutoClean Output

By default, AutoClean will run, then send the resulting data to the webhook. The webhook can then return final writeback values, errors, or warnings to the Osmos data mapping page.

You can override or augment AutoClean output data by setting up for the destination connector of your uploader.

If you don't want AutoClean to be performed before data is sent to the webhook for a given field you can set that field's autoCleanMode to 'disabled' as mentioned in the section above. In which case the input data will be sent to your webhook unaltered. When autoCleanMode is set to disabled, writeback values from the validation webhook will not be applied.

👩‍💻
Server Side Validation Webhooks
AutoClean Configuration Example
Advanced Mode