LogoLogo
Back to OsmosDeveloper DocsOsmos BlogWhat's New
  • Welcome to Osmos
  • 👋Getting Started with Osmos
    • Terminology
  • 🎉What's New
  • 🧩Osmos API Reference
  • ⌨️Osmos Chat
  • 👩‍💻Developer Docs
    • Manage API Keys
    • Embedding an Osmos Uploader
    • Embedding Uploader Jobs Table
    • Turning on Advanced Mode Uploader
    • Customizing Uploader Styling
    • Passing Parameterized Fields
    • Configuring Uploader's "Recall" functionality
    • Optional Uploader Settings
    • Uploader Submission Callback
    • Configuring AutoClean for your Uploader
    • Uploader Client-Side Validation
      • Data Validators
      • Checking for Duplicate values in a field
      • Creating Dropdown-Controlled Fields
      • Dynamic Dropdown Options
      • Dropdown Interaction with Validation Functions
    • Validation and Transformation Webhooks
      • OpenAPI Validation Webhook Testing
    • Parser Webhook for file based connectors
  • 🔠Datasets
    • Osmos Datasets
      • Uploading Data to your Table
      • Creating Primary and Foreign keys
      • Osmos Dataset Destination Connector
      • Osmos Dataset Source Connector
      • Dataset Edits
    • Datasets Query Builder
      • Query Builder Metadata
    • Performing Look Ups
      • Performing Joins
        • Types of Joins
  • ⏏️Uploader
    • Creating an Osmos Uploader
      • Testing your Osmos Uploader
    • Uploader Validation Summary
    • Advanced Mode
      • Overview
      • Process
    • Standard Mode
      • Overview
      • AutoClean
      • Process
    • AI AutoMapping
    • Uploaders Page
    • Uploader Details Page
  • 🔀Pipelines
    • Step 1. Select the Source
    • Step 2. Select a Destination
    • Step 3. Map & Transform Data
    • Step 4. Schedule the Pipeline
    • Step 5. Review & Confirm
    • Pipelines Page
    • Pipeline Details Page
  • ⏩Data Transformations
    • AutoMap
    • Column Mapping & Data Cleanup Panel
    • QuickFixes
    • AI Value Mapping
    • AI AutoClean
    • Lookups
      • Performing Lookups
    • SmartFill
    • Formulas
      • Date & Time Formulas
        • DateTime Format Specifiers
        • Timezone specifiers
      • Math Formulas and Operators
      • Logical Formulas & Operators
        • True & False Casting
      • Text Formulas
      • Other Formulas
    • Deduplication
  • ↘️Source Connectors
    • Amazon S3
    • Azure Blob Storage
    • BigQuery
    • Email
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
    • Osmos Dataset
    • Snowflake
    • Accessing Sources behind firewall
  • ↖️Destination Connectors
    • Amazon S3
    • BigQuery
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
      • Passing Dynamic Tokens in the API Header
    • MySQL
    • Osmos Dataset
    • PostgreSQL
    • Snowflake
    • Accessing Destinations behind firewall
  • 🗂️Projects
  • ⚙️Administration
    • Email Notifications
  • 🔒Security
  • 📞Support
  • Back to Osmos.io
Powered by GitBook
On this page
  • Contents
  • What is AI AutoClean?
  • Getting Started
  • Scenario
  • Features
  • Performing AutoClean
  • Scenario 1: Uploader, Advanced Mode

Was this helpful?

  1. Data Transformations

AI AutoClean

PreviousAI Value MappingNextLookups

Last updated 9 months ago

Was this helpful?

Contents

  1. What is AI AutoClean?

  2. Getting Started

What is AI AutoClean?

With AI Autoclean, users can provide simple instructions using everyday language to tell Osmos how to transform the data.

For example, this can be as simple as, “Convert to US phone number format in (XXX) XXX-XXXX format” or "Format as a US phone number with area code in parenthesis."

Here’s an example of a more sophisticated transformation:

AutoClean can be used throughout Osmos in both Uploaders and Pipelines. It is most commonly used with an Uploader and is located under the QuickFix menu. AutoClean democratizes the transformation process, making it viable to transition the work from sophisticated data experts to everyday users!

Getting Started

Scenario

Customer ABC is ingesting employee data. The employee data includes employee type, which needs to empower end-users to tell Osmos how to clean the data using natural language. In this scenario, see how to use AutoClean to write and apply AutoClean Quickflix.

Features

  • Uploader Advanced Mode

Performing AutoClean

Scenario 1: Uploader, Advanced Mode

Step 1: Go to your newly created Uploader and upload your data. Once the AutoMapping is complete, select the field you wish to initiate the Auto Clean. In our scenario it is Employee_Type.

Step 2: Select Transform on the left-hand side column mapping field. In this scenario, it will be the destination Employee_Type field.

Step 3: Select the Auto Clean QuickFix. Enter how you would like to clean the field.

AI AutoClean is available for Advanced Mode Uploader and Pipelines, the user experience is the same. Note, that AutoClean is most commonly used with Uploaders. .

⏩
See the scenario below