LogoLogo
Back to OsmosDeveloper DocsOsmos BlogWhat's New
  • Welcome to Osmos
  • 👋Getting Started with Osmos
    • Terminology
  • 🎉What's New
  • 🧩Osmos API Reference
  • ⌨️Osmos Chat
  • 👩‍💻Developer Docs
    • Manage API Keys
    • Embedding an Osmos Uploader
    • Embedding Uploader Jobs Table
    • Turning on Advanced Mode Uploader
    • Customizing Uploader Styling
    • Passing Parameterized Fields
    • Configuring Uploader's "Recall" functionality
    • Optional Uploader Settings
    • Uploader Submission Callback
    • Configuring AutoClean for your Uploader
    • Uploader Client-Side Validation
      • Data Validators
      • Checking for Duplicate values in a field
      • Creating Dropdown-Controlled Fields
      • Dynamic Dropdown Options
      • Dropdown Interaction with Validation Functions
    • Validation and Transformation Webhooks
      • OpenAPI Validation Webhook Testing
    • Parser Webhook for file based connectors
  • 🔠Datasets
    • Osmos Datasets
      • Uploading Data to your Table
      • Creating Primary and Foreign keys
      • Osmos Dataset Destination Connector
      • Osmos Dataset Source Connector
      • Dataset Edits
    • Datasets Query Builder
      • Query Builder Metadata
    • Performing Look Ups
      • Performing Joins
        • Types of Joins
  • ⏏️Uploader
    • Creating an Osmos Uploader
      • Testing your Osmos Uploader
    • Uploader Validation Summary
    • Advanced Mode
      • Overview
      • Process
    • Standard Mode
      • Overview
      • AutoClean
      • Process
    • AI AutoMapping
    • Uploaders Page
    • Uploader Details Page
  • 🔀Pipelines
    • Step 1. Select the Source
    • Step 2. Select a Destination
    • Step 3. Map & Transform Data
    • Step 4. Schedule the Pipeline
    • Step 5. Review & Confirm
    • Pipelines Page
    • Pipeline Details Page
  • ⏩Data Transformations
    • AutoMap
    • Column Mapping & Data Cleanup Panel
    • QuickFixes
    • AI Value Mapping
    • AI AutoClean
    • Lookups
      • Performing Lookups
    • SmartFill
    • Formulas
      • Date & Time Formulas
        • DateTime Format Specifiers
        • Timezone specifiers
      • Math Formulas and Operators
      • Logical Formulas & Operators
        • True & False Casting
      • Text Formulas
      • Other Formulas
    • Deduplication
  • ↘️Source Connectors
    • Amazon S3
    • Azure Blob Storage
    • BigQuery
    • Email
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
    • Osmos Dataset
    • Snowflake
    • Accessing Sources behind firewall
  • ↖️Destination Connectors
    • Amazon S3
    • BigQuery
    • FTP
    • Google Cloud Storage (GCS)
    • Google Drive
    • Google Sheets
    • HTTP API (Call an Osmos API)
    • HTTP API (Osmos Calls Your API)
      • Passing Dynamic Tokens in the API Header
    • MySQL
    • Osmos Dataset
    • PostgreSQL
    • Snowflake
    • Accessing Destinations behind firewall
  • 🗂️Projects
  • ⚙️Administration
    • Email Notifications
  • 🔒Security
  • 📞Support
  • Back to Osmos.io
Powered by GitBook
On this page
  • Overview
  • Getting Started

Was this helpful?

  1. Data Transformations

AutoMap

Getting started on your Data Transformation with AutoMap.

PreviousData TransformationsNextColumn Mapping & Data Cleanup Panel

Last updated 9 months ago

Was this helpful?

Overview

Data Transformation begins with column mapping. Our AutoMap makes –typically a manual process–faster (and more accurate) than before by automatically suggesting mappings. uses generative AI to significantly reduce the need for manual column mapping that is required during data ingestion. AutoMap takes that task and automatically maps your columns quickly and more accurately. Immediately map things like “MSRP” to “Price” and “Energy” to “Calories”. No additional setup is required.

To automate this process, the underlying engine needs to understand the semantics of not just the field names but also the values in the field. The goal is to automate this process as much as possible, minimizing the need for manual mapping by data specialists. Files often have hundreds of columns that need to be mapped, and this is not just a matter of matching similar names; understanding the semantics is crucial. Generative AI is perfectly positioned to automate this time-consuming and laborious process.

Osmos uses LLMs (Large Language Models) to map the source schema to the destination schema. For example, this column was AutoMapped, leveraging a semantic understanding of information that allows Osmos to determine that the Drug Code maps to the NDC (which stands for the National Drug Code).

Getting Started

Step 1: Once the data is loaded the AutoMapping process automatically kicks off.

AutoMap in Action

Step 2: You can then review the mapped data and make any adjustments.

Step 3: If you want to clean up or modify the data in the cleaned column (or resolve errors), click on any cell in the cleaned column to trigger the Data Cleanup panel (on the left side panel).

The left side panel toggles between column mapping and data cleanup.

Step 3: Review the available options in the Data Cleanup panel and select one that best fits your data cleanup needs for this column:

Note: If you are using the Uploader and need to edit specific cells in the output column after applying QuickFixes, click on any cell in the output column and provide a single-cell edit.

Step 4: Use the tabs at the top of the page to filter for rows with errors and rows flagged for review to confirm all issues with your data have been addressed.

Step 5: Once you have repeated Steps 1 through 4 for each of the required output columns (and any of the optional columns you want to include), you can proceed to the next step.

- One-click, data-cleanup buttons that allow you to easily clean up your data for the most common scenarios for that data type (i.e. Date, Text, Numeric, etc.).

- LLM-driven data transformation that automatically maps source data to its nearest semantic match in the destination schema

- Data transformation that searches and returns specific data based on record matching. It often involves accessing a set of values or records stored in a Dataset Table.

- Simple to use AI-powered data cleanup that learns and detects a pattern from examples of the clean data.

- Spreadsheet-style formulas used for complex transformations and data cleanup.

⏩
QuickFixes
AI Value Mapping
Lookups
SmartFill
Formulas
column mapping
AutoMap