MinIO

Transfer data at scale from your warehouse to MinIO

Supported syncing

Object Type	Description	Supported Sync Modes
Any data set	Sync data from a source to MinIO as CSV, JSON, or Parquet files	Insert, All, Diff

All: All mode creates one file with all the rows in the query results, every time the sync runs.
Insert: Insert mode creates one file with the rows that were added since the last sync.
Diff: Creates three files, one for rows added, one for rows changed, and another for rows removed since the last sync.

For more information about sync modes, refer to the sync modes docs.

The order of rows in uploaded files may differ from how they appear in your model. This is expected behavior and applies to all sync modes. Learn more about row ordering →

Prerequisites

To get started, you need:

a MinIO bucket
MinIO access key and secret key for use by Hightouch

Go to the Destinations overview page and click the Add destination button. Select MinIO and click Continue. You can then authenticate Hightouch to MinIO by entering your Bucket Name. The Bucket Name should just be the name of the bucket, not a URL.

Sync configuration

Once you've set up your MinIO destination and have a model to pull data from, you can set up your sync configuration to begin syncing data. Go to the Syncs overview page and click the Add sync button to begin. Then, select the relevant model and the MinIO destination you want to sync to.

Select file format

Hightouch supports syncing JSON, CSV, and Parquet files to Amazon MinIO.

Enter filename

The filename or object key field lets you specify the parent directory and the name of the file you want to use for your results. You can include timestamp variables in the filename, surrounding each with {}. Hightouch supports these timestamp variables:

YYYY: Represents the full year in four digits.
YY: The last two digits of the year.
MM: Two-digit month format (01-12).
DD: Two-digit day format (01-31).
HH: Two-digit hour format in 24-hour clock (00-23).
mm: Two-digit minute format (00-59).
ss: Two-digit second format (00-59).
ms: Three-digit millisecond format.
X: Unix timestamp in seconds.
x: Unix timestamp in milliseconds.

All dates and times are UTC.

For example, you could enter upload/{YYYY}-{MM}-{DD}-{HH}-{mm}-result.json to dynamically include the year, month, date, hour, and minute in each uploaded file. Hightouch would insert each file in the upload directory, which would need to already exist in your bucket.

You can also use other variable values to include sync metadata in the filename:

{model.id}
{model.name}
{sync.id}
{sync.run.id}

If a file already exists at the path you entered at the time of a sync, Hightouch overwrites it. To keep different versions of the same results file, you can enable versioning in your bucket, or your app can copy the data to another location.

If you are using an audience and would like to include the audience name, you will still use {model.name}.

Set filename offset

By default, Hightouch uses the timestamp of the sync run to fill in timestamp variables. You can optionally include an offset in seconds. For example, if you want the filename's date to be 24 hours before the sync takes place, enter -86400 (24 hours * 60 minutes * 60 seconds). If you want the filename's data to be one hour after the sync takes place, you would enter 3600 (60 minutes * 60 seconds).

CSV options

If you're syncing to a CSV file, you have additional configuration options:

Delimiter: Your options are comma (,), semicolon (;), pipe (|), tilde (~), and tab
(Optional) Whether to include a CSV header row in the exported files
(Optional) Whether to include a byte order mark (BOM) in the exported files, the BOM is <U+FEFF>

Columns to sync

You can export all columns exactly as your model returns them or choose to export specific ones.

If you need to rename or transform any column values you're syncing, you can use the advanced mapper to do so. If you choose this option, Hightouch only syncs the fields you explicitly map.

The preceding example shows how to selectively export the customer_id, email, first_name and last_name, columns. These columns are mapped to new fields in the destination file as id, email, and name—a templated concatenation of first_name and last_name—respectively. Hightouch exports these fields to new fields in the file and ignores all other columns from your results.

MinIO

Supported syncing

Prerequisites

Connect to MinIO

Sync configuration

Select file format

Enter filename

Set filename offset

CSV options

Columns to sync

Batch size

Empty file results

Gzip compression

Tips and troubleshooting

Common errors

Access denied

Sync alerts

Ready to get started?

Need help?

Feature requests?