Skip to main content

How to Create a New Nexadata Dataset

Learn how to create a new dataset in Nexadata using the CSV format with support for JSON and Parquet coming soon.

Updated this week

When working with Nexadata, you can create new datasets to manage your data pipelines effectively. This guide walks you through the process of adding a new dataset, including defining the source's name, format, and connection options.

Step-by-step Instructions

Step 1: Open the "Create New Dataset" Form

  • Navigate to the Nexadata dashboard and click on Add New Dataset.

  • This will bring up the dataset creation form.

Step 2: Enter a Name for Your Dataset

  • Provide a name in the Name field. While not required, it is recommended that this name be unique.

Step 3: Select Data Connection

  • From the Data Connection dropdown, choose the appropriate connection. You may see options like "sample data" or your organization’s available connections.

Step 4: Choose Data Format

  • Currently, Nexadata supports the following data formats:

    • Delimited data support includes CSV (Comma-Separated Values), TSV (Tab-Separated Values), and Semicolon.

    • Support for JSON and Parquet formats is coming soon, so they cannot be used at this time.

Step 5: Specify details based on the Data Connection

Please see this article on supported Nexadata connections.

Step 6: Submit the Form

  • Once all fields are filled, click the Submit button at the bottom of the form.

  • Nexadata will now register your new dataset.

Coming Soon: JSON and Parquet Support

Nexadata will soon support JSON and Parquet data formats. Once available, you can choose these options in the Data Format section.

Troubleshooting

  • If the S3 Bucket or S3 Path fields are not populating, ensure that your AWS permissions are correctly configured.

  • Double-check your data file for any formatting issues (e.g., using the wrong delimiter for CSV files).

Did this answer your question?