The Remove transformation in Nexadata Pipelines allows you to delete one or more selected columns from your dataset. This transformation is useful for removing unnecessary data, simplifying your dataset, and focusing only on the information that matters for your analysis. Depending on the level of control required, you can configure the transformation using Natural Language Mode or Advanced Mode.
Inputs for the Remove Transformation
Name of the Transformation: In Natural Language Mode, the transformation name is automatically generated, but you can update it later in Advanced Mode. For example, you might rename it to "Remove Sensitive Columns" or "Delete Unused Fields".
Columns to Remove: Select one or more columns to delete from the dataset. For example, you may choose columns such as "Employee_SSN", "Address", or "Notes" to remove unnecessary or sensitive information.
Using Natural Language Mode
In Natural Language Mode, describe the columns you want to remove, and Nexadata will automatically configure the transformation. The transformation name is auto-generated but can be modified later in Advanced Mode.
Example Instructions in Natural Language Mode
Remove the Employee_SSN and Address columns.
Delete the Notes column.
Remove all columns related to personal information.
Delete columns Salary and Bonus.
Remove Department_Code and Project_ID columns.
Note: If Natural Language Mode doesn’t fully capture your requirements, you can switch to Advanced Mode to make adjustments.
Using Advanced Mode
In Advanced Mode, you have complete control over the Remove transformation, allowing you to manually select which columns to delete. Advanced Mode allows for detailed control, ensuring the transformation precisely aligns with your analytical needs.
Steps in Advanced Mode
Name of the Transformation: Enter or update a custom name, such as "Delete Unused Fields" or "Remove Confidential Columns".
Columns to Remove: Select the columns you want to delete. For example, you might select "Employee_SSN", "Address", or "Email".
Example Use Case
The Remove transformation is ideal for streamlining datasets by removing extraneous or sensitive columns. For example, suppose you have an employee dataset with columns Employee_SSN, Address, and Phone_Number that you want to remove for privacy. For example:
Transformation Name: Remove Sensitive Information
Columns to Remove: Employee_SSN, Address, Phone_Number
This configuration will delete the Employee_SSN, Address, and Phone_Number columns, ensuring the dataset only contains relevant information.
Summary
The Remove transformation in Nexadata Pipelines enables you to delete unnecessary columns, simplifying your dataset and focusing on essential information. Use Natural Language Mode for a quick setup or Advanced Mode for detailed control. This transformation is ideal for refining datasets by eliminating unwanted columns.