Import the raw dataset into the data pipeline to enable structured data ingestion and prepare records for downstream data cleaning and transformation.
Import
data
Deduplicate the imported dataset by removing duplicate records based on the primary key, ensuring high data quality, consistency, and analytics-ready output.
Transform
Deduplicate rows
Keep first
Export the deduplicated dataset to Excel (XLSX) format to support spreadsheet-based reporting, business intelligence, and stakeholder analysis.
Code
Javascript
Export the cleaned and deduplicated dataset to CSV format for lightweight data sharing, system interoperability, and analytics pipeline integration.
Code
Javascript
Export the finalized deduplicated dataset to JSON format to provide a machine-readable output optimized for APIs, automation, and modern data workflows.
Code
Javascript