The normalization settings in the Model step allow you to specify advanced settings for source data tables. All normalization settings are applied at the table level. To modify multiple tables, you must apply normalization settings for each table individually.
In the Model step, Crux profiles the data source and detects normalization settings, including:
File encoding
Column separator
Row separator
Quote character
Escape character
Header lines to skip
Rows to skip
Trailing lines to skip
After completing table profiling, you can modify these settings and re-run the Model step. This is useful in the following scenarios:
Table schema could not be identified
Table schema needs correction
Inaccurate delimiters are detected in a CSV, TXT, or TSV file
You would like to apply advanced settings
To apply normalization settings, follow the following steps:
Select data source format: Delimited text, JSON, XML, Custom or Raw
Specify the file encoding type, assign a special column/row separator, and define row separators or skip lines.
Click the Apply button to save the normalization settings.
Click the Profile button to identify schema and metadata based on the applied normalization settings.