Crux

In the Model step, Crux scans the selected tables to review the delivery date ranges and file sizes of all underlying files. This allows you to confirm the delivery schedule and ensure the data is what you intend to work with. A file size chart shows the distribution of file sizes, helping you decide if Crux should further model the data into a cleaner structure. Note that larger files may increase the time required for the modeling process.

The modeling process at the table level includes five steps:

Prepare. Crux checks the source data to ensure it’s ready for processing.

Download files. Crux downloads files from the source and starts preparing them for profiling.

Profile contents. This is the core of the modeling process, where Crux:

- Analyzes file formats, encodings, delimiters, and other structural details of the raw files
- Identifies patterns in the raw files
- Infers the delivery schedule based on the frequency of file modifications

Create schemas. If the table is to be delivered as “raw,” this step is skipped. Otherwise, Crux:

- Defines the schema for standardized files, determining field types and formats based on the data.
- Estimates critical statistical information about the field values.
- Refines and groups file patterns based on content.

Generate data pipelines. The Crux workflow manager initiates the pipeline generation process, defining the extraction schedule and setting up data source parameters.

1. Prepare. Crux checks the source data to ensure it’s ready for processing.
2. Download files. Crux downloads files from the source and starts preparing them for profiling.
3. Profile contents. This is the core of the modeling process, where Crux:
 - Analyzes file formats, encodings, delimiters, and other structural details of the raw files
 - Identifies patterns in the raw files
 - Infers the delivery schedule based on the frequency of file modifications
4. Create schemas. If the table is to be delivered as “raw,” this step is skipped. Otherwise, Crux:
 - Defines the schema for standardized files, determining field types and formats based on the data.
 - Estimates critical statistical information about the field values.
 - Refines and groups file patterns based on content.
5. Generate data pipelines. The Crux workflow manager initiates the pipeline generation process, defining the extraction schedule and setting up data source parameters.

Learn about the steps involved in the modeling process.

Understanding the Model step

Find answers and get help from Intercom Support and Community Experts

This site employs cookies and other technologies that we and our third party vendors use to monitor and record personal information about you and your interactions with the site (including content viewed, cursor movements, screen recordings, and chat contents) for the purposes described in our Cookie Policy. By continuing to visit our site, you agree to our {websiteTermsLink}, {privacyPolicyLink} and {cookiePolicyLink}.

This site uses cookies and similar technologies ("cookies") as strictly necessary for site operation. We and our partners also would like to set additional cookies to enable site performance analytics, functionality, advertising and social media features. See our {cookiePolicyLink} for details. You can change your cookie preferences in our Cookie Settings.

We use cookies to make our site work and also for analytics and advertising purposes. You can enable or disable optional cookies as desired. See our {cookiePolicyLink} for more details.

You have the right to opt out of the sale of your personal information. See our {cookiePolicyLink} for more details about how we use your data.

Your Privacy Choices

We use cookies to enhance your experience. You can customize your cookie preferences below. See our {cookiePolicyLink} for more details.

Cookie Settings

Link, Press control-option-right-arrow to exit

Empty Help Center

Uh oh. That page doesn’t exist.

Disappointed

Neutral

Smiley

Thinking...

Searching through sources...

Analyzing...

Tickets submitted through the messenger or by a support agent in your conversation will appear here.