You define your data model in Data Workbench > Master Segments.

In Treasure Data Workflows (Data Workbench > Workflows) a series of workflows are automatically generated to create and refresh the master segment data model. Data in the data model is refreshed according to the schedule that you define for the master segment.

Creating and Defining Master Segments

  1. From the TD Console, select Data Workbench > Master Segments.

  2. Select New Master Segment.

  3. Specify Master Table , Attribute Tables , and Behavior Tables.

  4. Specify the schedule by which the master segment data is refreshed. We recommend that you run a refresh on a daily basis, to renew the dataset. The master table needs to be updated as well, before master segment creation begins.

  5. Optionally, specify the default timezone.

  6. Specify the Master database and table name. For example:

  7. Optionally, specify the resource pool. This allows you to define how your allotted Treasure Data resources are used to process the data.

    We generally recommend Hive only, because it’s designed for large-scale joins and known for robustness, while Presto and Hive might perform faster for smaller data sets.

  8. Optionally, specify the attribute tables. The Foreign Key and Join Key is used to join Master Table with Attribute Tables. Include a td_ column here for automatic data enrichment.

  9. Optionally, specify tables from which behaviors can be derived. Use the Name field to give a user-friendly name to the specific column. Include a td_ column here for automatic data enrichment.

  10. Save, name, and run your master segment.

  11. Optionally, view the progress of the workflow that is created by defining the master segment. From the main navigation, select Data Workbench > Workflows and select the Master Segment filter.

  12. To verify that the process ran and created your master segment, navigate to Profiles.