The options below appear on the Standard Data Partition dialog.

 

Use Partition Variable

Select this option when assigning each record to a specific data set using an added variable in the data set. Each observation should be assigned a t, v, or s to delineate Training, Validation, or Test, respectively.

Select a partition variable from the Variables in Input Data list, then next to Use partition variable, click > to add the appropriate variable as the partition variable.

Set seed

Random partitioning uses the system clock as a default to initialize the random number seed. By default, this option is selected to specify a seed for random number generation for the partitioning. Setting this option results in the same records being assigned to the same set on successive runs. The default seed entry is 12345.

Pick up rows randomly

When this option is selected, Analytic Solver Data Science randomly selects observations to be included in the Training, Validation, and Test sets.

Automatic percentages

If Pick up rows randomly is selected under Partitioning Options, Automatic percentages is enabled. Select this option to accept the defaults of 60% and 40% for the percentages of records to be included in the Training and Validation Sets. This is the default selection.

Specify percentages

If Pick up rows randomly is selected under Partitioning Options, this option is enabled to manually enter percentages for Training, Validation, and Test Sets. Records are randomly allocated to each set according to these percentages.

Equal percentages

If Pick up rows randomly is selected under Partitioning Options, Equal percentages enabled, and Analytic Solver Data Science allocates 33.33% of the records in the database to Training, Validation, and Test Sets.