This section illustrates how to use XLMiner's partition utility with the example data set, Wine.xlsx.
On the XLMiner ribbon, from the Applying Your Model tab, select Help - Examples, then Forecasting/Data Mining Examples, and open the Wine.xlsx example data set. Select a cell within this data set, then from the Data Mining tab, select Partition - Standard Partition to open the Standard Data Partition dialog.
From the Variables In Input Data list, highlight all variables, then click > to include them in the partitioned data. Click OK to accept the remainder of the default settings. Of the observations (Partitioning percentages when picking up rows randomly), 60% will be assigned to the Training Set, and 40% will be assigned to the Validation Set.
The Data_Partition worksheet is inserted to the right of the Data worksheet.
There are 107 observations assigned to the Training Set, and 71 observations assigned to the Validation Set (i.e., 60% and 40% of the observations, respectively).
It is also possible to specify which sets each observations should be assigned. In column 0, enter t, v, or s to indicate the assignment of each record to either the Training Set (t), the Validation Set (v), or the Test Set (s), as shown in the screenshot below.
On the XLMiner ribbon, from the Data Mining tab, select Partition - Standard Partition to open the Standard Data Partition dialog.
From the Selected Variables list, select a partition variable. Under Partitioning Options, select Use partition variable, then click < next to Partition Variable. XLMiner uses the values in the Partition Variable column to create the Training, Validation, and Test Sets. Records with a t in the O column will be designated as training records. Records with a v in the O column will be designated as validating records, and records with an s will be designated as testing records. Now highlight all remaining variables in the Selected Variables list, and click < to include them in the partitioned data. Click OK to create the partitions.
The worksheet Data_Partition1 is inserted to the right of the Data worksheet. If you inspect the results, you will find that all records assigned a t now belong to the Training Set, all records assigned a v now belong to the Validation Set, and all records assigned an s now belong to the Test Set.