Data Analysis

Analytic Solver Data Science includes several different methods for data analysis such as

Charting with 8 different types of available charts,
Using Feature Selection for dimensionality reduction
Apply Monte Carlo simulation methods, as well as charting like you find in @RISK or Analytic Solver Simulation, to analyze data
Utilize transformation techniques which handle
- missing data,
- binning continuous data,
- creating dummy variables
- transforming categorical data
Using Principal Components Analysis to reduce and eliminate superfluous or redundant variables
Two different types of Clustering techniques, k-Means and Hierarchical.
Perform Text Mining on a set of documents.

Exploring Your Data

Click the Explore icon to apply Monte Carlo simulation methods to your data, utilize Feature Selection to help decide which variables should be included in your classification or prediction models or use the Chart Wizard to create one or more charts of your data. The Feature Selection tool can help give insight into which variables are the most important or relevant for inclusion in your classification or prediction model using various types of statistics and data analysis measures. Analytic Solver Data Science includes 8 different types of charts to choose from, including: bar charts, line charts, scatterplots, boxplots, histograms, parallel coordinates charts, scatterplot matrix charts or variable charts. This menu allows you to edit or view previously created charts as well.

Transforming Your Data

Click the Transformation icon when data manipulation is required. In most large databases or datasets, a portion of variables are bound to be missing some data. Analytic Solver Data Science includes routines for dealing with these missing values by allowing a user to either delete the full record or apply a value of her/his choice. Analytic Solver Data Science also includes a routine for binning continuous data for use with prediction and classification methods which do not support continuous data. Continuous variables can be binned using several different user specified options. Non-numeric data can be transformed using dummy variables with up to 30 distinct values. If more than 30 categories exist for a single variable, use the Reduce Categories routine to decrease the number of categories to 30. Finally, use Principal Components Analysis to remove highly correlated or superfluous variables from large databases.

Using Cluster Analysis

Click the Cluster icon to gain access to two different types of clustering techniques: k-Means clustering and hierarchical clustering. Both methods allow insight into a database or dataset by performing a cluster analysis. This type of analysis can be used to obtain the degree of similarity (or dissimilarity) between the individual objects being clustered.

Text Mining

Click the Text icon to use the Text Miner tool to analyze a collection of text documents for patterns and trends. (In the Cloud app, this tool is included in the Text section of the Ribbon.) These algorithms can categorize documents, provide links between documents that were not otherwise noted and create visual maps of the documents. Analytic Solver Data Science takes an integrated approach to text mining by combining text processing and analysis in a single package. While Analytic Solver Data Science is effective for mining “pure text” such as a set of documents, it is especially useful for “integrated text and data mining” applications such as maintenance reports, evaluation forms, or any situation where a combination of structured data and free-form text data is available.