Wolfram Language & System Documentation Center

"DecisionTree" (Machine Learning Method)

Method for Predict, Classify and LearnDistribution.
Use a decision tree to model class probabilities, value predictions or probability densities.

Details & Suboptions

A decision tree is a flow chart–like structure in which each internal node represents a "test" on a feature, each branch represents the outcome of the test, and each leaf represents a class distribution, value distribution or probability density.
For Classify and Predict, the tree is constructed using the CART algorithm.
For LearnDistribution, the splits are determined using an information criterion trading off the likelihood and the complexity of the model.
The following options can be given:

	"DistributionSmoothing"	1	regularization parameter
	"FeatureFraction"	1	the fraction of features to be randomly selected for training (only in Classify and Predict)

Examples

open all close all

Basic Examples (3)

Train a predictor function on labeled examples:

Wolfram Language code: p = Predict[{1, 2, 3, 4} -> {.3, .4, .6, 9}, Method -> "DecisionTree"]

Look at the information about the predictor:

Wolfram Language code: Information[p]

Extract option information that can be used for retraining:

Wolfram Language code: Information[p, "MethodOption"]

Predict a new example:

Wolfram Language code: p[1.3]

Generate some data and visualize it:

Wolfram Language code:

data = Table[x -> Sin[x] + RandomVariate[NormalDistribution[0, .2]], {x, RandomReal[{-10, 10}, 400]}];
ListPlot[List@@@data]

Train a predictor function on it:

Wolfram Language code: p = Predict[data, Method -> "DecisionTree"]

Compare the data with the predicted values and look at the standard deviation:

Wolfram Language code:

Show[Plot[
	{p[x], 
	p[x] + StandardDeviation[p[x, "Distribution"]], p[x] - StandardDeviation[p[x, "Distribution"]]}, 
	{x, -2, 6}, 
	PlotStyle -> {Blue, Gray, Gray}, 
	Filling -> {2 -> {3}}, 
	Exclusions -> False, 
	PerformanceGoal -> "Speed", PlotLegends -> {"Prediction", "Confidence Interval"}], ListPlot[List@@@data, PlotStyle -> Red, PlotLegends -> {"Data"}]]

Learn a distribution using the method "DecisionTree":

Wolfram Language code: data = RandomVariate[NormalDistribution[], 1000];

Wolfram Language code: ld = LearnDistribution[data, Method -> "DecisionTree"]

Visualize the PDF obtained:

Wolfram Language code: Plot[PDF[ld, x], {x, -5, 5}, Filling -> Bottom]

Obtain information about the distribution:

Wolfram Language code: Information[ld]

Options (4)

"DistributionSmoothing" (2)

Use the "DistributionSmoothing" option to train a classifier:

Wolfram Language code: c = Classify[{1, 2, 3, 4, 5, 6} -> {1, 1, 3, 3, 1, 3}, Method -> {"DecisionTree", "DistributionSmoothing" -> .3}]

Use the mushrooms training set to train a classifier with the default value of "DistributionSmoothing":

Wolfram Language code: data = ExampleData[{"MachineLearning", "Mushroom"}, "TrainingData"];

Wolfram Language code: classifier = Classify[data, Method -> "DecisionTree"];

Train a second classifier using a large "DistributionSmoothing":

Wolfram Language code: smoothed = Classify[data, Method -> {"DecisionTree", "DistributionSmoothing" -> 100}]

Compare the probabilities for examples from a test set:

Wolfram Language code: testdata = ExampleData[{"MachineLearning", "Mushroom"}, "TestData"];

Wolfram Language code:

sample = RandomSample[testdata, 4];
Dataset@<|"AutomaticClassifier" -> 
	classifier[sample[[All, 1]], "Probabilities"], 
	"SmoothedClassifier" -> smoothed[sample[[All, 1]], "Probabilities"]|>

"FeatureFraction" (2)

Use the "FeatureFraction" option to train a classifier:

Wolfram Language code:

c = Classify[{{1, 2.3, 4, 5.3}, {2, 2.3, 2.4, 5}, {2, 2.3, 2.4, 5}, {1, 3, 4, -5.2}, {2, -5, -3.2, 5}, {2, 1.3, -8.1, 3.3}} -> {1, 1, 3, 3, 1, 3}, Method -> {"DecisionTree", "FeatureFraction" -> .5}]

Use the mushrooms training set to train two classifiers with different values of "FeatureFraction":

Wolfram Language code: data = ExampleData[{"MachineLearning", "Mushroom"}, "TrainingData"];

Wolfram Language code: c1 = Classify[data, Method -> {"DecisionTree", "FeatureFraction" -> 1}]

Wolfram Language code: c2 = Classify[data, Method -> {"DecisionTree", "FeatureFraction" -> .1}]

Look at the accuracy of these classifiers on a test set:

Wolfram Language code: testdata = ExampleData[{"MachineLearning", "Mushroom"}, "TestData"];

Wolfram Language code: ClassifierMeasurements[c1, testdata, "Accuracy"]

Wolfram Language code: ClassifierMeasurements[c2, testdata, "Accuracy"]

Top

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

"DecisionTree" (Machine Learning Method)

Details & Suboptions

Examples

Basic Examples (3)

Options (4)

"DistributionSmoothing" (2)

"FeatureFraction" (2)

"DecisionTree" (Machine Learning Method)

Details & Suboptions

Examples

Basic Examples (3)

Options (4)

"DistributionSmoothing" (2)

"FeatureFraction" (2)

See Also

Related Links

History