If there are two decision trees🌲: one with only one branch👆🏻 and the other with many leaves🖐🏻, which do you think is more likely to result from fitting the real estate training data?
In fact, both two decision trees can be derived from the real estate training data.
1---🌷The Decision Tree 1
The Decision Tree 1 probably makes more sense, because it captures the reality that houses with more bedrooms🛌 tend to sell at higher prices than houses with fewer bedrooms. So the biggest shortcoming of this model is that it doesn't capture most factors affecting house price, like number of bathrooms, lot size, location, etc.
When I have explained up to here, we seem to be able to guess what The Decision Tree 2 looks like and does.
2---💐The Decision Tree 2
The Decision Tree 2 has more “splits”, it can capture more factors. These are called "deeper" trees.
Function of The Decision Tree 2: Predicting the price of any house by tracing through the decision tree, always picking the path corresponding to that house's characteristics.
Appearance of The Decision Tree 2: The predicted price for the house is at the bottom of the tree. The point at the bottom where we make a prediction is called a leaf.🌿(The splits and values at the leaves will be determined by the data.)
Let's get more specific. It's time to Examine Your Data.