Using information theory
n To implement Choose-Attribute in the
DTL algorithm
n Entropy:
I(P(v1), … , P(vn)) = Σi=1 -P(vi) log2 P(vi)
n For a training set containing p positive
examples and n negative examples: