Learning from Observations

Using information theory


n		To implement Choose-Attribute in the

		DTL algorithm

n		Entropy:

	I(P(v₁), … , P(v_n)) = Σ_i=1 -P(v_i) log₂ P(v_i)

n		For a training set containing p positive
		examples and n negative examples: