There's a shut connection involving machine learning and compression. A program that predicts the posterior probabilities of a sequence offered its complete record can be used for best data compression (by using arithmetic coding around the output distribution).A call tree exhibiting survival probability of passengers to the Titanic Decision tree l