Topic extraction from news archive using TF*PDF algorithm | Synapse