Clustering and classification of document structure-a machine learning approach | Synapse