Extracting concepts from file names: a new file clustering criterion | Synapse