An automatic finite-sample robustness metric: when can dropping a little data change conclusions? Part I: definitions and experiments | Synapse