As machine learning (ML) technologies move from their discrete existence in research to being highly applied technologies across society, critical scholars have begun to address the epistemological conditions that shape the emergence of such systems and their societal implications. In this paper, we investigate a specific epistemological condition of ML, namely, how ML systems rely on ongoing negotiations and agreements of ‘good enough’ to be deployed. We do so by drawing on ethnographic fieldwork with the British Broadcasting Corporation (BBC) – a large data- and value-driven organisation. In studying the epistemological function and politics of ‘good enough’, we take an AI lab studies approach, following the Recommendations Team's efforts to materialise ‘good enoughness’ and make it negotiable as they develop and modify recommender systems that aim to better serve the BBC's audiences. Through our ethnographic account, we demonstrate how the team relies on various metrics and qualitative evaluations to inform provisional performance thresholds before submitting the ML systems to A/B testing to establish whether one of them is ‘good enough’ to deploy. By following these processes of establishing ‘good enough’, we see how these negotiations are entangled in various, sometimes competing organisational objectives, as well as particular data and technical infrastructures. By extension, we show how the metricised performance scores of AB testing are negotiated in practice by readjusting performance thresholds to manoeuvre different values and constraints. Ultimately, our paper shows that establishing ‘good enough’ is a political endeavour of adjusting seemingly objective evaluation criteria to find the best-fitting metrics and ‘right’ thresholds.
Hansen et al. (Wed,) studied this question.