Emergent Unfairness in Algorithmic Fairness-Accuracy Trade-Off Research

A. Feder Cooper afc78@cornell.edu Cornell UniversityDepartment of Computer ScienceIthacaNYUSA14850 and Ellen Abrams ema85@cornell.edu Cornell UniversitySociety for the HumanitiesIthacaNYUSA14850

(2021)

Abstract.

Across machine learning (ML) sub-disciplines, researchers make explicit mathematical assumptions in order to facilitate proof-writing. We note that, specifically in the area of fairness-accuracy trade-off optimization scholarship, similar attention is not paid to the normative assumptions that ground this approach. Such assumptions presume that 1) accuracy and fairness are in inherent opposition to one another, 2) strict notions of mathematical equality can adequately model fairness, 3) it is possible to measure the accuracy and fairness of decisions independent from historical context, and 4) collecting more data on marginalized individuals is a reasonable solution to mitigate the effects of the trade-off. We argue that such assumptions, which are often left implicit and unexamined, lead to inconsistent conclusions: While the intended goal of this work may be to improve the fairness of machine learning models, these unexamined, implicit assumptions can in fact result in emergent unfairness. We conclude by suggesting a concrete path forward toward a potential resolution.

algorithmic fairness, machine learning, societal implications of AI

^†^†journalyear: 2021^†^†copyright: acmlicensed^†^†conference: Proceedings of the 2021AAAI/ACM Conference on AI, Ethics, and Society; May 19–21, 2021; VirtualEvent, USA^†^†booktitle: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics,and Society (AIES ’21), May 19–21, 2021, Virtual Event, USA^†^†price: 15.00^†^†doi: 10.1145/3461702.3462519^†^†isbn: 978-1-4503-8473-5/21/05^†^†ccs: Social and professional topics Computing / technology policy^†^†ccs: Applied computing Computers in other domains

1. Introduction

Optimization is a problem formulation technique that lies at the core of multiple engineering domains. Given some fixed or limited resource, we can model its usage to effectively solve a problem. The optimal solution is the one that either minimizes some cost function or maximizes some utility function—functions that measure how well the model performs on a particular objective. Often there is more than one objective to satisfy simultaneously, and those objectives can be in tension with one another. In this case, it is possible to pose this problem as optimizing a trade-off (Yang, 2010).

For an intuitive example, consider a company that has a fixed amount of steel, which it can use to build cars and planes, which it then sells to earn a profit. The company has to decide how to allocate the steel to maximize that profit and can formulate the decision as an optimization problem. The blue curve in Figure 1 models possible ways to do this optimally; picking a specific point on the curve corresponds to the company’s choice for how to balance the trade-off between how many cars and how many planes to produce.

Refer to caption — Figure 1. Illustrating a trade-off. Given a fixed amount of steel, a company can build a combination of cars and planes. To optimally utilize the steel for maximizing profit, it can manufacture any point on the blue curve. Any combination above the curve is not possible because there is insufficient steel; any below is suboptimal because the same amount of steel could either produce more cars or more planes.

Optimization science has informed much of the last decade’s spectacular spate of statistical machine learning (ML) publications. It is at the core of how many ML algorithms learn. For example, learned classifiers use training data examples to fit a curve that optimizes for both classifying those examples correctly and generalizing to new, unclassified examples (Bishop, 1995; Hastie et al., 2009). This process of automating classification decisions has in the past been framed as optimal in another sense; automation brought with it the hope of rooting out the human (suboptimal) whims of decisionmaking—of eliminating the ugliest human biases, such as sexism and racism, that afflict high-impact decision processes. Yet, as has been well-documented, this hope cannot be passively realized merely by substituting humans with automated decision agents. Issues with biased data and biased model selection processes can in the worst case magnify, rather than replace, human biases (Abebe et al., 2020; Forde et al., 2021; Selbst et al., 2019).

In response, there has been a widespread push to actively engineer algorithmic fairness. This has become remarkably urgent as automated decision systems are being deployed in domains that have a significant impact on the quality of human life—in deciding judicial bail conditions, loan allocation, college admissions, hiring outcomes, COVID vaccine distribution, etc., (Monahan and Skeem, 2016; Barocas and Selbst, 2014; Ajunwa, 2020; Angwin et al., 2016). Models no longer just need to be correct; they also need to be fair.

It has become common to position accuracy in opposition to fairness and to formalize a mathematical trade-off between the two. For example, in the context of criminal justice and bail decisions, the accuracy of decisions has been framed as how to best “maximize public safety,” in contrast to satisfying “formal fairness constraints” that aim to reduce racial disparities in decision outcomes (Corbett-Davies et al., 2017). This kind of problem formulation is the norm in a growing area of research, in which the trade-off between fairness and accuracy in ML is considered “inherent” or “unavoidable.” Prior work suggests various ways of implementing the trade-off: At best, under particular conditions, the tension between the two can be dissolved to achieve both; at worst, fairness is sacrificed in favor of accuracy, while the remaining cases fall somewhere in the middle (Dutta et al., 2020; Chen et al., 2018; Bakker et al., 2019; Menon and Williamson, 2018; Noriega-Campero et al., 2019).

1.1. Our Contribution

Our work looks both at and beyond the fairness-accuracy trade-off, drawing from prior critiques of algorithmic fairness and studies of sociotechnical systems. We examine the choice—and our work here will show that it is a choice, not a requirement—to model assumptions that cast fairness in direct opposition to accuracy. Regardless of the particulars of specific implementations, this framing does not just involve math, but also implicates normative concerns regarding how to value fairness and accuracy both independently and in relation to each other (Flanagan and Nissenbaum, 2014; Friedman and Hendry, 2019).

Our contribution is to extract and explore patterns of these concerns across trade-off scholarship that arise at three different stages: the initial modeling assumption to treat accuracy and fairness together in an optimization problem, the move from abstract framing to concrete problem formulation, and the “optimal solutions” that result from those formulations. More specifically, we examine how the choice to operationalize the relationship between fairness and accuracy using the language of optimization inherently puts the two in conflict, rather than leaving open the legitimate possibility for them to actually be in accord. We discuss how this choice fails to take full account of social criteria when drawing the boundaries of a problem (Selbst et al., 2019), and relates to broader trends of techno-“solutionism,” in which math is (mistakenly) bestowed special authority to “solve” social problems (Abebe et al., 2020).

Beyond these overarching framing assumptions, there are other underlying, unexamined normative assumptions (not just explicit mathematical ones) that take root in how the trade-off is formalized: That strict notions of mathematical equality can model fairness, that it is possible to measure the accuracy and fairness of decisions independent from historical context, and that collecting more data on marginalized individuals—a practice called active fairness—is a reasonable solution to mitigate the effects of the trade-off.

If we take the time to clarify these implicit assumptions, we note that the conclusions that follow can actually perpetuate unfairness: The mathematical proofs may be sound—a particular choice of fairness metric may even be optimized—but the implicit normative assumptions and accompanying broader normative results suggest that these methods will not ensure fairer outcomes in practical applications. In summary, we argue that

•

Using the language of optimization situates fairness and accuracy in intrinsic opposition, generally privileging the latter over the former and necessarily foreclosing the possibility of examining the ways they can instead reinforce each other (Section 3).
•

Underlying mathematical assumptions bring unexamined normative dimensions, which can actually result in emergent unfairness (Section 4).
•

In light of these observations, algorithmic fairness researchers will conduct more robust research if they first clarify their normative assumptions (Section 5).
•

Due to the extent of emergent unfairness from conceiving of fairness and accuracy in trade-off, it is worth revisiting this formulation altogether (Section 6).

2. The Fairness-Accuracy Trade-Off

We begin by providing the background necessary for understanding the problem formulation of the fairness-accuracy trade-off. Before clarifying what the trade-off actually characterizes, we address each component in turn, summarizing common quantifiable metrics that ML researchers map to the values of “accuracy” and “fairness.”

2.1. Accuracy Metrics

In brief, accuracy measures how often a ML model correctly predicts or infers a decision outcome after training. So, to understand accuracy, we need to understand how ML models are trained. For the classification problems that dominate much of the fairness literature, training the model usually entails fitting a curve to a set of training data points for which classification labels are already known. After training, when supplied with a previously-unseen data point, the model can infer the classification label for that data point. For example, in building a model that infers whether or not to grant an applicant a loan, the curve-fitting training process occurs with past data concerning loan-granting decisions; inference corresponds to the model receiving a new loan applicant’s data and classifying whether that applicant should receive a loan or not—a decision ultimately corresponding to whether or not the loan-granting institution believes the applicant will repay or default on the loan.

A model’s accuracy tends to be measured during a validation process in between training and inference. Rather than using all labeled data for training, researchers reserve a portion to validate how well the trained model classifies unseen data points with known classification labels. In other words, accuracy is often a measure of label alignment: It is the percentage of correctly classified validation data points, where correctness is determined by whether the model’s classification decision matches the known label. There are other metrics that researchers use, such as Chernoff information (Dutta et al., 2020); however, label alignment is a popular accuracy metric, in part due to its simplicity.

This simplicity can be misleading, both in terms of what the math is actually measuring and the normative implications of that measurement. The broader algorithmic fairness community (and corresponding community of critics) has paid ample attention to this issue in relation to fairness (Binns, 2018; Selbst et al., 2019; Powles and Nissenbaum, 2018; Abdurahman, 2019); however, in fairness-accuracy trade-off literature, where accuracy is also explicitly centered as a value, parallel analyses of accuracy have been relatively sparse. We examine this in Section 4; for now, we emphasize that something as simple as a percentage metric can raise normative concerns (Forde et al., 2021).

One can see this from work in the broader ML community, in which accuracy issues often get cast as a problem of label bias: The classification labels in the training and validation data can be incorrect, in terms of some abstract notion of “ground truth.”¹¹1The implications of “true” classification, including the simplifying assumptions that inform such classification, are out of scope for our purposes. We refer the reader to the rich literatures in sociology and science & technology studies on categorization and classification, notably (Velocci, 2021; Bowker and Star, 1999). As an innocuous example, consider a labeled image dataset of dogs and cats. The individual that labeled the dataset incorrectly (though perhaps understandably) mis-labeled Pomeranians as cats. This mis-labeling in turn leads the learned ML model to mistakenly identify Pomeranians as cats.

The results of mis-labeling can be devastating for applications that impact human lives. Consider again an automated decision system that grants and denies loan applications. In the US, systemic racism against Black loan applicants, specifically manifested in the practice of redlining, has entailed denying loans to qualified Black applicants. These applicants, in terms of “ground truth” should have been granted loans, but were instead intentionally marked as likely defaulters. As with the example above, this mis-labeling would lead to inaccurate classification. The models trained on that data would mistakenly identify Black non-defaulters as defaulters—a mis-classification that could be used to wrongfully deny a loan.²²2We have simplified this example, which also involves using zip code as a proxy for race, to make our point that mis-labeled data impacts the accuracy of trained models. Unlike the former example, this one involves intentional mis-labeling; however, it is worth noting that even if the mis-labeling were unintentional, it would have the same effect on classification decisions. Regardless of intention, the impact would be the same. Either way, the learned model would systematically, incorrectly classify Black individuals as defaulters.

2.2. Fairness Metrics

Algorithmic fairness has dozens of mathematical definitions that can inform optimization problem formulation.³³3Similar to our discussion of accuracy and classification, our work will not focus on the normative limits of defining fairness mathematically, such as the challenges of formulating fairness problems that account for intersectional protected identities (Selbst et al., 2019; Chen et al., 2018; Dwork et al., 2018; Buolamwini and Gebru, 2018; Kroll et al., 2017; Hoffmann, 2019) and how many fairness definitions reflect what we ought to believe instead of (the arguably more useful, from a policy perspective) what we ought to do (Hellman, 2020b). All definitions, regardless of the specifics, involve some treatment of protected attributes, such as race and sex, along which decision outcomes can be evaluated for “fair” treatment. Broadly speaking, there are two families of fairness metrics: Those that measure individual-focused fairness and those that evaluate it in terms of groups defined by protected attributes.

Individual fairness, as the name suggests, centers analyzing automated decisions in terms of the individual (Dwork et al., 2012; Joseph et al., 2016; Bakker et al., 2019). In contrast, group fairness centers demographic group membership and typically aims to ensure that membership in a protected class does not correlate with decision outcomes (Dwork et al., 2018; Chen et al., 2018). A particularly popular metric is Hardt et al. (2016)’s formulation of equality of opportunity, which in essence only requires that there is no discrimination based on demographics for those assigned the positive classification. For the example of granting loans, paying back the loan is the positive class and defaulting is the negative class. Equality of opportunity corresponds to making sure that the rate of mistakenly classifying non-defaulters as defaulters is similar across demographic groups. We discuss this further in Section 4; for now, it is important to note that this formulation is inextricably tied to accuracy. Training a model to reduce mistaken negative classification directly depends on training data that contains negative labels. Those labels, however, may not align with what is correct in terms of “ground truth”—e.g., Black individuals erroneously marked as defaulters.

Lastly, it is also worth noting that multiple different mathematical notions of fairness cannot be satisfied simultaneously. This incompatibility has been formalized as impossibility results (Kleinberg et al., 2017; Chouldechova, 2017) and has placed a more significant emphasis on how computer scientists choose which fairness metric to study (Friedler et al., 2016). While these findings are extremely well-cited, they are not surprising when considering fairness beyond its mathematical definition as a metric. In a pluralistic world, values like fairness depend on the time and place in which they are defined; outside of math, different, incompatible definitions can hold simultaneously, depending on context (Berlin, 2013).

2.3. The “Inherent” Trade-Off

Algorithmic fairness tends to pose accuracy and fairness in an “inherent” or “unavoidable” trade-off: An increase in fairness necessarily comes with a decrease in accuracy; increasing accuracy necessarily decreases fairness (Chen et al., 2018; Menon and Williamson, 2018; Bakker et al., 2019; Corbett-Davies et al., 2017; Dwork et al., 2018; Sabato and Yom-Tov, 2020; Zhao and Gordon, 2019).

How did this trade-off problem formulation come about? Much of the literature that engages this trade-off does so empirically; that is, the authors perform or cite experiments that convey the intuition that a trade-off exists, so they decide that a mathematical trade-off is an appropriate way to model the problem. However, most do not characterize or quantify the trade-off theoretically. The work that has attempted theoretical treatment suggests that—at least in theory—the existence of the fairness-accuracy trade-off can be less rigidly described (Dutta et al., 2020; Wick et al., 2019). Nevertheless, the general practice in the field is to tacitly accept the trade-off as fact, regardless of the particular fairness and accuracy metrics under consideration.

Computer scientists further observe that the ramifications of this trade-off, particularly in high-stakes domains, can be significant. As a result, they sometimes wade into the murkiness of how to optimize the trade-off implementation in specific “sensitive” applications. For example, several computer scientists have noted that in areas like healthcare, trade-off implementations should favor accuracy, as privileging fairness can have “devastating” consequences, such as missing cancer diagnoses at higher rates (Chen et al., 2018; Srivastava et al., 2019).

Other researchers have posited why this trade-off exists in the first place. For example, one popular explanation comes from Friedler et al. (2016). The authors of this paper reason that there is an “abstract construct space” that represents the features we actually want to measure but cannot observe (e.g., intelligence). Instead, we see features in the “observed space” of the actual world (e.g., SAT score), and there is a mapping from features in the construct space to features in the observed space (e.g., SAT score is the mapped feature in the observed space, standing in place for intelligence in the construct space). According to Friedler et al. (2016), the trade-off between classification accuracy and fairness exists in the real world due to “noisier mappings” for less privileged groups from the construct space to the observed space. They contend that this noise comes from historic differences, particularly in opportunity and representation, which makes positive and negative data points less distinguishable (in comparison to privileged groups) for the learned classifier. It is this decrease in separability that leads to less fair classification for less privileged groups. In the example of SATs, this means that the scores are less reliable in terms of conveying information about intelligence for underprivileged groups. While this posited explanation may seem reasonable, work that engages with it rarely (if ever) supports the explanation with data, as it is usually not the specific fairness problem under mathematical consideration (Dutta et al., 2020).

3. Applying a Sociotechnical Lens

In the field of Science and Technology Studies, the term “sociotechnical” is used to signal the impossibility of understanding the social world separately from the technological, and vice versa (Bijker et al., 1987; Hughes, 1993; Hecht, 2002; Wajcman, 2004). This means that, among other things, technologies like algorithms are never separate from the systems, institutions, ideologies, cultures, and societies in which they are embedded. It also means that the choice to formulate algorithmic fairness as an optimization problem produces a particular kind of knowledge about fairness that cannot be detached from its broader social context. In our critiques of emergent unfairness in Section 4, we explicitly examine the fairness-accuracy trade-off model as a sociotechnical system.

We are not the first to examine algorithmic fairness using the concept of a sociotechnical system. For example, Selbst et al. (2019) use the concept of sociotechnical systems to draw attention to five pitfalls that imperil well-meaning data scientific approaches to fairness. One that they identify is called the “Framing Trap,” which the authors describe as the “failure to model the entire system over which a social criterion, such as fairness, will be enforced.” A sociotechnical lens, they argue, might suggest new ways for researchers to draw the boundaries of fairness problems to include social relations and dynamics that may have otherwise been excluded.

Defining fairness and accuracy in trade-off both exemplifies falling into the Framing Trap and presents an additional set of nuanced consequences. For example, the language of defining trade-offs along a curve (Figure 1) necessarily requires a give-and-take relationship between the factors under consideration, as those factors are cast in opposition to one another. In the fairness-accuracy trade-off, fairness and accuracy are framed as inherently competing goals, in which we must give up some of one in order to gain some of the other (even if “some” cannot be quantified definitively). Based on this framing, it is consistent that much of the literature uses language that describes the “cost of fairness” (Chen et al., 2018; Menon and Williamson, 2018; Corbett-Davies et al., 2017; Dutta et al., 2020), depending on where on the trade-off optimization curve a particular implementation lies. This cost, however, can be described as cutting both ways: It is similarly reasonable to talk about the “cost of accuracy.” Yet, with few exceptions (Corbett-Davies et al., 2017), the literature in this area chooses not to discuss costs in this way. This decision is perhaps due to the tendency within the field of ML more broadly to privilege accuracy during model design; nonetheless, this framing shifts the burden of defensibility to fairness, in the sense that it implies that fairness’ “costs” require justification.

The particulars of the trade-off choice present additional complications. Beyond failing “to model the entire system”, the fairness-accuracy trade-off formulation also forecloses the very reasonable possibility that accuracy is generally in accord with fairness (unless one specifies particular conditions for which it is possible to demonstrate that the trade-off does not exist (Dutta et al., 2020; Wick et al., 2019)). In other words, a trade-off model conceals the idea that the accurate thing to do could be complementary with the fair thing to do. For example, as Hellman (2020b) notes, it is possible to view accuracy and fairness as complementary values, where the former reflects what one “ought to believe” and the latter reflects what one “ought to do.” In Section 4, we expand on how fairness and accuracy can be considered in accord (Friedman and Hendry, 2019; Flanagan et al., 2008). Additionally, we suggest that the boundaries of the trade-off problem need to be redrawn to account for social and technical considerations in historical context. A suitable frame for fairness research can neither be blind to past historical context nor ignore the future.

Lastly, our analysis demonstrates some of the challenges of using trade-off and optimization tools in algorithmic fairness research. Borrowing tools from adjacent fields in computer science not only affects the results of fairness research, but it also helps to characterize the disciplinary status of fairness research itself. In this case, the operationalization of fairness as a mathematical problem helps situate questions of fairness within the realm of “science,” thus conferring a particular legitimacy that science connotes (Gieryn, 1983; Porter, 1995). Situating the fairness question as a scientific question and esconcing it in the language of trade-offs and optimization suggests that it is reasonable to try to solve for an “optimal,” best answer. Framing the problem as a trade-off problem to be “solved” using math falls under the highly-critiqued practice of technological “solutionism” (Selbst et al., 2019; Abebe et al., 2020)—the notion that technology is uniquely capable of solving social problems. This tendency toward solutionism grants special legitimacy to algorithmic fairness research, legitimacy absent in other fields that have tackled but have not “solved” the fairness problem from a social perspective. We contend that it is unlikely that the same ideas used to solve problems of steel allocation (Figure 1) will transfer without issue to questions of fair hiring practices. We suggest that attending to the sociotechnical context of each situation may help prevent the emergent unfairness we identify in the following sections.

4. Emergent Unfairness

Choosing to convey fairness and accuracy as a trade-off is a mathematical modeling assumption. As we discussed in Section 2, the authors have observed a pattern in their empirical results concerning accuracy and fairness, and deem a trade-off to be a useful way to formulate the mathematical problem of characterizing the relationship between the two. As we suggest above, fairness and accuracy metrics have normative dimensions; so, too, does the modeling assumption that poses them in trade-off.

There are also numerous, other mathematical assumptions, each which carry their own implicit normative dimensions. We observe that, based on these implicit assumptions, fairness-accuracy trade-off scholarship is plagued with gaps and oversights. These issues can lead to conclusions that actually perpetuate unfairness. There are dozens of examples of particular assumptions specific to each paper in fairness-accuracy trade-off scholarship. It is not possible to be exhaustive regarding each mathematical assumption’s corresponding normative assumptions. Instead, we isolate three patterns of implicit, unexamined assumptions in the discipline, and the emergent unfairness that can result: Unfairness from assuming 1) strict notions of equality can substitute for fairness, 2) historical context is irrelevant when formulating the trade-off, and 3) that collecting more data on marginalized groups is a reasonable mechanism for alleviating the trade-off.

4.1. Unfairness from Assuming Fairness = Equality

One assumption prevalent in fairness-accuracy trade-off literature concerns how different papers choose to measure fairness. Most of the work in this subfield relies on parity-based definitions. Algorithmic fairness definitions like this effectively make the modeling assumption to represent “fairness” as “equality.”⁴⁴4While such equality-based notions of fairness dominate the literature more generally, not just concerning the fairness-accuracy trade-off, there are a growing number of exceptions. For example, some recent work frames fairness in terms of Rawlsian social welfare (Rawls, 1971; Heidari et al., 2018; Joseph et al., 2016). To be clear, we mean “equality” in the strict sense of the values of metrics being as equal as possible, by minimizing some form of measured inequality. This kind of strict equality can stand in for “fairness” in terms of what is actually being modeled; fairness is being framed as a problem of strict equality. This is easily discernible in the popular equality of opportunity metric (Section 2), used in numerous trade-off papers (Chen et al., 2018; Dutta et al., 2020; Bakker et al., 2019; Noriega-Campero et al., 2019), which tries to minimize discrepancies in false negative classification decisions among different demographic groups; it literally tries to make those rates as equal as possible.

However, what is fair and what is equal are not always the same thing, and framing them as equivalent can actually lead to unfair outcomes (Kasy and Abebe, 2021). For example, when addressing historic or systemic inequity, it can be necessary to take corrective or reparative action for some demographic groups in order to create the conditions of more-equal footing. Such actions necessarily diverge from equality in the strict mathematical sense, so strictly parity-based fairness metrics cannot capture this kind of nuance.

The ongoing debate in the United States around the fairness of affirmative action policy can help illustrate this distinction, as well as the complications that arise when defining fairness as equality. In brief, affirmative action is a social policy aimed at increasing the representation of historically marginalized groups in university student and workforce populations; it attempts to implement a fairer playing field by providing individuals from marginalized backgrounds with the chance to have the same opportunities as those from more privileged backgrounds.

While affirmative action has existed as official policy in the US for decades (Kennedy, 1961), it is extremely contentious. Many Americans, who do not feel personally responsible for systemic discrimination against BIPOC⁵⁵5An acronym for “Black, Indigenous, and People of Color,” used particularly in the US and Canada. populations, feel that affirmative action puts them at a disadvantage. They claim that the policy is unfair, and in fact is responsible for “reverse discrimination” (Budryk, 2020; Newkirk, 2017; Pham, 2018). This belief comes in part from the idea that affirmative action does not lead to “equal” comparisons in the strictest sense—comparing SAT scores or GPAs point for point. Instead, in the language of Friedler et al. (2016), one could say that affirmative action attempts to repair or normalize for the “noisy mappings” that these scores convey for unprivileged populations in order to promote fairer outcomes.⁶⁶6It is also interesting to note that this controversy has found its way into the language of algorithmic fairness literature. Dwork et al. (2012); Dwork et al. (2018) use the term “fair affirmative action” in their work; they seem to be attempting to distinguish their notion from some imagined, alternative, unfair variant. This term is arguably redundant, since affirmative action is fundamentally about trying to promote fairer outcomes, even if that notion of fairness does not align with strict-equality-based notions in algorithmic fairness.

In short, the goal of affirmative action illustrates how notions of fairness and equality can diverge. The policy’s existence is predicated on the notion that strictly equal treatment, without attending to past inequity, can potentially perpetuate unfairness.

4.2. Unfairness from Assuming the Irrelevance of Context

Fundamentally, the issue with the assumption that strict notions of equality can stand in for fairness has to do with how the assumption treats—or rather discounts—context. Fairness metrics like equality of opportunity are only able to evaluate the local, immediate decision under consideration. As discussed above using the example of affirmative action, this type of equality cannot accommodate reparative interventions that attempt to correct for past inequity. This similarly implicates issues with how we measure accuracy, since such metrics measure the correctness of current classification decisions in relation to past ones. We next examine this issue, as it presents fundamental contradictions in the formulation of the fairness-accuracy trade-off problem.

4.2.1. Ignoring the Past

As discussed in Section 1, optimization involves minimizing a loss function. In statistical terminology, minimizing the expected loss depends on the true class labels (Bishop, 1995). In fairness-related application domains we rarely, if ever, have access to true class labels. To return to an earlier example, Black people have systematically been denied loans in the US due to their race. In many cases, while a Black person’s “true” label should be that they would not default on a loan, past loan-granting decisions (and therefore the corresponding data) mark them as a defaulter. This captures the problem of label bias: Misalignment between the “ground truth” label and the actual, observed label in real world data (Section 2). In a sense, this bias is what has motivated the entire field of algorithmic fairness in the first place: Automated decision systems that do not account for systemic discrimination in training data end up magnifying that discrimination (Barocas et al., 2018; Abebe et al., 2020); to avoid this, such systems need to be proactive about being fair.

Label bias presents an inherent issue with how we measure accuracy: If labels are wrong, particularly for individuals in the groups for which we want to increase fairer outcomes, then there are cases where misclassification is in fact the correct thing to do. In other words, how we measure accuracy is not truly accurate.

Yet, in the fairness-accuracy trade-off literature, it is very common to assume label bias can be ignored. Much of the work in this space does not mention label bias at all, or claims that it is out of scope for the research problem under consideration (Chen et al., 2018). This presents a contradiction: Simultaneously acknowledging that labels in the observed space are noisy representations of the ground truth (i.e., there is bias in the labels), but then explicitly assuming those labels in the training data (i.e., the observed space labels) are the same as the true labels (Dutta et al., 2020).⁷⁷7Wick et al. (2019) is a notable exception, acknowledging this contradiction in stark terms: “…there is a pernicious modeling-evaluating dualism bedeviling fair machine learning in which phenomena such as label bias are appropriately acknowledged as a source of unfairness when designing fair models, only to be tacitly abandoned when evaluating them.” In other words, because the labels are biased, the corresponding accuracy measurements that depend on them are also biased; unfairness from the past produces inaccuracy (Hellman, 2020a), which this work explicitly ignores in its trade-off formulation.

If accuracy measurements are conditioned on past unfairness, what is the trade-off between fairness and accuracy actually measuring? What does it mean to “increase” or “decrease accuracy” in this context? If accuracy measurements encode past unfairness for unprivileged groups, the fairness-accuracy trade-off is effectively positioning fairness in trade-off with unfairness, which is tautological. Giving validity to an accuracy metric that has a dependency on past unfairness inherently advantages privileged groups; it is aligned with maintaining the status quo, as there is no way to splice out the past unfairness on which it is conditioned. In the words of Hellman (2020b), this can lead to even more unfair outcomes via “compounding injustice.” To the best of our knowledge, no prior work has explicitly attempted to avoid this scenario—to model and extract this past unfairness.

4.2.2. Being Blind to the Future

Similarly, studying specific, local classification decisions in terms of balancing fairness and accuracy does not provide insight about the more global, long-term effects that such decisions potentially have. This also presents a contradiction: Some scholarship concerning the trade-off explicitly aims to support the goals of policymakers, but policymaking by its very nature takes a long-tailed view. Current policy interventions do not just have a local impact, but rather also have desired cascading effects that carry into the future.

Ironically, this contradiction is clearly spelled out in some of the trade-off literature as an intentional assumption. For example, Corbett-Davies et al. (2017) explicitly states: “Among the rules that satisfy a chosen fairness criterion, we assume policymakers would prefer the one that maximizes immediate utility.” They intentionally examine the “proximate costs and benefits” of the fairness-accuracy trade-off, assuming that this is the temporal resolution that would be most useful to policymakers. This approach enables simplifying mathematical assumptions, as it does not require evaluating how the specific automated decision under consideration has potential ramifications in the future. In Corbett-Davies et al. (2017), in which they examine risk-assessment decisions for granting bail, they specifically do not need to look at how the immediate decision to detain someone may in fact be predictive of (even causally linked to) future arrests. However, if such decisions are applied unfairly across racial demographic groups (even if somewhere slightly “fairer” on the optimization curve), then they would just repeat patterns of bias existing in past data.

4.3. Unfairness of “Active Fairness” Trade-Off Remedies

Some work regarding the fairness-accuracy trade-off sometimes goes beyond observing, characterizing, or implementing the trade-off for different applications, as we have discussed above. They note that while they agree with the notion that the trade-off is inherent, its effects can perhaps be mitigated by increasing both accuracy and fairness—essentially, moving the trade-off optimization curve up and to the right (Figure 1). The trade-off still exists in this scenario, but perhaps is less of an issue since the models perform better overall in terms of both accuracy and fairness.

Concretely, authors recommend a technique they call active feature acquisition or active fairness (Noriega-Campero et al., 2019), which promotes the idea that “data collection is often a means to reduce discrimination without sacrificing accuracy” (Chen et al., 2018)—that collecting more features for the unprivileged group will help ensure fairer outcomes (Dutta et al., 2020; Bakker et al., 2019). The rationale is that additional feature collection alleviates the bias in the existing data for unprivileged groups, which will result in reduced bias in the classification results for those groups. Moreover, the authors note that gathering more features for the unprivileged group leads to these benefits without impacting the privileged group; the privileged group’s accuracy and fairness metrics remain unchanged.

Setting aside that it might not even be possible to collect more features in practice, there are important implicit assumptions in this choice of solution. In particular, it seems like this work poses data collection as a value-neutral solution to ensure greater fairness. This assumption is clearly false. It is widely accepted, particularly in sociotechnical literature, that data collection is often a form of surveillance (Zuboff, 2018; Clarke, 1994; Brayne, 2017; Cohen, 2011). It is not a neutral act, and is generally not equally applied across demographic groups in the US.

The choice to collect more data raises a normative question directly in contradiction with their goal for increased fairness for unprivileged groups: Do we really want to collect more data on unprivileged groups—groups that already tend to be surveilled at higher rates than those with more privilege? In the US in particular there is a long history of tracking non-white and queer individuals: Black Americans, from Martin Luther King to Black Lives Matter activists; Japanese Americans during World War II; non-white Muslims, particularly since 9/11; Latine individuals in relation to immigration status; trans, particularly trans-feminine, people in sports and bathroom use (Bedoya, 2016; DeSilver et al., 2020; Speri, 2019; Painter, 2011; Conrad, 2009; of Representatives, 2021). In a more global treatment of fairness, is it fair to collect more data on these populations just to ensure we are optimizing some local fairness metric?

One could make the argument that machine learning broadly speaking pushes toward greater surveillance. The field is pushing to train larger and larger model specifications, which tend to require training on larger and larger datasets (Kaplan et al., 2020). These data-hungry methods in turn push for greater data collection and surveillance in general (Zuboff, 2018). Yet, the proposed techniques in active fairness to alleviate the fairness-accuracy trade-off stand apart: They specifically advocate for increasing data collection of already-surveilled groups. They tend to leave the data for the privileged group untouched in order to decrease classification disparities between groups. In essence, this unfairly shifts the burden of producing fair classification results to the unprivileged group, affording the additional privilege (i.e. even less relative surveillance) to the already privileged group. Put another way, their solution to the fairness-accuracy optimization problem introduces another, unexplored objective function—an objective function concerning the burden of surveillance, whose solution in this case causes residual unfairness for the marginalized group.

Some work in active fairness does acknowledge that additional data collection is not costless; however, this work often discusses it as a necessary cost for increased fairness rather than investigating it as a potential source for increased unfairness (Chen et al., 2018). Noriega-Campero et al. (2019) states that it would be useful to model the cost of each feature in the dataset, where costs implicate monetary, privacy, and opportunity concerns. Beyond noting this idea, they do not attempt to formalize it in their work.

Bakker et al. (2019) goes a step further by including cost in their problem formulation, associating a vector of costs with each feature. However, it is unclear how they pick the values of those costs and, perhaps more importantly, they make the assumption that the vector of costs is the same for each individual in the population. This assumes that different values for different features do not not incur different social impacts, which is demonstrably not the case. For example, consider individuals of transgender identity: Trans people, in comparison to cis people, face significant discrimination in response to disclosing their identity. In the language of Bakker et al. (2019), it is more costly to trans people to collect features about gender identity than it is for cis people. In fact, such disparate costs can be thought of as the basis for needing to acknowledge and do policymaking using protected demographic attributes in the first place.

5. Toward a Resolution

5.1. Making Normative Assumptions Explicit

Writing mathematical proofs requires assumptions. For example, in machine learning, when writing proofs about an algorithm’s properties, it is common to assume that the distribution we are trying to learn is convex. Assumptions like this enable us to guarantee certain logical conclusions about an algorithm’s behavior, such as bounds on its convergence rate. While fairness-accuracy trade-off researchers are accustomed to stating mathematical assumptions like this, and ensuring that sound mathematical conclusions follow, we have shown that they do not pay similar attention to normative assumptions and their ensuing contradictory conclusions. We contend that researchers should take the time to make explicit such assumptions underlying their work. Being rigorous and clear about normative assumptions enables them to be reviewed just as rigorously as mathematical assumptions.

We do not suggest that making such assumptions explicit is a sufficient solution on its own. Nevertheless, it would still help facilitate greater scrutiny about the appropriateness of proposed algorithmic fairness solutions. For example, as we discussed in Section 3, this would allow for considering that fairness and accuracy could in fact be in accord (Hellman, 2020b; Selbst et al., 2019). ML researchers should engage the assistance of social scientists if they believe they lack the expertise to do this work independently. Moreover, this process should facilitate researchers being introspective about how their individual backgrounds might inform the assumptions they bring into their work. This would be one necessary (though not on its own sufficient) way to address critics of fairness research being dominated by white voices (Abdurahman, 2019).

Moreover, clarifying implicit normative assumptions could facilitate rethinking how we measure accuracy. As we note in Section 4, common accuracy metrics are tied to unfairness. In an attempt to decouple accuracy from unfairness, one could, for example, put a Bayesian prior on existing unfairness and try to correct for it. To the best of our knowledge, no algorithmic fairness scholarship has attempted to do this: to explicitly assume and model the existing unfairness due to a history of discrimination against certain demographics.⁸⁸8Such modeling, of course, would not come without concern, as it would require introducing a different set of mathematical modeling assumptions that carry their own normative implications.

5.2. Tweaking Normative Assumptions for Robustness

Making normative assumptions explicit could also help facilitate more robust ML fairness research. When investigating algorithmic robustness, researchers are generally comfortable with relaxing or changing certain mathematical proof assumptions and reasoning out the resulting changes (or stasis) in algorithm behavior. As the economist Edward Leamer (1983) notes:

…an inference is not believable if it is fragile, if it can be reversed by minor changes in assumptions. … A researcher has to decide which assumptions or which sets of alternative assumptions are worth reporting.

As a test of normative robustness, we similarly recommend that fairness-accuracy trade-off researchers perturb their normative assumptions and investigate how this may alter normative outcomes. For example, when considering surveillance of the unprivileged via active feature acquisition as an appropriate mechanism for alleviating the trade-off, it would be useful to state this as an explicit assumption, and then consider surveillance as a constraint for the problem. In other words, one could ask, how much surveillance is tolerable for increased fairness? Perhaps none, but perhaps there is a small set of high quality features that could be collected to serve this purpose, rather than just indiscriminately collecting a lot of additional features.

6. Conclusion: Reconsidering the Fairness-Accuracy Trade-off

As Passi and Barocas (2019) note, “Whether we consider a data science project fair often has as much to do with the formulation of the problem as any property of the resulting model.” Furthermore, the work of problem formation is “rarely worked out with explicit normative considerations in mind” (Passi and Barocas, 2019). As we have shown in this article, not attending explicitly to these considerations can lead to contradictory, unintended results: Formulating a trade-off between fairness and accuracy involves a variety of normative assumptions that can in fact lead to various forms of emergent unfairness.

Our recommendations in Section 5 to make normative assumptions explicit aim to remedy this particular type of unfair outcome. However, we recognize that in the case of the fairness-accuracy trade-off, fully applying these recommendations may not be sufficient in itself. Rather, in clarifying the normative concerns of the trade-off, it is quite possible to reasonably conclude that the effects of emergent unfairness outweigh any benefits that come from choosing this particular problem formulation. As such, when it comes to the critical issue of algorithmic fairness, it may be time to reconsider the framing of trade-offs altogether.

Acknowledgements.

Thank you to our anonymous reviewers for their valuable feedback. Additionally, we would like to thank the following individuals for feedback on earlier drafts and iterations of this work: Professor Rediet Abebe, Bilan A.H. Ali, Harry Auster, Kate Donahue, Professor Chris De Sa, Jessica Zosa Forde, Professor Deborah Hellman, and Kweku Kwegyir-Aggrey. We would also like to thank the Artificial Intelligence Policy and Practice initiative at Cornell University, the John D. and Catherine T. MacArthur Foundation, and the Cornell Humanities Scholars Program.

References

(1)
Abdurahman (2019) J. Khadijah Abdurahman. 2019. A Response to Racial Categories of Machine Learning by Sebastian Benthall and Bruce Haynes. https://medium.com/@blacksirenradio/fat-be-wilin-deb56bf92539
Abebe et al. (2020) Rediet Abebe, Solon Barocas, Jon Kleinberg, Karen Levy, Manish Raghavan, and David G. Robinson. 2020. Roles for Computing in Social Change. In Proceedings of the Conference on Fairness, Accountability, and Transparency (Barcelona, Spain) (FAT* ’20). Association for Computing Machinery, New York, NY, USA, 252–260.
Ajunwa (2020) Ifeoma Ajunwa. 2020. The Paradox of Automation as Anti-Bias Intervention. Cardozo Law Review 41, 1671 (2020).
Angwin et al. (2016) Julia Angwin, Leff Larson, Surya Mattu, and Lauren Kirchner. 2016. Machine Bias. https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing.
Bakker et al. (2019) Michael A. Bakker, Alejandro Noriega-Campero, Duy Patrick Tu, Prasanna Sattigeri, Kush. R Varshney, and Alex ’Sandy’ Pentland. 2019. On Fairness in Budget-Constrained Decision Making. KDD Workshop on Explainable Artificial Intelligence (2019).
Barocas et al. (2018) Solon Barocas, Moritz Hardt, and Arvind Narayanan. 2018. Fairness and Machine Learning. fairmlbook.org. http://www.fairmlbook.org.
Barocas and Selbst (2014) Solon Barocas and Andrew D. Selbst. 2014. Big Data’s Disparate Impact. SSRN eLibrary (2014).
Bedoya (2016) Alvaro M. Bedoya. 2016. The Color of Surveillance. Slate (18 January 2016). https://slate.com/technology/2016/01/what-the-fbis-surveillance-of-martin-luther-king-says-about-modern-spying.html
Berlin (2013) Isaiah Berlin. 2013. The Pursuit of the Ideal. In The Crooked Timber of Humanity, Henry Hardy (Ed.). Princeton University Press, Chapter 1, 1–20.
Bijker et al. (1987) Wiebe E. Bijker, Thomas P. Hughes, and Trevor J. Pinch. 1987. The Social Construction of Technological Systems: New Directions in the Sociology and History of Technology. MIT Press, Cambridge, Mass.
Binns (2018) Reuben Binns. 2018. Fairness in Machine Learning: Lessons from Political Philosophy. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, New York, NY, USA, 149–159.
Bishop (1995) Christopher M. Bishop. 1995. Neural Networks for Pattern Recognition. Oxford University Press, Inc., USA.
Bowker and Star (1999) Geoffrey C. Bowker and Susan Leigh Star. 1999. Sorting Things Out: Classification and Its Consequences. MIT Press, Cambridge, Mass.
Brayne (2017) Sarah Brayne. 2017. Big Data Surveillance: The Case of Policing. American Sociological Review 82, 5 (2017), 977–1008.
Budryk (2020) Zack Budryk. 2020. Two white students sue UT-Austin claiming they were denied admission based on race. The Hill (22 July 2020). https://thehill.com/blogs/blog-briefing-room/news/508608-two-white-students-sue-ut-austin-claiming-they-were-denied
Buolamwini and Gebru (2018) Joy Buolamwini and Timnit Gebru. 2018. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, New York, NY, USA, 77–91.
Chen et al. (2018) Irene Y. Chen, Fredrik D. Johansson, and David Sontag. 2018. Why is My Classifier Discriminatory?. In Proceedings of the 32nd International Conference on Neural Information Processing Systems (Montréal, Canada) (NIPS’18). Curran Associates Inc., Red Hook, NY, USA, 3543–3554.
Chouldechova (2017) Alexandra Chouldechova. 2017. Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments. Big Data 5, 2 (2017), 153–163.
Clarke (1994) R. Clarke. 1994. The Digital Persona and Its Application to Data Surveillance. Inf. Soc. 10 (1994), 77–92.
Cohen (2011) Nicole Cohen. 2011. The Valorization of Surveillance: Towards a Political Economy of Facebook. Democratic Communiqué 22 (2011).
Conrad (2009) Kathryn Conrad. 2009. Surveillance, Gender and the Virtual Body in the Information Age. Surveillance & Society 6, 4 (2009), 380–387.
Corbett-Davies et al. (2017) Sam Corbett-Davies, Emma Pierson, Avi Feller, Sharad Goel, and Aziz Huq. 2017. Algorithmic Decision Making and the Cost of Fairness. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Halifax, NS, Canada) (KDD ’17). Association for Computing Machinery, New York, NY, USA, 797–806.
DeSilver et al. (2020) Drew DeSilver, Michael Lipka, and Dalia Fahmy. 2020. 10 things we know about race and policing in the U.S. Pew Research Center (3 June 2020). https://www.pewresearch.org/fact-tank/2020/06/03/10-things-we-know-about-race-and-policing-in-the-u-s/
Dutta et al. (2020) Sanghamitra Dutta, Dennis Wei, Hazar Yueksel, Pin-Yu Chen, Sijia Liu, and Kush Varshney. 2020. Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daumé III and Aarti Singh (Eds.). PMLR, 2803–2813.
Dwork et al. (2012) Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard Zemel. 2012. Fairness through Awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference (Cambridge, Massachusetts) (ITCS ’12). Association for Computing Machinery, New York, NY, USA, 214–226.
Dwork et al. (2018) Cynthia Dwork, Nicole Immorlica, Adam Tauman Kalai, and Max Leiserson. 2018. Decoupled Classifiers for Group-Fair and Efficient Machine Learning. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, New York, NY, USA, 119–133.
Flanagan et al. (2008) M. Flanagan, Daniel Howe, and H. Nissenbaum. 2008. Embodying values in technology: Theory and practice. Information Technology and Moral Philosophy (01 2008), 322–353.
Flanagan and Nissenbaum (2014) Mary Flanagan and Helen Nissenbaum. 2014. Values at Play in Digital Games. The MIT Press.
Forde et al. (2021) Jessica Zosa Forde, A. Feder Cooper, Kweku Kwegyir-Aggrey, Chris De Sa, and Michael Littman. 2021. Model Selection’s Disparate Impact in Real-World Deep Learning Applications. ICML 2021 Science of Deep Learning Workshop.
Friedler et al. (2016) Sorelle A. Friedler, Carlos Scheidegger, and Suresh Venkatasubramanian. 2016. On the (im)possibility of fairness. arXiv:1609.07236 [cs.CY]
Friedman and Hendry (2019) Batya Friedman and David G. Hendry. 2019. Value Sensitive Design: Shaping Technology with Moral Imagination. The MIT Press.
Gieryn (1983) Thomas F. Gieryn. 1983. Boundary-Work and the Demarcation of Science from Non-Science: Strains and Interests in Professional Ideologies of Scientists. American Sociological Review 48, 6 (1983), 781–795.
Hardt et al. (2016) Moritz Hardt, Eric Price, and Nathan Srebro. 2016. Equality of Opportunity in Supervised Learning. arXiv:1610.02413 [cs.LG]
Hastie et al. (2009) Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference and Prediction (2 ed.). Springer.
Hecht (2002) Gabrielle Hecht. 2002. Rupture-Talk in the Nuclear Age: Conjugating Colonial Power in Africa. Social Studies of Science 32, 5-6 (2002), 691–727.
Heidari et al. (2018) Hoda Heidari, Claudio Ferrari, Krishna Gummadi, and Andreas Krause. 2018. Fairness Behind a Veil of Ignorance: A Welfare Analysis for Automated Decision Making. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Vol. 31. Curran Associates, Inc., 1265–1276.
Hellman (2020a) Deborah Hellman. 2020a. Big Data and Compounding Injustice. Working draft.
Hellman (2020b) Deborah Hellman. 2020b. Measuring Algorithmic Fairness. Virgina Law Review 106, 4 (2020).
Hoffmann (2019) Anna Lauren Hoffmann. 2019. Where fairness fails: data, algorithms, and the limits of antidiscrimination discourse. Information, Communication & Society 22, 7 (2019), 900–915.
Hughes (1993) Thomas Parker Hughes. 1993. Networks of Power: Electrification in Western Society, 1880-1930 (1st ed.). Johns Hopkins University Press.
Joseph et al. (2016) Matthew Joseph, Michael J. Kearns, Jamie Morgenstern, Seth Neel, and Aaron Roth. 2016. Rawlsian Fairness for Machine Learning. CoRR abs/1610.09559 (2016). arXiv:1610.09559
Kaplan et al. (2020) Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, and Dario Amodei. 2020. Scaling Laws for Neural Language Models. arXiv:2001.08361 [cs.LG]
Kasy and Abebe (2021) Maximilian Kasy and Rediet Abebe. 2021. Fairness, Equality, and Power in Algorithmic Decision-Making. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (Virtual Event, Canada) (FAccT ’21). Association for Computing Machinery, New York, NY, USA, 576–586. https://doi.org/10.1145/3442188.3445919
Kennedy (1961) John F. Kennedy. 1961. Executive Order 10925.
Kleinberg et al. (2017) Jon M. Kleinberg, Sendhil Mullainathan, and Manish Raghavan. 2017. Inherent Trade-Offs in the Fair Determination of Risk Scores. In 8th Innovations in Theoretical Computer Science Conference, ITCS 2017, January 9-11, 2017, Berkeley, CA, USA (LIPIcs, Vol. 67), Christos H. Papadimitriou (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 43:1–43:23.
Kroll et al. (2017) Joshua A. Kroll, Joanna Huey, Solon Barocas, Edward W. Felten, Joel R. Reidenberg, David G. Robinson, and Harlan Yu. 2017. Accountable Algorithms. University of Pennsylvania Law Review 165, 633 (2017).
Leamer (1983) Edward Leamer. 1983. Let’s Take the Con Out of Econometrics. American Economic Review 73, 1 (1983), 31–43.
Menon and Williamson (2018) Aditya Krishna Menon and Robert C Williamson. 2018. The Cost of Fairness in Binary Classification. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, New York, NY, USA, 107–118.
Monahan and Skeem (2016) John Monahan and Jennifer L. Skeem. 2016. Risk Assessment in Criminal Sentencing. Annual Review of Clinical Psychology 12, 1 (2016), 489–513.
Newkirk (2017) Vann R. Newkirk. 2017. The Myth of Reverse Racism. The Atlantic (5 August 2017). https://www.theatlantic.com/education/archive/2017/08/myth-of-reverse-racism/535689/
Noriega-Campero et al. (2019) Alejandro Noriega-Campero, Michiel A. Bakker, Bernardo Garcia-Bulle, and Alex ’Sandy’ Pentland. 2019. Active Fairness in Algorithmic Decision Making. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society (Honolulu, HI, USA) (AIES ’19). Association for Computing Machinery, New York, NY, USA, 77–83.
of Representatives (2021) Florida House of Representatives. 2021. CS/HB 1475: Sex-specific Student Athletic Teams or Sports. https://www.flsenate.gov/Session/Bill/2021/1475/BillText/c1/PDF
Painter (2011) Nell Irvin Painter. 2011. The History of White People. W. W. Norton.
Passi and Barocas (2019) Samir Passi and Solon Barocas. 2019. Problem Formulation and Fairness. In Proceedings of the Conference on Fairness, Accountability, and Transparency (Atlanta, GA, USA) (FAT* ’19). Association for Computing Machinery, New York, NY, USA, 39–48.
Pham (2018) Elyse D. Pham. 2018. Fellow Asian-Americans, Back Off of Affirmative Action. The Harvard Crimson (15 October 2018). https://www.thecrimson.com/article/2018/10/15/pham-back-off-affirmative-action/
Porter (1995) Theodore M. Porter. 1995. Trust in Numbers: The Pursuit of Objectivity in Science and Public Life. Princeton University Press.
Powles and Nissenbaum (2018) Julia Powles and Helen Nissenbaum. 2018. The Seductive Diversion of Solving Bias in Artificial Intelligence.
Rawls (1971) John Rawls. 1971. A Theory of Justice. Belknap Press of Harvard University Press, Cambridge, Massachusetts.
Sabato and Yom-Tov (2020) Sivan Sabato and Elad Yom-Tov. 2020. Bounding the Fairness and Accuracy of Classifiers from Population Statistics. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daumé III and Aarti Singh (Eds.). PMLR, 8316–8325.
Selbst et al. (2019) Andrew D. Selbst, Danah Boyd, Sorelle A. Friedler, Suresh Venkatasubramanian, and Janet Vertesi. 2019. Fairness and Abstraction in Sociotechnical Systems. In Proceedings of the Conference on Fairness, Accountability, and Transparency (Atlanta, GA, USA) (FAT* ’19). Association for Computing Machinery, New York, NY, USA, 59–68.
Speri (2019) Alice Speri. 2019. The FBI Spends a Lot of Time Spying on Black Americans. The Intercept (29 October 2019). https://theintercept.com/2019/10/29/fbi-surveillance-black-activists/
Srivastava et al. (2019) Megha Srivastava, Hoda Heidari, and Andreas Krause. 2019. Mathematical Notions vs. Human Perception of Fairness: A Descriptive Approach to Fairness for Machine Learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Anchorage, AK, USA) (KDD ’19). Association for Computing Machinery, New York, NY, USA, 2459–2468.
Velocci (2021) Beans Velocci. 2021. Binary Logic: Race, Expertise, and the Persistence of Uncertainty in American Sex Research. PHD Dissertation, Yale University. Chapter 1, ”Unsolved Problems of Anomalous Sex: Managing Sexual Multiplicity in Nineteenth-Century Animal Studies”.
Wajcman (2004) Judy Wajcman. 2004. TechnoFeminism. Polity.
Wick et al. (2019) Michael Wick, swetasudha panda, and Jean-Baptiste Tristan. 2019. Unlocking Fairness: a Trade-off Revisited. In Advances in Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc., 8783–8792.
Yang (2010) Xin-She Yang. 2010. Engineering Optimization: An Introduction with Metaheuristic Applications (1st ed.). Wiley Publishing.
Zhao and Gordon (2019) Han Zhao and Geoff Gordon. 2019. Inherent Tradeoffs in Learning Fair Representations. In Advances in Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc., 15675–15685.
Zuboff (2018) Shoshana Zuboff. 2018. The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power (1st ed.). Profile Books.