The homogeneity of the star forming environment of the Milky Way disk over time
Abstract
Stellar abundances and ages afford the means to link chemical enrichment to galactic formation. In the Milky Way, individual element abundances show tight correlations with age, which vary in slope across ([Fe/H]-[/Fe]). Here, we step from characterising abundances as measures of age, to understanding how abundances trace properties of stellar birth-environment in the disk over time. Using measurements from 27,000 APOGEE stars (R=22,500, SNR200), we build simple local linear models to predict a sample of elements (X = Si, O, Ca, Ti, Ni, Al, Mn, Cr) using (Fe, Mg) abundances alone, as fiducial tracers of supernovae production channels. Given [Fe/H] and [Mg/H], we predict these elements, [X/H], to about double the uncertainty of their measurements. The intrinsic dispersion, after subtracting measurement errors in quadrature is dex. The residuals of the prediction (measurement model) for each element demonstrate that each element has an individual link to birth properties at fixed (Fe, Mg). Residuals from primarily massive-star supernovae (i.e. Si, O, Al) partially correlate with guiding radius. Residuals from primarily supernovae Ia (i.e. Mn, Ni) partially correlate with age. A fraction of the intrinsic scatter that persists at fixed (Fe, Mg), however, after accounting for correlations, does not appear to further discriminate between birth properties that can be traced with present-day measurements. Presumably, this is because the residuals are also, in part, a measure of the typical (in)-homogeneity of the disk’s stellar birth environments, previously inferred only using open-cluster systems. Our study implies at fixed birth radius and time, there is a median scatter of dex in elements generated in supernovae sources.
1 Introduction
Chemical abundances from surveys like APOGEE (Majewski et al., 2017), GALAH (Buder et al., 2018) and LAMOST (Zhao et al., 2012), combined with a reference set of ages from Kepler (Borucki et al., 2010) provide the means to link chemical enrichment to galactic formation over time. Temporally, the disk formed inside-out (e.g. Frankel et al., 2019): we observe there to be older stars in the inner galaxy and younger stars in the outer regions. Chemically, the disk shows a negative metallicity gradient across Galactic radius, that has presumably weakened over time due to radial migration (e.g. Roškar et al., 2008; Frankel et al., 2018; Minchev et al., 2018).
For stars in the disk, studies of individual element distributions, [X/Fe], have revealed that at fixed metallicity, [Fe/H], there are very small intrinsic scatters ( dex) around weak age-individual abundance relations (e.g. Nissen, 2015; Bedell et al., 2018; Ness et al., 2019; Hayden et al., 2020; Casamiquela et al., 2021; Espinoza-Rojas et al., 2021). The age-abundance relations vary in their slopes at different values in the chemical plane defined by Supernovae type Ia (SNIa) and Supernovae type II (SNII) sources (Fe, Mg); although the intrinsic scatter in the disk population is comparably small across chemical cells (or bins) in the [Fe/H]-[Mg/Fe] plane (Yuxi et al., 2021). Previously, we have addressed the question of how individual abundances at fixed [Fe/H] and stellar ages can link to birth radius (e.g. Ratcliffe et al., 2021; Ness et al., 2019). Here, we turn this question around to ask; at fixed birth radius and age, what is the intrinsic scatter in individual elements? That is, what is the abundance scatter set by the time and place of the star forming environment?
To address this question we first measure how well a set of elements that are produced by supernovae sources are predicted by two fiducial supernovae sources. For the disk, a two-process model defined by (Fe, Mg) can well describe the smooth individual abundance variation across the disk (Weinberg et al., 2018; Griffith et al., 2021a; Weinberg et al., 2021). Here we use Fe and Mg as fiducial SNIa and supernovae SNII sources, respectively, to quantify how well we can predict an ensemble of eight other supernovae produced elements (Si, O, Ca, Ti, Ni, Al, Mn, Cr), given these two elements alone. We then examine the residual scatter around these predictions and estimate how much of this scatter is not correlated with astrophysical sources. That is, how much scatter is intrinsic to the birth place of stars in the disk. Section 2 introduces our data, Section 3, our method, Section 4 our results, and in Sections 5 and 6 we include a brief discussion and summary of future prospects.
2 Data
We assemble a high fidelity sample of APOGEE DR16 red giant stars with ten abundance measurements from the ASPCAP pipeline (García Pérez et al., 2016). We use X = Fe, Mg, Si, O, Ca, Ti, Ni, Mn, Cr, Al (we take the reported abundances with respect to Fe and convert them to [X/H] by [Fe/H])111we test using both [X/H] and [X/Fe] and for local linear regression and all the results and inferences are the same. These ten elements are understood to be produced in supernovae channels, specifically via massive star and white-dwarf explosions. (e.g. Kobayashi et al., 2020a). We perform the following quality cuts to obtain our sample of 27,000 stars:
K
dex
dex
dex
SNR
Flags ASPCAPBAD and ROTATION not set
Most of these stars are located near the Sun, with a Galactic radius of , but range from , in total. The stars span all ages, from stars as young as Gyr, with a mean distribution of age Gyr. At Signal to Noise (SNR) 200 these ten elements are measured to a precision of between dex, on average. We restrict our sample to heliocentric velocities of based on an initial assessment of how precisely two fiducial channels (Fe, Mg) can predict the other abundances [X/H]. We find that abundance residuals (measured abundance-predicted abundance) that we calculate show strong correlations with velocity. This is presumably not related to the star, but nuisance imprints in the spectra that are a significant fraction of the residual amplitude, from sky emission and tellurics and diffuse interstellar bands (Holtzman et al., 2018, and McKinnon in preparation).
We examine the correlations between individual element abundance residuals and orbital actions and ages. The residuals are calculated as the difference between the ASPCAP measurements and the model’s prediction. The ages we use are derived from spectra using data-driven modeling and the Kepler reference objects (Yuxi et al., 2021). These have individual uncertainties of Gyr. The orbital actions and spatial coordinates are from the ASTRO-NN catalogue (Leung & Bovy, 2019). Additionally, we use the open cluster members from the OCCAM catalogue of open clusters (Donor et al., 2020) to assess the residuals in sites of known inter and intra birth origin. For examining the open cluster stars we relax our SNR limit to SNR 100 and remove our radial velocity restriction. This increases the number of field stars that we work with to 80,000 and increases our open cluster sample by a factor of two (to 100 open cluster stars in nine clusters). We note that our less restrictive cuts negligibly change our overall results and conclusions for the full ensemble analyses. However, using the higher fidelity sample of stars is a more precise examination of the data, with a lesser contribution from nuisance signals i.e. artifacts that correlate with radial velocity.
3 Method
We want to determine, given stellar evolutionary state and two abundances (Fe, Mg), how well we can predict other elements generated via supernovae channels. To do this, we use linear regression. A local linear regression model is like basic regression (Hastie et al., 2001), but an individual model is built for each object on a subset of data i.e. a local model. Each local model is constructed by defining a neighborhood around each object in some data space (e.g. see Sayeed et al., 2021, for an example of this using Kepler data). This approach takes on the following steps for our stars, where the parameters that we select as predictors are = (, , [Mg/H], [Fe/H]) and our goal is to predict eight abundances X = (Si, O, Ca, Ti, Ni, Al, Mn, Cr):

1. Standardization of parameters: we first standardize the parameters that we wish to use for calculating a distance between each star and every other star. We use these distance measurements to define a neighbourhood for every star. The parameters we select are evolutionary state and the elements from two fiducial supernovae channels: = (, , [Mg/H], [Fe/H]). To standardize each parameter, we subtract the mean and divide by the standard deviation, calculated for the N stars, for each;
y = ( - .
2. Distance metric: For each star in our set of N objects, we determine, via a comparison, the distance to each other star in the sample using the four standardized parameters, = ( , , [Fe/H] [Mg/H]). For any two stars, the value is given by the following equation:
(1) |
where the indices and denote the two stars, is index of each the standardized parameters, and the standardized parameter measurements
with standardized uncertainty .
3. Define the local model: For each star, , we then take the nearest neighbours with the smallest values (where sets the size of the neighbourhood). In practice the model is fairly insensitive to the choice of and between produces comparable results. We select k=100.
4. Regression: using the local model for each star of nearest neighbours, and excluding the star used to define the neighborhood, we use the linear regression function in python’s package (Pedregosa et al., 2011). This modeling step learns the relationship between the parameters of = (, , [Fe/H], [Mg/Fe]) and each of the eight abundances [X/H] (separately), parameterized for each [X/H]) with five coefficients for each local model (the intercept and one for each Y). Then abundances [X/H] are predicted from the five parameters.
5. Predict: for each star, we then (separately) predict each of the eight abundances defined above [X/H] using the four parameters of = (, , [Fe/H], [Mg/H]) and the coefficients determined in step (4). This gives our prediction for [X/H], that we can compare to the measured abundance from APSCAP.
6. Repeat: this procedure for every star 1 to N.
4 Results
4.1 Local Linear Model Predictions
The mean summary of our element predictions compared to their measurements using local linear modeling is shown in Figure 1, for our stars. The x-axis shows the ASPCAP abundances for the stars and the y-axis the prediction using a linear regression of the local neighbourhood (modeled with the predicted object removed). A number of measurements are reported in each sub-panel. The name of each element [X/H] is shown in the top left hand corner of each sub-panel. The intrinsic scatter (in a data analysis sense) of each element’s prediction, which is the is shown in square brackets: This is the root mean square (rms) difference between the predicted and measured abundance [X/H], and the mean measurement uncertainty (which is typically the same for each star) . This bias of the predicted versus the measured abundance [X/H] is also included at top left (bias =0 as expected from linear regression). We note that the predicted abundance are unbiased for [Fe/H] . However, this is not the case for the halo which is a far more chemically diverse population at fixed (Fe, Mg). The scatter around the 1:1 line is also relatively metallicity independent. The mean measurement uncertainty is reported in the bottom right hand corner of each sub-panel. We note that the measurement uncertainty on the predictors in our model (, , , ) propagate negligibly to the intrinsic scatter that we measure. We note that the systematic uncertainty (accuracy) of the individual element abundance measurements is much larger than the intrinsic dispersion values. Including and as predictors is done so as to remove systematic biases imputed by stellar model approximations (e.g. Jofré et al., 2019). We note that while these will remove systematics in the model from model atmospheres, they will not remove biases from anything that is inherited at the level of the spectra itself (see Figure 6).

We examine the correlation with intrinsic dispersion as a function of [Fe/H] as shown in Figure 2. The 1- intrinsic dispersion of the [X/H] measurement compared to the prediction is shown for each element. There are unique trends in each element; some on the order of a few percent and some on the order of a factor of two, across [Fe/H]. The elements Cr and O show the most dramatic increase with decreasing [Fe/H] and the element Mn shows the most significant increase at higher [Fe/H]. Beyond noting that the intrinsic dispersion varies across metallicity [Fe/H], we do not draw strong astrophysical conclusions from these trends. This is because the total scatter of the [X/H] prediction and subsequently the intrinsic dispersion values that we calculate are sensitive to the accuracy of the abundance measurement errors, which vary across the [Fe/H] plane. For all elements, the measurement errors increase with decreasing metallicity, with the mean error for elements Cr and O increasing the most dramatically by a factor of more than two.
Each element’s mean intrinsic dispersion across all [Fe/H] (or intrinsic scatter) from Figure 1 are compared in Figure 3. The dashed line indicates the median intrinsic dispersion for these eight elements, which is dex. That is, using (Fe, Mg) alone the other elements produced in supernovae can be predicted on average to this precision, but with a range of dex (for Ni) to dex (for Cr).

The intrinsic scatter for each element [X/H] is a quantification of the effective individuality of the element. That is, how much additional information it captures beyond (Fe, Mg) alone. The intrinsic scatter also represents the diversity of the star forming environment. For example, given the (Fe, Mg) of a star alone, to learn something additional from Si, from Figure 1, we see that we need to measure the Si abundance to dex.
4.2 Residuals: what do these mean?
Elements are individuals (e.g. Freeman & Bland-Hawthorn, 2002; Buder et al., 2018; Kobayashi et al., 2020b; Blancato, 2018; Ting & Weinberg, 2021), but as summarised in Figure 3, only marginally (compared to typical measurement uncertainties), for supernovae sources. While the ten elements considered, of Fe, Mg, Si, O, Ca, Ti, Ni, Al, Mn, Cr, are produced in supernovae, this is (i) via different SNII compared to SNIa fractions, (ii) with different [Fe/H] dependencies and (iii) with different mass dependencies (e.g. Woosley & Weaver, 1995; Kobayashi et al., 2020b; Matteucci, 2021, and references therein). For each element shown in Figure 1, we calculate the residual of that element = measured abundance from ASPCAP model’s prediction. A positive residual means the measured abundance is higher than predicted by the local neighbourhood and a negative residual means the abundance is less than that expected in the local neighbourhood. As the residuals quantify how far each star deviates from the model’s prediction, in principle these may be valuable tracers of birth environment, that are independent of (Fe, Mg). That is, because the element abundances ([Fe/H], [Mg/H]) are the predictors, the metallicity dependence of the yield has been removed, so scatter is caused by other conditions of the star forming environment.
Figure 4 shows a matrix of the correlations of the residuals of the predictions. This Figure shows that intra-family elements (e.g. Si and O, produced predominatly in SNII) appear to be correlated and inter-family (e.g. Cr and Si, produced predominantly in SNIa and SNII, respectively) appear anti-correlated. Note, all correlations are fairly weak (a Pearson correlation coefficient of 1 would be a perfectly correlated set of two error-free distributions).

We now investigate correlations of the residuals with age and present-day orbital properties (time and place). The correlations between the element residuals, and their trends with age and orbit are revealing as to the difference in the star forming environment that has given rise to their abundances. Furthermore such correlations of the residuals would indicate additional resolving power in the eight elements in Figure 1 to identify birth environment beyond Fe and Mg alone (Ting & Weinberg, 2021).
4.3 Correlations of residuals with age and orbit
Figure 5 shows the correlation between the element abundance residuals and the angular momentum (at left) and vertical action (at right), and , respectively. We do not include the radial action , as the residual trends are flat and the range is limited, given as these are disk stars. The (smoothed) running mean of action-residual trends are shown and the shaded regions represent the 1- confidence on the mean. The axes are scaled to be the amplitude of the intrinsic dispersion measured as reported in Figure 1. The axis scaling highlights that the spatial/orbital correlations explain only (small) part of the total magnitude of this scatter.


Similarly to Figure 5, Figure 6, at left, shows the residual of each element versus age. There are 15,000 stars with ages (Yuxi et al., 2021) that are used to make the panel showing the age correlation. The right panel of Figure 6 shows a larger sample of 80,000 stars with an SNR cut of 100 (and no velocity restriction imposed). Figure 6 at right shows the correlation between residual amplitude for these 80,000 stars and heliocentric velocity. This is the most significant variable accounting for correlations in the abundance residuals – and likely due to nuisance signals in the spectra near the limits of the precision. Initial results suggest that residual telluric and sky line and other non-stellar features remain in the apogee spectra at the level of percent across many wavelengths. Because these features are in the (approximate) observer frame, the stellar rest frame spectra have contamination (which impacts measured abundances) that correlates with stellar heliocentric velocity (see McKinnon et al., in preparation). The relations are non-monotonic and are non trivial to remove, and therefore separate from the abundance measurements themselves.
We note that if we repeat the exercise of the calculating the residuals of the [X/H] predictions and subsequently examining orbit and age correlations with a lower fidelity sample e.g. SNR 100 and remove any velocity limit, we find effectively the same results to those shown for our higher fidelity sample here. However, we measure higher intrinsic scatters for a few of the elements (shown in the next Section in Figure 8).


4.4 Intrinsic variance not explained by astrophysical properties
We see from Figures 5 and 6 that orbital parameters and age correlations explain only part of the intrinsic scatter around the abundance predictions shown in Figure 1 (residual amplitude). To test how much the intrinsic scatter decreases once we remove known correlations, we add to the regression model the additional parameters shown in Figures 5 and 6, of , , age and . We do this to try to assess how much of the variance remains once we account for other sources of correlation. We notice that additional small correlations are seen in the residuals with mean fiber number and SNR, and subsequently add these as additional parameters to our model as well. Furthermore, as we are interesting in finding a model with a minimum level of intrinsic dispersion of the measurement minus the predicted [X/H], we explore additional parameter cuts. We find that introducing a restricted vertical extent of the stellar orbits, , gives another 5 percent reduction in . We therefore restrict this sample to disk stars with , which leaves a total of 13,000 stars.

In Figure 7, we show the summary of the intrinsic dispersion for each element from the model with the additional parameters included as predictors. Including this additional set of parameters in the model reduces the intrinsic dispersion of the eight elements to a median level of dex. This demonstrates that an estimate of the residual variance that is not able to be explained with age or orbit up to dex for these eight elements. The local linear model does not, however, entirely remove the correlations of these variables. Specifically, for Ni and Mn partial trends with age persist at the dex level. Therefore, the median intrinsic dispersion of dex represents an upper limit. We note that for this model we found a larger neighborhood, with = 300, gave a marginally lower intrinsic dispersion for the element Cr in particular, by about 15 percent. This is not surprising that a larger might be favoured given the larger number of variables used in the model.
We verify that the smaller intrinsic dispersion we obtain is a consequence of including additional predictors in the model by using our initial four parameter model applied to this subset of 13,000 stars. The intrinsic dispersion obtained with the smaller subset of 13,000 stars and four parameter model is near-identical to that obtained with the 27,000 fiducial sample, with a median dex compared to dex for the full sample. The subsequent reduction in the scatter of the elements around the model with additional parameters, is therefore a consequence of including these additional correlated variables in the model regression and not the different samples.
To validate these results, we do an additional calculation as follows, to estimate how much of the overall intrinsic scatter we measure for each element is not a consequence of correlation with physical parameters and velocity: , , age and . For each of these four parameters, we subtract in quadrature the average 1- dispersion around the mean-trends reported in Figures 5 and 6, from the rms-dispersion as measured around the prediction of the model in Figure 1, for each element. This gives an overall measure of the amplitude of the scatter that is explained by each of , , age and (combined with the measurement errors) for each element. The quadrature sum of these for each element represents the total scatter (combined with the measurement errors) accounted for by these joint parameters (ignoring any correlation between them, so as to obtain a lower limit), . We then take the quadrature difference for each element between the intrinsic dispersion of the model’s prediction summarised in Figure 3 and the quadrature sum of the scatter explained by the four parameters; . This gives us how much intrinsic variance is not explained by these parameters. On average, we find this to be dex, for the eight elements.
Is it possible that there are correlations with birth radius that would explain the remaining intrinsic scatter, but as stars migrate over time this is simply lost (e.g. Sellwood & Binney, 2002; Roškar et al., 2008)? We examine the and trends for only the youngest populations. Although the residual-action gradients are steeper (see Figure A1 in the Appendix), these are still only are able to explain a fraction of the intrinsic scatter measured in each element. We therefore think this is unlikely, as this would be preserved in the age-residual gradients, particularly for the youngest stars. Stars of a given birth radius likely follow tracks in [Fe/H]-age (Minchev et al., 2018; Frankel et al., 2018). Yet, we see from Figure 6 that the age correlations are rather weak (including in narrow ranges of [Fe/H]), with the exception of Mn and Al which still only account for part of the variance. We therefore conclude that dex is an effective measure of the amplitude of dispersion in these eight elements in the disk at birth time and radius.
4.5 The residual scatter of open clusters
We test the intrinsic scatter of residual abundances for the eight individual elements, [X/H], using known sites of common birth origin – open clusters. Cluster members are selected from our set of 27,000 stars as those with a membership probability of as reported in the APOGEE OCCAM open cluster catalogue (Donor et al., 2020). This gives us a total of 46 stars distributed among the following four clusters: NGC 188, NGC 2682, NGC 6705 and NGC 6819. For each element, in each cluster, we calculate the mean measurement error for the set of stars, and the 1- dispersion of the residuals in each [X/H]. The quadrature difference of the scatter of the residuals and the measurement uncertainties gives an average intrinsic dispersion of dex for the eight elements and four clusters (with a range range of dex for Si to dex for O). This mean value of dex is consistent with our inference from the field stars, of dex scatter in elements at fixed birth radius and age.

We now use a more lenient selection of stars to study the residuals in more open clusters. We use the larger set of 80,000 stars with 100. From this sample, we obtain 107 stars from nine different clusters with a membership probability of and more than eight stars per cluster. Figure 8 shows a summary of the residual amplitude away from the model’s prediction, for each element, in each cluster. These are ordered from youngest to oldest cluster 222see (Salaris et al., 2004; Janes & Phelps, 1994; Donati et al., 2015; Meibom et al., 2009; Montgomery et al., 1993; Sung et al., 1999). For reference, in the context of Figures 5 and 6, these clusters have the following distributions in parameters: Age = Gyr ( Gyr), ), ), ). Although selected with a minimum SNR 100, 70 percent of the open cluster stars have SNR 200.

In Figure 8, the mean residuals are colored by mean [Fe/H] of the cluster stars (which range from dex [Fe/H] 0.1 dex). The thin error bars around the mean report the 1- dispersion of the residual distributions. The thick, coloured error bars show the intrinsic dispersion of the residuals (from the quadrature difference of the 1- dispersion and the mean error of the cluster stars). The red dashed lines in Figure 8 show the intrinsic dispersion measurements calculated for each element using the set of stars with the model from Figure 7. As described in Section 4.3, this comprises a sample of 13,000 field stars of the disk (SNR 200), with age, orbital actions (and also velocity) included in the model for as predictors for each element. This serves as a baseline comparison for the open cluster sample results (where the stars in open clusters have the same orbital and age properties so these are implicitly accounted for in examining the clusters). About 50 percent of the time, the intrinsic dispersion of the open cluster residuals exceeds that of the field model. The mean intrinsic dispersion of the residuals for the eight elements and nine clusters is dex (with a variation of dex on average for the distribution of all clusters and all elements, with individual elements measuring between dex). The smaller variance compared to the field (red dashed lines in Figure 8) for some clusters in some elements indicates that the cluster comprises a more homogeneous environment, where elements are more precisely predicted by (Fe, Mg), compared to the field. However, on average, the intrinsic dispersion of the clusters is not markedly different from the field.
Looking at the distribution of the mean residual values in Figure 8, it is clear that the cluster stars have biased mean residuals with respect to the field. On average, the intrinsic dispersion of each measurement is smaller when the bias is larger. The mean of the residuals in the field sample is shown with the black line at zero, for reference. For individual elements, the set of clusters are approximately distributed around zero-mean, with the exception of Al and Cr. Individual clusters themselves, however, show bias away from zero-mean. This must be due to some difference between the clusters and the field distribution. Part of the bias in each element for the clusters can be explained by the correlations reported in Figures 5 and 6, and the interplay between these correlations.
Figure 9 shows the residuals of the four elements with the largest scatter in the cluster means (Ti, Al, Mn and Cr) ordered in increasing vertical action, (for Ti and Cr), angular momentum, (for Al), as well as dispersion in heliocentric velocity, (for Mn). We look at the correlation with velocity dispersion due to its association with tracing cluster mass (e.g. Poovelil et al., 2020). Figure 9 reveals that the correlations with the orbital actions appear stronger than in the field. The residuals for Al show a relationship across that echoes some of the structured variation in the mean seen in the young field stars of the disk, in Figure A1. However, NGC 6705 has an anomalously low Al residual and the mean residual value for three of the clusters exceeds the 1- intrinsic dispersion of the field. The residuals for the elements Ti and, in particular, Cr, show a correlation with . The correlation between the residual of Cr and , in particular, is not similarly seen in the field sample. The relationship between the residual in Cr and suggests that the amplitude of Cr at fixed (Mg,Fe) is a tag of birth height from the plane of the disk, that is erased with cluster dissolution and heating. The element residuals for Cr and Al have the largest intrinsic dispersion values, by a factor of about two, compared to the other elements (with dex). The correlations with the (presumably birth) orbital actions seen here for these elements likely account for part of that scatter measured in the field.
The residuals of the element Mn show a correlation with age in Figure 8 that is overall consistent with the trends seen in Figure 6, with the exception of NGC 188, which is the oldest cluster with age Gyr. From Figure 6, old stars are biased to have negative residual values for Mn. Yet, the mean Mn residual for NGC 188 is dex (and near-identical to that of the youngest cluster, NGC 6705). Interestingly, the two clusters, NGC 188 and NGC 6705 are the most metal rich clusters and show similar velocity dispersion values. These two clusters also have anomalously low residuals in Ca.
We examine the cluster with the highest overall intrinsic dispersion in the elements, NGC 2243. This cluster shows a dramatically larger intrinsic dispersion in the residual of Cr, compared to the other clusters. Interestingly, this cluster has a very high radial action compared to the other clusters, with . The other eight clusters have . NGC 2243 is also the most metal-poor cluster and Figure 2 shows that for the field sample, the residual scatter for six of the eight elements increases with decreasing metallicity, most substantially for the element Cr (by a factor of about two between and ).
In summary, the analysis of the open clusters shows that there is, on average, an intrinsic scatter of dex in the individual element residuals [X/H] predicted from (Fe, Mg) in groups of stars known to be born together. This is the same order of magnitude as the intrinsic dispersion in the field, measured after accounting for orbital and age correlations. However, the clusters also show mean residuals that are biased away from zero. Part of this is explained by the expected correlations between the residuals and age and orbital actions. However, the correlations appear to be more substantial in the open clusters compared to the field, for some elements (and most notably perhaps for Cr). This implies that the residuals for these elements are correlated with, and thereby discriminative of, mean birth location in the disk.
4.6 The holistic view of residual scatter in the Milky Way disk and halo
We now demonstrate the potential of the information expressed in the element residuals from the prediction using (Fe, Mg) elements alone, across the Milky Way, more broadly. We proceed by comparing the residual scatter in the set of eight elements for disk stars to halo stars. The stars are selected to have SNR 100, and disk and halo separated only via a metallicity criteria; stars more metal rich than are assigned as disk ( 80,000 stars) and stars more metal poor than dex are assigned as halo (3000 stars). Figure 10 shows the spatial plane of Galactocentric radius and height , for stars of the disk at left, and halo at right, with at least 5 stars per bin. Each bin in the plane is colored by mean residual amplitude, as shown in each Figure’s colorbar. The colorbar in each panel is scaled to the 1- standard deviation of the element’s residual). The disk is smooth in residual trends, with some gradients seen for the elements radially, as explored in Figure 5, in particular seen in Si. In the halo, the residual scatter is two or more times larger than in the disk, and groups of stars in the spatial plane have common and distinguishing mean residual amplitudes (but also in some cases significant scatter within individual sub-structure). Figure 10 at right shows the stars that are grouped in spatial location have different mean residual amplitudes. The targeted APOGEE halo fields are labeled, and these correspond to the groups seen on the sky with similar residual amplitudes. Note the modeling of the halo and the residual amplitude is also sensitive to the neighbourhood ; for Figure 10, the is the same as the disk, of =100. A full exploration of this is beyond the scope of the paper, but different have different utility in examining halo substructure.


Figure 11 shows 36,000 stars selected with SNR 200, including 500 stars with dex (with the same and cuts as in Section 2). The stars are coloured in the [Fe/H]-[Mg/Fe] plane by their residuals (the measured [X/H] the predicted [X/H]). This demonstrates the striking structure in the residuals in the population that is not disk material. Presumably, the halo shows this interesting residual variability and structure as it is comprised of stars from many different star formation environments (e.g. Scannapieco et al., 2021; Naidu et al., 2021; Bonaca et al., 2020; Hallakoun & Maoz, 2021; Ishigaki et al., 2021; Naidu et al., 2020; Buder et al., 2021a). Note where some elements show large positive residuals, others show large negative residuals. The elements Mn and Al are particularly powerful diagnostics of and within the halo sub-systems, compared to the disk; stars with negative residuals in [Al/H] look to be a potential additional chemical tag of the Gaia-Enceladus-Sausage system (Helmi et al., 2018; Belokurov et al., 2018). This is not necessarily surprising given as Al and Mn have been used as diagnostic of halo substructure previously (e.g. Horta et al., 2021; Das et al., 2020). Using the residuals effectively maps individual [X/H] measurements to a common chemical reference frame. Additionally, stars with low-[Mg/Fe] at higher metallicities, dex show high residual amplitudes; a probable signature of these stars having an origin ex-situ the Milky Way disk.

5 Discussion
We demonstrated that using two abundances (Fe, Mg) for disk stars, we can predict an ensemble of other elements produced in supernovae to high precision, with dex. This reveals the (small) amplitude of the individuality of these elements, near the limits of the precision of many large surveys (e.g. García Pérez et al., 2015; Kollmeier et al., 2017; Buder et al., 2018; Jofré et al., 2019). The stars we examine comprise different metallicities, ages and birth radius, having migrated across the disk over time (e.g. Bensby et al., 2014; Minchev et al., 2018; Frankel et al., 2018, 2019; Ratcliffe et al., 2021; Sharma et al., 2021). From a galaxy evolution perspective, the extent to which (Fe, Mg) do not predict the other eight elements we study is, in part, due to environmental differences at birth in the radial and temporal extent of the disk, as told by the level of individuality of the elements (see also Ting & Weinberg, 2021; Weinberg et al., 2021). However, we note that the correlations with present day radius and stellar age are small and that the most dominant correlations for some elements are imputed by artifacts in the spectra. We propose that the residual scatter we measure, after accounting for measurement uncertainties and astrophysical correlations, affords a way of assessing the intrinsic scatter of elements at fixed birth radius and time in the disk. We note however, that the amplitude of the intrinsic scatter that we report for the elements is sensitive to the measurement errors, and this is the biggest source of uncertainty in our estimate.
We explore correlations between the residuals of the measured [X/H] compared to the model’s prediction from (Fe, Mg), as shown in Figures 5 and 6. Looking in detail at these correlations, we see different behaviours for each element. We note that as the correlations between elements themselves are contaminated by systematic imprints (e.g. seen in Figure 6 and McKinnon et al., in preparation) we focus on only these age and orbital correlation, and not inter-element correlations. A positive residual means that there is excess [X/H] compared to the model’s prediction and a negative value means there is less than the prediction from (Fe, Mg) alone. The iron-peak elements of Mn and Ni show the largest age correlations, with positive residuals at youngest ages and negative residuals for old stars. This is presumably as younger stars are born in environments that have relatively higher contributions of SNIa. The primarily massive star, SNII elements, show (very small) correlations with mean guiding radius, measured with . The elements Ca and Si correlate in opposite directions with . The Si (explosive element) abundance is higher than the Ca (hydrostatic) element abundance at smaller guiding radius (see also Blancato, 2018), yet their age relations are relatively flat. Regardless of age, this hints at a very small difference in the initial mass distribution of stars in the inner and outer parts of the Galaxy (e.g. Griffith et al., 2021b; Krumholz, 2014; Matteucci, 2021). Quantitative modeling, however, is needed to understand and explain the underlying parameters of the environment that give rise to the systematic changes in individual abundance distributions over time and radius at fixed (Fe, Mg) as seen in the distributions of the residuals of these elements (e.g. Johnson et al., 2021; Spitoni et al., 2021; Matteucci, 2021; Philcox & Rybizki, 2019; Rybizki et al., 2017, and references therein).
Overall, Figures 5 and 6 reveal that present day orbits and age explain only a fraction of the intrinsic scatter. After removing the astrophysical correlations of orbit and age with individual element residuals and accounting for errors, a level of dex in residual scatter around the model’s prediction remains. This is a likely a measure of the scatter in the elements that are produced in supernovae, at fixed birth radius and age. Correspondingly, this is a complementary result to studies that use open clusters – stars known to be born together, which report an intrinsic scatter of dex for these elements within clusters (e.g. Bovy, 2016; Ness, 2018; Liu et al., 2019; Souto et al., 2019; Price-Jones & Bovy, 2018; Cheng et al., 2021; Kos et al., 2021).
We examine the residual distributions of the eight elements, predicted using (Fe, Mg, , ) in open cluster systems. We find that these sites, comprising stars of common birth origin, have similar residual intrinsic dispersion measurements to the field, on average. Furthermore, that there are trends between the amplitude of the intrinsic dispersion of the residuals and [Fe/H], as seen in the field, and reported in Figure 2. However, the clusters show bias with respect to the field. That is, non-zero mean residual values. Some of this bias is consistent with correlations between the residual amplitudes and the orbital actions and age, as seen in the field. However, some correlations in the cluster population are not observed in the field population. The significant trend of the residual of Cr and the orbital action , that is seen for the cluster stars and not in the field, implies that the cluster dissolution process, heating and/or radial migration removes this relationship. Using the cluster sample, the residual for Cr is revealed to be a likely tag of mean birth . Presumably, in general, we see stronger correlations with orbital actions and the cluster stars compared to the field, as the open clusters have not migrated significantly nor dissolved (Spina et al., 2021).
Although the residuals of the abundances at fixed (Fe, Mg) have very small amplitudes in the disk, the variance in the halo is substantially larger. Therefore in the halo, residual abundances may be useful to constrain and differentiate star formation environments and progenitors. Figure 10 also reveals the large residual variance at very low [Mg/Fe] ( dex, dex), which is presumably ex-situ material. In the disk, where the intrinsic variance is very small, stars that are significant outliers from the model’s prediction may be interesting objects to study. These may have had their abundances modified over the course of their evolution, due to dynamical effects like rotation, binarity and planet accretion.
6 Conclusions
The detailed star formation environment across the Galactic disk over time is traced by the variance in element abundances, [X/H], at fixed fiducial supernovae contributions of (Fe, Mg). We quantify this variance by building local linear models to predict eight abundance ratios [X/H] (for X = Si, O, Ca, Ti, Ni, Al, Mn, Cr) that are produced in supernovae, given the chemical abundances of [Fe/H] and [Mg/H] and evolutionary state parameters of and . Investigation of correlations between the residual of the model (measured [X/H] predicted [X/H]) and astrophysical parameters reveals that the residuals in part correlate with orbit and age. However, the residual measurement, which is near the measurement precision limits, is also hampered by nuisance signals in the spectra, traced by radial velocity correlations. These signals will be important to model and remove in order to fully exploit the information content in stellar spectra (e.g. Feeney et al., 2021; de Mijolla et al., 2021; Wheeler et al., 2021; Price-Jones & Bovy, 2018). Overall, we find that (Fe, Mg) predict the other abundances with a median intrinsic dispersion of dex, which decreases to dex once additional non-chemical parameters are added to the model.
We claim, based on this study that disk stars in the Milky Way have dex, on average, scatter in elements synthesised in supernovae, at fixed birth radius and time. This corresponds to percent of the individual element variance at fixed (Fe, Mg), for elements produced in supernovae. We report some weak dependence of the intrinsic scatter of the residuals of each element on [Fe/H]. The intrinsic scatter of individual elements, at fixed birth radius and time, plus any [Fe/H] dependence, has implications for initial cluster mass distributions and timescales of formation (which are presumably similar regardless of metallicity and age), as well as the mechanics of element mixing (e.g. Bland-Hawthorn et al., 2010; Krumholz et al., 2019; Armillotta et al., 2018). Chemical evolution modeling to fit these data will inform the environmental parameters that give rise to the trends we see in individual elements across both present-day radius and age (see Matteucci, 2021, and references therein). We expect new survey data will enable us to expand this assessment of the intrinsic scatter at fixed radius and time in the disk, to examine different elements and nucleosynthetic families (e.g. De Silva et al., 2015; Buder et al., 2021b; Kollmeier et al., 2017).
Finally, we highlight that this study has focused on the disk population ( dex), where the joint element abundance variance is marginal (Feeney et al., 2021; Rampalli et al., 2021; Yuxi et al., 2021; Ness et al., 2019; Weinberg et al., 2019). This is not the case in the halo ( dex) where residual abundances calculated here are likely much stronger tags of common birth environment.
7 Acknowledgements
Melissa K. Ness acknowledges support from a Sloan Foundation Fellowship. We thank the dynamics group at Flatiron for helpful discussions with respect to this work during its development.
8 Appendix
Figure A1 shows the 3300 stars with ages Gyr and the running mean of their residuals in the eight elements and and . These trends between residuals of the elements and orbital actions are stronger for the younger population compared to the full field distribution.


References
- Armillotta et al. (2018) Armillotta, L., Krumholz, M. R., & Fujimoto, Y. 2018, MNRAS, 481, 5000
- Bedell et al. (2018) Bedell, M., Bean, J. L., Meléndez, J., et al. 2018, ApJ, 865, 68
- Belokurov et al. (2018) Belokurov, V., Erkal, D., Evans, N. W., Koposov, S. E., & Deason, A. J. 2018, MNRAS, 478, 611
- Bensby et al. (2014) Bensby, T., Feltzing, S., & Oey, M. S. 2014, A&A, 562, A71
- Blancato (2018) Blancato. 2018, arXiv e-prints, arXiv:1810.12325
- Bland-Hawthorn et al. (2010) Bland-Hawthorn, J., Krumholz, M. R., & Freeman, K. 2010, ApJ, 713, 166
- Bonaca et al. (2020) Bonaca, A., Conroy, C., Cargile, P. A., et al. 2020, ApJ, 897, L18
- Borucki et al. (2010) Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977
- Bovy (2016) Bovy, J. 2016, ApJ, 817, 49
- Buder et al. (2018) Buder, S., Asplund, M., Duong, L., et al. 2018, MNRAS, arXiv:1804.06041
- Buder et al. (2021a) Buder, S., Lind, K., Ness, M. K., et al. 2021a, arXiv e-prints, arXiv:2109.04059
- Buder et al. (2021b) Buder, S., Sharma, S., Kos, J., et al. 2021b, MNRAS, 506, 150
- Casamiquela et al. (2021) Casamiquela, L., Soubiran, C., Jofré, P., et al. 2021, A&A, 652, A25
- Cheng et al. (2021) Cheng, C. M., Price-Jones, N., & Bovy, J. 2021, MNRAS, 506, 5573
- Das et al. (2020) Das, P., Hawkins, K., & Jofré, P. 2020, MNRAS, 493, 5195
- de Mijolla et al. (2021) de Mijolla, D., Ness, M. K., Viti, S., & Wheeler, A. J. 2021, ApJ, 913, 12
- De Silva et al. (2015) De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604
- Donati et al. (2015) Donati, P., Cocozza, G., Bragaglia, A., et al. 2015, MNRAS, 446, 1411
- Donor et al. (2020) Donor, J., Frinchaboy, P. M., Cunha, K., et al. 2020, AJ, 159, 199
- Espinoza-Rojas et al. (2021) Espinoza-Rojas, F., Chanamé, J., Jofré, P., & Casamiquela, L. 2021, arXiv e-prints, arXiv:2105.01096
- Feeney et al. (2021) Feeney, S. M., Wandelt, B. D., & Ness, M. K. 2021, MNRAS, 501, 3258
- Frankel et al. (2018) Frankel, N., Rix, H.-W., Ting, Y.-S., Ness, M., & Hogg, D. W. 2018, ApJ, 865, 96
- Frankel et al. (2019) Frankel, N., Sanders, J., Rix, H.-W., Ting, Y.-S., & Ness, M. 2019, ApJ, 884, 99
- Freeman & Bland-Hawthorn (2002) Freeman, K., & Bland-Hawthorn, J. 2002, ARA&A, 40, 487
- García Pérez et al. (2015) García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., Shetrone, M., & Mészáros, S. e. a. 2015, ArXiv e-prints, arXiv:1510.07635
- García Pérez et al. (2016) García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144
- Griffith et al. (2021a) Griffith, E., Weinberg, D. H., Johnson, J. A., et al. 2021a, ApJ, 909, 77
- Griffith et al. (2021b) —. 2021b, ApJ, 909, 77
- Hallakoun & Maoz (2021) Hallakoun, N., & Maoz, D. 2021, MNRAS, arXiv:2009.05047
- Hastie et al. (2001) Hastie, T., Tibshirani, R., & Friedman, J. 2001, The Elements of Statistical Learning, Springer Series in Statistics (New York, NY, USA: Springer New York Inc.)
- Hayden et al. (2020) Hayden, M. R., Sharma, S., Bland-Hawthorn, J., et al. 2020, arXiv e-prints, arXiv:2011.13745
- Helmi et al. (2018) Helmi, A., Babusiaux, C., Koppelman, H. H., et al. 2018, Nature, 563, 85
- Holtzman et al. (2018) Holtzman, J. A., Hasselquist, S., Shetrone, M., et al. 2018, AJ, 156, 125
- Horta et al. (2021) Horta, D., Schiavon, R. P., Mackereth, J. T., et al. 2021, MNRAS, 500, 1385
- Ishigaki et al. (2021) Ishigaki, M. N., Hartwig, T., Tarumi, Y., et al. 2021, MNRAS, arXiv:2107.04194
- Janes & Phelps (1994) Janes, K. A., & Phelps, R. L. 1994, AJ, 108, 1773
- Jofré et al. (2019) Jofré, P., Heiter, U., & Soubiran, C. 2019, ARA&A, 57, 571
- Johnson (2019) Johnson, J. A. 2019, Science, 363, 474
- Johnson et al. (2021) Johnson, J. W., Weinberg, D. H., Vincenzo, F., et al. 2021, arXiv e-prints, arXiv:2103.09838
- Kobayashi et al. (2020a) Kobayashi, C., Karakas, A. I., & Lugaro, M. 2020a, ApJ, 900, 179
- Kobayashi et al. (2020b) —. 2020b, ApJ, 900, 179
- Kollmeier et al. (2017) Kollmeier, J. A., Zasowski, G., Rix, H.-W., et al. 2017, ArXiv e-prints, arXiv:1711.03234
- Kos et al. (2021) Kos, J., Bland-Hawthorn, J., Buder, S., et al. 2021, MNRAS, 506, 4232
- Krumholz (2014) Krumholz, M. R. 2014, Phys. Rep., 539, 49
- Krumholz et al. (2019) Krumholz, M. R., McKee, C. F., & Bland-Hawthorn, J. 2019, ARA&A, 57, 227
- Leung & Bovy (2019) Leung, H. W., & Bovy, J. 2019, arXiv e-prints, arXiv:1902.08634
- Liu et al. (2019) Liu, F., Asplund, M., Yong, D., et al. 2019, arXiv e-prints, arXiv:1902.11008
- Majewski et al. (2017) Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94
- Matteucci (2021) Matteucci, F. 2021, arXiv e-prints, arXiv:2106.13145
- Meibom et al. (2009) Meibom, S., Grundahl, F., Clausen, J. V., et al. 2009, AJ, 137, 5086
- Minchev et al. (2018) Minchev, I., Anders, F., Recio-Blanco, A., et al. 2018, MNRAS, 481, 1645
- Montgomery et al. (1993) Montgomery, K. A., Marschall, L. A., & Janes, K. A. 1993, AJ, 106, 181
- Naidu et al. (2020) Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020, ApJ, 901, 48
- Naidu et al. (2021) —. 2021, arXiv e-prints, arXiv:2103.03251
- Ness (2018) Ness, M. 2018, PASA, 35, e003
- Ness et al. (2019) Ness, M. K., Johnston, K. V., Blancato, K., et al. 2019, ApJ, 883, 177
- Nissen (2015) Nissen, P. E. 2015, A&A, 579, A52
- Pedregosa et al. (2011) Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, Journal of Machine Learning Research, 12, 2825
- Philcox & Rybizki (2019) Philcox, O. H. E., & Rybizki, J. 2019, ApJ, 887, 9
- Poovelil et al. (2020) Poovelil, V. J., Zasowski, G., Hasselquist, S., et al. 2020, ApJ, 903, 55
- Price-Jones & Bovy (2018) Price-Jones, N., & Bovy, J. 2018, MNRAS, 475, 1410
- Rampalli et al. (2021) Rampalli, R., Ness, M., & Wylie, S. 2021, arXiv e-prints, arXiv:2108.02218
- Ratcliffe et al. (2021) Ratcliffe, B. L., Ness, M. K., Buck, T., et al. 2021, arXiv e-prints, arXiv:2107.08088
- Roškar et al. (2008) Roškar, R., Debattista, V. P., Quinn, T. R., Stinson, G. S., & Wadsley, J. 2008, ApJ, 684, L79
- Rybizki et al. (2017) Rybizki, J., Just, A., & Rix, H.-W. 2017, A&A, 605, A59
- Salaris et al. (2004) Salaris, M., Weiss, A., & Percival, S. M. 2004, A&A, 414, 163
- Sayeed et al. (2021) Sayeed, M., Huber, D., Wheeler, A., & Ness, M. K. 2021, AJ, 161, 170
- Scannapieco et al. (2021) Scannapieco, C., Cescutti, G., & Chiappini, C. 2021, arXiv e-prints, arXiv:2107.05653
- Sellwood & Binney (2002) Sellwood, J. A., & Binney, J. J. 2002, MNRAS, 336, 785
- Sharma et al. (2021) Sharma, S., Hayden, M. R., & Bland-Hawthorn, J. 2021, MNRAS, arXiv:2005.03646
- Souto et al. (2019) Souto, D., Allende Prieto, C., Cunha, K., et al. 2019, arXiv e-prints, arXiv:1902.10199
- Spina et al. (2021) Spina, L., Ting, Y. S., De Silva, G. M., et al. 2021, MNRAS, 503, 3279
- Spitoni et al. (2021) Spitoni, E., Verma, K., Silva Aguirre, V., et al. 2021, A&A, 647, A73
- Sung et al. (1999) Sung, H., Bessell, M. S., Lee, H.-W., Kang, Y. H., & Lee, S.-W. 1999, MNRAS, 310, 982
- Ting & Weinberg (2021) Ting, Y.-S., & Weinberg, D. H. 2021, arXiv e-prints, arXiv:2102.04992
- Weinberg et al. (2018) Weinberg, D. H., Holtzman, J. A., Hasselquist, S., et al. 2018, arXiv e-prints, arXiv:1810.12325
- Weinberg et al. (2019) —. 2019, ApJ, 874, 102
- Weinberg et al. (2021) Weinberg, D. H., Holtzman, J. A., Johnson, J. A., et al. 2021, arXiv e-prints, arXiv:2108.08860
- Wheeler et al. (2021) Wheeler, A. J., Hogg, D. W., & Ness, M. 2021, ApJ, 908, 247
- Woosley & Weaver (1995) Woosley, S. E., & Weaver, T. A. 1995, ApJS, 101, 181
- Yuxi et al. (2021) Yuxi, Lu, Ness, M., Buck, T., & Zinn, J. 2021, arXiv e-prints, arXiv:2102.12003
- Zhao et al. (2012) Zhao, G., Zhao, Y., Chu, Y., Jing, Y., & Deng, L. 2012, arXiv e-prints, arXiv:1206.3569