Spin Parity of Spiral Galaxies III – Dipole Analysis of the Distribution of SDSS Spirals with 3D Random Walk Simulations
Abstract
Observation has not yet determined whether the distribution of spin vectors of galaxies is truly random. It is unclear whether is there any large-scale symmetry-breaking in the distribution of the vorticity field in the universe. Here, we present a formulation to evaluate the dipole component of the observed spin distribution, whose statistical significance can be calibrated by the expected amplitude for 3D random walk (random flight) simulations.
We apply this formulation to evaluate the dipole component in the distribution of Sloan Digital Sky Survey (SDSS) spirals. Shamir (2017a) published a catalog of spiral galaxies from the SDSS DR8, classifying them with his pattern recognition tool into clockwise and counterclockwise (Z-spiral and S-spirals, respectively). He found significant photometric asymmetry in their distribution. We have confirmed that this sample provides dipole asymmetry up to a level of .
However, we also found that the catalog contains a significant number of multiple entries of the same galaxies. After removing the duplicated entries, the number of samples shrunk considerably to 45%. The actual dipole asymmetry observed for the ’cleaned’ catalog is quite modest, . We conclude that SDSS data alone does not support the presence of a large-scale symmetry-breaking in the spin vector distribution of galaxies in the local universe. The data are compatible with a random distribution.
1 Introduction
For many decades, investigation of the formation and evolution of galaxies has been primary subject of astrophysics. Semi-analytic simulations of structure formation in the CDM model of the universe to reproduce clustering and merging of galaxies provide the current standard picture of galaxy formation (White & Rees, 1978; Steinmetz & Navarro, 2002). Ferreira (2020) suggests that ultralight dark matter may reconcile certain remaining problems which the standard models fail to explain. Recent high-spatial resolution simulations enable tracking of the formation of the spiral structure of individual galaxies (Robertson et al. , 2004; Agertz et al. , 2011; Ceverino et al. , 2017; Shimizu et al. , 2019).
On the other hand, there are other classical models of galaxy formation, such as the primordial whirl scenario (von Weizsaecker, 1955), the pancake shock scenario (Peebles, 1969; Zeldovich, 1970), and the tidal torque scenario (White, 1984). Each provides naive inference regarding the statistical distribution of the spin vectors of galaxies.
If the spin vectors of individual galaxies were produced by splitting of large-scale primordial whirls, there would be some remaining coherent spin alignment parallel to the primordial whirl vectors, producing an observable dipole anisotropy.
If the galaxies were formed mainly at the equatorial planes of primordial collapsing pancakes, spin vectors can mostly be expected to be parallel to the equatorial planes, possibly producing a quadrupole anisotropy seen from the observer located within a cluster of galaxies.
If an individual galaxy started spinning owing to the tidal torque from a galactic cluster mass assembly, the galaxy’s spin vector should initially be perpendicular to the line joining the galaxy and the center of gravity of the cluster mass assembly. Such an initial correlation would, however, be diluted soon by orbital mixing.
Figure 1 illustrates potential anisotropy that might be induced from different galactic formation scenarios (Sugai & Iye, 1995). While the orbital mixing and merging of galaxies could wipe out these initial spin vector anisotropy, if any, observational verification of any symmetry-breaking in the distribution of spin vectors would be a significant evidence.

There were hot debates in the middle of the last century over whether the spiral structure of galaxies winds in a trailing way or in a leading way. It has been generally believed that the spiral structure of galaxies is in general ”trailing” rather than ”leading.” Iye et al. (2019) found corroborative evidence from their survey of 146 spiral galaxies that all spiral structure in these galaxies is indeed ”trailing.” This confirmation provides a basis for us to use the spiral winding direction projected on the sky, either S-wise spirals or Z-wise spirals, to judge the sign of the line-of-sight component of a galaxy’s spin vector. This determines whether the spin vector is pointing away or toward us.
Borchkhadze & Kogoshvili (1976) pointed out the number dominance of ”S”spirals over ”reverse S” spirals, hereafter referred as Z-spirals in the present paper, in the 7,563 galaxies sampled from the Abastumani catalog of Bright Galaxies. MacGillivray & Dodd (1985) found a similar trend and a marginal dependence on the super galactic hemisphere. Iye & Sugai (1991) and Sugai & Iye (1995) used S/Z parity information on spiral galaxies to study the distribution of spin angular momentum vectors in the assembly of galaxies. However the data sets available at that time were not large enough.
The Galaxy Zoo 1 catalog, a morphological classification of SDSS galaxies, was compiled by a public poll. It showed a similar predominance of S spirals as previous studies, but Hayes et al. (2017) interpreted this as being due to human selection bias rather than a human chirality bias or physical reality.
Shamir (2017a) published tables of 82,244 clockwise (Z-spirals) and 80,272 counterclockwise (S-spirals) galaxies by using his Ganalyzer algorithm (Shamir, 2011a, b) for a dataset of 740,908 galaxies classified as spiral galaxies from three million SDSS Data Release 8 galaxies (Kuminski & Shamir, 2016). The statistics show over-dominance of Z-spirals from a random distribution at . Shamir (2017b) reported finding a significant ”photometric anisotropy” such that the mean magnitudes of S/Z spirals behave differently depending on their right ascension with a possible asymmetry axis at () in J2000. This axis corresponds to the direction () near the galactic pole (). Note that Shamir’s catalog shows dominance of Z-spirals instead of the S-spiral dominance reported by previous workers.
In the present paper, we develop a formulation to analyze the dipole anisotropy of S/Z spin distribution of galaxies and apply this formulation to reanalyze the Shamir’s SDSS catalog.
2 Formulation of Dipole Analysis
2.1 Dipole Component of Spin Distribution
Number statistics can be used to study the S/Z number asymmetry from equipartition. The significance level of the number asymmetry is given by
(1) |
Another option is to look for any statistically significant dipole in the spatial distribution of S/Z-spirals.
Let be the spin vector of the -th galaxy with galactic longitude , galactic latitude , and distance . The distance to the galaxy can be approximated for the nearby universe by , where is the speed of light, is the spectroscopic/photometric redshift of the -th galaxy and is the Hubble constant.
Measuring the 3D vector is nontrivial, but one can easily judge the sign of . This can be done by examining the spiral winding sense to see if it is S-wise () or Z-wise (), since all spiral galaxies can be assumed to be trailing (Iye et al. , 2019, : Paper I). The net effect of misidentifying S-spirals vs Z-spirals and improbable presence of leading spiral arms would be reduction of the dipole strength, should it exist. For simplicity, hereafter we assume that all the have an unit length, namely .
By compiling the spin parity, , together with the coordinates of spiral galaxies from image archives such as SDSS, Pan-Starrs1, ESO-DR2, DES, Subaru HSC, and others, we estimate that one can produce a spin parity catalog of up to a million spiral galaxies in the 3D volume within 1 Gpc of the Earth. We are developing an analysis scheme to probe any partial volume within this volume, not necessarily centered on the Earth. This will be discussed in a forthcoming paper. For the moment, however, we consider a local volume centered on the Earth.
To analyze the spin vector distribution in terms of spherical harmonics expansion, one can first examine on its dipole component , quadrupole component and higher components.
If we define a unit vector to a fiducial pole , one can calculate the inner product
(2) |
where the angle is the angle between the direction of the -th galaxy and the direction of the fiducial pole vector . By determining the direction of the vector for which the amplitude takes the largest value, one can derive the dipole vector of the observed distribution.
In fact, can be obtained simply by calculating a vector , which is the vector sum of the unit radial vectors pointing to the direction of the -th galaxy multiplied by the helicity . The inner product of with takes the maximum value when is pointing in parallel to , and then is .
2.2 Perfect Dipole Distribution
We assume an extreme case of perfect dipole segregation, where all S-spirals are in the northern hemisphere and all Z-spirals are in the southern hemisphere as shown in Figure 2(a). The weight factor of Equation 2 shows that the galaxies toward the dipole axis with small add to while those near the equator with reduce . It is easy to see by integration that the expected mean amplitude of all sky sampling is 0.5. Perfect but random dipole distribution produces fluctuation around this expected mean amplitude of 0.5. Monte Carlo simulation shows that the associated standard deviation from this mean amplitude for perfect random dipole distributions is .
2.3 Effect of Non-uniform Sky Coverage for a Perfect Dipole Distribution
Although the currently available data from imaging surveys are growing rapidly, the data do not yet fill the entire sky uniformly. There may be observational biases that affect evaluation of S/Z number asymmetry and/or the evaluation of the observed dipole vector in the S/Z distribution. Let us examine the effect of non-uniform sky coverage for the perfect dipole distribution case mentioned in the previous subsection.
When the sky sampling is limited to a certain narrow cone direction, the dipole amplitude can take any value in the range depending on the direction to the sample. The largest amplitude is obtained when the sampling is toward the pole, or , and the lowest amplitude is observed toward the equator . Therefore depending on the direction of the biased small sky sampling, the resulting can be larger or smaller than the whole sky sampling.
Consider a non-uniform sampling of the complete dipole distribution covering only the northern hemisphere as shown in Figure 2(b). There would be a conspicuous number count asymmetry. However, the dipole strength observed would be equal to that of the whole sky sampling. If the sampling were limited to the eastern hemisphere as shown in Figure 2(c), there would be no number asymmetry. The dipole amplitude, again, would be equal to that of full sky sampling.
The number asymmetry and the dipole amplitude thus provide complementary information to analyze large scale symmetry-breaking in the spin distribution of galaxy ensembles.

2.4 Random Flight Simulation for an Isotropic Distribution
For a uniformly randomly distributed set of vectors with , the resultant vector sum will have an isotropic distribution with non-zero amplitude, unless the summation has incidentally completely canceled .
As can be calculated by , our problem is equivalent to a well-studied mathematical problem, 3D random walk (random flight). Namely, it is equivalent to find the distribution of distance to the final point vector of a particle that starts from the origin and takes steps of a 3D random walk. One can evaluate the mean amplitude of and the standard deviation around the mean amplitude as a function of .
Chandrasekhar (1943) established that the probability density function of the final displacement vector of random flights for a large will be a 3D Gaussian distribution.
(3) |
where is the expected mean square displacement, which is in our case .
The distribution of , therefore, follows the chi-squared distribution for three degrees of freedom. The distribution of , hence, follows the chi distribution111Weisstein, Eric W. ”Chi Distribution.” From MathWorld–A Wolfram Web Resource. https://mathworld.wolfram.com/ChiDistribution.html, a square root of chi- squared distribution. The expected mean amplitude is
(4) |
The associated standard deviation from this expected mean distance is given by
(5) | |||||
We can use these formulae to evaluate the statistical significance of the observed spin dipole strength in any ensemble of spiral galaxies using
(6) |
2.5 Detectability of Dipole Component
Consider an ensemble of galaxies where a perfect dipole system and a uniform random system are mixed with fractions and , respectively. The observed dipole vector would be, on average, a vector sum of with an amplitude and a random vector with an amplitude in an arbitrary direction. To discern the intrinsic dipole from the random dipole with a statistical significance of -sigma, the following relation is required
(7) |
This implies
(8) |
The Equation 8 indicates that between one hundred thousand or one million spirals are necessary to detect a 3%, or 1% residual inherent dipole system at confidence level, respectively.
Real data are not always obtained uniformly. The quantitative discussion of the non-uniform sampling of a random distribution in the general case is not straightforward. We examine, however, the actual S/Z data of SDSS spirals in the next section and compare the observed dipole amplitude with those expected from simulated random distributions.
3 Application to the S/Z Distribution of SDSS Spiral Galaxies
3.1 Reanalysis of Shamir’s Spin Catalog
To apply our formulation of dipole anisotropy to real data, we studied two samples using Shamir’s catalog based on SDSS photometric data (Shamir, 2017a)222https://data-portal.hpc.swin.edu.au/dataset/data-for-galaxy-assymetry-experiment. The first sample retains all 162,516 spirals from the original catalog. The original sample shows an S/Z dipole signal of , with its axis pointing toward . This axis coincides with that reported in Shamir (2020a), (), which is () for an SDSS sample, considering the estimation error of about in both coordinates.
For calibration, we made 50,000 independent Monte Carlo simulations by assigning randomly to the 162,516 spirals and measured the simulation’s . The resulting shows an isotropic distribution with an ensemble mean amplitude of and an associated amplitude standard deviation of , as shown in Table 1. The observed from the original catalog, therefore, has an amplitude, that is larger than the mean amplitude by . The number dominance of Z-spirals over S-spirals here is .
The second sample we studied is a volume-limited sample retaining 111,867 spirals with measured redshift in the range . To avoid any possible effect of local peculiar motions, 162 nearby spirals at were removed from this volume-limited sample. This sample shows a stronger S/Z dipole signal with its axis pointing toward .
The value observed for this sample, together with that of 50,000 simulated mock samples is shown in Figure 3. The observed from the original catalog limited to a volume defined by redshift, therefore, has an amplitude, that is larger than the mean amplitude by .This amplitude happens to be close to the value reported in Shamir (2020a). The number dominance for this sample is .


3.2 Analysis of New Cleaned Catalog
Upon re-examining Shamir’s original catalog, we found significant duplication of entries in its tables. Apparently, Shamir (2017a) used PhotoObjAll to search the SDSS catalog. According to the Table Description of DR8333http://skyserver.sdss.org/dr8/en/help/docs/tabledesc.asp, the view of PhotoPrimary, instead of PhotoObjAll, should be used to avoid duplicate detections. PhotoObjAll returns every entry from the searched images without checking for duplication of the same object.
In fact, 34,198 spiral galaxies were found to have multiple entries with their coordinates within 3 arcsec distance of each another. For instance, one of the galaxies in his catalog, SDSS J031945.63-000437.9 (objid=1237666300555427954) at , has 89 separate entries. We confirmed by visual inspection that all of them are the same galaxy.
A total of 106 spirals had contradicting S/Z classification among the duplicated entries. We rectified these contradictions by assigning a spin parity via visual inspection in some cases. In other cases, where the voting was significantly split, we adhered to the outcome of majority voting.
After removing the duplicated entries, the number of spiral galaxies is reduced to 72,888, that is of the original number. The number of spirals in the volume-limited sample range is 48,089 (23,819 S-spirals and 24,270 Z-spirals), 43% of the 111,867 counted in the original catalog. The measured amplitudes for these ’cleaned’ samples with and without the volume limitation are summarized in Table 1.
The observed dipole for the entire cleaned sample set has with its axis pointing toward . 50,000 random mock simulations for the cleaned sample shows a mean amplitude and a standard deviation . Therefore, the observed dipole deviates from the expected mean strength only at a level of . Number dominance of Z-spirals over S-spirals is also only at the level.
The observed dipole for the cleaned sample limited to the redshift range , is with its axis pointing toward . Results of 50,000 random mock simulations for the cleaned sample shows a mean amplitude and a standard deviation , as shown in Figure 4.
Therefore, the observed dipole deviates from the expected mean strength only at the level. Column (13) of Table 1, obtained from Equation 6 for uniform sampling is not necessarily correct for non-uniform sampling. However, the fact that Column (13) shows values close to the values in column (12), obtained from actual sampling, suggests their relevance. Finally, the number dominance of Z-spirals over S-spirals is also only at the level.
Comparison of Figures 3 and 4 clearly shows that the apparent pseudo dipole signal observed in Figure 3 came from massively duplicated data in the original catalog.
As a final remark, Longo (2011) made a similar analysis of his sample of 15,158 spirals with from SDSS DR6. He used the terms left-handed (S-spiral in the present paper) and right-handed (Z-spiral). He found a dipole asymmetry by plotting the distribution of . The dipole strength under his definition was , found with an axis pointing at at . The relation of his study to the current work has yet to be investigated.
(1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | (10) | (11) | (12) | (13) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Sample | redshift range | N | N(S) | N(Z) | stddev | |||||||
Original | All | 162,516 | 80,272 | 82,244 | 4.89 | 0.00489 | 189 | +15 | 0.00225 | 0.00105 | 2.52 | 2.70 |
Original | 111,867 | 54,870 | 56,997 | 6.36 | 0.00773 | 138 | -38 | 0.00271 | 0.00126 | 4.00 | 4.28 | |
Cleaned | All | 72,888 | 36,191 | 36,697 | 1.87 | 0.00535 | 192 | +79 | 0.00336 | 0.00154 | 1.29 | 1.34 |
Cleaned | 48,089 | 23,819 | 24,270 | 2.06 | 0.00468 | 106 | +57 | 0.00414 | 0.00188 | 0.29 | 0.27 |
4 Conclusion
We present a formulation to quantify the observed dipole amplitude of the S/Z spin parity distribution of spirals in an ensemble of galaxies. We show that the statistical significance of this quantity can be calibrated with that expected from 3D random flight simulations.
Notably, we found that the S/Z spin catalog published by Shamir (2017a) contains a significant amount of duplicated data, which at least partly caused the increased level of dipole asymmetry that we observed. After removing the duplicated entries from the catalog, we found that the distribution is compatible with random distribution. We conclude that the SDSS sample of spiral galaxies does not show large scale anisotropy in the spin distribution of galaxies.
Recently, Shamir (2020b) studied another dataset based on spectroscopic sample to detect a significant dipole signal. Previously Shamir (2012) also presented that a dataset based on SDSS SpecObj shows a significant dipole signal, while the direction of the axis is different. The different results from the different datasets will be investigated elsewhere.
The current authors are compiling a large coherent dataset of spiral winding evaluation as derived from several modern large image datasets, including the SDSS (Aguado et al. , 2018), the Panoramic Survey Telescope and Rapid Response System (Pan-Starrs1: Chambers et al. , 2016), the Hyper Suprime-Cam (HSC: Miyazaki et al. , 2018) of Subaru Telescope, and the Dark Energy Survey (Abbott et al. , 2018) by deep learning algorithm. Tadaki et al. (2020) developed a deep learning algorithm to judge S/Z winding of 76,635 spiral galaxies from the Hyper Suprime-Cam Subaru Strategic Program Data Release 2 dataset (HSC SSP DR2: Aihara et al. , 2019) sample, with further objective to generate a galactic spin data catalog.
References
- Abbott et al. (2018) Abbott, T. M. C. Abdalla, F. B., Allam, S. et al. , 2018, ApJS, 239, 18
- Agertz et al. (2011) Agertz, O., Teyssier, R. & Moore, B. 2011, MNRAS, 410, 1391.
- Aguado et al. (2018) Aguado, D. S., Aumada, R. Almeida, A.et al. , 2018, ApJS, 240, 23
- Aihara et al. (2019) Aihara, H.,, AlSayyad,Y., Ando,M. et al. 2019, PASJ, 71, 114
- Borchkhadze & Kogoshvili (1976) Borchkhadze, T. M. & Kogoshvili, N. G. 1976, A&A, 53, 431
- Ceverino et al. (2017) Ceverino, D., Primack, J., Dekel, A. et al. , 2017, MNRAS, 467, 2664
- Chambers et al. (2016) Chambers, K. C., Magnier, E. A., Metcalfe, N. et al. 2016, arXiv:1612.05560
- Chandrasekhar (1943) Chandrasekhar, S. 1943, Rev. Modern Physics,15, 1
- Ferreira (2020) Ferreira,E.G.M., 2020, arXiv:2005.03254v1
- Hayes et al. (2017) Hayes, W. B., Davis, D., & Silva, P. 2017, MNRAS, 466, 3928
- Iye & Sugai (1991) Iye, M. and Sugai, H. 1991, ApJ, 374, 112
- Iye et al. (2019) Iye, M., Tadaki, K. and Fukumoto, H. 2019, ApJ, 886, 133
- Kuminski & Shamir (2016) Kuminski, E. & Shamir, L. 2016, ApJS, 223, 20
- Longo (2011) Longo, M. 2011, Physics Letter B, 669, 224
- MacGillivray & Dodd (1985) MacGillivray, T. M. & Dodd, R. J. 1985, A&A, 145, 269
- Miyazaki et al. (2018) Miyazaki, S., Komiyama, Y., Kawanomoto, S. et al. 2018, PASJ, 70, 1
- Peebles (1969) Peebles, P. J. E., 1969, ApJ, 155, 393
- Robertson et al. (2004) Robertson, B., Yoshida, N., Springel, V. et al. 2004, ApJ, 606, 32
- Shamir (2011a) Shamir, L. 2011a, The Astrophysics Space Code Library, p.ascl:1105.011
- Shamir (2011b) Shamir, L. 2011b, ApJ, 736, 141
- Shamir (2012) Shamir, L. 2012, Phys. Let. B, 715, 25
- Shamir (2017a) Shamir, L. 2017a, PASA, 34, e011
- Shamir (2017b) Shamir, L. 2017b, PASA, 34, e044
- Shamir (2020a) Shamir, L. 2020a, Astron. N., 341, 324
- Shamir (2020b) Shamir, L. 2020b, Ap&SS, 365, 136
- Shimizu et al. (2019) Shimizu I., Todoroki, K., Yajima, H. and Nagamine, K., 2019, MNRAS, 484, 2632-2655
- Steinmetz & Navarro (2002) Steinmetz, M. & Navarro, J. F. 2002, New A, 7,155.
- Sugai & Iye (1995) Sugai, H. & Iye, M. 1995, MNRAS, 276, 327
- Tadaki et al. (2020) Tadaki, K., Iye, M., Fukumoto, H., Hayashi, M., Rusu, C. E., Shimakawa, R. and Tosaki T. 2020, MNRAS, xxx, yyy
- von Weizsaecker (1955) von Weizsaecker, C. F., 1955, Zeitshrift fur Astronomie, 35, 252
- White & Rees (1978) White, S. & Rees, M. 1978, MNRAS, 183, 341.
- White (1984) White, S., 1984, ApJ, 286, 38
- Zeldovich (1970) Zeldovich, Y. B., 1970, A&A. 5, 84