A post in which some value judgements are made about the situation with wide binaries

A post in which some value judgements are made about the situation with wide binaries

I have tried very hard to remain objective and even handed, but I find that I weary of the wide binary debate. I don’t know what the right answer will turn out to be. But I do have opinions.

For starters, it is a big Galaxy. There is just too much to know. When I wrote about the Milky Way earlier this year, the idea was to set up an expectation value for wide binaries in the solar neighborhood. That devolved into at least eight other posts on the Milky Way itself, because our Galaxy is too damn interesting, and has its own controversies. So it occurs to me that I never really got on with the regularly scheduled program.

In my assessment, the radial acceleration at the solar circle is 2.2 x 10-10 m/s/s, which in terms of the MOND acceleration scale is 1.8 a0. We live on the Newtonian side of the transition to the MOND regime. The ideal place to test MOND with wide binaries would be the deep MOND regime, well below a0. That is in a part of the Galaxy that is far, far away, and not currently accessible to us. What is accessible are wide binaries in the solar neighborhood (within 250 pc, about 1% of the Galaxy’s radius) as mapped by Gaia. Locally, the MOND effect is modest, but nonzero. We’re close enough to the transition for there to be a small, detectable effect.

Local binaries that are widely separated enough for their internal acceleration to drop below a0 find themselves in the regime dominated by the field of the rest of the Galaxy and subject to the so-called External Field Effect (EFE). This situation is illustrated in the lower right panel below.

Mass estimators in different regimes of acceleration. The top row illustrates pure Newtonian (left) and MOND (right) regimes. The bottom row illustrates the case of small systems embedded in larger systems. A low acceleration system embedded in a Newtonian external field is Newtonian (left) while a very low acceleration system embedded in a merely low acceleration system is quasi-Newtonian (right). Wide binaries fall in the last category.

Intriguingly, orbits in the EFE regime remain Keplerian. The rotation curves of nearby binaries, if you could map them, are not expected to be flat in MOND. They should, however, experience enhanced speeds, with a boost to the effective value of Newton’s constant: G → γG. The value of γ depends on the sum of internal and external acceleration as well as the shape of the interpolation function when near a0. That’s one reason to prefer to do this experiment in the deep MOND regime, where the shape of the interpolation function doesn’t matter. But that’s not where we live. For Galactic data and viable possibilities* for the interpolation function, a reasonable expectation value is γ = 1.4 ± 0.1. This is what the wide binary papers attempt to measure. So, what do they find?

Hernandez et al. find γ = 1.0±0.1 for 466 close binaries with 2D separations less than 0.01 pc (about 2000 AU) and γ = 1.5±0.2 for 108 wide binaries with 2D separations greater than 0.01 pc. A purely Newtonian result (γ = 1) is recovered in the high acceleration regime of relatively close binaries where this is expected to be the case. For wider binaries, one finds a boost value consistent with the prediction of MOND and differing from Newton with modest significance (2.6σ).

Chae reports+ γ = 1.49(+0.21/-0.19) for 2,463 “pure” binaries in the low acceleration regime, consistent with his earlier result γ = 1.43±0.06 for 26,615 wide binaries. The larger numbers make the formal error smaller, hence a formally more significant departure from Newton. Many of these binaries are impure in the sense of being triples with one member being itself a close binary as discussed previously, an effect that has to be modeled in large samples. The point of the smaller samples is to select true binaries so that this modeling is unnecessary. For his smaller pure binary sample, Chae finds a smooth transition from γ ≈ 1 at high acceleration (10-8 m/s/s ≈ 100a0) through γ ≈ 1.11 around 7a0 to γ ≈ 1.49 at local Galactic saturation (1.8 a0).

Banik et al. use a slightly different language. Translating, they find γ = 1 at high confidence (16σ)$ from 8,611 wide binaries with separations from 2,000 to 30,000 AU. Newtonian behavior persists at all scales and accelerations; they find no significant deviations from γ = 1 anywhere. Note that despite going out very far, to 30,000 AU, they do not reach especially low accelerations because the EFE of the Galaxy is effectively constant in the solar neighborhood. There is no getting away from the Galaxy’s 1.8 a0. They also do not reach particularly high, purely Newtonian accelerations: 2,000 AU is in the transition regime where MOND effects are perceptible.

Here is Fig. 11 from Banik et al., the key figure Dr. Banik was advocating in his comments to the previous post:

Fig. 11 of Banik et al. shows the median dimensionless characteristic velocity as a function of dimensionless binary separation for several bins of the data (solid lines). The predicted MOND effect increases with separation until saturating in the Galactic field (dashed lines). This is not seen in most of the data, with only a hint in the highest velocity bin that represents only a few percent of the data.

A flat line in this plot indicates no boost in velocity with diminishing acceleration, so one can clearly see the source of the claim that Newton works better than MOND. There is little indication that the velocity increases at wider separations. Effectively, γ ≈ 1 pretty much everywhere.

Both Chae and Hernandez have pointed out that the lack of a constraint on the high acceleration Newtonian regime is problematic. Orbits are Keplerian in the quasi-Newtonian regime, so the behavior looks Newtonian. Lacking an anchor in the high acceleration regime, it is conceivable that the analysis of Banik is detecting the predicted MOND quasi-Newtonian behavior and defining it to be purely Newtonian. It’s just a modest offset in qualitatively similar behavior. In this context, it is worth noting that Chae and Hernandez independently measure γ ≈ 1 at high acceleration as well as γ ≈ 1.5 at low acceleration: they [claim to] detect the difference between these regimes in a way Banik does not probe.

Now let’s look at the plot that gave me the heebie-jeebies with data on it, Banik et al.’s Fig. 12:

The left two panels of Fig. 12 from Banik et al. showing the probability of observing a particular dimensionless velocity in two bins of radial separation: 2,000 to 3,000 AU (top) and 5,000 to 12,000 AU (bottom). The histograms are the Gaia data. The black lines show the Newtonian prediction while the blue lines show that of MOND. These predictions depend on many things besides the underlying theory, sampling over many astrophysical complications like the distribution of stellar masses, orbital orientations to the line of sight, orbital phase, orbital eccentricity, the close binary fraction, and probably other things that I don’t instantly recall.

One can see the basis of the concern. At high acceleration, the prediction of Newton and MOND are identical. The top bin is the closest we get to that, yet there is a clear difference in the predictions. This bin is in the transition region; there is no bin at sufficiently high acceleration for the predictions to align and provide the self-calibration that both Chae and Hernandez independently exploit.

Looking at the data in the top panel, it clearly agrees better with the Newtonian prediction. I can believe that; what concerns me is the lack grounding at still higher acceleration where the black and blue lines should coincide. I do not have a sufficiently clear understanding of all the machinations (and their inevitable foibles) that go into the predicted lines to trust that this constitutes a definitive test.

Looking at the data in the bottom panel, it clearly agrees better with the prediction of MOND. The histogram of the data follows the blue line of MOND more closely than the Newtonian black line. This is so obvious that I wondered if the colors were wrong – maybe there had been some inadvertent switcheroo in the line color in the plotting code. Apparently not, as this point is addressed in the text of Banik et al.: [in the bottom panel,] “MOND performs somewhat better in a handful of pixels around the peak region…” Yes. Yes it does. That’s… a really weird way of putting it. They go on to say “…though given the uncertainties, the Newtonian model is not that far off.” One could just as well say that about MOND in the top panel. Just looking at this figure, one might conclude that they have detected MOND at large separations.

One of the things that gives me the heebie-jeebies about this figure is that there isn’t much difference in the location of the peaks of the distributions. Some, yes, but not much: Newton and MOND apparently predict very nearly the same typical velocity. Yet that is what Fig. 11 traces: the typical (median) normalized velocity. That only tells a tiny bit of the story that is in Fig. 12, and does not appear to be a particularly sensitive indicator of the effect we’re testing for.

Returning to the matter of statistics, the attentive reader might have noted that I have not said much about the number of binaries included in each analysis. These range from a few hundred to many thousands to tens of thousands. More is better, right?

In this case, I think not. There is always a tension between data quality and quantity. Quantity helps with the statistics, but only so long as there is a signal to be dug out. At some point, it becomes a matter of garbage in, garbage out. In this respect, I am inclined to agree with Ernest Rutherford:

If your experiment requires statistics, you ought to have done a better experiment.

Ernest Rutherford$

I suspect that we’re squeezing the stone of statistics too hard here. When we do this, we get the appearance of a signal when really we’re just grinding metal. I am reminded that any time we do a big experiment like this (dark matter searches are a great example), the first thing we learn about are all the false signals we didn’t anticipate. That happens no matter how well we construct the experiment, and I give Banik et al. credit for planning this out ahead of time. That doesn’t guarantee that everything comes out right on the first attempt. There is just so much junk that the universe can and does throw at us that it is easy to imagine that the samples with large numbers of binaries bring with them too much junk (e.g., false binaries). If the fraction of junk is high, then it will look like junk at all scales – there will be no trend with increasing separation even if there is a signal buried in junk.

Consequently, I am at present inclined to trust more the super-clean sample of Hernandez – the high quality binaries where there is a chance that we’re actually measuring what we want to measure. There are only a few hundred such binaries, so the statistical confidence is modest (2.6σ). I worry that in setting the highly restrictive standards necessary to select the best binaries that we might unintentionally omit objects that could change the answer. But at least there is some confidence that these are real binaries that stands above the accumulation of all the gratuitously enormous amount of junk the universe has to throw our way.

I hope the principal scientists can come to agreement about what the data show and not just wind up having the same argument over and over forever more. That’s what usually happens. I ask all parties to remember that it is important to retain the ability to change one’s mind. All of them have demonstrated the ability to do this previously, and somebody will need to do it again.


*In principle, one might also hope to distinguish between specific theories of MOND. Examples of modified gravity like AQUAL and QUMOND give slightly different predictions. To be able to do this seems… optimistic at this point.

In modified inertia theories, the interpolation function is a chimera that depends on each orbital trajectory, so its effective realization may differ between the nearly circular motions of rotation curves and eccentric wide binaries. If so, the interpolation function defined by external galaxies may not be relevant to the problem. Ultimately, we need a theory that automatically results in the MONDian phenomenology in galaxies. Whatever it is wide binaries are doing should help inform this theory development as well as test alternatives and hopefully exclude some of them.

+The work of Chae has gone through some revisions in response to a referee, but the basic findings are unchanged. Having read an earlier version, I appreciate the clarity provided by the additions: this is a case where the refereeing process was beneficial. (I was not the referee of this or any of these papers. Editors keep me busy enough as it is, thank you very much.)

$Outside of long-established observations like the value of Gauss’s constant, there is no such thing as 16σ confidence in astronomy. The tails of real probability distributions are never as tiny as a pure Gaussian. This is a remarkably naive assertion.

Wide binary debate heats up again

Wide binary debate heats up again

One of the most interesting and contentious results concerning MOND this year has been the dynamics of wide binaries. When last I wrote on this topic, way back at the end of August, Chae (2023) and Hernandez (2023) both had new papers finding evidence for MONDian behavior in wide binaries. Since that time, they each have written additional papers on the subject. These independent efforts both report strong evidence for MONDian behavior in wide binaries, so for all of October it seemed like Game Over for conventional* dark matter.

I refrained from writing a post then because I was still waiting to see if there would be a contradictory paper. Now there is. And boy, is it contradictory! Where Hernandez et al. find 2.6σ evidence for non-Newtonian behavior and Chae finds ~5σ evidence for non-Newtonian behavior, both consistent with MOND, Banik et al. find purely Newtonian behavior and claim to exclude MOND at 19σ. That’s pretty high confidence!

Well, which is it, young feller? You got proof of non-Newtonian dynamics, or you want to insist that’s impossible?

After the latest results appeared, a red-hot debate [re]ignited on e-mail, largely along the lines of what was discussed at the conference in St. Andrews. Banik et al say that they can reproduce the MOND-like signal of Chae, but that it goes away when the data quality restriction is applied to physical velocity uncertainties (arguing that this is what you want to know) rather than to raw observational uncertainties. Chae and Hernandez counter that the method Banik et al. apply is not grounded in the Newtonian regime where everyone agrees on what should happen, so they could be calibrating the signal away. This is one thing that I had the impression that everyone had agreed to work on in St. Andrews, but it doesn’t appear that we’re there yet.

Banik et al. do a carefully planned Bayesian analysis. This approach in principle allows one to separate many effects simultaneously, one of which is close binaries (CB**). I look at the impact that close binaries have on the analysis, and it gives me the heebie-jeebies:

One panel from Fig. 10 of Banik et al.

This figure illustrates the probability of measuring a characteristic velocity in MOND for the noted range of projected sky separation. If it is just wide binaries (WB), you get the blue line. If there are some close binaries, the expected distribution changes dramatically. This change is rather larger than the signal expected from the nominal difference in gravity. You can in principle fit for everything simultaneously, but extracting the right small signal when there is a big competing signal can be tricky. Bayesian analyses can help, but they are also a double-sided sledge-hammer: a powerful tool with which to pound the data, but also a tool that can bounce back and smack you in the face. Having done such analyses, and been smacked around a few times (and having seen others get smacked around), looking at this plot really does give me the heebie-jeebies. There are lots of ways in which this can go wrong – or even just overstate the confidence of a correct result.

Everyone uses Bayesian methods these days.***

I expect people are expecting me to comment on this hot mess. Some have already asked me to do so. I really don’t want to. I’ve already said more than I should.

There are very earnest, respectable people doing this work; I don’t think anyone is being intentionally misleading. Somebody must be wrong, but it isn’t my job to sort out who. Moreover, these are long and involved analyses; it will take me time to read all the papers and make sense of them. Maybe once I do, I’ll have something more cogent to say.

I make no promises.


*By conventional dark matter, I mean new particles that only communicate with baryons via gravity.

**CB: In principle, some of the wide binaries detected by Gaia will also be close binaries, in the sense that one of the two widely separated stars is itself not a single star but an unrecognized close binary. We know this happen in nature: the nearest star system, αCentauri, is an example. The main A&B components compose a close binary with Proxima Centauri being widely separated. Modeling how often this happens in the Gaia data gives me the willies.

***To paraphrase Churchill: Many forms of statistics have been tried, and will be tried in this science of sin and woe. No one pretends+ that Bayes is perfect or all-wise. Indeed it has been said that Bayes is the worst form of statistics except for all those other forms that have been tried from time to time.

+Lots of people pretend that Bayes is perfect and all-wise.