Did COVID-19 come from a laboratory?

Premium

2.1k

Ṁ4.3m

2040

43%

chance

ALL

Rootclaim debate released

-13.0%

ACX article published https://www.astralcodexten.com/p/practically-a-book-review-rootclaim

-12.0%

This market resolves once we have a definitive answer to this question. (i.e. "I've looked at all notable evidence presented by both sides and have upwards of 98% confidence that a certain conclusion is correct, and it doesn't seem likely that any further relevant evidence will be forthcoming any time soon.")

This will likely not occur until many years after Covid is no longer a subject of active political contention, motivations for various actors to distort or hide inconvenient evidence have died down, and a scientific consensus has emerged on the subject. For exactly when it will resolve, see /IsaacKing/when-will-the-covid-lab-leak-market

I will be conferring with the community extensively before resolving this market, to ensure I haven't missed anything and aren't being overconfident in one direction or another. As some additional assurance, see /IsaacKing/will-my-resolution-of-the-covid19-l

(For comparison, the level of evidence in favor of anthropogenic climate change would be sufficient, despite the existence of a few doubts here and there.)

If we never reach a point where I can safely be that confident either way, it'll remain open indefinitely. (And Manifold lends you your mana back after a few months, so this doesn't negatively impact you.)

"Come from a laboratory" includes both an accidental lab leak and an intentional release. It also counts if COVID was found in the wild, taken to a lab for study, and then escaped from that lab without any modification. It just needs to have actually been "in the lab" in a meaningful way. A lab worker who was out collecting samples and got contaminated in the wild doesn't count, but it does count if they got contaminated later from a sample that was supposed to be safely contained.

In the event of multiple progenitors, this market resolves YES only if the lab leak was plausibly responsible for the worldwide pandemic. It won't count if the pandemic primarily came from natural sources and then there was also a lab leak that only infected a few people.

I won't bet in this market.

#COVID origins

#COVID

#Past Events

Get Ṁ1,000 play money

40 Comments

Sort by:

Dr. Tom Inglesby, Director, John Hopkins Center for Health Security, Bloomberg School of Public Health:

"It remains a priority to study the origins of COVID. While the evidence for accidental origin of COVID is not definitive, there is a substantial body of strong circumstantial evidence. US intelligence agencies remain divided. The WHO Director has said publicly there is not enough information to conclude one way or another, and the WHO SAGO committee just concluded its recent report saying it did not have sufficient evidence to definitively make judgements about COVID, and it is seeking additional clinical, epidemiologic and laboratory information. Dr Kadlec’s paper includes data and references to reports around biosafety and laboratory practices that are not referenced by SAGO, and so represents a new source of information, references and reports."

https://archive.ph/g2Ymu

@MikePa67d It's odd you can't find any quotes endorsing Kadlec's report's conclusion that SARS-CoV-2 was clandestine mind control virus engineering research gone wrong.

It's almost as if everyone knows it's nuts even if they don't know just how divorced Kadlec's report is from reality on the facts.

bought Ṁ100 YES at 43%

Since @MikePa67d is quoting random reviews of Kadlec's report from people who hopefully haven't actually read it and aren't aware of all of it's errors, here's an interesting comparison.

First Muddy Waters report:

The RGD motif is not found in strains identified as RaTG-13 nor GD Pangolin. It is, however, found in a number of other SARS-related coronaviruses, including Rco319, BANAL-52, BANAL-236, RshSTT182, Rs7924, RmYN08, RsYN04, and several others. Therefore, this motif is not an unusual feature unique to SARS-CoV-2, unlike the furin cleavage site. Its clinical implications are significant.

From the latest report (Muddy Waters Version 2, part B, if you're following along):

The SARS-CoV-2 and Pangolin-GD strain also share an integrin-binding protein [sic] at the distal end of the RBD [the "RGD motif"]. This sequence is novel and was not identified in a previous SARS-related virus or coronavirus prior to the pandemic.

So, in the first report, Kadlec made a point about how the sequence wasn't unusual, although it might have clinical significance. He was right about the first part, at least. It's not unusual at all.

But now it's suspiciously rare because Kadlec has decided to add an unhinged story about Chinese research into mind control viruses that went wrong. In reality, this "RGD" sequence motif has been found in SARS2-like coronaviruses in bats in China, Vietnam, Laos, Cambodia, Japan, and Great Britain.

Dr. Roger Brent: So, while the report falls short of being a case for the prosecution, it continues to fill out a picture of a city engaged in active research efforts on coronaviruses and nipahviruses -- surely by 2020 the largest concentration of such virological R&D in the world-- with research animals and samples being schlepped across town among four different sites. A city in which, in January 2020, workers at one of those sites, a genomics facility at Huazhong Agricultural University uploaded to NCBI a reassemble-able genome of BAC vector into which had been inserted an entire, hitherto unknown, still unacknowledged, MERS-like coronavirus, complete with T7 promoter and Hepatitis D ribozyme and polyA, and so good to go to further engineer or to recover live virus. Presumably, accidentally sequenced, as a contaminant in their uploaded rice genomic datasets (6).

https://www.linkedin.com/pulse/muddy-waters-2-out-roger-brent-sglzc?utm_source=share&utm_medium=member_android&utm_campaign=share_via

Andrew Weber, a former assistant defense secretary in the Obama administration now at the Council on Strategic Risks, said that Kadlec’s report adds to the “scientific and documentary evidence for an accidental release” of the virus, and highlights the need for the U.S. and other countries to implement “stringent safety measures and limit risky biological research.” https://share.google/gypq49gZo2Rzjcc9i

A claim occasionally pops up in lab leak literature that the SARS-CoV-2 spike receptor binding domain is suspiciously well suited for human infections and poorly suited for binding its receptor, Ace2, in bats. A new paper tested this and found that it’s not true.

The results showed that the binding of SARS-CoV-2 S to hACE2 or Rp-bACE2 was significantly above that of BANAL-52 and BANAL-103 S, and the binding of BANAL-52 S to the receptor was the lowest, especially to Rp-bACE2

https://journals.asm.org/doi/10.1128/jvi.01007-25

In other words, the SARS-CoV-2 receptor binding domain works fine against bat and human receptors. Exactly what you expect for the virus that happens to have spilled over somewhat recently. Not what you expect from extensive engineering experiments or something else to specifically develop a human targeted virus.

This isn’t a strawman argument — it was the core lab leak theory until Fall 2021 that the SARS2 receptor binding domain was suspiciously different from those found in bats. The furin cleavage site was once far less important to lab leak theorists… until it was the only thing left.

https://www.city-journal.org/article/robert-kadlec-covid-19-pandemic-report-bioweapons

Say what you want about Judith Miller, but she's got her sources in the conservative side of the intelligence community. She's convinced by Robert Kadlec's wildly erroneous report concluding that SARS-CoV-2 was an accident in a Chinese program to develop a mind control bioweapon. Does that mean it's impossible to find anyone with a less wacky story to tell?

Putting aside how Kadlec's report is fundamentally nuts, here's another specific error. From the same page as the one in the comment below.

Kadlec imagines that secret work was carried out by a Beijing-based lab in Wuhan. Kadlec explains that his rationale is a suspicious failure to say where work was done in some papers:

However, and perhaps significantly, General Zhou’s AMMS team did not identify where they conducted these animal vaccine challenge studies (with humanized mice and NHPs) as other vaccine study groups have.

Here, he's referencing two papers. The location of the work is described in the supplementary information for both paper. It's Beijing; not Wuhan:

All experiments involving infectious of SARS-CoV-2 were performed under Biosafety Level 3 facilities in AMMS.

and...

All experiments involving infectious SARS-CoV-2 were performed in biosafety level 3 (BSL-3) containment laboratory in Beijing Institute of Microbiology and Epidemiology, AMMS.

FYI, Kadlec has been nominated as Assistant Secretary of Defense for Nuclear Deterrence, Chemical, & Biological Defense Policy & Programs 🤯

Folks here who are too young to remember 2001-2003 vividly might want to go revisit what happens to the intel community in the USA when the President declares what the answer is and then everyone is tasked with figuring out how to prove it.

bought Ṁ100 YES from 44% to 45%

FYI for anyone who took the Vanity Fair and ProPublica reporting seriously (e.g. COVID-19 Origins: Investigating a “Complex and Grave Situation” Inside a Wuhan Lab), Robert Kadlec's report linked by @George below shows that it was based on faulty information. Specifically this part of the Vanity Fair reporting:

We analyzed WIV documents, consulted with experts in CCP communications, asked biocontainment experts to help analyze documents, and reviewed with independent scientists the possible evidence that certain vaccine research may have begun far earlier than acknowledged.

Kadlec's report assumes that the vaccination schedule for that research would've required 40 days after a booster dose to collect enough data for a February 24, 2020 patent application. E.g. with this figure:

This is wrong. The data in the patent application only includes samples from 12 days after a single dose.

In other words, this moves the timeline up several weeks from what Kadlec and his investigators assumed. It's no longer suspicious. Case closed.

Edit: There's TONS of stuff in Kadlec's report that is far, far worse than this, but this is the one thing that people seemed to have taken seriously and still do. Now we learn that it was always bogus.

Scowcroft Institute Report Examines COVID-19 Brain Effects And Origins

Texas A&M research institute releases final installment of study highlighting the pandemic’s neurological impact and raising concerns about Chinese military research on coronaviruses.

The Scowcroft Institute of International Affairs at Texas A&M University has released the second and final installment of a major report completed by Dr. Robert Kadlec in 2024. The report, A Critical Review of COVID-19 Origins: “Hidden in Plain Sight,” examines the evidence on how the COVID-19 pandemic emerged and the disease’s impact on the brain.

The final installment of Kadlec’s report reaches three overarching conclusions:

Evidence suggests that the pandemic began due to a virus escaping a laboratory rather than natural spillover from infected animals. This finding was included in the first installment of the report, published in November 2024.
COVID-19 infection often has major short- and long-term effects on the brain, even in children and people with mild cases. Efforts to treat and prevent these effects should be urgent priorities.
The Chinese military may have been researching a vaccine to protect against the effects of COVID-19 prior to the pandemic. This and other Chinese military research on coronaviruses threaten U.S. national security and raise concerns about Chinese compliance with international arms control treaties.

https://bush.tamu.edu/news/scowcroft/scowcroft-institute-report-examines-covid-19-brain-effects-and-origins/

@George LOL at setting up a proton mail account because the Dean over there is tired of fielding emails about Kadlec's inane reports.

It's also funny he kept in the HVAC renovation data point after he corrected it from a $55.1M to a $550k contract. It was half a billion bucks in the original Muddy Waters report!

To say the least, the guy who has a whole team in the basement of a Senate office building including one guy who said he spoke secret Chinese, but no one in his brain trust knows what 万 means or that air conditioners don't, you know, cost half a billion dollars... that guy isn't going to crack the case now.

Eddie Holmes predicted exactly what trajectory this would take long ago:

To assign the origin of SARS-CoV-2 to the Wuhan Institute of Virology requires a set of increasingly implausible “what if?” scenarios. These eventually lead to preposterous suggestions of clandestine bioweapon research.

And now, here we are:

The report further suggests the possibility of offensive biological weapons (BW) research occurring in China with link to the origins of SARS-CoV-2. This is the report’s most provocative finding and one worth taking seriously

Come bet in my market! We are trying a new market structure that has the advantage of resolving sooner while incentivizing truthful predictions and avoiding whale manipulation.

The Covid ‘lab leak’ theory isn’t just a rightwing conspiracy – pretending that’s the case is bad for science

Jane Qiu.

https://www.theguardian.com/commentisfree/2025/jun/25/covid-lab-leak-theory-right-conspiracy-science?utm_source=substack&utm_medium=email

hold the line YES patriots

bought Ṁ750 NO at 46%

bought Ṁ500 YES from 46% to 47%

Dr Jane Qui has gone from being very dismissive of any lab leak scenario to now penning a piece in the Guardian saying it's not a conspiracy theory and blaming the likes of Peter Daszak for damaging trust in science. Quite a turnaround. https://www.theguardian.com/commentisfree/2025/jun/25/covid-lab-leak-theory-right-conspiracy-science

bought Ṁ100 YES from 46% to 47%

@MikePa67d Until the other day, Jane Qiu was part of the conspiracy theory in lab leak world. Now she and Peter Daszak have some sort of falling out over a movie, we get an odd opinion piece that adds nothing to the debate and absolutely does not say that Jane Qiu thinks lab leak is likely.

But, give that there's no actual evidence to support any "lab leak" theory. The lab leak hive mind is jumping on this as vindication:

Same dude, back when Qiu was part of the global coverup conspiracy:

In the real world, this shows that there is no global coverup of "lab leak" being a likely origin. For people who thought that was somehow plausible a week ago, your inferred likelihood of lab leak should drop. For the rest of us, it's a boring spat between a journalist, a scientist, and a filmmaker spilling over in public.

That kind of spillover is a bit more common than the ones that cause pandemics.

bought Ṁ50 YES at 47%

@zcoli Aren't you the guy who gets paid to harass people who disagree with you by emailing their employers?

bought Ṁ50 YES at 47%

@Marion8w2 I sent a message (not by email) to the organization that has a book listed under "our work" on its website that says a paper I co-authored might be fraudulent, but doesn't explain why -- Bratlie has two employers and I didn't mail the irrelevant one. I asked if someone could explain what the issue is so that I could answer it; apparently and unsurprisingly, Bratlie can't back up what she writes and put on some performance on X claiming it was censorship to ask a question. If an organization employing someone says that a book is "our work" and the topic of the book is on the same topic as the work they do for the organization, I assume it's work for hire.

Ironically, the relevant part of her book is a call to remove articles, including one I co-authored, from the scientific literature.

The last thing I heard from her was posting an email of mine to X, including my email address, but cropping out the bottom of the email. Here it is -- the question she's so incapable of answering that she doesn't want her audience on X to know that it was asked.

The rest of the email can be seen here -- https://xcancel.com/sigridbratlie/status/1932786213762564129#m

Part of being a scientist is responding to criticism and that's what I tried to do there. This will be the end of responding to anonymous trolls coming over from X. I recommend taking it with a grain of salt when conspiracy theorists claim to be victims of censorship. Inevitably, claimed "censorship" is criticism that they can't respond to.

@zcoli she says a bit more than that. She appears to be calling out your co-authors like Edward Holmes and co. She also clarified her dispute with Daszak was over his continued denial of any conflicts of interest and about the nature of the research undertaken with WIV.

"Some scientists assert evidence supporting natural-origins hypotheses with excessive confidence and show little tolerance for dissenting views. They have appeared eager to shut down the debate, repeatedly and since early 2020. For instance, when their work was published in the journal Science in 2022, they proclaimed the case closed and lab-leak theories dead. Even researchers leaning towards natural origins theories, such as the virus ecologist Vincent Munster of Rocky Mountains Laboratories in Hamilton, Montana, told me they lamented that some of their colleagues defend their theories “like a religion”.

@MikePa67d Who cares?

As far as Qiu's example of excessive hubris goes, this from Holmes' article has held up quite well:

To assign the origin of SARS-CoV-2 to the Wuhan Institute of Virology requires a set of increasingly implausible “what if?” scenarios. These eventually lead to preposterous suggestions of clandestine bioweapon research.
The lab leak theory stands as an unfalsifiable allegation. If an investigation of the lab found no evidence of a leak, the scientists involved would simply be accused of hiding the relevant material. If not a conspiracy theory, it’s a theory requiring a conspiracy.

Qiu uses Filippa Lentzos as a counterexample, one of "these scholars [that] have lent scientific legitimacy to the debate." Here Lentzos is about a year after Holmes' article joining in evidence free speculation that... clandestine bioweapon research sparked the pandemic.

I think I'm gonna score this one for Holmes.

https://www.who.int/news/item/27-06-2025-who-scientific-advisory-group-issues-report-on-origins-of-covid-19

The WHO Scientific Advisory Group for the Origins of Novel Pathogens (SAGO), a panel of 27 independent, international, multidisciplinary experts, today published its report on the origins of SARS-CoV-2, the virus responsible for the COVID-19 pandemic.

SAGO has advanced the understanding of the origins of COVID-19, but as they say in their report, much of the information needed to evaluate fully all hypotheses has not been provided.

“I thank each of the 27 members of SAGO for dedicating their time and expertise to this very important scientific undertaking over more than three years,” said Dr Tedros Adhanom Ghebreyesus, WHO Director-General. “As things stand, all hypotheses must remain on the table, including zoonotic spillover and lab leak. We continue to appeal to China and any other country that has information about the origins of COVID-19 to share that information openly, in the interests of protecting the world from future pandemics.”

@George The report found that there's the same level of support for an engineering origin of SARS-CoV-2 that there is for other Intelligent Design origin theories.

The report describes the consensus best supported theory by scientists:

While available data support that the HSM played a significant role in early transmission and amplification, it is not conclusive that the HSM was where the virus first spilled over into the human population, or if it occurred through upstream infected humans or animals at the market.

The report then talks about additional evidence that could possibly be collected that could support this theory further or support something else. The paper on market environmental samples from Crits-Christoph et al on this subject says the same thing:

Any hypothesis of COVID-19’s emergence has to explain how the virus arrived at one of only four documented live wildlife markets in a city of Wuhan’s size at a time when so few humans were infected [3]. Human introductions linked to the animal trade offer one explanation for this, and the introduction of the virus by an animal trader or farmer cannot be excluded, but these hypotheses are challenged by phylodynamic evidence for multiple spillovers [11].

When it comes to lab leak on the other hand, there is no specificity at all about how the evidence demanded could test any lab leak theory. This would be impossible, because the there's no falsifiable lab leak theory presented in the report. The report can't even settle on which lab to investigate. The requested data is basically all biosafety data and occupational health data for two large organizations, plus access to open-ended interviews of everyone there. By definition everything that's requested couldn't falsify lab leak theories because the underlying assumption is that everyone who might spill the beans now has been lying for over five years as part of a perfect coverup. A failure to find lab leak evidence would be rejected by anyone who finds it plausible now that there was a lab leak and a massive cover up to suppress evidence of it.

It's telling that the access requested to vaguely investigate lab leak isn't requested for investigating wildlife origins -- because there's just no need to look for what evidence might exist were it not covered up; the evidence that's not covered up is strong enough.

But you can tell from SAGO wasting time humoring the "MA-30" theory that someone susceptible to lab leak nonsense has influence in the report. Plausibly that might be the person who spoke on behalf of a report that concluded one scenario out of four was the only one with supporting data, yet decided to lead with your "all hypotheses must remain on the table."

bought Ṁ50 NO

Can anyone point to a scientific manuscript that explains the available data and finds it more likely than not using any quantitative method that the COVID-19 pandemic originated in a lab?

It doesn't need to be peer reviewed -- anything will do. What is the best example you know of?

@zcoli Sure. I'm sorry it's a bit long but for a contentious issue with a variety of relevant evidence that's needed. I've gone over it with some very serious stats and virology folks, and its >10k readers include some intense zoonosis types, who have helped by finding some errors, now fixed. It's been stable for many months now.
https://michaelweissman.substack.com/p/an-inconvenient-probability-v57

It's got very extensive references.
Several other much shorter blogs on my substack deal with narrower parts of the question: the gross math errors in Pekar 2022, the improper Bayes methods used by Scott Alexander, etc.

@MichaelWeissman You’re pretty critical of people on one side of the issue yet cite Jesse Bloom’s deleted sequences paper extensively. What’s up with that? Omitting critical data in a paper seems a bit worse than anything anyone said on Slack that you quote.

https://academic.oup.com/mbe/article/42/6/msaf109/8158640

I don’t know how many qualified people you talked to, but I think it’d be worth taking the time to look at the primary data yourself for some of the many inaccurate things here you’re getting from others. For one example, the discussion of D614G. This didn’t quickly dominate because it happened many times — it quickly dominated because it was followed up by another important mutation to make B.1 and then another one to make B.1.1 — lineage A without D614G was more prevalent than lineage A with D614G until about April 2021.

For another example, you cite a plasmid encoding spike using CGG. It makes sense to use human codons for expressing a protein from a plasmid in human cells. It makes sense to use human coronavirus codons for engineering a human coronavirus. It makes no sense and it’s implausible that an engineer would use human codons to engineer a human coronavirus, rather than human coronavirus codons.

Your other example here is “plasmid primers” (itself a nonsensical term) in this table:

That underlined bit is an EcoRI site. The CG at the end is added to the primer because it increases efficiency of digestion to add a few nucleotides to the end. Then, they’re lost when this is digested and ligated with something else digested by EcoRI.

Someone spent a whole lot of time cherry picking papers tangentially related to WIV and ctrl+F’d for CGG in the text. Someone whose familiarity with molecular cloning doesn’t extend to literally the most common technique is the person informing you on what is and isn’t evidence of engineering. Knowing what this is is typically part of undergrad bio curricula.

Fully documented every example of this sort of thing in your document would take hours and it’s trivial to find examples. The very serious stats and virology folks are either not reading closely at all, not as serious as you say, or happy to have you make these nonsensical arguments.

@zcoli Zach- At a couple of points I think you've got the logic screwed up.
On using D614G as evidence of the recency of the FCS insert, that was explicitly to increase the likelihood of a zoonotic CGGCGG based on recent insert sequence properties. If it's not a recent insert then the CGGCGG probability falls to the typical rate for Asian coronaviruses, 0.0001, and the sequence becomes a smoking gun against zoonosis. You seem to assume that my arguments must all be against zoonosis but this one was intended to give it the fairest break possible. Your argument is backwards.

On your disputes with Bloom about the most likely MRCA, my blog already said "(These groups also suspect that the MRCA differed from A by an additional nt shared with wild relatives but not with B. There is some reason to doubt that conclusion since A differs from the main suspect by a T→C mutation, much less common at this stage than a C→T mutation, although non-reversionary mutations are much more common than reversionary ones.)". So you're arguing about a point that I explicitly don't use.

So one of your arguments is irrelevant and the other has the wrong sign of effect on the odds for your case.

Your claim that no engineer would use CGGCGG at least has the right sign for your case. Readers can compare my arguments (based on points made by people who engineer sequences) with yours and try to make their own rough estimates of the odds for that particular factor, the fourth most important of the likelihood factors used.

@MichaelWeissman I'm not discussing your quantitative argument at all. I'm demonstrating how your post is full of things you say are facts that are untrue or conclusions from unreliable sources such as Bloom's deleted sequences paper. It's relevant that you have no expertise in this and you are basing your argument on people who are at best very wrong. It makes no sense to discuss the logic applied to things that aren't facts. If it's true that it's a point that you don't use, it also makes no sense to discuss logic hidden in between irrelevant points.

If you had concluded zoonosis based on the same sort of nonsense I would be saying the same things. I don't care what direction the argument is in.

What in the world is "the typical rate for Asian coronaviruses" ? I promise you that alphacoronaviruses and betacoronaviruses sampled in Asia are less similar in every way than betacoronaviruses sampled inside and outside of Asia.

The fact is that the composition of the FCS is evidence against an engineering origin because no engineer would choose it, but natural selection doesn't care about codon usage tendencies that take hundreds of years to approach what we observe today. Might be worth a rethink on what timescale you're talking about here with "recency".

If the "plasmid primers" thing falls in the category of "points made by people who engineer sequences" then those people are lying to you in one way or another.

@zcoli Here's the relevant passage from my argument "In a broader set of relatives, the fraction of ArgArg pairs coded CGGCGG ranges from 0 outside Africa and Asia to 1/10790 in Asia to 1/5493 in Africa."
The broader set is betacoronaviruses.
https://www.preprints.org/manuscript/202110.0080/v2
I agree with your statement "I'm not discussing your quantitative argument at all." since you instead make ad hominem remarks and discuss factors that end up not being used.

@zcoli Here's your other engineering example for FCS insertion with one CGG:

In the one example of which I’m aware in which a collaborator of the WIV group added a 12nt code for an FCS to produce a viral protein via a plasmid (reminiscent of the 12nt addition in SC2) they only used CGG for one of its three Arg’s.

Let's check out the abstract; nope, this isn't correct at all. It's a plasmid for producing antibodies.

Your analysis is based on "facts" from people who are habitually wrong and/or lying (I think this one comes from Yuri?). Seriously, just slow down and pick any one thing and dig into the primary data yourself. Start with what you think is the most important factor. In this case, all it took was reading the abstract of the paper you linked or looking at any of the figures to realize this was nonsense. And it's so wildly nonsensical that whoever you heard it from should be ignored on everything else as well.

@MichaelWeissman BTW I see the paper cited for "plasmid primers" also has primers including tandem CGG-CGG. You write "these are for plasmid work and thus subject to substantially different optimization criteria" -- there's no optimization of that sequence; it's the sequence you find in bovine herpesvirus-1 isolates.

I supposed at one point this was yet another smoking gun for a synthetic origin of the FCS? It's something somehow associated with someone at WIV with an RRAR and the RR encoded by CGG-CGG?

A quick search of X showed that this was right: Yuri posted about it in Sept 2023 and you misinterpreted this as being a synthetic CGG-CGG as a choice:

It would kind of make sense as a choice for that type of virus, by the way -- CGG is one of two common arginine codons (herpesvirus also doesn't have the same codon usage as hosts).

The person Yuri credited with this had proposed another smoking gun just a couple months earlier for exactly the same thing:

How many smoking guns do people need to claim they found before you realize it's a trivial creative writing exercise? Pick any old random natural virus that's somewhat rare in nature and you can do exactly the same sort of cherry picking.

@zcoli To the limited extent that example could have affected P(CGGCGG|LL) it would have lowered it, since 1/3 is less than the rates used in the somewhat more relevant examples. You've got an unerring sense of how to find the most irrelevant tangents. For somebody interested in the passage you're criticizing, here it is

"In the one example of which I’m aware in which a collaborator of the WIV group added a 12nt code for an FCS to produce a viral protein via a plasmid (reminiscent of the 12nt addition in SC2) they only used CGG for one of its three Arg’s. Other plasmid primers from WIV use high fractions of CGG, including CGGCGG dimers, but again these are for plasmid work and thus subject to substantially different optimization criteria."

I can drop the word "viral" without changing the fact that these data are mentioned only for completeness and explicitly not used.

I think you consistently misunderstand the need for probabilistic reasoning. None of those features are used as "smoking guns" for LL. E.g. RRAR is not used as a signature of LL. The point is just that it's among the many reasonable possibilities for LL, just as it's among the many possibilities for ZW, and thus provides no reliable factor either way.
Likewise the preceding P in PRRAR could easily swing either way, so not used.

@MichaelWeissman None of the smoking guns proposed for the FCS are remotely plausible. A lot of impossible things don't add up to one possible thing. They fall into a few categories:

Misunderstanding expectations from natural selection e.g. David Baltimore and CGG-CGG, and Nicholas Wade's inane argument that inserts can only be acquired from closely related organisms.
"Underpants Gnomes" theories (look it up) missing a step without a plausible way that step could happen e.g. Sachs & Harrison ignore the P in PRRA, and Lisewski ignores the A in PRRA. Because there's no plausible explanation for either in their theories. Step 1: Notice homology and copy FCS around it, Step 2: ?, Step 3: Pandemic.
Blatantly cherry picked nonsense that's statistically insignificant e.g. "Adrian J" googling "WIV & PRRA", HIV inserts cherry picked from highly variable regions in single patients, and the Moderna patent thing that Daoyu Zhang cherry picked and some scientists, I think, stole the idea and lied about how they found it in a paper. Interesting aside: one of those scientists was once the world's youngest doctor!

That's a condensed history of FCS "smoking guns" and I'm missing some -- I think you mention Yuri arguing it's there as some sort of marker? Alina Chan once said the proline was there to introduce a restriction site. It's impossible to describe just how stupid all of this is.

So you've got a scale and on one side is natural selection and on the other side is a bunch of the worst examples of Intelligent Design that anyone's ever come up with. You reason that on balance, this means that it "provides no reliable factor either way." Gotta recalibrate your scale.

@zcoli again, you criticize many things I don't say.
E.g. I agree that Wade's claim that inserts only come from related organisms is false and therefore do not use it.
E.g. I don't use any of the "HIV inserts" stuff because there seems to be a consensus that it's BS, presumably because of the multiple comparisons issue.

So you've started with criticisms of things that I do not use and do not believe and then layered on all sorts of emotional verbiage,

@MichaelWeissman Neither of those is any more or less false than Bruttel's theory. I case anyone forgot, Bruttel responded to contradictory evidence by claiming the Chinese were monitoring his tweets and fabricating genomes to stay ahead of him. He stopped saying that when I told him it required time travel. He also thinks Ebola is a lab leak and some other lab leaks I can't keep track of.

Your article cites several of the orthogonal synthetic FCS theories that are all equally false. I forgot to include Yuri's cherry picked sick cat; that'd be in the last category I guess.

If you aren't going to do any of the work yourself to learn what's true and false, there's no point in going beyond that.

@zcoli Your "orthogonal" remark provides a sort of lead-in to a discussion of compound hypotheses. Having several possible ways something can happen under one broad hypothesis is normal. It applies particularly to natural evolution, so that gives a nice way to illustrate.

E.g. The FCS in SC2 looks like an improbable event. Early attempts to give a zoonotic account relied on point mutations and small inserts. Later attempts involved an insert in a bat. Or maybe in one of a variety of intermediate species. Or maybe in an immunocompromised person.
This is not a logical contradiction. In the limit of low probabilities for each account, the net probability here is just the sum of those. It's small, but not due to any problem with the parallel-possibility logic.

@MichaelWeissman Again, the net probability of one of a set of impossible things being possible is zero. There are countless plausible, low probability pathways via nature to get from the (recombinant) common ancestor of SARS-CoV-2 and RaTG13, BANAL-52, and MP789 to SARS-CoV-2, including acquiring an insert with or without subsequent adaptation to give the observed polybasic S1/S2 cleavage site.

I don't think Gallaher's HKU9 hypothesis is particularly plausible, by the way. Nor would I have agreed with him in 2009 that a lab origin of 2009 H1N1 was plausible. In general, anyone who says they can tell a story of where the insert came from (and if/how it subequently adapted or expanded or shrunk) is almost certainly wrong.

For what it's worth, my guess at a mechanism is that I figure the subset of plausible mechanisms are more likely that involve the repeat ahead of the S1/S2 insert that is found in SARS-CoV-2 but so far absent in the handful of related viruses sampled (TCAGACTCAGACT vs TCAGACTCAAACT).

The most plausible theory of the SARS-CoV-2 S1/S2 site being an engineered insert goes like this:

WIV sampled something that was almost identical to SARS-CoV-2, but lacked the FCS. Experiments showed that its RBD bound hAce2. This virus was sequenced and all of the related experiments have been perfectly covered up.
WIV sampled something else that was very, very similar to that, but a little bit more diverged from SARS-CoV-2 so that experiments showed its RBD didn't bind hAce2; this virus was sequenced and all of the related experiments have been perfectly covered up.
WIV made a recombinant virus identical to the first one and has successfully covered up its existence--all of the associated sequencing and experiments and so on. It didn't grow all that well despite hAce2 binding.
WIV made another recombinant virus with the observed, predicted cleavage site at S1/S2 (we will ignore the problem that the R-R-A-R sequence doesn't match R-X-[R/K]-R that WIV planned to search for). The construction and existence of this was also perfectly covered up.

No proposes this because it's less plausible than skipping steps 2 through 4 and simply finding SARS-CoV-2 in nature in step 1. No one proposes simply finding SARS-CoV-2 in nature because mad scientist theories are attractive to people with bad intuition who are incapable of recognizing that the supporting evidence is a 5.5-year-long creative writing exercise based on being able to Google "WIV and RRAR" for example, and elide the origins story about the immeasurable multiplicity of hypotheses you tested doing that.

You might want to check out the history of alternative proposals to explain global warming. There are many -- they're all wrong -- there could be 100 times more equivalent examples of people writing vague about "What if there's a cycle lasting X years that we just lack the knowledge to explain? We need more data to be sure enough to act! Imagine the cost if we're wrong!" It wouldn't reduce the likelihood that greenhouse gas emissions are responsible. These alternative theories are all be in the "what about blah blah blah?" genre and fail to explain all of the observed data. They all come hand in hand with questioning the legitimacy of published data, often arguing that there's a global conspiracy of relevant experts to suppress the truth. Sound familiar?

@zcoli The relevant content of that is the assumption that there must be full data available on the sequences available at the labs. Nobody testifying before Congress was willing to maintain that position. A few of the quotes from Daszak et al. specifically boasting about their many unpublished sequences are included in my long blog. https://michaelweissman.substack.com/publish/posts/detail/142625697?referrer=%2Fpublish%2Fposts

As for your wanderings into the sociology of science, you can read those analogies any way you want. (Fools reject expert opinion! vs. Fools deny humans can now have unprecedented effects!) but it doesn't add useful info either way.
I wrote a very short piece on two ways psychology can lead to errors of different types.
https://michaelweissman.substack.com/p/two-tales-from-my-dad

@MichaelWeissman Yes. "Full data" i.e. the sequences were available at the labs if viruses were isolated by reverse genetics and modified, etc. Not just for the sequences but also for all of the plasmids synthesized and sequenced for the different fragments and so on. Theories requiring the combination of rare elements identified in a combination of separate viruses from their sequences require more things to be covered up.

Reading between the lines of what someone said on the other side of the planet just expands the scope of the coverup you require to include Daszak and whoever knows what you imagine he knows. Back in the real world, a bunch of WIV sequences got published by accident when an embargo expired and it only confirmed that WIV wasn't lying about RaTG15. It's the sort of thing you do when a theory lacks supporting data. It's definitionally not falsifiable because any data published will just be declared to be incomplete or fake. Happened for basically every relevant viral genome published.

@zcoli Here's one example, from Eddie Holmes, not exactly a proponent of lab-leak ideas, quoted in my blog.

Proximal Origins author Holmes noted “I’m pretty sure that groups in China are sitting on more SC-2 like viruses….It’s striking to me that CCDC have published so little on this yet have supposedly sampled so many animals. This doesn’t add up. Never discount the politics.”

@MichaelWeissman That wouldn't surprise me at all since conspiracy theorists jumped on RaTG13 and ignored RaTG15 when it turned out their conspiracy theory about those 8 samples was wrong. Data from China is exclusively cherry picked for conspiracy theories or ignored if it disproves lab leak theories.

If you think lab leak is like 1000:1 likely, you won't need to depend on a perfect cover up and cite covered up evidence. You'd have actual evidence. Instead, you're pointing to Antarctic sequence that is DEFINITELY contamination in 2020 (it contains mutations that emerged in 2020). This is the kind of "just asking questions" stuff that plagues climate change skepticism.

@zcoli As you know, I consider those Sangon traces in the Antarctic samples not usable to obtain a Bayes factor. I try to address points raised by both sides, even when they end up not being usable. It would help if you could specify which of the 14 known mutations were not present until early 2020 because then I could check if that's above statistical noise and maybe strengthen the argument as to why those data are not useful.

If there are equivalent types of data that would alter factors that I do use, that would also be helpful.

Beyond that, this tedious and probably unread conversation may have run its course. That 47% number at the top of the page shows no sign of having noticed it either way.

Related questions

Related questions