Jason Zesheng Chen

Silly proof of nonisomorphic uncountable linear orders using ordertypes of nonstandard models of arithmetic

2023-10-29T00:00:00+00:00

It is well known that all countable dense linear order with no endpoints are isomorphic. It is natural to ask if this holds for uncountable linear orders. The answer is no, and there are many proofs of this fact in textbooks of model theory or set theory or on Math StackExchange. Here’s a silly one that uses ordertypes of nonstandard models of arithmetic.

The claim is that there exists dense linear order with no endpoints that is not isomorphic to the real line. First we observe that if $M$ is a nonstandard model of arithmetic, then $M$ has ordertype $\mathbb{N}+\mathbb{Z}\theta$, where $\theta$ is the ordertype of a dense linear order without endpoint. In other words, every nonstandard model of arithmetic looks like the natural numbers followed by many copies of the integers, such that between any two copies there’s another such copy, and every $\mathbb{Z}$-chain is preceded and followed by other $\mathbb{Z}$-chains.

To see this, note that the standard natural numbers, $0, s0, ss0,...$,consist of the $\mathbb{N}$ part of $M$. And every nonstandard natural number $a$ sits in an $\mathbb{Z}$-chain $...a-2,a-1,a,a+1,a+2...$. For no-end-points: take any $\mathbb{Z}$-chain and a nonstandard $a$ on it; $2a$ and $a/2$ (or $(a+1)/2$ if $a$ is odd) must sit on different $\mathbb{Z}$-chains. Why? If $a$ and $2a$ were on the same chain, then $a$ can reach $2a$ by standardly-many applications of successor operation, contradicting the assumption that $a$ is nonstandard. And for denseness: similar idea, take nonstandard $a,b$ on different chains, then $(a+b)/2$ (or $(a+b+1)/2$ if $a+b$ is odd) must sit on a chain between them, because otherwise the midpoint between $a$ and $b$ would can be reached by either standardly-many applications of successor operation from $a$ or predecessor operation from $b$, contradicting the assumption that $a$ and $b$ are nonstandard and sit on different chains.

Next, by the Löwenheim–Skolem theorem there must be nonstandard models of arithmetic of cardinality $\vert \mathbb{R}\vert$. If all dense linear orders without endpoints of this size were isomorphic, then such a model would have ordertype $\mathbb{N}+\mathbb{Z}\mathbb{R}$. We now argue that there isn’t such a model. This argument is due to Klaus Pothoff.

Assume there is a nonstandard model of arithmetic of ordertype $\mathbb{N}+\mathbb{Z}\mathbb{R}$. Take a nonstandard natural number $a$, and consider \((na\mid 0

Putting everything together, there must be dense linear orders without endpoints of size continuum but not isomorphic to the real line.

Being reals of a model of set theory a special property

2023-09-26T00:00:00+00:00

(Another #lolbvious post, because it’s one of those things that’s supposed to be obvious/clear/straightforward/true by the usual argument… etc, but just makes you go lol, yeah right)

The following screenshot is taken from the notes of Jörg Brendle’s Bogotá lectures on forcing and the structure of the real line.

This post provides a proof or two of the remark that in any extension which adds reals, the ground model reals have inner measure zero.

Theorem. Assume $M$ is a model of ZFC, possibly a proper class. If there is a real number not in $M$, then the reals in $M$ have inner measure zero.

Proof using $[0,1]$. Letting $a$ denote a real not in $M$, consider the translates $A_n=\{r+\frac{a}{n}\mid r\in [0,1^M]\}$. The $A_n$’s are pairwise disjoint, because otherwise (say $q+\frac{a}{n}=r+\frac{a}{m}$) $a$ would have been definable in $M$ as the unique real solution to the equation $q+\frac{x}{n}=r+\frac{x}{m}$, contradicting the assumption that $a\notin M$.

But translation preserves inner measure, and $\bigcup_n A_n$ is bounded. So if $[0,1]^M$ has inner measure anything other than zero, then $\bigcup_n A_n$ would have inner measure infinity, contradicting boundeness. $\square$

Proof using $2^\omega$. Given a real $a\notin M$, consider the flip maps induced by $a$. That is, for each natural number $n$, let $F_n$ flip the $(n+k)^\text{th}$ bit of $x\in 2^\omega$ iff $a(k)=1$. In other words, $F_n(x)(n+k)=1-x(n+k)$ iff $a(k)=1$, otherwise $F_n(x)(n+k)=x(n+k)$. Now mirroring the proof above, let $A_n=F_n[(2^\omega)^M]$.

First notice that if $F_n(x)=F_m(y)$ for $n\neq m$, then $x\neq y$ (This is most easily proven by looking at the contrapositive and using the fact that $a$ has at least one $1$). Next, I claim that the $A_n$’s must be disjoint. This is because if $F_n(x)=F_m(y)$ for $n\neq m$, then $a$ is definable as the unique real that makes this true (recall that the $F$’s are defined from $a$).

To see why $a$ is unique: suppose not, then there are $a\neq a'$ witnessing the corresponding $F_n(x)=F_m(y)$ and $F'_n(x)=F'_m(y)$. Now let $k$ be the first place that $a$ differs from $a'$ and assume without loss of generality that $a(k)=0$ and \(n

Observe: $F_m(y)(n+k)=F'_m(y)(n+k)$. This is because if \(n+k

Now to arrive at a contradiction, notice that we have:

\[\begin{align*} F_n(x)(n+k) & = F_m(y)(n+k)\\ & = F'_m(y)(n+k) \\ & = F'_n(x)(n+k) \end{align*}\]

But this cannot be true, since $a$ tells $F_n$ to keep the $(n+k)^\text{th}$ bit of $x$, whereas $a'$ tells $F'_n$ to flip it. $\square$

The lemma before the remark is meant to show that random forcing preserves outer measure. So after forcing to add a random real, the ground model reals have outer measure 1 but inner measure 0, making it non-measurable. Similarly, after adding a Cohen real, the ground model reals don’t have the property of Baire.

Three funny proofs of the existence of incomparable Turing degrees

2023-04-17T00:00:00+00:00

“People would usually spend a whole class in computability theory proving this. What they are doing is they are very carefully proving the Baire category theorem without explicitly saying it.”

I’ve recently learned of a pretty neat proof of the existence of incomparable Turing degrees. And this reminds me that I’ve actually seen quite a few funny (nuking-the-mosquito-type) proofs of this statement, so I decided to record them here.

The first proof was shown to me today by Andrew Marks. You can do this with either measure or category:

Consider the relation $R(x,y)\Leftrightarrow x\leq_T y$. First notice that this is a Borel subset of $\mathbb{R}\times \mathbb{R}$ (take your favorite interpretation of what $\mathbb{R}$ is). So it is measurable/has the Baire property. Second, observe that each section $R_y$ is countable, and so it is a null/meager subset of $\mathbb{R}$. By Fubini’s theorem/Kuratowski-Ulam theorem, $R$ is a null/meager subset of $\mathbb{R}^2$. The analogous argument works to show that $R^{-1}$ is also null/meager.

Hence, $R\cup R^{-1}$, the set containing all pairs that are Turing comparable, is null/meager. Therefore the set of pairs $(x,y)$ such that $x,y$ are Turing-incomparable is measure one/comeager.

The second proof I saw on the internet (for example here). It uses the observation that the continuum hypothesis follows from total comparability of the Turing degrees: each real computes countably many reals, so the reals ordered by $\leq_T$ will form an uncountable linear order in which every proper initial segment is countable. This implies there are at most $\omega_1$ many reals, so CH holds.

Now just force to negate CH. In the extension, $\leq_T$ is not a linear order. But the sentence “there exists two reals that are incomparable” is $\Sigma^1_1$, and hence by Mostowski absoluteness this alread holds in the ground model.

The third proof is somewhat similar to the second. I came up with it when I was thinking about the question “if a real is computable from a comeager set of reals, is it computable?” The measure analogue of this is true, and this was the first interaction between computability theory and measure theory. That result was indepdently obtained by Sacks, and De Leeuw-Moore-Shannon-Shapiro. The answer to my question is also yes, and the argument I came up with had already appeared in Andreas Blass’s Needed reals and recursion in generic reals 20 years ago.

The proof goes like this: force to add two Cohen reals, then neither computes the other. But again it’s $\Sigma^1_1$ to say there exists two incomparable reals, and so this already holds true in the ground model.

The key fact used in the proof is that Cohen reals hold no computation power. I think this is an independently interesting fact, so I’ll end this post with a properly written proof.

Theorem. Let $M$ be a countable transitive model of enough of ZFC, and let $x$ be a real in $M$ and $c$ a Cohen real over $M$. If $x$ is computable relative to $c$, then $x$ is computable.

Proof. If $x$ is computed by the Turing program $\Phi^c_e$, then this fact also holds true in $M[c]$, and so by the forcing theorem this is forced by some condition $p$. That is,

\[p\Vdash \text{ the } \check e\text{th Turing program in the oracle }\dot c \text{ computes } \check x\]

For any $i\in\omega$ we compute $x(i)$ as follows: run $\Phi^s_e(i)$ for all the $s$ extending $p$.

As soon as any of these computations halt, the output will be the correct value of $x(i)$. This is because: if $s_0,s_1$ are two different nodes extending $p$ and $\Phi^{s_0}_e(i)=0\neq 1=\Phi^{s_1}_e(i)$, then we can build two different filters $G_0$ and $G_1$ containing $s_0,s_1$ respectively. Now $M[G_0]$ and $M[G_1]$ will both think $x$ is computed by $\Phi_e^a$ (since both filters contain $p$. Note that they will interpret $a$ differently; but that doesn’t matter). So $M[G_0]$ thinks that $x(i)=0$ and $M[G_1]$ thinks $x(i)=1$. But whatever $x(i)$ is, this is an absolute fact about $x\in M$, so it should be answered in the same way by all transitive models extending $M$. Contradiction! $\square$

A perfect set of not-eventually-equal reals

2022-09-02T00:00:00+00:00

(This post is tagged #lolbvious, because it’s one of those things that’s supposed to be obvious/clear/straightforward/true by the usual argument… etc, but just makes you go lol, yeah right)

I remember being puzzled by the following passage from the chapter on Borel equivalence relations (by Greg Hjorth) in the Handbook of Set Theory.

The claim is that there is a Borel map $f:2^\omega\to 2^\omega$ that reduces identity to eventual equality. In other words, $f$ is such that

\[x=y\Leftrightarrow (f(x)(n)=f(y)(n) \text{ for all but finitely many } n)\]

It is well-known (or maybe I should say well-documented?¹) that the existence of such a map is equivalent to saying that there is a perfect set of inequivalent elements for the following equivalence relation denoted $E_0$:

\[xE_0 y \Leftrightarrow x(n)=y(n) \text{ for all but finitely many } n\]

which is to say that $x$ and $y$ are eventually equal.

Hjorth says it’s routine to prove the existence of a perfect set of mutually generic reals in the Cantor space. This is puzzling at first glance: mutual genericity is a notion in forcing, which is typically used for proving consistency results, instead of existence.

It turns out that this is one of those cases where one can prove an existence claim using forcing. The trick, of course, is to appeal to Shoenfield absoluteness.

To see this, consider the statement: there is a perfect set such that any two elements are not eventually equal. Now perfect sets are coded by perfect trees, which can in turn be coded by a single real. So this sentence is really saying that there is a real coding a perfect tree, such that any two branches (i.e., real numbers tracing these branches) are not eventually equal.

Since it is arithmetic to say two reals are not eventually equal, this makes the statement $\Sigma^1_2$. So if we can force this, this already holds true in the ground model by Shoenfield absoluteness.

But then this is easy to prove: the forcing to add a perfect set of Cohen reals² will make this true. This is because the perfect set of Cohen reals will all fail to be eventually equal with one another: if some $c_1,c_2$ are eventually equal, then from $c_1$ we can define $c_2$ by only chaning $c_1$ on some finite initial segment. That will contradict the fact that $c_1$ and $c_2$ are mutually generic.

So I think this is what Hjorth means to say with mutual genericity.

Of course, since Cohen forcing is essentially a Baire-category method, I’ve committed theft over honest toil by sweeping under the Cohen rug any mention of “dense, meager, comeager”, etc. The interested reader can find an argument using Baire category notions in Su Gao’s Invariant Descriptive Set Theory, Theorem 5.3.1, a stronger theorem which is attributed to Mycielski.

See, for example, Proposition 5.1.12 in Su Gao’s Invariant Descriptive Set Theory. ↩
See this MathOverflow answer by Joel David Hamkins ↩

What’s difficult about forcing?

2022-01-20T00:00:00+00:00

Note: the goal of this article is to give a sense of the kind of difficulties, both conceptual and technical, one might encounter if one decides to learn the mathematical tool of forcing. As such it is not my intention to provide an intuitive motivation for forcing, or explain the inner workings of it in a perspicuous way. Already there are excellent sources for that: for example see A Philosopher’s Guide to Forcing by Toby Meadows or A Beginner’s Guide to Forcing by Timothy Chow.

What I intend to achieve here is somewhat particular: after a first course in formal logic, set theory, or Gödel’s theorems, a motivated student might see the term “forcing” pop up on the occasional Google chase. They might be intrigued as to why this technique won Paul Cohen, its discoverer, a Fields Medal, for instance. But “what is forcing” is a question that is highly difficult to address in office hours, because there are quite a few moving pieces involved. Each of these pieces brings its own special kind of unease to the student, making the whole thing quite daunting. It is simply my hope here to record these moving pieces their associated difficulties.

Due to this peculiar aim, the article is sprinkled with pausing remarks (click to expand) on what is difficult about the matter at hand and whether it’s conceptual or technical. My judgment is that forcing is not substantially harder than any other main theorems in graduate math textbooks. It is simply due to the variety of concepts involved and the unfamiliarity of the tools used, that forcing has gained some kind of notoriety.

An earlier simpler draft of this post was first published in Chinese on the Q & A platform Zhihu. One may view the original version here: link to the Chinese version.

What is forcing?

Forcing is the name of a technical method to establish independence results in set theory. These are results saying that certain statements cannot be deduced from the axioms of set theory (the Zermelo-Fraenkel axioms with Choice).

To show that a statement cannot be deduced from a theory, one shows that the negation of the statement is consistent with the theory. Given a mathematical theory $T$, we write $\mathsf{Con}(T)$ for the statement that $T$ is consistent: i.e., there is no proof that starts with the axioms of $T$ and ends with a contradiction.

If for some sentence $P$ we succeed in showing $\mathsf{Con}(\mathsf{ZFC}+P)$, then we have succeeded in showing that $\mathsf{ZFC}$ cannot refute $P$. This is because if $\mathsf{ZFC}$ refutes $P$, then this means $\mathsf{ZFC}$ proves $\neg P$ (“not-P”); therefore from the theory $\mathsf{ZFC}+P$ one can deduce $P \wedge \neg P$, a contradiction, by doing the following: first deduce $\neg P$ from $\mathsf{ZFC}$ alone, and then deduce $P$.

Forcing allows one to conclude, under suitable assumptions laid out below, $\mathsf{Con}(\mathsf{ZFC}+P)$ for various choices of $P$. Different people have different ways of understanding and implementing it, but arguably the most straightforward way to make sense of it is that it is a method of building new models of ZFC from old ones.

I thought we were talking about proving things. So why models?

It turns out that talking about proofs is not the easiest thing to do. It is easier to work with models instead. A model of a theory, roughly put, is just a set of things that satisfies that theory. The connection between proofs and models was confirmed by Kurt Gödel in his doctoral thesis:

Fact. (Gödel’s Completeness Theorem): $\mathsf{Con}(T)$ if and only if $T$ has a model.

Technical Difficulties

Well-ordering of the reals is not measurable

2021-08-23T15:12:00+00:00

Certain sets of real numbers occur frequently in Mathematical Logic but lack a nice geometric presentation to make it into Real Analysis textbooks. Some of them are quite fun examples to have in one’s repository. I’d like to refer to these as “logical” subsets of the reals, for the boring reason that logic is where they’re typically found. A particularly nice example is a well-ordering of the reals, considered as a subset of the plane. A quick proof using Fubini’s Theorem demonstrates that it is not (Lebesgue) measurable, thus providing another proof that the axiom of choice implies there is a nonmeasurable set. The core idea of the proof is, to the best of my knowledge, due to Sierpiński.

The key to this proof is the following special case of Fubini’s Theorem:

Theorem. Let $A$ be a measurable subset of $\mathbb{R}^2$. For each $y\in \mathbb{R}$ we define the section of $A$ by $y$ as $A_y:=\{x\in \mathbb{R}\mid (x,y)\in A\}$. Then $A$ has measure zero iff almost no point can give $A$ non-measure-zero sections. That is, iff $\{y\in\mathbb{R}\mid A_y \text{ is not measure zero}\}$ is a measure-zero subset of $\mathbb{R}$.

Here, by not measure zero (or non-null), I mean either having positive measure or nonmeasurable.

The key to utilizing Fubini’s Theorem along with a well-ordering of the reals is that we can now identify the first place where things start to have positive measure. In other words, we find a place on the well-ordering where all the proper initial segments are either measure zero or nonmeasurable. This will end up giving us a set which satisfies Fubini’s Theorem.

Let us see this in action with a warm-up proof.

Theorem. Assume the continuum hypothesis, which says there is a well-ordering $\prec$ of $\mathbb{R}$ such that every proper initial segment is countable. Then $\prec$ is not measurable as a subset of the plane.

Proof: For each $x\in\mathbb{R}$, the initial segment $\prec_x$ determined by $x$ is countable, and hence of measure zero. It is helpful to visualize $(\mathbb{R},\prec )\times (\mathbb{R},\prec )$ as a plane. Assume towards a contradiction that the set $\prec =\{(x,y)\mid x\prec y\}$ is measurable. Then it follows that it has measure zero, because each of its section is an initial segment in the well-ordering $\prec$ and hence countable.

But then the complement of this well-ordering $\succ:=\{(x,y)\mid y\prec x \text{ or } y=x\}$ would also have measure zero, by the same argument (except now we consider the sections along the other axis). This would then mean that $\mathbb{R}^2=\prec \cup \succ$ is null. Contradiction!

So this shows that under $\mathsf{CH}$, a well-ordering of the reals is not measurable. we now observe that we used $\mathsf{CH}$ to establish that every initial segment in the well-ordering has measure zero. This can be dispensed with, because one might simply consider a similar least position, below which everything is either measure zero or nonmeasurable. This works as follows.

Theorem. Let $\prec$ be a well-ordering of the reals. Then $\prec$ is not measurable.

Proof: Suppose it were measurable. Let $a$ be the least real such that the product $\prec_a\times\prec_a$ has positive measure ($\prec_a$ here denotes the initial segment determined by $a$ in this well-ordering). It could very well be that such a real doesn’t exist, in which case we consider again the whole space $(\mathbb{R},\prec)\times(\mathbb{R},\prec)$.

Now we restrict the well-ordering to everything $\prec$-below $a$, that is, consider $\prec\upharpoonright a :=\{(x,y)\mid x,y\prec a \text{ and }x\prec y\}$. This set must have measure zero: if a section of it by some $z$ is non-null, then $z\prec a$ would violate the minimality of $a$. So Fubini’s Theorem implies that $\prec\upharpoonright a$ has measure zero.

But the other half of the square $\prec_a\times\prec_a$ has the same measure as $\prec\upharpoonright a$. And hence the square $\prec_a\times\prec_a$ is the union of two measure zero sets, which implies that it has measure zero itself, contradicting the choice of $a$. $\square$