Parametricity and Free Theorems — getting theorems for free since 1989

🐱

WadlerCatFan

Theorem Prover

★★★★★

Posts: 2,847 Joined: 2019-03-14 Rep: +412 Purrs/day: 7.4

#1 — 2025-11-02, 14:37:09

Quote Report ⤶ Link

OK fellow type-theoretic cat girls, let's talk about one of my all-time favourite papers: Wadler's "Theorems for Free!" (1989). This paper is an absolute masterpiece and I'm still in awe of how much you can extract from a polymorphic type signature alone.

The core idea: given a most-general polymorphic type signature, you can derive a theorem that any inhabitant of that type must satisfy — completely for free, without looking at the implementation.

Here's the motivating example. Suppose you have a function:

r :: forall a. [a] -> [a]

Without knowing anything else about r, parametricity tells us it must satisfy:

\forall types A B, \forall f : A \to B: map f \circ r = r \circ map f

Why? Because r works on lists of any type a. Since it's given no operations over values of type a, all it can do is rearrange the list — it has no way to inspect or create elements. So applying f before or after rearranging gives the same result. The function commutes with map.

            ✨ Key Insight
            Given a most-general polymorphic type signature (taking as few constraints on its values as possible), we can generate for free a theorem to which any inhabitant of such a type must adhere.
          

The theoretical backbone: from the type of a polymorphic function we can derive a theorem that it satisfies. And this isn't just a party trick — Wadler described an application of parametricity to derive theorems about parametrically polymorphic functions based on their types, and parametricity is the basis for many program transformations implemented in compilers for the programming language Haskell.

I'll walk through the full technical story in follow-up posts. But first — who else here has been personally victimized by a free theorem? 😸

theorems before bedtime, every night without exception

🐈

PolyPurrfect

λ Abstractionist

★★★★☆

Posts: 1,203 Joined: 2020-07-22 Rep: +188

#2 — 2025-11-02, 14:52:41

Quote Report

Great thread OP! Let me add some historical context because I'm a huge Reynolds nerd.

The parametricity theorem was originally stated by John C. Reynolds, who called it the abstraction theorem. Reynolds published this in his landmark 1983 paper "Types, Abstraction and Parametric Polymorphism". Wadler's 1989 paper then popularized and extended it to the setting of Haskell-style polymorphism.

Reynolds' original framing was relational: Reynolds proved an Abstraction Theorem — every term in F₂ satisfies a suitable notion of logical relation — and formulated a notion of parametricity satisfied by well-behaved models.

The key technical idea: types are interpreted as relations, not just sets.

Reynolds Abstraction Theorem: For any relation R : A \leftrightarrow A', and any well-typed term t : \forallX.T, (t[A], t[A']) \in ⟦T⟧(R) i.e. related inputs yield related outputs, for ANY relation R you care to choose.

This is where the magic comes from. You're free to choose any relation whatsoever, and the theorem still holds. Choosing relations cleverly gives you all the interesting free theorems.

In programming language theory, parametricity is an abstract uniformity property enjoyed by parametrically polymorphic functions, which captures the intuition that all instances of a polymorphic function act the same way.

∀ cats. cats are cute

😸

LambdaCatgirl

Proof Assistant

★★★★★

Posts: 4,501 Joined: 2018-01-11 Rep: +703

#3 — 2025-11-02, 15:10:27 📌 Notable Post

Quote Report

Let me do the canonical walkthrough of the forall a. [a] -> [a] example properly because I've seen it explained incorrectly so many times on this forum (yes I'm looking at you, GradualTypingSkeptic).

We want to derive what any function r : forall a. [a] -> [a] must do. By the parametricity theorem, for any relation R : A ↔ B:

If xs and ys are R-related element-wise (i.e. in List(R)), then r(xs) and r(ys) are also R-related element-wise. Formally: (xs, ys) \in List(R) ⟹ (r(xs), r(ys)) \in List(R)

Now pick the graph relation of an arbitrary function f : A → B:

R = {(x, f(x)) | x ∈ A}

Then (xs, ys) ∈ List(R) means exactly ys = map f xs. Substituting:

(xs, map f xs) \in List(R) ⟹ (r(xs), r(map f xs)) \in List(R) ⟺ map f (r xs) = r (map f xs)

QED. For free. No peeking at the implementation required! 🐱

The consequences for forall a. [a] -> [a]: since any such function commutes with map, it can only select, duplicate, drop, or reorder elements from the input list — it can never conjure new elements from thin air, because it has no way to construct values of type a.

            🎯 Concrete Examples of forall a. [a] -> [a]
            Valid: id, reverse, tail, take n, sort (via comparison on a), cycle, [] (const empty)

            Invalid: any function that creates elements, peeks at values, or changes their structure.
          

The intuitive explanation of this result is that r must work on lists of x for any type x. Since r is provided with no operations on values of type x, all it can do is rearrange such lists, independent of the values contained in them.

f . g = g . f (in the right monoidal category)

🐾

ProofsArePrograms

Curry-Howard Enthusiast

★★★☆☆

Posts: 678 Joined: 2021-05-30 Rep: +91

#4 — 2025-11-02, 15:34:18

Quote Report

The forall a. [a] -> [a] example is super clean, but my personal favourite demonstration of free theorems is the identity type:

Claim: any f : \forallX. X \to X must be the identity function.

Proof via parametricity: let R = {(x, x)} (the singleton relation on an arbitrary point x). The parametricity condition says:

-- (x, x) ∈ R  ⟹  (f(x), f(x)) ∈ R
-- ⟺  f(x) = x  for all x

Since x was arbitrary, f = id. There is only one inhabitant of forall a. a -> a!

Assume M : ∀X. X → X. Let t be a type and x ∈ [[t]]. We show that [[M]] x = x. Interpreting X by the singleton relation, we obtain [[M]] x = x. This holds for any x. Hence, [[M]] is the identity function.

Similarly for forall a b. (a, b) -> (a, b) — you can prove it must be either the identity or swap. And for forall a. a -> a -> a, parametricity tells you it must be either const or flip const (i.e., it always picks the first or always picks the second argument).

⚠️ Caveat: seq and general recursion These theorems hold cleanly in a total, purely polymorphic setting. In Haskell with seq or general recursion, parametricity is not "lost" when the language includes seq — it is just weakened, in a precisely known way. Also, when the language includes just fix (general recursion), parametricity is weakened (not lost).

🐱

WadlerCatFan

Theorem Prover

★★★★★

Posts: 2,847 Joined: 2019-03-14 Rep: +412

#5 — 2025-11-02, 16:02:55

Quote Report

Good responses everyone. Now let me talk about what makes this mechanically work — the connection to naturality in category theory.

The free theorem for r : forall a. [a] -> [a]:

  map f ∘ r  =  r ∘ map f

...is exactly the naturality condition for a natural transformation r : List → List in the category of types and functions (Hask)! Free theorems are naturality conditions in disguise.

Natural transformation α : F \to G satisfies: α_B \circ F(f) = G(f) \circ α_A for all f : A \to B Parametricity gives us this for FREE just from the type.

More generally, Wadler's key insight was to interpret Reynolds' theorem not only as a way of identifying different implementations of the same type, but also as a source of free theorems for polymorphic types.

The deepest version of this connection: Reynolds' abstraction theorem shows how a typing judgement in System F can be translated into a relational statement (in second-order predicate logic) about inhabitants of the type. The relational interpretation is exactly what gives you the naturality squares!

If you want to go deep on this, look up Hermida, Reddy, Robinson — "Logical Relations and Parametricity". It makes the categorical picture completely explicit. Spoiler: it's enriched category theory and it's beautiful 🐾

every commuting diagram is a theorem for free

😺

PolyPurrfect

λ Abstractionist

★★★★☆

Posts: 1,203 Joined: 2020-07-22 Rep: +188

#6 — 2025-11-02, 16:44:13

Quote Report

WadlerCatFan wrote: ...free theorems are naturality conditions in disguise...

YES. And this is why GHC can use parametricity to justify compiler optimisations! The classic example is short-cut fusion (a.k.a. foldr/build fusion).

The key lemma underlying stream fusion / shortcut fusion is derivable from the free theorem for the foldr type. Because foldr has type:

foldr :: (a -> b -> b) -> b -> [a] -> b

...the free theorem tells you exactly how foldr must commute with functions, which enables GHC to rewrite:

map f . filter p . map g
-- ↓ via free theorems (fusion)
foldr (\x acc -> if p (g x) then f (g x) : acc else acc) []

Parametricity is the basis for many program transformations implemented in compilers for the programming language Haskell. The free theorem is literally being used to prove compiler optimisations are safe. That's not just beautiful mathematics — it's load-bearing infrastructure!

For more: Reynolds' Parametricity Theorem (also known as the Abstraction Theorem), a result concerning the model theory of the second order polymorphic typed λ-calculus (F₂), has been used by Wadler to prove some unusual and interesting properties of programs. A purely syntactic version of the Parametricity Theorem shows that it is simply an example of formal theorem proving in second order minimal logic.

GHC RULES are free theorems you can actually run

🐱

LambdaCatgirl

Proof Assistant

★★★★★

Posts: 4,501 Joined: 2018-01-11 Rep: +703

#7 — 2025-11-02, 17:22:08

Quote Report

Hot take: the most underrated free theorem is the one you get from Church-encoded data types. Consider:

type ChurchBool = forall a. a -> a -> a

-- The free theorem says: any f : ChurchBool satisfies
-- for all g : A → B, x : A, y : A:
--   g (f x y) = f (g x) (g y)

This means f must always pick the first or always pick the second argument — it corresponds exactly to True and False! Free theorems essentially prove that Church encodings are uniquely inhabited by their intended meanings.

And the Church natural numbers: type ChurchNat = forall a. (a -> a) -> a -> a — the free theorem for this type tells you that any such function must equal some n-fold composition. Free theorems justify the entire Church encoding programme!

forall a. (a \to a) \to a \to a ≅ ℕ (via free theorems) forall a. (a \to a \to a) \to a \to a ≅ ℕ (in a different way) forall a. a \to a \to a ≅ Bool

This connects back to Jean-Yves Girard and John Reynolds independently discovering the second-order polymorphic lambda calculus, F₂. Girard additionally proved a Representation Theorem: every function on natural numbers that can be proved total in second-order intuitionistic predicate logic can be represented in F₂. Reynolds additionally proved an Abstraction Theorem: every term in F₂ satisfies a suitable notion of logical relation.

✝ Girard-Reynolds duality real and certified

🐾

ProofsArePrograms

Curry-Howard Enthusiast

★★★☆☆

Posts: 678 Joined: 2021-05-30 Rep: +91

#8 — 2025-11-02, 17:59:50

Quote Report

To close the loop on the "relational parametricity" formalism for newcomers reading this thread:

In programming language theory, parametricity is an abstract uniformity property enjoyed by parametrically polymorphic functions, which captures the intuition that all instances of a polymorphic function act the same way.

The formal machine is logical relations / relational parametricity. We assign to every type τ a relation [[τ]]:

-- Base types: equality relation
[[Int]] = {(n,n) | n ∈ ℤ}

-- Function types: maps related inputs to related outputs
[[A → B]] = {(f,g) | ∀(a,b) ∈ [[A]]. (f a, g b) ∈ [[B]]}

-- Universal types: quantify over ALL relations R
[[∀X. T]] = {(f,g) | ∀A,B, ∀R : A↔B. (f[A],g[B]) ∈ [[T]](R)}

The Parametricity Theorem then states: every well-typed expression e of λF behaves the same as itself according to its type A — that is, (e,e) ∈ E⟦A⟧ρ. At first glance this is underwhelming — of course e behaves the same as itself! The powerful part is that e behaves "according to its type A".

And it is powerful enough to provide behavioral guarantees, which Wadler christened "theorems for free".

Everything in this thread follows from choosing the right relation R at the right type variable. The whole art is picking cleverly! 🐾

🐱

WadlerCatFan

Theorem Prover

★★★★★

Posts: 2,847 Joined: 2019-03-14 Rep: +412

#9 — 2025-11-02, 18:30:00

Quote Report

Great summary from ProofsArePrograms. Wrapping up the first page: one thing I want to stress is that free theorems aren't just cute party tricks. They're genuinely useful engineering tools:

✓ Compiler optimisation — GHC's rewrite rules, fusion, safe code motion
✓ API design — a polymorphic type tells users what a function can't do
✓ Reasoning about abstract types — representation independence follows from parametricity
✓ Verifying Church encodings — uniqueness of Church numerals, booleans, etc.
✓ Category theory — naturality conditions come for free

Next page I'll get into higher-rank types and whether free theorems still hold there, plus the drama around unsafeCoerce destroying parametricity. Stay tuned! 🐾

(see page 2 for continuation)

forall f. f is a natural transformation

✏️ Post a Reply