Origin of Lebesgue Integration
This article follows the steps of Henri Lebesgue as he came upon his theory of integration. The story could be started earlier, but we don't lose too much by starting with Borel, Lebesgue's adviser, at the end of the 19th century.
Borel and the measure of a set
At the end of the 19th century, Émile Borel was thinking about
the problem of measure, that is, the problem of describing the size of
things. In geometry, the measure of a solid body in 1, 2 and 3 dimensions is
referred to as the length, area or volume of the solid body, respectively.
This idea of measure is deeply intuitive; we are giving a single number to an
object, and that number is larger for larger objects. If we loosen the
restriction to solid bodies, we may extend this notion to a more general
setting. We may ask: what is the measure of an arbitrary subset in 1,
2 and 3 dimensions? Specifically, we are asking for the size of a subset of
Borel was searching for a general definition for the measure of a set.
He restricted his focus to the 1 dimension case: subsets of
If there was a way to assign such a measure to each set, what sort of characteristics would the measure have? In 1898, the 27 year old Borel published his Leçons Sur La Théorie Des Fonctions, a 130 page text covering a range of different topics. A short section from page 46-50 outlined some requirements for the measure of a set:
- A measure is always nonnegative.
- The measure of the difference of two sets (a set and a subset) is equal to the difference of their measures.
- The measure of a countable union of non-overlapping sets is the sum of their measures.
- Every countable set has measure 0.
It is worth pausing to understand these requirements and to consider if they have been chosen well.
Interlude: Leçons, original and translation
Below is a cropped photo of page 46 and 47 of Borel's Leçons Sur La Théorie Des Fonctions. The full text is available on Internet Archive.
The reader is free to skip this section without missing out on any crucial information.

The section Les ensembles mesurables, starting half way down page 46, is where Borel describes the requirements of a measure. This section is transcribed and translated below.
Les ensembles mesurables
Tous les ensembles que nous considérerons seront formés de points compris entre
0 et 1.
Lorsqu'un ensemble sera formé de tous les points compris dans une infinité
dénombrable d'intervalles n'empiétant pas les uns sur les autres et ayant une
longueur totale s, nous dirons que l'ensemble a pour mesure
Plus généralement, si l'on a une infinité dénombrable d'ensembles n'ayant
deux à deux aucun point commun et ayant respectivement pour mesure
Tout eela est une conséquence de la définition de la mesure. Voici
maintenant des définitions nouvelles : si un ensemble
Le théorème fondamental démontré pages 41-43 nous assure que ces
définitions ne seront jamais contradictoires entre elles ('); nous sommes donc
libres des les adopter; nous sommes d'ailleurs assurés aussi que la mesure d'un
ensemble ne sera jamais une quantité négative; mais un ensemble peut avoir pour
measure zéro et avoir la puissance du continu. Tel est l'ensemble
puisque
Ainsi un ensemble qui a pour mesure zéro peut être non dénombrable; mais tout ensemble dénombrable a pour mesure zéro; c'est une conséquence aisée de ce qui précède.
Measurable sets
All the sets we will consider are formed by points between 0 and 1. When a
set is formed by all the points included in countably infinite union of
non-overlapping intervals, each having a total length s, we will say that the
set has measure
More generally, if we have countably infinite sets having no common point
between them and having measure
All this is a consequence of the definition of measure. Here are some new
definitions: if a set
The fundamental theorem shown on pages 41-43 assures us that these
definitions will never be contradictory to each other ('); we are therefore
free to adopt them; we are also assured that the measure of a set will never be
a negative quantity; but a set can have zero measure and have the power of
the continuous. Such is the set
since
Thus a set which has measure zero can be uncountable; but any countable set has measure zero; this is an easy consequence of the above.
Translated with help of www.DeepL.com on 2021-08-08 (translation sharable link). Snapshot of licence at the time of translation.
Interlude over. Back to the 4 properties...
Any system of measuring sets that breaks one of these requirements would be strange enough that it wouldn't feel like a measure. This is more obvious for properties 1 and 2 than for properties 3 and 4. What makes properties 3 and 4 harder to interpret is that they both refer to a type of infinity: countable infinity. When Borel published his text, it had only been 30 years since Cantor had introduced the idea of multiple types of infinity. These ideas were slow to be accepted, so if a reader struggles to interpret them, they are in good company. The ideas of different infinities are not discussed here, but a grasp of them is probably required to make sense of the rest of this article.
Property 3
Property 3 is repeated here:
The measure of a countable union of non-overlapping sets is the sum of their measures.
Property 3 expresses the idea that if multiple sets do not overlap, and we form a new set from their union, then the measure of the result equals sum of the measures of the individual sets.
But why is a countably infinite union specified? Why not a finite union, or an uncountably infinite union? Consider these three alternatives in turn.
Finite union
We can word property 3 for finite unions as follows.
Let
Countably infinite union
Set theory allows for an infinite union. Reading the axiom of union might make this more clear. Below, property 3 in laid out explicitly for the case of countable union.
Let's first construct a set
Property 3 for a countable union then says:
If all sets
This countable union includes the finite union as a special case, just let all but a finite number of the sets be empty.
This idea pushes against the boundaries of intuition. Why is it acceptable to consider the measure of an infinite union? What is interesting is that if we try to extend this to uncountably infinite unions, we fail.
Uncountably infinite union
We cannot allow an arbitrary union of sets to have summable measure without the concept of measure becoming useless. An example explains this.
Consider a single point, such as
None of these points overlap, so if we were to allow an arbitrary union to
preserve summation of measure, then we would be forced to accept that
Property 4
Property 4 states:
Every countable set has measure 0.
Consider first a sub-case of this property: every finite set has measure
zero. Consider a single point set, such as
No consider the set of all rationals within
Is it possible to satisfy all properties
With the 4 properties in mind, two questions raise themselves:
- Does there actually exist a measure that satisfies all 4?
- If we do find such a measure, is it unique?
The 4 properties might seem more obvious than these two questions. In some sense, these questions are quite deep. A lot of mathematicians present their work as a painstaking construction of a mathematical object and then showing that it has some nice intuitive properties. Here we have the reverse process: we have some nice intuitive properties in mind and we are trying to search for some mathematical object that fits them all. In addition, we are asking: to what extent might such an object be unique? Uniqueness would allow the requirements to become a definition: "the single object that satisfies our requirements". A definition of an object by stating the properties it must have is often called a descriptive definition. Both Borel and Lebesgue began their theories with descriptive definitions.
It's worth pondering the second question for a little while now. The metric
system and imperial system assign different numbers to the lengths, areas and
volumes of things, yet both seem to "work" in the sense of the 4 requirements.
So it seems intuitive that systems that differ by a constant multiple could be
valid measurement systems, in other words, if we have some function
Borel tackled question 1 (he ignored question 2). Borel attempted to
answer question 1 by proposing a measuring system for subsets of
The reader may have noticed some deficiencies with the above argument.
Indeed, Borel did not show that requirement (1) is always satisfied (it is
conceivable that some set difference might arrive at a negative measure).
Furthermore, sets might be built up from intervals in multiple ways, and it's
not clear that our measure system will assign a consistent measure. For
example, consider an open set
and
we must have
and
But Borel did not show that both of these sums will be equal. This problem is usually phrased as: we must show that our measure system is well defined. Borel's ideas are a preliminary argument. Borel demonstrates that by following a descriptive definition of measure, we can start to narrow in on how our measure system must behave. The issues that Borel didn't address were later addressed rigorously by Lebesgue.
We leave the idea of measure for the moment and move to the problem of integration.
Note: for the rest of the article, the 1-D case—measuring
subsets of
Integration
A function maps elements of a set to elements of another set; a measure maps sets to a real number, and an integral takes in two inputs—a function and a subset of the function's domain—and maps to a real number. In this sense, they are all functions, they just differ in their inputs and outputs. Using function notation, the comparison is as follows:
- A function
maps an element of to an element of , denoted as . - A measure function
maps a subset to a real number . - An integral function
maps a function and a subset to a real number . refers to the set of all functions from .
With this perspective, the integral written as:
is just elaborate syntax for:
which describes function
In terms of meaning, while a measure assigns a size to a set,
an integral can be thought of as assigning a size to the space between the
function and its domain. For both measure and integration, the sets that get
assigned these sizes are all subsets of the reals,
The signature of the integration function—its domain and codomain— are clear; however, there are many functions with this signature, and it's not as clear which of these functions best represents the meaning of integration. Neither is it clear that there is just one such function—there could be many! Many mathematicians unsatisfied with existing integration functions searched for good integration functions, Lebesgue being one of them.
Integration before 1900
By the year 1900, the search for a good integration function had already resulted in various different formulations. Often, mathematicians tackled the subject in passing as they worked on some other topic that required integration, for which the existing ideas of integration somehow fell short.
Here we will just mention some names of Lebesgue's predecessors and move on:
- Cauchy (1823)
- Dirichlet (never published, but Lipschitz documented his lectures in 1864)
- Riemann (1867)
- Harnack (1883)
- Holder (1884)
- Darboux (1875)
- De la Vallee-Poussin (1994)
- Stieltjes (1895)
The integrals these people proposed were named eponymously, such as the Riemann integral. We can understand Lebesgue integration without understanding the details of these integrals; it is sufficient to appreciate that in 1900, the search for a better integration definition was still ongoing.
Lebesgue part I: characterizing the integral
Lebesgue wrote his doctoral thesis, Intégrale, longueur, aire in 1902 (French for Integral, length, area). The full text is available on Internet Archive. Lebesgue's advisor was none other than Émile Borel, (Borel being 4 years older). For his thesis, Lebesgue set out to identify and condense the properties common to all the previous integrals. His realization was that there were properties common to all integrals, and that it was worth investigating whether these properties could be used to actually define integration. Instead of the properties being a consequence, Lebesgue inverted the problem and considered the properties as the starting point. In other words, if these properties are to hold, what must be implied about the nature of integration and its formulation? Might this approach lead us to a better constructive definition of an integral? It was the case that a better definition was needed—the mathematicians listed above found themselves tackling integration due to issues with the definitions available to them.
This inversion of the problem is in the same spirit as Borel's investigation of measure. This is not the only connection between Borel's work on measure and Lebesgue's work on integration; indeed, we arrive at a much more concrete and important connection later on. But for now, let's proceed by listing the properties Lebesgue identified as common to all integration theories.
Lebesgue had written about his integration theory before his thesis; he briefly introduced some of his ideas in the 132nd volume of French journal Comptes rendus, published in 1901. After his thesis, he presents his theory in a more comprehensive form in Leçons sur l’intégration et la recherche des fonctions primitives. The quotations that appear below come from this text.
The 6 properties
Here are Lebesgue's original words, translated by Pezin & Kotz:
"It is our purpose to associate with every bounded function which is defined in a finite interval
—positive, negative, or equal to zero—a certain finite number which we will call the integral of on and which satisfies the following conditions:
For any
, and , we have: For any
, , we have: - If
and , then also: - If
tends increasingly to , then the integral of tends to the integral of . The significance, necessity, and corollaries of the first five conditions of this problem of integration are more or less evident..."
It's hard to imagine removing one of the 1-5 conditions and ending up with a meaningful sense of integration. The importance of the 6th condition should eventually become clear. In a later publication, Lebesgue reformulates the properties by removing the 6th and reworking the 2nd (the 6th becomes a consequence of the rewritten 2nd).
Out of the 6 properties, only (4) assigns a specific value to an integral.
So, at this point, there is only a single function,
Lebesgue part II: searching for conclusions
Lebesgue proceeded by searching for interesting conclusions that might follow once the 6 properties are assumed. If some integration procedure has the 6 properties above, what else can we say about it? Lebesgue found quite a lot to say.
4 conclusions
The following 4 statements are true if one assumes that properties 1-5 above are true.
-
-
if
and then
What Lebesgue has shown is that it's impossible to design an integration procedure that breaks any of these 4 properties without also breaking one or more of the first 5 properties decided to be necessary for a reasonable integration procedure. The 6th property comes into play soon.
Before moving on, it might be interesting to notice how result (4)
gives us a way to calculate any one of the infinite number of
integrals of the form
5th conclusion
Lebesgue continued his search for conclusions that follow from properties 1-6. He reported the following finding.
First we need some definitions. Let
This set is all values within
In other words, each
We are also going to briefly refer to the function
With the required definitions in place, we can continue.
Consider one of the subsets
Denote the two sums of
and because each
We can show that
[proof]
So far in this section only standard properties of function have been used, such as function addition and convergence of functions. Next the 6th property chosen by Lebesgue will be employed to arrive at Lebesgue's next conclusion:
and also:
These two statements follow from
The importance of the 5th conclusion
In the statements
Lebesgue part III: step functions and the return to measure
Let
The integral
This matches Borel's belief that the measure of
"Here is the problem which is to be solved: Our purpose is to associate with every bounded set
consisting of points on the axis a certain nonnegative number, , which will be called the measure of , and which satisfies the following conditions:
- Two congruent sets have the same measure.
- A set which is the sum of a finite or countable number of pairwise disjoint sets has a measure equal to the sum of the measures of the summands.
- The measure of the set of all points of the interval
equals 1."
Where did these conditions come from? They are similar in nature but distinctly different than those proposed by Borel. Lebesgue chose conditions (1')-(3') so that the 6 properties of integration would be satisfied for characteristic functions. The correspondence for characteristic functions is as follows:
- (1') implies (1)
- (3') implies (5)
- (2') implies (2), (3), (4) and (6)
[proof]
Lebesgue part IV: solving the measure problem
Lebesgue, equipped with the 3 measurement properties, begins again the process of searching for conclusions: what can be said about measurement systems that have these 3 properties? Can enough be said to construct a measurement system? Lebesgue is asking the same questions about measure that Borel did.
Out of Lebesgue's 3 measurement properties, only (3') assigns a
measure to a set, specifically,
Open sets
Lebesgue's search focused on the behaviour of open sets. Why?
Because by definition open sets are a union of intervals, such as
Inner measure
[todo]Outer measure
[todo]Measure: a constructive definition
[todo]Lebesgue part V: constructing integration
[todo]TBC
Welcome to my unfinished article! One day, it will be done.