Analysis
Table of Contents
Defintions
General
The support of a real-valued function
is given by
Lp space
spaces are function spaces defined using a generalization of the p-norm for a finite-dimensional vector space.
p-norm
Let
be a real number. Then p-norm (also called the
-norm) of vectors
, i.e. over a finite-dimensional vector space, is
Banach space
A Banach space is a vector space with a metric that:
- Allows computation of vector length and distance between vectors (due to the metric imposed)
- Is complete in the sense that a Cauchy sequence of vectors always converge to a well-defined limit that is within the space
Sequences
Sequences of real numbers
Definition of a convergent sequence
such that
such that
Bounded sequences
A bounded sequence
such that
Cauchy Sequence
A sequence of points
is said to be Cauchy (in
) if and only if for every
there is an
such that
For a real sequence this is equivalent of convergence.
TODO Uniform convergence
Series of functions
Pointwise convergence
Let
be a nonempty subset of
. A sequence of functions
is said to converge pointwise on
if and only if
exists for each
.
We use the following notation to express pointwise convergence :
Remarks
- The pointwise limit of continuous (respectively, differentiable) functions is not necessarily continuous (respectively, differentiable).
- The pointwise limit of integrable functions is not necessarily integrable.
- There exist differentiable functions
and
such that
pointwise on $[0, 1], but
- There exist continuous functions
and
such that
pointwise on $[0, 1] but
Uniform convergence
Let
be a nonempty subset of
. A sequence of functions
is said to converge uniformly on
to a function
if and only if for every
.
Continuity
Lipschitz continuity
Given two metric spaces
and
, where
denotes the metric on
, and same for
and
, a function
is called Lipschitz continuous if there exists a real constant
such that
Where the constant
is referred to as the Lipschitz constant for the function
.
If the Lipschitz constant equals one,
, we say
is a short-map.
If the Lipschitz constant
is
we call the map
a contraction or contraction mapping.
A fixed point (or invariant point )
of a function
is an element of the function's domain which is mapped onto itself, i.e.
Observe that if a function
crosses the line
, then it does indeed have a fixed point.
The mean-value theorem can be incredibly useful for checking if a mapping is a contraction mapping, since it states that
for some
.
Therefore, if there exists
such that
for all
then we clearly know that
hence it's a contraction.
Hölder continuity
A real- or complex-valued function
on a d-dimensional Euclidean space is said to satisfy a Hölder condition or is Hölder continuous if there exists non-negative
and
such that
for all
in the domain of
.
This definition can easily be generalized to mappings between two different metric spaces.
Observe that if
, we have Lipschitz continuity.
Càdlàg function
Let
be a metric space, and let
.
A function
is called a càdlàg function if, for very
:
The left limit
exists.
The right limit
exists and equals
That is,
is right-continuous with left limits.
Affine space
An affine space generalizes the properties of Euclidean spaces in such a way that these are independent of concepts of distance and measure of angles, keeping only properties related to parallelism and ratio of lengths for parallel line segments.
In an affine space, there is no distinguished point that serves as an origin. Hence, no vector has a fixed origin and no vector can be uniquely associated to a point. We instead work with displacement vectors, also called translation vectors or simply translations, between two points of the space.
More formally, it's a set
to which is associated a vector space
and a transistive and free action
of the additive group of
. Explicitly, the definition above means that there is a map, generally denoted as an addition
which has the following properties:
- Right identity: $∀ a ∈ A, a + 0 = 0 $
Associativity:
- Free and transistive action: for every
, the restriction of the group action to
, the induced mapping
is a bijection. - Existence of one-to-one translations: For all
, the restriction of the group action to
, the induced mapping
is a bijection.
This very rigorous definition might seem very confusing, especially I remember finding
to be quite confusing.
"How can you map from some set
to itself when clearly the LHS contains an element from the vector space
!?"
I think it all becomes apparent by considering the following example:
and
. Then, the map
Simply means that we're using the structure of the vector space
to map an element from
to
!
Could have written
to make it a bit more apparent (but of course, a set does not have any inherit struct, e.g. addition).
Schwartz space
The Schwartz space
is the space of all
functions
on
s.t.
for all
.
Here if
then
and
An element of the Schwartz space is called a Schwartz function.
Covering and packing
Let
be a normed space
We say a set of points
is an
of
if
or, equivalently,
The packing number of
as
Theorems
Cauchy's Theorem
This theorem gives us another way of telling if a sequence of real numbers is Cauchy.
Let
be a sequence of real numbers. Then
is Cauchy if and only if
converges (to some point
in
).
Suppose that
is Cauchy. Given
, choose
such that
By the Triangle Inequality,
Therefore,
is bounded by
.
By the Bolzano-Weierstrass Theorem
Telescoping series
Bolzano-Weierstrass Theorem
A sequence
of sets is said to be nested if
If
is a nested subsequence of nonempty bounded intervals, then
is non-empty (i.e. contains at least one number).
Moreover, if
then
contains exactly one number (by non-emptiness of
).
Each bounded sequence in
has a convergent subsequence.
Assume that
is the lower and
the upper bound of the given sequence. Let
.
Divide
into two halves,
and
:
Since
at least one of these intervals contain
for infinitively many values of
. We denote the interval with this property
. Let
be such that
.
We proceed by induction. Divide the interval
into two halves (like we did with
). At least one of the two halves will contain infinitively many
, which we denote
. We choose
such that
.
Observe that
is a nested subsequence of boudned and closed intervals, hence there exists
that belongs to every interval
.
By the Squeeze Theorem
as
.
Triangle Inequality
Mean Value Theorem
If a function
is continuous on the closed interval
, and differentiable on the open interval
, then there exists a point
in
such that:
Rolle's Theorem
Suppose
with
. If
is continuous on
, differentiable on
and
then
for some
.
Intermediate Value Theorem
Consider an interval
on
and a continuous function
. If
is a number between
and
, then
M-test
Let
be a nonempty subset of
and
and suppose
(i.e. series is bounded ).
If
for
, then
converges absolutely and uniformly on
.
Fixed Point Theory
Banach Fixed Point Theorem
Let
be a be non-empty complete metric space with a contraction mapping
. Then
admits a unique fixed-point
in
.
Furthermore,
can be found as follows:
Start with an arbitrary element
in
and define a sequence
by
, then
When using this theorem in practice, apparently the most difficult part is to define the domain
such that
.
Fundamental Contraction Inequality
By the triangle inequality we have
Where we're just using the fact that for any two different
and
,
is at least less than
by assumption of
being a contraction mapping.
Solving for
we get
Measure
Definition
A measure on a set is a systematic way of defining a number to each subset of that set, intuitively interpreted as size.
In this sense, a measure is a generalization of the concepts of length, area, volume, etc.
Motivation
The motivation behind defining such a thing is related to the Banach-Tarski paradox, which says that it is possible to decompose the 3-dimensional solid unit ball into finitely many pieces and, using only rotations and translations, reassemble the pieces into two solid balls each with the same volume as the original. The pieces in the decomposition, constructed using the axiom of choice, are non-measurable sets.
Informally, the axiom of choice, says that given a collecions of bins, each containing at least one object, it's possible to make a selection of exactly one object from each bin.
Measure space
If
is a set with the sigma-algebra
and the measure
, then we have a measure space .
Sigma-algebra
Let
be some set, and let
be its power set. Then the subset
is a called a σ-algebra on
if it satisfies the following three properties:

is closed under complement: if 
is closed under countable unions: if 
These properties also imply the following:

is closed under countable intersections: if 
A measure
on a measure space
is said to be sigma-finite if
can be written as a countable union of measurable sets of finite measure.
Borel sigma-algebra
Any set in a topological space that can be formed from the open sets through the operations of:
- countable union
- countable intersection
- complement
is called a Borel set.
Thus, for some topological space
, the collection of all Borel sets on
forms a σ-algebra, called the Borel algebra or Borel σ-algebra .
Borel sets are important in measure theory, since any measure defined on the open sets of a space, or on the closed sets of a space, must also be defined on all Borel sets of that space. Any measure defined on the Borel sets is called a Borel measure.
Lebesgue sigma-algebra
Basically the same as the Borel sigma-algebra but the Lebesgue sigma-algebra forms a complete measure.
- Note to self
Suppose we have a Lebesgue mesaure on the real line, with measure space
.
Suppose that
is non-measurable subset of the real line, such as the Vitali set. Then the
measure of
is not defined, but
and this larger set (
) does have
measure zero, i.e. it's not complete !
- Motivation
Suppose we have constructed Lebesgue measure on the real line: denote this measure space by
. We now wish to construct some two-dimensional Lebesgue measure
on the plane
as a product measure.
Naïvely, we could take the sigma-algebra on
to be
, the smallest sigma-algebra containing all measureable "rectangles"
for
.
While this approach does define a measure space, it has a flaw: since every singleton set has one-dimensional Lebesgue measure zero,
for any subset of
.
What follows is the important part!
However, suppose that
is non-measureable subset of the real line, such as the Vitali set. Then the
measure of
is not defined (since we just supposed that
is non-measurable), but
and this larger set (
) does have
measure zero, i.e. it's not complete !
- Construction
Given a (possible incomplete) measure space
, there is an extension
of this measure space that is complete .
The smallest such extension (i.e. the smallest sigma-algebra
) is called the completion of the measure space.
It can be constructed as follows:
- Let
be the set of all
measure zero subsets of
(intuitively, those elements of
that are not already in
are the ones preventing completeness from holding true) - Let
be the sigma-algebra generated by
and
(i.e. the smallest sigma-algreba that contains every element of
and of
)
has an extension to
(which is unique if
is sigma-finite), called the outer measure of
, given by the infimum
Then
is a complete measure space, and is the completion of
.
What we're saying here is:
- For the "multi-dimensional" case we need to take into account the zero-elements in the resulting sigma-algebra due the product between the 1D zero-element and some element NOT in our original sigma-algebra
- The above point means that we do NOT necessarily get completeness, despite the sigma-algebras defined on the sets individually prior to taking the Cartesian product being complete
- To "fix" this, we construct a outer measure
on the sigma-algebra where we have included all those zero-elements which are "missed" by the naïve approach, 
- Let
Product measure
Given two measurable spaces and measures on them, one can obtain a product measurable space and a product measure on that space.
A product measure
is defined to be a measure on the measurable space
, where we've let
be the algebra on the Cartesian product
. This sigma-algebra is called the tensor-product sigma-algebra on the product space.
A product measure
is defined to be a measure on the measurable space
satisfying the property
Complete measure
A complete measure (or, more precisely, a complete measure space ) is a measure space in which every subset of every null set is measurable (having measure zero).
More formally,
is complete if and only if
Lebesgue measure
Given a subset
, with the length of a closed interval
given by
, the Lebesgue outer measure
is defined as
The Lebesgue measure is then defined on the Lebesgue sigma-algebra, which is the collection of all the sets
which satisfy the condition that, for every
For any set in the Lebesgue sigma-algrebra, its Lebesgue measure is given by its Lebesgue outer measure
.
IMPORTANT!!! This is not necessarily related to the Lebesgue integral! It CAN be be, but the integral is more general than JUST over some Lesgue measure.
Intuition
- First part of definition states that the subset
is reduced to its outer measure by coverage by sets of closed intervals - Each set of intervals
covers
in the sense that when the intervals are combined together by union, they contain 
- Total length of any covering interval set can easily overestimate the measure of
, because
is a subset of the union of the intervals, and so the intervals include points which are not in 
Lebesgue outer measure emerges as the greatest lower bound (infimum) of the lengths from among all possible such sets. Intuitively, it is the total length of those interval sets which fit
most tightly and do not overlap.
In my own words: Lebesgue outer measure is smallest sum of the lengths of subintervals
s.t. the union of these subintervals
completely "covers" (i.e. are equivalent to)
.
If you take an a real interval
, then the Lebesge outer measure is simply
.
Lebesgue Integral
The Lebesgue integral of a function
over a measure space
is written
which means we're taking the integral wrt. the measure
.
Special case: non-negative real-valued function
Suppose that
is a non-negative real-valued function.
Using the "partitioning of range of
" philosophy, the integral of
should be the sum over
of the elementary area contained in the thin horizontal strip between
and
, which is just
Letting
The Lebesgue integral of
is then defined by
where the integral on the right is an ordinary improper Riemann integral. For the set of measurable functions, this defines the Lebesgue integral.
Measurable function
Let
and
be measurable spaces.
A function
is said to be measurable if the preimage of
under
is in
for every
, i.e.
Radon measure
- Hard to find a good notion of measure on a topological space that is compatible with the topology in some sense
- One way is to define a measure on the Borel set of the topological space
Let
be a measure on the sigma-algebra of Borel sets of a Hausdorff topological space
.
is called inner regular or tight if, for any Borel set
,
is the supremum of
over all compact subsets of
of 
is called outer regular if, for any Borel set
,
is the infimum of
over all open sets
containing 
is called locally finite if every point of
has a neighborhood
for which
is finite (if
is locally finite, then it follows that
is finite on compact sets)
The measure
is called a Radon measure if it is inner regular and locally finite.
Suppose
and
are two
measures on a measures space
and that
is absolutely continuous wrt.
.
Then there exists a non-negative, measurable function
on
such that
The function
is called the density or Radon-Nikodym derivative of
wrt.
.
Continuity of measure
Suppose
and
are two sigma-finite measures on a measure space
.
Then we say that
is absolutely continuous wrt.
if
We say that
and
are equivalent if each measure is absolutely continuous wrt. to the other.
Density
Suppose
and
are two sigma-finite measures on a measure space
and that
is absolutely continuous wrt.
. Then there exists a non-negative, measurable function
on
such that
Measure-preserving transformation
is a measure-preserving transformation is a transformation on the measure-space
if
Sobolev space
Notation
is an open subset of 
denotes a infinitively differentiable function
with compact support
is a multi-index of order
, i.e.
Definition
Vector space of functions equipped with a norm that is a combination of
norms of the function itself and its derivatives to a given order.
Intuitively, a Sobolev space is a space of functions with sufficiently many derivatives for some application domain, e.g. PDEs, and equipped with a norm that measures both size and regularity of a function.
The Sobolev space spaces
combine the concepts of weak differentiability and Lebesgue norms (i.e.
spaces).
For a proper definition for different cases of dimension of the space
, have a look at Wikipedia.
Motivation
Integration by parst yields that for every
where
, and for all infinitively differentiable functions with compact support
:
Observe that LHS only makes sense if we assume
to be locally integrable. If there exists a locally integrable function
, such that
we call
the weak
-th partial derivative of
. If this exists, then it is uniquely defined almost everywhere, and thus it is uniquely determined as an element of a Lebesgue space (i.e.
function space).
On the other hand, if
, then the classical and the weak derivative coincide!
Thus, if
, we may denote it by
.
Example
is not continuous at zero, and not differentiable at −1, 0, or 1. Yet the function
satisfies the definition of being the weak derivative of
, which then qualifies as being in the Sobolev space
(for any allowed
).
Ergodic Theory
Let
be a measure-preserving transformation on a measure space
with
, i.e. it's a probability space.
Then
is ergodic if for every
we have
Limits of sequences
Infinite Series of Real Numbers
Theorems
Abel's formula
Let
and
be real sequences, and for each pair of integers
set
Then
for all integeres
.
Since
for
and
, we have
Infinite Series of Functions
Uniform Convergence
Theorems
Cauchy criterion
Let
be a nonempty subset of
, and let
be a sequence of functions.
Then
converges uniformly on
if and only if for every
there is an
such that
for all
.
Generally about uniform convergence
Let
be a nonempty subset of
and let
be a sequence of real functions defined on
i) Suppose that
and that each
is continuous at
. If
converges uniformly on
, then
is continuous at
ii) [Term-by-term integration] Suppose that
and that each
is integrable on
. If
converges uniformly on
, then
is integrable on
and
iii) [Term-by-term differentiation] Suppose that
is a bounded, open interval and that each
is differentiable on
. If
converges at some
, and
converges uniformly on
, then
converges uniformly on
,
is differentiable on
, and
Suppose that
uniformly on a closed interval
. If each
is integrable on
, then so is
and
In fact,
uniformly on
.
Problems
7.2.4
Let
- Show that the series converges on
![$[0, \frac{\pi}{2}]$](../../assets/latex/analysis_2ab3b0e0ae3632b24a23b25b4ec68039f3ab2cad.png)
- We can integrate series term by term
Start by bounding the terms in the sum:
And since the series
converges, the series in question converges.
Further,
Here we note that the numerator will only take on the values
, and in the non-zero cases the denominator will be as in the claim.
TODO 7.2.5
converges pointwise on
and uniformly on each bounded interval in
to a differentiable function
which satisfies
for all
- Pointwise convergence on

- Uniform convergence
with 
For 1. we observe that
where the last step is due to the sum being a Telescoping series, which equals 1.
We then use the M-test, hence we get convergence in uniform on
.
Now that we know that the series converges, we need to establish that the function satisfies the boundaries.
Where
becomes
if we can prove that RHS converges.
TODO 7.2.6
- Look at the more general case
- Look also at

Workshop 2
- 6
- Question
Let
for
. Prove that
converges pointwise on
and find the limit function.
Is the convergence uniform on
? Is the convergence uniform on
with
?
- Answer
First observe that for
and
we have
And for
Therefore the limiting function is
Is the convergence uniform on
? No!
By Thm. 7.10 in introduction_to_analysis we know that if
uniformly then
but in this case
Hence we have a proof by contradiction.
Is the convergence uniform on
? Yes!
and
hence
uniformly on
for
.
- Question
- 7
- Question
Let
be a sequence of continuous functions which converge uniformly to a function
. Let
be a sequence of real numbers which converges to
. Show that
.
- Answer
Observe that
for some
and
. We know
and for all
Further, by Theorem 7.10 introduction_to_analysis,
which implies
Therefore, for
, let
and
then
as wanted.
- Question
Uniform Continuity
Theorems
Suppose
is continuous. Then it is uniformly continuous.
Problems
Workshop 3
- 5
- Question
Let
be an open interval in
. Suppose
is differentiable and its derivative
is bounded on
. Prove that
is uniformly continuous on
.
- Answer
Suppose
for
. Then by Mean Value theorem we have
Therefore,
Thus, let
Then
Hence
is uniformly continuous.
- Question
Power series
Definitions
Let
be a sequence of real numbers, and
. A power series is a series of the form
The numbers
are called coefficients and the constant
is called the center.
Suppose we have the power series
then the radius of convergence
is given by
unless
is bounded for all
, in which case we declare
I.e.
is the smallest number such that all series with
is bounded.
Analytic functions
We say a function is analytic if it can be expressed as a power-series .
More precisely,
is analytic on
if there is a power series which converges to
on
.
Theorems
This holds in general , thus is basically another way of defining the radius of convergence .
provided this limit exists.
Converges to a continous function
Assume that
. Suppose that
. Then the power series converges uniformly and absolutely on
to a continuous function
. That is,
Taylor's Theorem
Suppose the radius of convergence is
. Then the function
is infinitely differentiable on
, and for such x,
and the series converges absolutely, and also uniformly on
for any $ r < R$. Moreover
Consider the series
which has radius of convergence
and so converges uniformly on
for any
.
Since
and
at least converges at one point, then by Theorem 7.14 in Wade's we know that
Further, we have
and
, which we can keep on doing and end up with
Problems
Finding radius of convergence
The power series
has a radius of convergence
, and converges absolutely on the interval
.
One can easily see that the series convergences on the the interval
, and so we consider the endpoints of this interval.
convergences
, which is known as the harmonic series and is known to diverge
7.3.1
a)
The series
converges on the interval
, and has radius of convergence
.
Letting
we write
Using the Ratio test, gives us
Thus we have the endpoints
and
.
For
we get the series
which converges.
For
we get the series
for which we can use the alternating series test to show that it converges.
Thus, we have the series converging on the interval
.
b)
We observe that
and let
, writing the series as
and using the root test, we get
where we let the above equal
for the sake of convenience.
Then we use the lim-inf definition of radius of convergence
c)
7.3.2
a)
Look at solution to 7.3.1. a)
b)
c)
Integrability on R
Definitions
Partition
Let
with
i) A partition of the interval
is a set of points
such that
ii) The norm of a partition
is the number
iii) A refinement of a partition
is a partition
of
which satisfies
. In this case we say that
is finer than
.
Riemann sum
Let
with
, let
be a partition of the interval
, set
for
and suppose that
is bounded.
i) The upper Riemann/Darboux sum of
over
is the number
where
ii) The lower Riemann/Darboux sum of
over
is the number
where
Since we assumed
to be bounded, the numbers
and
exist and are finite.
Riemann integrable
Let
with
. A function
is said to be Riemann integrable on
if and only if
is bounded on
, and for every
I.e. the upper and lower Riemann / Darboux sums has to converge to the same value.
Riemann integral
Let
with
, and
be bounded.
i) The upper integral of
on
is the number
ii) The lower integral of
on
is the number
iii) If the upper and lower integral are equal, we define the integral to be this number
The following definition of the Riemann sum can be proven to be equivalent of the upper and lower integrals using introduction_to_analysis.
Let
i) A Riemann sum of
wrt. a partition
of
generated by samples
is a sum
ii) The Riemann sums of
are said to converge to
as
if and only if given
there is a partition
of
such that
for all choices of
. In this case we shall use the notation
1
is just some arbitrary number in the given interval, e.g. one could set
.
Theorems
Suppose
with
. If
is continuous on the interval
, then
is integrable on
.
Telescoping
If
, then
This is more of a remark which is very useful for proofs involving Riemann sums, since we can write
This allows us to write the following inequality for the upper and lower Riemann / Darbaux:
in which case all we need to prove for
to be Riemann integrable is that this goes to zero as
, or equiv. as we get a finer partition.
Mean Value Theorem for Integrals
Suppose that
and
are integrable on
with
for all
. If
then
In particular, if
is continuous on
, then
Suppose that
are integrable on
, that
is nonnegative on
, and that
Then there is an
s.t.
In particular, if
is also nonnegative on
, then there is an
which satisifies
Let
be a real-valued function which is continuous on the closed interval
,
must attain its maximum and minimum at least once. That is,
Fundamental Theorem of Calculus
Let
be non-degenerate and suppose that
.
i) If
is continuous on
and
, then
and
ii) If
is differentiable on
and
is integrable on
then
Non-degenerate interval means that
.
Integrability on R (alternative)
Let
. Then we define the characteristic function
Let
be a bounded interval, then we define the integral of
as
Reminds you about Lebesgue-integral, dunnit?
We say
is a step function if there exist real numbers
such that
for
and 
is constant on 
We will use the phrase "
is a step function wrt.
" to describe this situation.
In other words,
is step function wrt.
iff there exists
such that
for
.
If
is a step function wrt.
which takes the value
on
, then
Notice that the values
have no effect on the value of
, as one would expect.
Let
. We say that
is Riemann integrable if for every
there exist step functions
and
such that
and
A function
is Riemann integrable if and only if
A function
is Riemann integrable if and only if there exist sequences of step functions
and
such that
and
If
and
are any sequences of step functions satisfying the above, then
as
.
Suppose
and
are Riemann integrable, and
. Then
is Riemann-integrable and
If
then
. Further,
is Riemann integrable and
and
are Riemann integrable
is Riemann integrable
Integrals and uniform limits of sequences and series of functions
Suppose
is a sequence of Riemann integrable functions which
uniformly.
Suppose that
and
are zero outside some common interval
. Then
is Riemann integrable and
Suppose
is a non-negative sequence of numbers and
is a function that
For some
and
For
we have
Then
for some
.
Problems
Workshop 5
5
- Question
- Answer
is Riemann integrable, then there exists step-functions
and
such that
Or rather, for all
, there exists
such that
Since
is integrable on a bounded and closed interval, then
is bounded and has bounded support. That is
Therefore, by the Mean Value theorem, we have
Therefore the "integral sum" for
satisfy the following inequality
Therefore, for
, we choose
which gives us
That is,
is Riemann integrable implies
is Riemann integrable.
We've left out the
in all the expressions above for brevity, but they ought to be included in a proper treatment.
Metric spaces
Definitions
Metric
A metric space is a set
together with a function
(called the metric of
) which satisfies the following properties for all
:
2
Metric space
A metric space is a set
together with a function
(called the metric of
) which satisfies the following properties for all
:
3
Balls
Let
and
. The open ball (in
) with center
and radius
is the set
Let
and
. The closed ball (in
) with center
and radius
is the set
Equivalence of metrics
We say two metrics
and
on a set
are strongly equivalent if and only if
We say two metrics
and
on a set
are equivalent if and only if for every
and every
there exists
such that
Closedness and openness
Closure and interior
For
is the interior of
; it is the largest subset of
which is open.
Or equivalently, the interior of a subset
of points of a topological space
consists of all points of
that do not belong to the boundary of
.
A point that is in the interior of
is an interior point of
.
For
is the closure of
; it is the smallest set containing
which is closed .
Or, equivalently, the closure of
is the union of
and all its limit points (points "arbitrarily close to
"):
For
is the boundary of
.
Convergence, Cauchy sequences and completeness
Most theorems and definitions used for sequences are readily generalized to metric spaces.
We say a metric space is complete if and only if every Cauchy sequence in
converges.
In a metric space
, a sequence
with
is bounded if there exists some ball
such that
for all
.
In a metric space
, a sequence
with
is a Cauchy sequence iff for every
,
Let
be a metric space, then
is said to satisfy the Bolzano-Weierstrass Property iff every bounded sequence
has a convergent subsequence .
Closedness, limit points, cluster points and completeness
is a limit point for
if and only if there is a sequence
such that
as
.
is a cluster point for
if and only if every open ball centred at
contains infinitely many points of
.
The following statements are equivalent:
s a cluster point for 
- for all
,
contains a point of 
, with
for all
, s.t.
as 
Every cluster point for
is a limit point for
. But
can be a limit point for
without being a cluster point .
A closed subset of a complete metric space is complete
A complete subset of any metric space is closed.
Every convergent sequence is Cauchy, but the opposite is not necessarily true.
Compactness
Let
be a collection of subsets of a metric space
and suppose that
is a subset of
.
is said to cover
if and only if
is said to be an open covering of
iff
covers
and each
is open.
Let
be a covering of
.
is said to a finite (respectively, countable ) subcovering iff there is a finite (respectively, countable) subset
of
s.t.
covers
.
Let
be a metric space.
A subset
is compact iff for every open cover
of
, there is a finite subcover
of
.
I often find myself wondering "what's so cool about this compactness?! It shows up everywhere, but why?"
Well, mainly it's just a smallest-denominator of a lot of nice properties we can deduce about a metric space. Also, one could imagine compactness being important since the basis building blocks of Topology is in fact open sets, and so by saying that any open cover has a finite open subcover, we're saying it can be described using "finite topological constructs". But honestly, I'm still not sure about all of this :)
One very interesting theorem which relies on compactness is Stone-Weierstrass theorem, which allows us to show that for example polynomials are dense in the space of continous functions! Suuuuper-important when we want to create an approximating function.
Let
be a metric space and let
. Then
is said to be dense in
if for every
and for every
we have that
i.e., every open ball in
contains a point of
.
Or, alternatively, as described in thm:dense-iff-closure-eq-superset:
A metric space
is said to be separable iff it contains a countable dense subset.
Where with countable dense subset we simply mean a dense subset which is countable.
We say the metric space
is a precompact metric space if for every
there is a cover of
by finitely many closed balls of the form
Let
be a complete metric space and precompact, then
is compact.
Let
be a metric space. Then
is said to be sequentially compact if and only if every sequence in
has a convergent subsequence.
Let
be a topological space, and
a subspace.
Then
is compact (as a topological space with subspace topology) if and only if every cover of
by open subsets of
has a finite subcover.
If
for
open in
(subspace topology), then
open in
s.t.
.
Therefore
: Choose finite subcover
. Then
is a finite subcover of
.
: Let
, then
open in
. So we let
so there exists finite
with
Every space with cofinite topology is compact.
Let
. Take some
so that
is finite.
Then
, there exists
in cover with
. Therefore
is a finite cover.
Idea: take away one cover → left with finitely many points → we good.
Motivation
Let
. For which
must
be bounded?
finite- If
of opens with
bounded then
is bounded. Any continuous
is locally bounded
with
bounded, e.g.
If there exists finitely many
as above, with
then
is bounded.
Compactness NOT equivalent to:
is a cover of 
covers
but not finite.
cover
→ clearly not finite mate
- Same as 2, but take finitely many

- Follows from 2 by taking subcover to be the whole cover.
always covers
and has finite subcover (e.g.
)
Examples
- Non-compact
, so
has no finite subcover, so
not compact- Infinite discrete space is not compact. Consider
which is an open cover, but has no finite subcover.
- Compact
indiscrete so
. Only open covers are
and
, and
is a finite subcover, hence
is compact.- Any finite space is compact (for any topology)
Some specific spaces
Banach space
See Banach space
Hilbert space
A Hilbert space
is a vector space equipped by an inner product such that the norm induced by the inner product
turns
into a complete metric space.
A Hilbert space is thus an instance of a Banach space where we specifically define the metric as the square-root of the inner product.
Reproducing Kernel Hilbert Space
Here we only discuss the construction of Reproducing Kernel Hilbert Spaces on the reals, but the results can easily be extended to complex-valued too.
Let
be an arbitrary set and
a Hilbert space of real-valued functions on
. The evaluation functional over the Hilbert space of functions
is a linear functional that evaluates each function at a point
,
We say that
is reproducing kernel Hilbert space if, for all
in
,
is continuous at any
in
, or, equiavelently, if
is a bounded operator on
, i.e. there exists some
such that
While this property for
ensure both the existence of an inner product and the evaluation of every function in
at every point in the domain. It does not lend itself to easy application in practice.
A more intuitive definition of the RKHS can be obtained by observing that this property guarantees that the evaluation functional
can be represented by taking the inner product of
with a function
in
. This function is the so-called reproducing kernel of the Hilbert space
from which the RKHS takes its name.
The Riesz representation theorem implies that for all
there exists a unique element
of
with the reproducing property
Since
is itself a function in
, it holds that for every
in
there exists a
s.t.
This allows us to define the reproducing kernel of
as a function
by
From this definition it is easy to see that
is both symmetric and positive definite, i.e.
for any
,
and some
.
- RKHS in statistcal learning theory
The representer theorem states that every function in an RKHS that minimizes an empirical risk function can be written as a linear combination of the kernel function evaluated at the training points.
Let
be a nonempty set and
a postive-definite real-valued kernel on
with corresponding RKHS
.
Given a training sample
, a strictly monotonically increasing real-valued fuction
, and a arbitrary emipirical risk function
, then for any
satisfying
admits a representation of the form
where
for all
and, as stated before,
is the kernel on the RKHS
.
Let
(so that
is itself a map
)
Since
is a reproducing kernel, then
where
is the inner product on
.
Given any
, one can use the orthogonal projection to decompose any
into a sum of two functions, one lying in the
and the other lying in the orthogonal complement:
where
for all
.
The above orthogonal decomposition and the reproducing property of
show that applying
to any training point
produces
which we observe is independent of
. Consequently, the value of the empirical risk
in defined in the representer theorem above is likewise independent of
.
For the second term (the regularization term), since
is orthogonal to summand-term and
is strictly monotonic, we have
Therefore setting
does not affect the first term of the empirical risk minimization, while it strictly decreasing the second term.
Consequently, any minimizer
of the empirical risk must have
, i.e., it must be of the form
which is the desired result.
This dramatically simplifies the regularized empirical risk minimization problem. Usually the search domain
for the minimization function will be an infinte-dimensional subspace of
(square-integrable functions).
But, by the representer theorem, we know that the representation of
reduces the original (infinite-dimensional) minimization problem to a search for the optimal n-dimensional vector of coefficients
for the kernel for each data-point.
Theorems
Let
, then
Let
. Consider the following polynomial
where we've used the fact that
for
.
Since it's nonnegative, it has at most one real root for
, hence its discrimant is less than or equal to zero. That is,
Hence,
as claimed.
Let
be a metric space.
- A sequence in
can have at most one limit. - If
converges to
and
is any subsequence of
, then
converges to
as 
- Every convergence sequence in
is bounded - Every convergence sequence in
is Cauchy
Let
. Then
as
if and only if
Let
. Then
is closed if and only if the limit of every convergent sequence
satisfies
A set is open iff it equals its interior ; a set is closed iff it equals its closure .
Let
be a metric space and let
. Then,
is dense if and only if
.
Any compact set must be closed and bounded.
The converse is not necessarily true (Heine-Borel Theorem addresses when this is true).
Bounded:
is compact implies that
s.t.
where
.
Let
be a separable metric space which satisfies the Bolzano-Weierstrass Property and
. Then
is compact if and only if it is closed and bounded.
Observe that
is closed and bounded.
is compact if and only if
.
Suppose
. Idea is cto construct
of unit vectors s.t.
Example: continuous functions
is a Banach space.
What are the compact sets
?:
is compact if and only if
is closed, bounded AND "something something" (what is it?)
Continuity and limits of functions
Let
where
and
are metric spaces.
Then
if and only if for every
we have
Or equivalently, if
is continuous on
,
Connected sets
Let
be a metric space.
- A pair of nonempty open sets
is said to separate
if and only if
and 
is said to be connected if and only if
cannot be separated by any pair of open sets 
Loosely speaking, a connected space cannot be broken into smaller, nonempty, open pieces which do not share any common points.
Let
be a metric space, and
.
are said to separate
if and only if:
(non-empty)

is connected if and only if it cannot be separated by any
.
A subset
is connected if and only if
is an interval
A subset
of a metric space
is path-connected if for every
there is a continuous function (path)
such that
Let
be a metric space, and
.
If
is path-connected , then
is connected.
Stone-Weierstrass Theorem
Notation
is a metric space
denotes a algebra in 


Goal
The goal of this section is to answer the following question:
Can one use polynomials to approximate continuous functions on an interval
?
Stuff
Let
be a metric space.
A set
is a said to be a (real function) algebra in
if and only if

- If
, then
and
both belong to 
- If
and
, then 
A subset
of
is said to be (uniformly) closed if and only if for each sequence
that satisfies
as
, the limit function
belongs to
.
A subset
of
is said to be uniformly dense in
if and only if given
and
there is a function
such that
.
A subset
of
separates points of
if and only if given
with
Stone-Weierstrass Theorem
Suppose that
is a compact metric space.
If
is an algebra in
that separates points of
and contains the constant functions, then
is uniformly dense in
.
This is HUGE. It basically says that on any compact metric space, we can approximate any continuous function arbitrarily well using only the constants functions and some functions which separates points in the metric space!
You know what space satisfies this? Space of all polynomials!
Q & A
DONE Alternative definition of compact; consequences?
What's the difference between the definition of compactness and the following:
A set
is compact if for every subset
there exists a finite covering
.
Is there any difference; and if so, what are the consequences?
Answer
Yes, there is a difference.
In our new definition we're only saying that the EXISTS some finite covering, while the proper definition is sort of saying that all coverings of
does in fact contain a finite covering, a sort of "the lowest common denominator covering has to be finite, and each of these coverings do in fact have this".
Fixed Point Theory
Differential Equations
Notation
Stuff
In this section we're considering the large class of ODEs of the form
there will exists a unique solution
for
sufficiently small.
To ensure the existence of a unique solution, we need to consider the following:
Suppose
and
. Also suppose
and
and
is continuous.
Further, suppose that for all
and
there exists
such that
Due to Mean Value Theorem, if
exists and is continuous on
then the above is satisfied.
Suppose
satisfies a Lipschitz condition as above. Then there exists an
such that the ODE
has a unique solution
for
.
Exercises
From the notes
A contraction is continuous
A contraction
is continuous.
Let
, and let
be a contraction. Now, for the sake of contradiction, suppose
is discontinuous at some point
.
Due to
being a contraction, then for any two points
we have
where
. Since
is arbitrary we can let
be such that
Clearly,
Thus,
Which implies,
But this is only true if and only if
is continuous; hence we have our contradiction.
TODO Exercise 1
Suppose
is a contraction mapping.
If
then any fixed point will be unique, whether or not
is complete.
Further, show that if
is not complete, then a fixed-point does not necessarily exist.
Let
be a metric space.
For the first part of our claim, suppose the mapping
satisfies
Then clearly
is a contraction, since we can always choose
such that
due to the interval
being dense in
.
Now, for the sake of contradiction suppose there exists two different fixed-points,
. Then, from the property above of
, we have
But, since
and
are fixed-points, we have
Hence, the above inequality implies
Which clearly is a contraction, hence if there exists a fixed-point of
, then that is a unique fixed-point.
Now, for the second part of the claim,
PROBABLY EXIST SOME COUNTER-EXAMPLE THAT I CAN'T THINK OF. SOME
SUCH THAT THIS IS NOT THE CASE.
TODO Exercise 2
Let
be a metric space which is not complete. Then there exists contractions with no fixed point.
Probably some counter-example I can't think of.
TODO Exercise 3
Note taken on
Regarding the previous note, could we not have
which for even
we would still have
be a contraction mapping, but for odd
it would not be! Therefore I believe it's reasonable to assume that they mean for any
.
- Note taken on
But
alone does not imply that
since
, could be
, hence there needs to be something else which ensures this implication. That is, if the claim is supposed to be true for any
, then yeah, this implication would definitively hold, but I'm not sure that is what they mean - Note taken on
being a contraction in a complete metric space =>
is a contraction in a complete metric space =>
has a unique fixed point in this space: NOPE! We can have fixed points without the function
being a contraction, also
being a contraction does not in fact imply that
is a contraction! See Exercise 4 for a counter-example.
Let
be a complete metric space, and suppose
is such that
is a contraction. Then
has a unique fixed point.
As proven in the first part of Exercise 1 we know that any contraction
has a unique fixed point when
is a complete metric space. Thus, we know that there exists some
such that
If we assume the claim is supposed to hold for any
, and not a specific arbitrary
, then the above implies
Further, if
was not unique, then
for some
, but this implies that
has another fixed point, which we know cannot be.
Hence, if
is a contraction, then
has a unique fixed point.
TODO Exercise 4
Note taken on
Maaaybe you can build some argument by creating two linear functions
and
which intersect
at a single point
, and such that
Basically, consider two functions for which we can easily compute the effect of
for two points
and
on the distance between them
which we can clearly tell has the property that
, buuut I'm not going to bother spending time on this now.
Note taken on
But, if we can show that
and then
we're good!
Note taken on
I was wondering if we can use some function
on the interval
which is defined such that
which also has the property that
for
, and then we potentially compute the distance of the difference between these functions to obtain some upper-bound on
which related to
.
Doesn't seem to work very well though
- Note taken on
Technically we only need to prove that
is a contraction mapping on the open interval
, since for
not in this interval,
will lie in the interval specified before.
is not a contraction, but
is a contraction.
Further, this implies that there is a unique solution in
to the equation
Clearly
is not a contraction, since
which implies
and
Hence,
NOW PROVE THAT
IS A CONTRACTION MAPPING YAH FOOL
Fourier Series
Notation
Definition
Let
be integrable on
.
The Fourier coefficients of
are numbers
and
Let
be integrable on
and let
be a nonnegative integer.
We define the Fourier series of
as the trigonometric series
and the partial sum of
of order
to be the trigonometric polynomial defined
Kernels
Dirichlet kernel
Let
be nonnegative integer.
The Dirichlet kernel of order
is the function defined
with the special case of
.
It turns out it can also be written as
for all
and
.
That is, we can write the N-th order Fourier partial sum as a convolution between
and the Dirichlet kernel.
where we've brought the intergrals together and used the trigonometric identity
Finally remembering that
We see that the above expression is simply
as claimed.
Suppose that
is integrable and that
uniformly on
. Then,
as
uniformly in
.
We know that
such that
Then we have,
I.e. we can make the difference between the coefficients as small as we'd like, hence
The very same argument holds for
.
Fejér kernel
The Fejér kernel of order
is the function defined
with the special case of
.
Functional Analysis
Notation
is the fields we'll be using
"Small Lp" space:
Let
be a measure space and
, then
denotes the set of all measurable functions
on
such that
and the values of
are real numbers, except possibly on a set of measure
.
Lp space:
where
Or more accurately,
where
denotes the equivalence classes of the equivalence relation
- NLS means normed linear spaces
Theorems
Inequalities
Let
such that
. Then
For any two sequences
and
of nonnegative numbers, we have
for any
where
is the conjugate exponent of
, i.e.
Observe that this aslo includes
and
!
For
and
:
From Hölders inequality we have
where
. Finally, letting
and
, we get
And taking both sides to the power of
:
as wanted.
For any two elements
and
of
, we have
for any
.
Mercer's theorem
Let
be a symmetric continuous function, often called a kernel.
is said to be non-negative definite (or positive semi-definite) if and only if
for all fininte sequences of points
and all choices of real numbers
.
We associate with
a linear operator
by
The theorem then states that there is an orthonormal basis
of
consisting for eigenfunctions of
such that the corresponding sequence of eigenvalues
is nonnegative.
The eigenfunctions corresponding to non-zero eigenvalues are continuous on
and
has the representation
where the convergence is absolute and uniform.
There are also more general versions of Mercer's thm which establishes the same result for measurable kernels, i.e.
on any compact Hausdorff space
.
Banach spaces
A norm on a vector space
over
(
or
) is a map
with the following properties:
- For all
,
, with equality if and only if 
- For all
and
, we have 
For all
, we have
If
is a norm on
, then we can define a metric
on
by setting
.
A normed vector space is said to be a Banach space if it is complete wrt. the associated metric.
A Banach space is said to be separable if it contains a countable dense subset.
Let
be a normed space and
a Banach space.
Suppose
is a dense subset of
and that
is a bounded linear operator.
Then there exists a unique bounded linear map
such that
where
denotes the restriction of
to the subspace
.
Furthermore, the norm of
equals the norm of
.
Suppose that
is a Banach space and
a normed vector space.
For any linear map
, let
denote the set of pairs
in
such that
.
If the graph of
is a closed subset of
, then
is bounded.
Examples
Equip
with the norm
Then,
Metric space structure on NLS
Let
is a NLS of
is a subspace.
If
is open, then
.
Clearly
, hence
there exists
Observe that
if and only if
for some
.
by definition of a subspace, and reverse is
which implies that
.
Cosnider
since
. Then,
So basically, since a linear space is closed under scalar multiplication, any open subspace
must contain any scaling of
, hence it must contain the entire space.
Let
be a NLS with
.
Then
is closed.
Completion of NLS
Notation
denotes a NLS which is not necessarily complete
denotes the completion of 
Set of Cauchy sequences in
:
Stuff
- Formal procedure to "fill holes" in an NLS, i.e. making non-complete NLS (NLS for which Cauchy sequences does not necessarily converge), well, complete!
- Observe that
is dense in the completion of
, i.e. 
- Let
.
Let
on
be the equivalence relation such that
Let
be a Banach space, and let
be a subspace.
Then
is a Banach space if and only if
is closed.
Suppose
is closed. Let
be Cauchy. Then
is Cauchy in
.
And since
is closed,
contains all its limit points, hence all convergent sequences in
converges to a point in
, i.e.
is Banach.
. Suppose
is a Banach space.
Let
s.t.
for some
.
Note that
is Cauchy, and since
is Banach, there exists an
s.t.
as
. Then
Hence,
, and
is closed.
Let
and
be two normal linear spaces.
- A linear map
is said to be an isometry if
for all
. - We say
and
are isometrically isomorphic if there exists an isometry
from
to
.
- Note that
is automatically a surjective isometry.
- Note that
- The Banach space completion of
is a pair, consisting of a Banach space
and an isometry
s.t.
is a dense subspace of
.
Let
and
be two completions of
.
Then
and
are isometrically isomorphic.
Let
be an NLS.
Then there exists a unique a Banach space completion of
.
Let
Equivalence relation
on
Let
, and define
Observe that
i.e. all sequences which converges to
.
Equip
a vector space structure, i.e. addition
and scalar multiplication
Equip
with a norm
Observe that
By completion of the underlying field
,
is Cauchy and thus converges. Hence the above norm is defined
Need to check that this is well-defined:
is a Banach spaceThen
is a linear map, then
Finally, one can show that
is dense in
, hence we have our Banach space completion.
Example
Let
be a subspace.
, i.e.
is closed in
.
Let
, thus
. Then
for some
.
We then need to show that
or
as
. or
Given
. Then
s.t.
.
But
implies
So for
,
So basically, show that a sequence of sequences
converge to some sequence
, thus
contains all it's limit points, hence
is closed.
Consider the NLS
for
.
Let
Then
is not closed, but
is dense in
, i.e.
Let
Then
, therefore
.
But
, but
, hence
does not contain all of its limit points, i.e. is not closed, proving our first claim.
Now suppose that
. We now want to show that
such that
and
.
TO THIS.
Equivalence of norms
- We'll be referring to norms as equivalent if the induced norms are strongly equivalent metrics
Two norms
and
on
are said to be equivalent if there is a constant
s.t.
This does in fact define an equivalence relation on norms.
Hilbert spaces
Let
be a
and
its dual space.
Then the map
Then
is isometric isomorphism.
If
is a bounded linear functional, then there exists a unique
such that
Furthermore, the operator norm of
as a linear functional is equal to the norm of
as an element of
.
and
s.t.
Assume
and consider
is closed (
cont.) and subspace (
linear) of
. Then
since
.
- But
implies there exists nonzero
. Take
and check
Basis in Hilbert spaces
Orthogonal decomposition
If
, then
For any seminorm
defined by a semi-inner product and any
,
For any inner product space
, and any Hilbert subspace
, and
there is a unique representation
with
and
.
Idea:
- Prove existence of orthogonal decomposition
- Proof by contradiction
- Consider a lower-bound on the distance between
for
, and then show that this is violated if
, i.e. the "rest" of
is not in
.
- Prove uniqueness
Let
Then let
such that
and
.
Then, by lemma:parallelogram-law we have
thus,
By completeness of
, we know that
. Further, observe that
, thus by def. of
,
Thus, as
, we have
for some
.
Now let
. Then
by continuity of
and the fact that
and
.
Further, suppose that
for some
. Let
and
.
Then we observe that
Substituting back in our epxression for
:
The last term is simply
, thus
For sufficiently small
, the
term dominates which implies that
for
.
This clearly contradicts our definition of
, hence we have the proof of decomposition existence by contradiction.
For uniqueness of the decomposition, we simply observe that if also
for some
and
, then
thus
which implies
and
.
Many functions in
of Lebesgue measure, being unbounded, cannot be integrated with the classical Riemann integral. Therefore spaces of Riemann integrable functions would not be complete in the
norm, and the orthogonal decomposition would not apply to them.
Another victory for good ole' Lebesgue!
Orthonormal sets and bases
A set
in a semi-inner product space
is called orthonormal if and only if
For any orthonormal set
and
,
If
is an orthonormal set in
, and
, where
and
Then
and
for all
.
Furthermore,
For any Hilbert space
, any orthonormal set
and any
, then
: Follows from Bessel's inequality and Parseval-Bessel equality, since we have
where the first equality is from Bessel's inequality and the second from Parseval-Bessel equality.
: For each
choose a finite set
such that
increases with
and
Then
is a Cauchy sequence, hence converges to some
since
is complete.
Then the net of all partial sums converges to the same limit, concluding our proof.
Let
be any inner product space and
be any linearly independent sequenec in
.
Then there is an orthonormal sequence
in
s.t. for each
,
and
have the same linear span.
- Side-note on why Hilbert space > Banach space when talking about bases
In any vector space
, a Hamel basis is a set
such that every
can be written uniquely as
with only finitely many
.
So, Hamel basis is an algebraic notion, which does not relate to any topology on
.
In a separable Banach space
, a Schauder basis is a sequence
such that for every
,
It is possible to find a Schauder basis in the "most useful" separable Banach spaces, but Schauder bases may not be conditional bases, and in general it may be very hard to find unconditional bases.
- Orthonormal basis for Hilbert space
Coming up with a basis for an infinite-dimensional space comes down to constructing a sequence of orthonormal vectors
by taking some vector for which the projection
, i.e. all of
does not lie in
.
Then we prove that this gives us a space which is dense in the "parent" space.
More concretely, let
and
be defined by choosing some
such that
, i.e.
, and
Every Hilbert space has an orthonormal basis.
If a collection of orthonormal sets is linearly ordered by inclusion (a chain), then their union is clearly an orthonormal set.
Thus by Zorn's lemma, let
be the maximal orthonormal set.
Take any
. Let
where the sum converges by Bessel's inequality and Riesz-Ficher theorem.
If
, we are done. Otherwise, then
for all
, so we can adjoin a new element
contradicting the maximality of the orthonormal set.
Every Hilbert space is isometric to a space
for some set
.
Let
be an orthonormal basis for
.
Then
takes
into
by Bessel's inequality.
This function preserves inner products by the Parseval-Bessel equality. It is onto
by the Riesz-Fischer theorem, concluding our proof.
For any inner product space
, an orthonormal set
is an orthonormal basis of
if and only if its linear span
is dense in
.
Let
be a orthonormal sequence in a Hilbert space
.
The following are equivalent:
If
s.t.
for all
, then
In other words, the sequence
is a maximal orthonormal family of vectors.
Span of
is dense in
Unique convergence
then
Inner product on basis
The norm
If one of these statements hold (and thus all of them hold), we say
is an orthonormal basis of
.
: Idea is that
implies
as a subspace is simply
, hence the closure is dense in
.
Suppose
.
Then since
, we have
and so
s.t.
and
for all
. But this is a contradiction wrt.
, hence we have our proof.
: Let
and since
.
Set
We make the following observations:
and
.
as
which tells us that a particular subsequence converges.
we have
,
as
, since
. Which tells us that all possible subsequences converge.
Therefore
.
:
were we have used the interchanging of limits on multiple occasions, and in the final equality used the orthogonality of the
.
: Apply 4 with
.
: Otherwise we would have extra terms for the norm, rather than just the "Fourier" coefficients → contradiction.
Bounded Linear Operators
Notation
be linear operator and
and
are normed linear spaces
be vector space of bounded linear operators from
to
.- $T ∈
Stuff
Let
be linear operator and
and
are normed linear spaces.
Then the following are equivalent
is continuous on all of 
is continuous at 
is bounded
is seen by observing that for a sequence
we have
and so
and
are equivalent.
: Suppose
is continuous at
. Thus
Let
, and let
and
So
which gives us
If
, then all linear maps
are continuous.
Hence
.
T is onto: let
Then
is a closed subspace of
.
Suppose
, then we can decompose
, then this implies there exists some nonzero
.
Let
and consider
Then observe that
which implies that
which implies
So for
Therefore,
or equiv,
and 
Let
is the subspace of finite rank operators.
Then for
and
, there exists
, i.e. constants which depend on
(functions yo!) s.t.
Then:
is linear
are linear for all 
bounded / cont.
are bounded / cont. for all 
If
then
, i.e. the dual space of
!
If
is a Banach space, then
is a Banach space.
Examples with
, i.e. 
for 
Fix
and define
Then
by Hölder's inequality. So
is bounded,
(remember we fixed
).
Hence
and
.
Letting
Then we attain the UB, and so we have equality.
We can then isometrically embed
into
. That is, we can show that for any
, there exists
s.t.
Hilbert-Schmidt operators
Let
where
are separable Hilbert spaces.
We say that
is a Hilbert-Schmidt operator if
The space of all such operators are denoted
with the norm defined
is a vector space and
but
is not a closed subspace of
since
- Suppose
be a ONB for
and
ONB for
. Let
then
and
Thus,
(switching sums we can always when all the terms are nonnegative).
- Hence, the definition of a Hilbert-Schmidt operator (also the norm
) is independent of the choice of ONB
Let
Then
is a FR operator.
as
then
since then
which would imply that
contain the limit points of
.
since
i.e.
Example: kernel operators on ![$X = C([0, 1])$](../../assets/latex/analysis_8fe674cfb6c700fd8b70de8404a7c4713bc0d852.png)
Let
be what's called a kernel on
and let
called a integral operator of
.
Then
which implies that
That is,
So
is bounded and
.
Let
be the completion of
.
Consider the ONB defined by the Fourier coefficients
.
where
Summing over
Compact operators
Notation
denotes the space of compact operators
, i.e. denotes the set of all convergent sequences
Stuff
Let
and
be normed spaces and let
.
Then
is said to be a compact operator if
is a compact subset of
.
That is,
is compact if the image of the unit closed ball is compact.
Recall that in finite-dimensional space, by Heine Borel, closed and bounded subsets are compact.
Hence finite-rank operators are always compact!
Suppose that
is a normed space,
is a Banach space and that
is a compact operator.
Suppose there is a
s.t.
as
. Then
is compact.
is a subspace of 
- If
is a Banach space, then
is closed (i.e. Theorem thm:limit-of-compact-operators-is-compact since it contains all its limit points)
- Supose
are separable Hilbert spaces (thus there exist bases) We have
- Though we will only prove

- Though we will only prove
- Let
linear and
be a ONB in
, and assume
since otherwise "containments" would be equality for all in the above. Furthermore, suppose
iff
and
, since
iff
and 
iff 
iff 
iff 
From this we get the above sequence of containments, since
Spectral Theorem for Bounded Self-Adjoint Operators
Notation
is the separable complex Hilbert spaceOperator norm of
on
is
is finite.
- Banach space of bounded operators on
, wrt. operator norm is denoted
.
denotes the resolvent set of 
denotes the spectrum of 
denotes the projection-valued measure associated with the operator self-adjoint 
For any projection-vauled measure
and
, we have an ordinary (positive) real-valued measure
given by
is a map defined by
Spectral subspace for each Borel set
of
defines a simultanouesly orthonormal basis for a family
of separable Hilbert spaces
Properties of Bounded Operators
Linear operator
on
is said to be bounded if the operator norm of
is finite.
Space of bounded operators on
forms a Banach space under the operator norm, and we have the inequality
for all bounded operators on
and
.
For
, the resolvent set of
, denoted
is the set of all
such that the operator
has a bounded inverse.
The spectrum of
, denoted by
, is the complement in
of the resolvent set.
For
in the resolvent set of
, the operator
is called the resolvent of
at
.
Alternatively, the resolvent set of
can be described as the set of
for which
is one-to-one and onto.
For all
, the following results hold.
- The spectrum
of
is closed, bounded and nonempty subset of
. - If
, then
is in the resolvent set of 
Point 2 in proposition:hall13-quant-7.5 establishes that
is bounded if
is bounded.
Suppose
satisfies
.
Then the operator
is invertible, with the inverse given by the following convergent series in
:
For all
, we have
Spectral Theorem for Bounded Self-Adjoint Operators
Given a bounded self-adjoint operator
, we hope to associate with each Borel set
a closed subspace
of
, where we think intuitively that
is the closed span of the generalized eigenvectors for
with eigenvalues in
.
We would expect the following properties of these subspaces:
and
- Captures idea that generalized eigenvectors should span

- Captures idea that generalized eigenvectors should span
- If
and
are disjoint, then
- Generalized eigenvectors ought to have some sort of orthogonality for distinct eigenvalues (even if not actually in
)
- Generalized eigenvectors ought to have some sort of orthogonality for distinct eigenvalues (even if not actually in
- For any
and
, 
If
are disjoint and
, then
- For any
,
is invariant under
. If
and
, then
Projection-Valued measures
For any closed subspace
, there exists a unique bounded operator
such that
where
is the orthogonal complement.
This operator is called the orthogonal projection onto
and it satisfies
i.e. it's self-adjoint.
One also has the properties
or equivalently,
Conversely, if
is any bounded operator on
satisfying
and
, then
is the orthogonal projection onto a closed subspace
, where
- Convenient ot describe closed subspaces of
in terms of associated orthogonal projection operators - Projection operator expresses the first four properties of the spectral subspaces; those properties are similar to those of a measures, so we use the term projection-valued measure
Let
be a set and
an
in
.
A map
is called a projection-valued measure if the following properties are satisfied:
- For each
,
is an orthogonal projection
and 
If
are disjoint, then for all
, we have
where the convergence of the sum is in the norm-topology on
.
- For all
, we have 
Properties 2 and 4 in of a projection-valued measure tells us that if
and
are disjoint, then
from which it follows that the range of
and the range of
are perpendicular.
Let
be a
in a set
and let
be a projection-valued measure.
Then there exists a unique linear map, denoted
from the space of bounded, measurable, complex-valued functions on
into
with the property that
for all
and all
.
This integral has the following properties:
For all
, we have
In particular, the integral of the constant function
is
.
For all
, we have
Integration is multiplicative: For all
and
, we have
For all
, we have
In particular, if
is real-valued, then
is self-adjoint.
By Property 1 and linearity, integration wrt.
has the expected behavior on simple functions. It then follows from Property 2 that the integral of an arbitrary bounded measurable function
can be comptued as follows:
- Take sequence
of simple functions converging uniformly to 
- The integral of
is then the limit, in the norm-topology, of the integral of the
.
A quadratic form on a Hilbert space
is a map
with the following properties:
for all
and 
the map
defined by
is a sesquilinear form.
A quadratic form
is bounded if there eixsts a constant
such that
The smallest such constant
is the norm of
.
If
is a bounded quadratic form on
, there is a unique
such that
If
belongs to
for all
, then the operator
is self-adjoint.
Spectral Theorem for Bounded Self-Adjoint Operators: direct integral approach
Notation
is a
measure on a
of sets in 
- For each
we have a separable Hilbert space
with inner product 
- Elements of the direct integral are called sections

Stuff
There are several benefits to this approach compared to the simpler "multiplication operator" approach.
- The set
and the function
become canonical:
- The direct integral carries with it a notion of generalized eigenvectors / kets, since the space
can be thought of as the space of generalized eigenvectors with eigenvalue
. - A simple way to classify self-adjoint operators up to unitary equivalence: two self-adjoint operators are unitarily equivalent if and only if their direct integral representations are equivalent in a natural sense.
Elements of the direct integral are called sections
, which are functions on
with values in the union of the
, with property
We define the norm of a section
by the formula
provided that the integral on the RHS is finite.
The inner product between two sections
and
(with finite norm) should then be given by the formula
Seems very much like the differential geometry section we know of.
is the fibre at each point
in the mfd.
is the mfd.
First we slightly alter the concept of an orthonormal basis. We say a family
of vectors is an orthonormal basis for a Hilbert space
if
and
This just means that we allow some of the vectors in our basis to be zero.
We define a simultanouesly orthonormal basis for a family
of separable Hilbert spaces to be a collection
of sections with the property that
Provided that the function
is a measurable function from
into
, it is possible to choose a simultaneous orthonormal basis
such that
is measurable for all
and
.
Choosing a simultaneous orthonormal basis with the property that the function
is a measurable function from
into
, we can define a section to be measurable if the function
is a measurable complex-valued function for each
. This also means that the
are also measurable sections.
We refer to such a choice of simultaneous orthonormal basis as a measurability structure on the collection
.
Given two measurable sections
and
, the function
is also measurable.
Suppose the following structures are given:
- a
measure space 
- a collection
of separable Hilbert spaces for which the dimension function is measurable - a measurability structure on

Then the direct integral of
wrt.
, denoted
is the space of equivalence classes of almost-everywhere-equal measurable sections
for which
The inner product
of two sections
and
is given by the formula
If
is self-adjoint, then there exists a σ-finite measure
on
, a direct integral
and a unitary map
between
and the direct integral such that
for all sections
.
Proofs
Notation
with spectral radius
Stage 1: Continuous Functional Calculus
Stage 2: An Operator-Valued Riesz Representation Theorem
Let
be a compact metric space and let
denote the space of continuous, real-valued functions on
.
Suppose
is a linear functional with the property that
is non-negative if
is non-negative.
Then there exists a unique (real-valued, positive) measure
on the Borel sigma-algebra in
for which
Observe that
is a finite measure, with
where
is the constant function.
Continuous one-parameter groups
semigroup or strongly continuous one-parameter semigroup
A strongly continuous one-parameter semigroup on a Banach space
is a map
such that
(i.e.
identity operator on 
we have
we have
The first two axioms are algebraic, and state
is a representation of a semigroup
, and the last axiom is topological and states that
is continuous in the strong operator topology.
Infinitesimal generator
The infinitesimal generator
of a strongly continuous semigroup
defined by
whenever the limit exists.
The domain of
, denoted
, is the set of
for which the limit does exist;
is a linera subspace and
is linear on this domain.
The operator
is closed, although not necessarily bounded, and the domain
is dense in
.
The storngly continuous semigroup
with generator
is often denoted by the symbol
, which is compatible with the notation for matrix exponentials.
Reproducing Kernel Hilbert Spaces (RKHSs)
Definitions
Let
be a non-empty set, sometimes reffered to as the index set.
A symmetric function
is called a positive-definite kernel of
if
holds for any
,
,
.
Or more generally, for some field real or complex field
, a function
is called a kernel on
if there exists a
and a map
such that for all
we have
In machine learning context, you'll often see
be called a feature map and
a feature space of
.
The definition of a positive-definite kernel is equivalent to the following: A symmetric function
is positive-definite if the matrix
with elements
is positive semi-definite for any finite set
of any size
.
This matrix
is often referred to as the kernel matrix or Gram matrix.
Note the swapping of order in the kernel and inner product above; this is only necessary in the case of a complex Hilbert space, where the inner product is sesquilinear:
Examples of kernels
Let
Then a Gaussian RBF kernel or square exponential kernel is defined
Let
The Matérn kernel is defined
where
is the gamma-function
is the modified Bessel function of the second kind of order 
determines the scale
determines the smoothness of the functions in the associated RKHS
- As
increases, the functions get smoother
- As
If
can be written as
for
, then the expression for the Matérn kernel reduces to a product of the exponentiated function and a polynomial of degree
, which can be computed easily rasmussen2006gaussian:
For example
results in whats know as the Laplace or exponentiated kernel:
Gaussian RBF kernels can be obtained as limits of Matérn kernels for
, i.e. for a Matérn kernel
with
being fixed, we have
Let
Then a polynomial kernel is defined
TODO Kernel embeddings
Exam prep
May 2017
1
c
If
uniformly continuous, and
uniformly then
is also uniformly continuous.
Given
, find
such that
In particular
,
is uniformly continuous, implies
Compactness
,
is closed and
is compact, then
is also compact.
Let
be any open cover of
.
might not cover whole
.
is open, therefore
is an open cover of
. Thus, due to
being compact, there exists a finite subcover of the above cover,
covers
, then clearly
is a finite cover of
, hence
is compact, as claimed.
Proving connectedness of a set
- Best way to prove a set
is connected is to check if it's path connected - Best way to prove a set
is not connected (disconnected), we use the definition of connectedness:
Find 2 open sets
such that
i.e. two sets such that the union covers
but they share no elements.
as the common value
outside of
is said to be open if and only if for every
there is an
is contained in
is
.
.









