The MAST Course on Relativity: Chapter Two

George E. Hrabovsky

MAST

Introduction

This chapter introduces several mathematical notions. I assume you have seen some set theory before this. If you are new to the ideas of set theory, I recommend Halmos’s excellent book called Naive Set Theory.

Manifolds

Last chapter we spoke of manifolds. We stated, almost as a definition, that manifolds were the arena where relativity lives. In point of fact, it is the place where any geometry occurs.

We can think of a manifold as some space having the local smoothness of Euclidean space, l2_1.png. Many authors use l2_2.png as it is the space of the set of n-tuples of real numbers, but it has a fixed origin. Since we need to speak of local frames of reference, we want to be able to establish new origins as we need them.

We don't really know what smoothness means, but maybe we can find out. How do we proceed without knowing what we mean? To understand this we will have to examine those properties of l2_3.png that produce local smoothness—whatever that is.

Recall that a set is a collection of all the elements of that set. If a set, U, is entirely a member of another set, M, then we say that U is a subset of M, we write this symbolically UM. I do not propose that we delve into set theory, but it is the language used in mathematics so we will cover the bare necessities to proceed. The set of no elements is called the empty set, also called the null set, it is denoted Ø={}. The empty set is always a subset of any set. We can also establish a universe of discourse, or just the universal set, U, that contains every possible element we could ever be interested in. All sets are subsets of the universal set.

Recall that if we have two sets, A and B, we can establish a special set made up of the elements of both sets combined, called the union. This is written A∪B=(x|x∈A or x∈B}. We can also make a new set out of the elements that they have in common, called the intersection. This is written A∩B={x} x∈A and x∈B}. We can also create a new set by listing all elements of A that are not in B, this is called the difference and is denoted A-B={x}x∈A and x∉B}, where ∉ means not an element of ... . If we have the difference U-B, this is called the complement of B, this can also be written l2_4.png.

The set of points in a space is called, reasonably enough, a point set.

We can state that there is some rule, φ, that connects two sets, A and B, so that an element of A is converted into a unique element of B. Such a rule is called a mapping, or function. We then can write φ:A→B. A mapping where no two elements of A are mapped to the same point in B, is said to be one-to-one or equivalently injective. Such a mapping could be called an injection. We say that the image of φ is φ[A]={φ(x)}x∈A}.

A mapping from a subset of l2_5.png to l2_6.png is a fancy way of saying that you now have n functions of n real variables. If all of the partial derivatives of these n functions exist and are continuous, we say that such a mapping is l2_7.png or smooth. So now we know what smoothness means. To apply this idea to what we have already written about manifolds, it tells us that a manifold is some space whose local structure is represented by n functions of n variables whose partial derivatives exist and are continuous on l2_8.png. The spacetime manifold of chapter one is one such example of this.

Now we explore a little bit of topology. Let’s say that between any two points, say x and y, we can define the distance using the standard Euclidean distance in l2_9.png,  d(x,y). A point set, O, that for every point, xO, a number ε>0 and that when d(x,y)<ε then yO, is called an open set. Simple examples of open sets are the open intervals of elementary calculus whose distance function is the absolute value.

To begin our hunt for what a manifold is, we begin by assuming that we have a set M.

l2_10.gif

We will also assume that we have a subset of M, U.

l2_11.gif

We will then establish a mapping l2_12.png.

l2_13.gif

We will state our first two rules,

C1: We will require that φ is injective.

C2: φ[U]=O is open, l2_14.png.

l2_15.gif

These structures can be combined, (U,φ), where we call such combinations n-charts or just charts. Another phrase used for a chart is a coordinate patch.

It seems that we might be able to use these charts to somehow make sure that our set M has this local smoothness structure we have been talking about. Why? Such a chart sets up a correspondence between the point set UM and the open point set l2_16.png. In other words a chart defines n real-valued functions on U. These functions have all relevant partial derivatives and those are l2_17.png. Such a set of functions is what we call the smoothness structure. The points of U can be labeled by the values of these n functions. When charts overlap we will further require that they share the same smoothness structure. A collection of charts on a manifold is called an atlas. We will require that we are able to establish an atlas on M.

Above we introduced the notion of an injective mapping. We now state that a mapping μ:A→B where, for every element in B there is a unique element in A, we say that μ is onto or a surjection. A mapping that is both injective and surjective is called a one-to-one correspondence—or in less of a mouthful—bijection. Such a mapping is said to be bijective.

Say that we have two charts on M, (U,φ), and (U',φ'). If U and U' intersect in M, then their intersection, U∩U', will induce two smoothness structures on M. One smoothness structure comes from φ and defines a bijection between the intersection U∩U' and the image of that intersection in l2_18.png,  l2_19.png. Another smoothness structure comes from φ' and looks a lot like the previous one; it defines a bijection between U∩U' and l2_20.png.

l2_21.gif

We can compose these mappings, l2_22.png.

l2_23.gif

We can also take its inverse l2_24.png.

l2_25.gif

Together these compositions form a bijection between φ[U∩U'] and φ'[U∩U']. It is this bijection that we can use to compare the smoothness structures of our proposed manifold. We now state two more rules:

Co1: φ[U∩U'] and φ'[U∩U'] are open subsets of l2_26.png.

Co2: The mappings l2_27.png and l2_28.png are l2_29.png.

If these conditions are met for a pair of charts, those charts are said to be compatible. For charts to be compatible we only require that they share the same smoothness property. In this way you can isolate a single smoothness structure. It is important to realize that (U,φ), and (U',φ') must be compatible if U and U' do not intersect.

We can have collections of sets, but we don’t want to call them sets of sets. Instead we call them families of sets. If we have a family of ten sets, each labeled M, we can index them using a subscript, l2_30.png. We can define a set i={1,2,…,10} and call this our index set. We can then define our family over the index set i by writing l2_31.png. As an example, we say that the third set of our family, according to our index set, is l2_32.png.

Let’s say we have a set, M, and we have a family of n-charts over an index set i on M, l2_33.png. If the following four conditions are met, then we call M an n-dimensional manifold.

M1: Any two charts of M are compatible. Another way of saying this is that if two charts induce a smoothness structure in the same region of M then those structures must agree.

M2: The charts cover M. This is another way of saying that all of M has an induced smoothness structure.

M3: Any n-chart compatible with the charts in the collection of charts C of M, is itself a member of C. This condition makes sure we are not overburdened with structure by limiting the number of charts we can put on M. By this condition we allow all compatible charts and remove all other structures.

M4: If we have distinct points in M, say pM and qM, then there will exist charts, l2_34.png and l2_35.png such that l2_36.png and l2_37.png where l2_38.png.

When we speak of manifolds, we write the label we have assigned and the charts are implied. There are different, but equivalent names for an n-dimensional manifold: l2_39.png-manifold (smooth manifold), Hausdorff manifold, and (this last is a mouthful) an n-dimensional manifold without boundary that is not necessarily paracompact or connected. In fact, the condition M4 prevents non-Hausdorff manifolds.

Given any set M with a set of n-charts C, such that C satisfies M1, M2, and M4, then M with C is a manifold.

Exercise 2.1: Show this last sentence to be true. If true we can always reduce the structure to that of a manifold by including more charts.

For example, say we have M as the set of n-tuples of real numbers. We can consider M a point set, so that we write l2_40.png. Let U be any subset of l2_41.png that is open. Let l2_42.png be the identity mapping. (U,φ) is a chart (check C1 and C2 to make sure). We see that this satisfies M1, M2, and M4, and that makes this a manifold. M3 is a lot harder to implement, but fortunately we do not need it.

As another example assume that M is the set of (n+1)-tuples of real numbers that also satisfy the equation l2_43.png. Define a point set l2_44.png having l2_45.png. Define the mapping l2_46.png as acting on the point l2_47.png, transforming to l2_48.png. Then we define another point set l2_49.png having l2_50.png. Define the next mapping l2_51.png acting on the point l2_52.png, this transforms to l2_53.pngl2_54.png. Define a point set l2_55.png having l2_56.png. We define the next mapping l2_57.png that acts on the point l2_58.pngl2_59.png, becoming l2_60.png l2_61.png. If we continue in this way we will find (2 n+2) charts, l2_62.png and l2_63.png, where i=1,2,…,(n+1). These n charts are all compatible. This produces an n-dimensional manifold called the n-Sphere, l2_64.png.

Exercise 2.2: Show that l2_65.png is indeed a manifold by this definition.

Assume we have two manifolds, M and N of dimensions m and n respectively. We can define a new manifold, P, called the product of M and N where we write, P=M×N. The dimensions of P will be m+n. P is a point set made up of the pair (p,q) where pM and qN. How do we introduce the required charts for this to be a manifold? Since M is a manifold we have a chart l2_66.png where l2_67.png and l2_68.png. Similarly for N, we have a chart l2_69.png where l2_70.png and l2_71.png. Since P=M×N, then we have the open subset l2_72.png. Now we need to establish our mapping. We use the points we defined above. We have l2_73.png which maps l2_74.png to l2_75.png. We also have l2_76.png which maps l2_77.png to l2_78.png. This leads us to map l2_79.png to l2_80.png, thus establishing l2_81.png. Thus we have l2_82.png. This gives us a chart on P, thus P is a manifold.

For example. we can take the product of the manifolds l2_83.png. We can see this as extending the 1-sphere, l2_84.png, along the set of real 1-tuples, l2_85.png. The resulting manifold is called the cylinder manifold.

Exercise 2.3: Draw a diagram of l2_86.png and satisfy yourself that it is a cylinder.

As another example, we can take the product of the manifold l2_87.png. Here we see that we are extending the 1-sphere, l2_88.png, along another 1-sphere, l2_89.png. The resulting manifold is called the torus manifold.

Exercise 2.4: Draw a diagram of l2_90.png and satisfy yourself that it is a torus.

We have previously defined an open set. We can extend this definition to cases where we have a chart. A set O is open in M if for any point pO, there is a chart (U,φ) where pUO. A set is closed if its complement is open.

We can also construct new manifolds by cutting holes in existing manifolds, but only certain hole-cutting is allowed. Assume we have the manifold M and a subset of it, SM. Here we state that S is open in M. Let C be closed and CM. We now define N=M-C, with the charts (U,φ) on M where U∩C.

Exercise 2.5: Is N a manifold?

Before we move, let me say that for all of the fuss we have made about how to establish a manifold, they are completely boring objects. They are a blank canvas having a smoothness structure. They only become interesting when we put something on the canvas. Of course, linking back to our spacetime manifold this smoothness structure is what establishes the geometry of spacetime. If we make the assumption that spacetime is the world we live in, then the manifold contains the entirety of the dynamics of our world in its geometry.

Smooth Functions and Differentiable Mappings

Let’s say we have a manifold M, and we have some real-valued function, f, on M. If we have a chart on M, say (U,φ), then we can say that f:U→U, in other words f is a function from U to itself. We can state that l2_91.png is defined on l2_92.png.

Exercise 2.6: Why can we state this?

A less formal way of saying this is that l2_93.png is a real-valued function of n-variables.

Exercise 2.7: Let f be a function on M. Also let l2_94.png be l2_95.png for a collection of charts, C, satisfying M2 above. Show that f is a smooth function on M.

This is a standard technique for defining something on a manifold. You use a chart to describe that something in terms of l2_96.png. We will see why by the end of this chapter.

The collection of smooth functions on M is something that we will denote by the gothic F, F.

Exercise 2.8: Let l2_97.png be a l2_98.png function of m variables. Let l2_99.png represent a set of smooth functions on M, where the index set goes from 1 to m. Show that l2_100.png is a smooth function on M.

As stated in the last section, smooth functions on a manifold, M, characterize the manifold structure of M.

Let M be a set. let C and C' be two sets of charts on M satisfying M1-M4 from the previous section. If every smooth function on (M,C) is also a smooth function on (M,C'), then C=C'.

Exercise 2.9: Prove this last sentence to be true.

Exercise 2.10: Attempt to redefine a smooth mapping in terms of charts.

It is often the case in mathematics that when we introduce some class of objects that we also introduce some structure-preserving mappings between objects of the same class. Such structure-preserving mappings are called morphisms. We have mappings between sets, we have linear mappings (also called linear transformations) on vector spaces, and so on. The morphisms I want to present below are called differentiable mappings.

Let M and N be manifolds. Define the mapping μ:MN. Assume there is a smooth function, f, on N. Then the composition fμ is a function on M.

Morphisms all share the property that a composition of morphisms is itself a morphism.

Theorem 2-1: Given the manifolds M, N, and O with smooth mappings l2_101.png and l2_102.png. Then the composition l2_103.png is also a smooth mapping.

Proof: Let g be a smooth function on O. We will show that the composition l2_104.png is a smooth function on M. Since g is a smooth function on O and l2_105.png is a smooth mapping, then l2_106.png is a smooth mapping on N. Since l2_107.png is a smooth function on N and l2_108.png is a smooth mapping, then l2_109.png is a smooth mapping on M. QED.

Two objects that are identical with regard to the definition of a structure being considered are called isomorphic. We say that two such objects share an isomorphism. We have already seen an example of isomorphisms of sets are what we have called bijections. Isomorphisms of vector spaces are simply called isomorphisms. Isomorphisms of manifolds are called diffeomorphisms. Let’s say we have two manifolds, M and N. Say that we have μ:MN. If μ is bijective and l2_110.png is a smooth mapping, then we call μ a diffeomorphism. In such a case M and N are said to be diffeomorphic.

The composition of diffeomorphisms is a diffeomorphism.

Exercise 2.11: Prove this last sentence to be true.

Exercise 2.12: Prove that if two manifolds are diffeomorphic, they have the same number of dimensions.

Theorem 2-2: Let p∈M. There is an open set, O, containing p with the following property: given any point qO there will exist a diffeomorphism μ:MM with μ(p)=q.

Proof: Let (U,φ) be a chart, and assume pU. Define l2_111.png. Now choose ε>0, such that l2_112.png and when d(x,z)<ε then x∈φ[U]).

l2_113.gif

The subset l2_114.png is our candidate for the open subset of M. Here V is the collection of all points l2_115.png with d(x,z)<ε.

l2_116.gif

Define a point qO such that l2_117.png where l2_118.png.

l2_119.gif

We can further make ε'=d(z,y)<ε.

We introduce the set [d(x,r)] and a function f(r) having the properties: NEEDS WORK

r1: f(r) is l2_120.png.

r2: There exists l2_121.png such that f(r)=1 when l2_122.png.

r3: There exists l2_123.png such that f(r)=0 when l2_124.png.

r4: l2_125.png.

We can see a prototypical example of this,

l2_126.gif

Exercise 2.13: Prove r1, r2, r3, and r4.

Since ε'<ε the function f(r) exists.

We now define a mapping, Λ:VV such that,

l2_127.png

This mapping is smooth by conditions r1 and r2. We can see by r2 that Λ(z)=y. By all four conditions we can see that Λ is the identity near the edge of V. We can also see that l2_128.png exists and is smooth.

Exercise 2.14: Show that the assertions of the previous paragraph are true. If necessary get hold of a book on Advanced Calculus and follow through a proof of the inverse function theorem and adapt it to this version of the theorem.

We can then define μ:MM. If we say that sM we have two cases, sO or sO. If we accept the first case we write l2_129.png. If we accept the second case then we write μ(s)=s. It seems that μ is a smooth bijection, and that μ(p)=q. QED

Exercise 2.15: Prove that μ is a diffeomorphism.

This theorem seems to allow us to use charts to pull some specific property of l2_130.png into a property of the manifold M.

What we have done in this section is develop the ability to make connections from out manifold to archetypal sets that maintain the geometry of the manifold in the form of diffeomorphisms. This sets the stage for our ability to place things on our blank canvas.

Tangent Vectors in a Manifold

While in all generality we will be discussing geometrical objects throughout a manifold, it is convenient to begin by discussing objects at a point. To that end, let’s say we have a tangent vector located at a point, p, in l2_131.png denoted as l2_132.png. By establishing a local origin in l2_133.png we can treat it is if it were a local copy of l2_134.png, and this allows us to establish n-tuples—ordered sets of numbers having n elements, l2_135.png. We can thus write the n-tuple for the vector at the point, p, as l2_136.png. Each element of the vector n-tuple is a component of the vector in l2_137.png. In this way the vector can be represented by its components with respect to a set of coordinate axes (a chart) in l2_138.png, l2_139.png. Another way of saying this is that a tangent vector in l2_140.png is equivalent to a list of n real numbers. This is a completely natural representation of a vector in l2_141.png with a local origin. It is not necessarily so natural in an arbitrary manifold—where there need be no “natural” axes.

Assume that we are in l2_142.png. The collection of all smooth functions on l2_143.png will be denoted as l2_144.png. Thus an element of l2_145.png will be a l2_146.png real-valued function of n-variables, l2_147.png. Given a vector x having the set of components l2_148.png and also having a smooth function on l2_149.png, f, then we can define

l2_150.png

(2.1)

This is called a directional derivative of f in the direction represented by l2_151.png.

We can see from the elementary properties of derivatives that our directional derivative  ξ(f) will satisfy three conditions:

DD1: The Sum Rule: ξ(f+g)=ξ(f)+ξ(g)

DD2: The Power Rule: ξ(f g)=f(x)ξ(g)+g(x)ξ(f)

DD3: The Constant Function Rule: If f is a constant function, then ξ(f)=0.

We can see how this works with manifolds. Say we have our manifold, M. On this manifold we have a collection of smooth mappings, F(M). By Exercise 2.8, DD1, and DD2 we can see that the pointwise sum and product of elements of F(M) are also elements of F(M). For those of you who have studied abstract algebra this gives F(M) the structure of a ring.

Choosing a point, p, of M, then a tangent vector at p is a mapping l2_152.png, and this satisfies DD1, DD2, and DD3, where x is replaced by p. The collection of all tangent vectors in the manifold M located at p are denoted F(p). Our goal now is to show that F(p) is a vector space. To do that we begin by stating that ξ and η are in F(p). If we further state that we have some real number, say m, and an element of F, say f, then we can write (ξ + m η)(f)=ξ(f)+m η(f), and with this we are have accomplished our goal of establishing a vector space. We have shown how to add elements of the space and how to multiply elements by a real number.

Theorem 2-3: Equation 2.1 defines a bijection between the n-tuples l2_153.png and mappings from l2_154.png to l2_155.png. This bijection satisfies DD1, DD2, and DD3.

Proof: There are two things that we need to show to prove this theorem. The first is that equivalent directional derivatives have equivalent components. The second is that mappings from l2_156.png to l2_157.png satisfy DD1, DD2, and DD3.

To prove the first part let ξ(f)=η(f) for all fF. We set l2_158.png. Then by (2.1) we have l2_159.png. Since we can do this for all l2_160.png we have proved the first part.

To prove the second part we state that there is a mapping, l2_161.png and that μ(f) satisfies DD1, DD2, and DD3. We can define a set of n values l2_162.png. This allows us to refine what we are trying to do. We now need to show that for this set of values (2.1) holds for all f. To accomplish this let l2_163.png, we we then rewrite f,

l2_164.png

(2.2)

and where the set l2_165.png. To proceed we need to introduce a Lemma,

Lemma 2-4: If f is in l2_166.png then it can be written as (2.2) where p is some fixed point of l2_167.png for some l2_168.png.

Proof: We can write,

l2_169.png

l2_170.png

l2_171.png

We can then write l2_172.png.

QED.

Continuing from this point, we apply DD! and DD2 to get

l2_173.png

l2_174.png

(2.3)

Since p is a fixed point we can use DD3 to state that μ(f(p))=0.

l2_175.png

l2_176.png

By using both DD1 and DD3 l2_177.png, so

l2_178.png

By (2.2) l2_179.png, so

l2_180.png

We can see that l2_181.png, so

l2_182.png

This proves the second part. QED

Tangent vectors can also be represented in terms of charts. Given a chart on M, (U,φ), and where we label a smooth function f the composition l2_183.png is a smooth function on the image φ[U]. If our point p is in U, then we can write p=φ(p). We now introduce a set of n functions of F such that in an open subset of l2_184.png having our point p we get l2_185.png. In this way, if ξ is a tangent vector in our manifold at a point, then the set of numbers l2_186.png are called the components of the tangent vector with respect to the chart (U,φ).

Exercise 2.16: What is the purpose of the h functions?

Theorem 2-5: For a tangent vector, the choice of components is independent of the choice of h functions. In fact, given a set of components there exists a unique tangent vector in a manifold at a given point with those specific components.

Proof: Say that f is an element of the collection of smooth functions, F. If we also allow the functions l2_187.png, s, and t are also in F, such that s(p)=0. We can write f,

l2_188.png

(2.4)

Exercise 2.17: Construct (2.4). Choose the set of functions l2_189.png so that l2_190.png on some open set containing p. Choose some s that is almost always positive, except that it vanishes at p. Choose t so that it satisfies (2.4).

Using an argument similar to that when we proved Theorem 2-3, and the fact that l2_191.png, we get

l2_192.png

(2.5)

This proves our first statement, that the components are independent of the h functions.

The second statement of the theorem is proven by first noting that if we have a set of numbers l2_193.png, then 2.5 defines a tangent vector in M at the point p with the set of numbers as components. If M is an n-dimensional manifold, and p is a point in M, then the composition F◦(p) is an n-dimensional vector space.

The components of a vector depend on our choice of chart. We derive the well-known formula for this dependence. Let (U,φ) and (U',φ') be two charts, both containing the point p of M. Then U U' is handled in two ways, once by φ and once by φ’. That is to say we have a smooth bijection φ◦φ' from φ[U U'] to φ'[U U'], both subsets of l2_194.png. This gives us a set of n functions of n variables. We can write this set of functions, l2_195.png. Now let f be a smooth function on M. We can write l2_196.png and l2_197.png. If ξ is a tangent vector in M at p and l2_198.png and l2_199.png are it components with respect to our two charts (U,φ) and (U',φ') respectively. We can use (2.5) to conclude that

l2_200.png

(2.6)

Since this must hold for all smooth functions,

l2_201.png

(2.7)

Doing this Stuff in Mathematica

While we can translate the formulas to perform calculations into Mathematica, most of the contents of this chapter are purely mathematical. In other words we are making mathematical definitions, and stating then proving conjectures. Mathematica has some theorem proving capabilities, but this requires listing all relevant axioms, definitions, and supporting theorems. This would require a book in itself, perhaps an interested reader will do such, alternately some crotchety old author might be convinced to add this to an ever-growing list of things to do. Here is an example of this kind of thing for group theory (some of which is now built-in to Mathematica). We begin by stating some group theory axioms (or gta) (establishing associativity, the identity, and the inverse—note that closure is implicitly understood).

l2_202.png

l2_203.png

We can now write a proof for the left-identity theorem.

l2_204.png

l2_205.gif

So, somewhere in Mathematica is an 11 step proof of this theorem. We can now write it out in a nicely formatted form.

Doing this Stuff in Mathematica

While we can translate the formulas to perform calculations into Mathematica, most of the contents of this chapter are purely mathematical. In other words we are making mathematical definitions, and stating then proving conjectures. Mathematica has some theorem proving capabilities, but this requires listing all relevant axioms, definitions, and supporting theorems. This would require a book in itself, perhaps an interested reader will do such, alternately some crotchety old author might be convinced to add this to an ever-growing list of things to do. Here is an example of this kind of thing for group theory (some of which is now built-in to Mathematica). We begin by stating some group theory axioms (or gta) (establishing associativity, the right-identity, and the right-inverse—note that closure is implicitly understood).

l2_206.png

l2_207.png

l2_208.png

l2_209.png

l2_210.png

l2_211.png

l2_212.png

We can now write a proof for the left-identity theorem. We begin by stating the cenjecture (by rights it shoyuld not be called a theorem until it is proved).

l2_213.png

l2_214.png

l2_215.png

l2_216.gif

So, somewhere in Mathematica is an 11 step proof of this theorem. We can now write it out in a nicely formatted form.

l2_217.png

l2_218.png

In the future we may add leftident to gta as we have proven it.

A systematic application of this would involve beginning with the rules of logic, then the rules of set theory, then advanced calculus, and then abstract algebra.  Then you could proceed step-by-step to prove each theorem in differential geometry by listing any new axioms and including all proven statements as new statements to be used in proofs.

Created with the Wolfram Language