• No results found

The Geometry behind the Special Relativity Theory

N/A
N/A
Protected

Academic year: 2022

Share "The Geometry behind the Special Relativity Theory"

Copied!
56
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

The Geometry behind the Special Relativity Theory

Paul Anthony Front´eri June 4, 2012

Bachelor Project in Mathematics

(2)

Abstract

This project is an attempt of approaching known physical concepts from a mathematical point of view, or more precisely, from a geometrical framework. In particular, we are studying con- cepts from special relativity (Lorentz transformation), electromagnetism (Skew-symmetry), and quantum mechanics/fields (Spinors).

Most of the material of the project is based on the book ”The Geometry of Minkowski Space. An introduction to the Mathematics of the Special Relativity Theory” by Gregory L.

Naber [1]. But other literature have also been used, such as ”Semi-Riemannian Geometry”

by Barrett O’Neill [2], ”Introduction to 2-Spinors in General Relativity” by Peter O’Donnell [3] and ”Riemannian manifolds” by John M. Lee [4] to complement the mathematical under- standing. My education in physics allowed me to understand the content and motivation of the G. L. Naber book, but from my point of view there is a lack of mathematical logic and care. One of the aims of the project was to complement the physically interesting material of the book written by Gregory L. Naber by a mathematical understanding of the geometry and algebraic structures behind the physical processes.

The first Chapter starts with the motivation for studying the special relativity theory from a mathematical point of view. The second Chapter consists of an account of the geometry of the Minkowski space and the Lorentz transformation in this space. Sections 2.1 to 2.4 concern the basic theory behind, and in Section 2.5 some practical examples of the usage of the Lorentz transformation are presented in order to connect the mathematical theory and physics. The third Chapter goes further and introduces particles and charged particle, which naturally introduces the electric and magnetics fields (electromagnetism). As in the second Chapter, Sections 3.1 and 3.2 deal with the mathematical framework, meanwhile Section 3.3 gives the physical application of this framework. The last Chapter includes one additional structure that posses our particles, namely spin. Sections 4.1 - 4.4 are devoted to the presentation of the theory, and Section 4.5 gives a clear image for the ”double-value” property of the spinors by the null flag.

(3)

Acknowledgements

I which to thank my supervisor on this project, Irina Markina. Thank you for giving me a interesting, exciting and sometimes very hard project. And also, just as importantly, for having the patient and believing in me.

Paul Anthony Front´eri 09 May, 2012

(4)

Contents

1 Our world as a space-time 2

2 Geometry of Minkowski space and the Lorentz transformations 3

2.1 Definitions . . . 3

2.2 Minkowski Space . . . 4

2.3 Lorentz Group . . . 7

2.4 Timelike Vectors and Curves . . . 10

2.4.1 Some properties of the world velocity and acceleration . . . 13

2.5 Spacelike Vectors . . . 14

2.6 Some Classical Example . . . 14

2.6.1 Time dilation and relativity of simultaneity . . . 14

2.6.2 Special Lorentz Transformation and Boots . . . 16

3 Particles and Electromagnetic Fields 19 3.1 Particles . . . 19

3.2 Charged particles and Electromagnetic Fields . . . 20

3.3 Interactions of particles and Charged particles in Electromagnetic Fields . . . 23

3.3.1 Relativistic Doppler and Compton effect . . . 23

3.3.2 Constant fields . . . 25

3.3.3 Variable fields and Maxwell’s Equations . . . 26

4 Spin Transformations and Spinors 28 4.1 Spin Transformation and Spinor map . . . 28

4.2 Representations of matrix groups . . . 31

4.3 Spin Space and spinor of valence . . . 32

4.4 Spinors and World Vectors . . . 35

4.5 Bivectors and Null Flag . . . 37

(5)

1 Our world as a space-time

Our world, the universe, the space-time or the reality, is a four-dimensional space endowed with a special kind of metric called Minkowski metric. These are different names for one common concept, namely, for the place we live and breathe in. Nevertheless, the reality of our habitat has not always been thought of this form. Some time ago one though that the world as flat, a disk where the world states and where sailors on the far seas have to be careful not to fall into the deep of nothing.

Following to the geometry of Euclid, Descartes assigned to each observerO a coordinate system(x1, x2, x3), which one could identify with three space dimensions.

Galileo Galilei discovered that laws of mechanics are the same for all observers. So, given an observation by one observerO, one could translate the observation to an other observer ˜O by the Galilean transformation. Time was disconnected from space itself. The ancient Greeks thought that time was not real, it was just an illusion of the mind, of the flow of the world.

Newton made it more concrete by declaring it absolute.

The first who imagined the universe in more than three dimension were mathematicians:

Gauss, Riemann, M¨obius, Lorentz, Poincare and Minkowski, just to mention a few of them.

They assumed that the reality should not only have more dimensions, but also should be endowed with a curvature. The curvature could be positive, like of the sphereSn, or negative like of the hyperbolic spaceHn. But what does the spaces of the mathematical imagination, to do with the real world we live in? It turned out that they much more related than one could expect. The starting point in the understanding of this connection belongs to the physicist Albert Einstein, who invented theories of special and general relativity.

As the first step, our three dimensional space and time were not anymore thought of as two separated concepts, but they became a part of a bigger space. Namely, they formed the space- time, or a four-dimensional space which was furnished with special tools of measurement.

Each event in the universe, was described as one point in this space-time. The birth and death of Einstein, are two different points in the space-time, connected by the worldline of Einstein himself: the journey of his life in this universe of ours. However, binding these two concepts of space and time was not all. They became also interconnected, space and time were no longer absolute, as it was according Newton’s theory, but relative to each observers viewpoint. Thereby, the namerelativity theory comes.

The distance between two points in the space-time was no longer given by the Euclidean distance, or its metric, but it became a byproduct of the Minkowski metric. The transforma- tion between two observers was changed from the Galilean transformation to the Lorentzian one in order to preserve the new type of metric in the universe.

(6)

2 Geometry of Minkowski space and the Lorentz transforma- tions

This chapter goes through the general geometric structures of the Minkowski space, such as inner product, null cone, future directedness and causality. It is also defined the Lorentz transformation and the Lorentz group. Timelike and spacelike vector properties are discussed, while a physical meaning is postponed to the end of the Chapter.

2.1 Definitions

LetV be an arbitrary vector space of dimension n≥1 over the real numbers R and v, ω be elements (vectors) of this vector space.

Definition 2.1(Bilinear form). A bilinear form on V is a map g∶V×V →Rthat is linear in each variable, i.e. such thatg(a1v1+a2v2, ω)=a1g(v1, ω)+a2g(v2, ω) andg(v, a1ω1+a2ω2)= a1g(v, ω1)+a2g(v, ω2), where a∈R,∀v, ω∈V

A bilinear form g is symmetric if g(v, ω) = g(ω, v), ∀v, ω ∈ V and non-degenerate if g(v, ω)=0 for∀ω∈V impliesv=0.

Definition 2.2 (Scalar product). A non-degenerate, symmetric, bilinear form g is called an scalar product and the imageg(v, ω) is written as v⋅ω.

Note that Naber [1] use the term inner product for the scalar product. In general is a scalar product g for which g(v, v) > 0 if v ≠0 called positive definite (or inner product), negative definite for whichg(v, v)<0 ifv≠0, and indefinite if it is neither positive or negative definite. A vector spaceV, with a scalar product g,(V, g) is called a scalar space.

Definition 2.3(Orthogonality). If (V, g) is an n-dimensional scalar space then, two vectors v, ω∈V are said to be g-orthogonal (or just orthogonal) ifg(v, ω)=0.

Definition 2.4 (Orthogonal complement). If W is a subspace of V, then the orthogonal complementW of W in V is defined by W={v∈V�g(v, ω)=0, ∀ω∈W}.

Definition 2.5(Quadratic forms). The quadratic forms associated with a scalar productgon an n-dimensional vector spaceV is the mapQ∶V →R defined byQ(v)=g(v, v)=v2, v∈V. Definition 2.6 (Index of bilinear form [2]). The index of a bilinear form Q defined on a vector space V is the maximal dimension of the subspace of V where the form Q is negative definite.

Theorem 2.7 ([1] page 8,[2]). Let V be an n-dimensional real vector space on which a non- degenerate, symmetric, bilinear form g ∶ V ×V → R is defined. Then there exists a basis {e1, . . . , en} for V such that g(ei, ej) = 0 if i ≠ j and Q(ei) = ±1 for each i = 1, . . . , n.

Moreover, the number of basis vectorsei for which Q(ei)=−1 is the same for any such basis and coincides with the index ofg. Such a basis is called for a orthonormal basis.

(7)

2.2 Minkowski Space

All the definitions in this section has been taken from Naber [1] pages 9-18 and 64-66.

The Minkowski spacetime is a 4-dimensional real vector spaceMon which a non-degenerate, symmetric, bilinear formgof index 1 is defined. The points ofMare called events in physics andg is referred to as a Lorentz scalar product onM.

One make the use of theEinstein summation. To illustrate the usage of the summation convention do one have the indicesaand bthat range over the set {1,2,3,4}, then

xaea=�4

a=1xaea=x1e1+x2e2+x3e3+x4e4, Λabxb=�4

b=1Λabxba1x1a2x2a3x3a4x4,

ηabvaωb11v1ω112v1ω213v1ω314v1ω421v2ω1+. . .+η44v4ω4.

An eventx∈Mexpressed in the orthonormal basis {e1, e2, e3, e4}={ea} of Mis written asx=xaea=x1e1+x2e2+x3e3+x4e4. Here(x1, x2, x3, x4)are called coordinates ofxrelative to the basis {ea}, with the spatial (x1, x2, x3) and time (x4) coordinates. Choosing two elementsv, ω inMrelative to same basis ofM, one hasv=vaea,ω=ωaea, and

g(v, ω)=v1ω1+v2ω2+v3ω3−v4ω4. The metric written in matrix form is

η=

����

����

1 0 0 0

0 1 0 0

0 0 1 0

0 0 0 −1

����

����

� with entriesηab. Thus

ηab=����

�����

1 a=b=1,2,3

−1 a=b=4 0 otherwise.

One can then writeg(ea, eb)=ηab, andg(v, ω)=ηabvaωb. The inverse matrix to η={ηab} is denoted byη−1={ηab}=η.

Since the Lorentz scalar product is not positive definite, there exist nonzero vectorsv∈M such thatg(v, v)=Q(v)=0. Such vectors are called null (or lightlike) vectors.

Theorem 2.8 ([1] page 10,[2]). Two nonzero null vectors v and ω in M are orthogonal if and only if they are parallel, i.e. if there ist∈R such thatv=tω.

Consider two distinct eventsx0 and x for which the displacement vector v=x−x0 from x0 toxis null, i.e., Q(v)=Q(x−x0)=0. Expressing this in relation to an orthonormal basis {ea}, do one have that x=xaea andx0=xa0ea, such that

(x1−x10)2+(x2−x20)2+(x3−x30)2+(x4−x40)2 =0.

This equation describes a cone in the four dimensional space R4 with a vertex at the point (x10, x20, x30, x40).

(8)

Definition 2.9 (Null cone, null line). The null cone (or light cone) CN(x0) at x0 in M is defined by

CN(x0)={x∈M�Q(x−x0)=0}.

The null line (or physically, a light ray) Rx0,x containing x0 and x is defined by Rx0,x={x0+t(x−x0)�t∈R}.

Notice that: Rx0,x=Rx,x0.

Theorem 2.10 ([1] page 12). Let x0 and x be two distinct events with Q(x−x0)=0.Then Rx0,x=CN(x0)∩CN(x).

A vector v in M is said to be timelike if Q(v)< 0 and spacelike if Q(v) >0, or if it is the zero vectorv=0. Further if v is the timelike displacement vector v=x−x0, then v will be inside the null coneCN(x0). If it is spacelike, then it is outside the cone. A timelike line T={x0+t(x−x0)�t∈R} is a line inM, wherex−x0 is a timelike vector.

Theorem 2.11([1] page 16). Suppose that v is timelike and ω is either timelike or null. Let {ea} be a orthonormal basis forM withv=vaea and ωaea. Then either

(a) v4ω4>0, in which case g(v, ω)<0, or (b) v4ω4<0, in which case g(v, ω)>0.

Corollary 2.12 ([1] page 16). If a nonzero vector in M is orthogonal to a timelike vector, then it must be spacelike.

We have found now some basic building blocks for the Minkowski space, but we would like to have some sort of time orientation inM. We define then the future and past direction.

Definition 2.13 (Future- and Past-directed Equivalence Relation). By taking the collection of all timelike vectors inMcalled τ, we define an equivalence relation ∼onτ: if v andω are in τ, then v∼ω if and only if g(v, ω)<0.

Note that in the collection τ, the coordinates v4 and ω4 will have the same sign in any orthonormal basis ofMby Theorem (2.11). This leads us to that the collectionτ is the union of two distinct subsets,τ+ for positive value ofv4 (orω4) andτ for negative value ofv4 (or ω4). They have the following properties: v ∼ ω for all v, ω ∈τ+, v ∼ω for all v, ω∈ τ and v∼�ω if one of the vectors v or ω is in τ+ and the other one belongs toτ. The elements of τ+are called future directed and the elements inτ arepast-directed.

Definition 2.14 (Time cone, future time cone and past time cone). For each x0 in M we define the time coneCT(x0), future time cone CT+(x0) and past time cone CT(x0) at x0 by

CT(x0)={x∈M�Q(x−x0)<0}, CT+(x0)={x∈M�x−x0∈τ+}=CT(x0)∩τ+, CT(x0)={x∈M�x−x0∈τ}=CT(x0)∩τ.

The time cone CT(x0) is a open solid cone which boundary is the null coneCN(x0) and it is the disjoint union ofC+(x ) and C(x ).

(9)

Definition 2.15 (Future- and Past-directed nonzero null vectors). A nonzero null vector n is future-directed ifn⋅v<0,∀v∈τ+ and it is past-directed ifn⋅v>0,∀v∈τ+.

Note that one uses only τ+ to define the direction of the null vector. This is due to the sign of the productn⋅v that is the same for all v∈τ+.

Definition 2.16 (Future- and Past null cone at x0). For any x0 ∈ M we define the future null cone at x0 by

CN+(x0)={x∈CN(x0)�x−x0 is f uture directed}, and past null cone atx0 by

CN(x0)={x∈CN(x0)�x−x0 is past directed}.

Illustration of the future and past cone for a point. Time is the vertical axis, and space the others. As we see, we can define a plane of simultaneity, where all these events happen at

the same time.

With the basic building blocks of the Minkowski space and time orientation one can define a causal structure onM.

Definition 2.17(Chronologic, causal relations). There exist two order relations onMcalled causality relations onM. For two eventsx and y in M we say that

1) the event x chronologically precedes y, and write x�y, if the displacement vector y−x is timelike and future-directed.

2) the event x causally precedesy, and write x<y, if the displacement vector y−x is null and future-directed.

Definition 2.18 (Causal Automorphism). A map F ∶M→M is said to be a causal auto- morphism if it is bijective, and bothF andF1 preserves the causal order <, i.e.,x<y if and only ifF(x)<F(y). Note that, in particular, F is not assumed to be linear or continuous.

Definition 2.19 (Translation). A continuous map T ∶M→Mis said to be a translation if T(v)=v+v0 for some fixed v0∈M. Geometrically is it a function that moves every point a constant distance in a the direction ofv.

(10)

Definition 2.20 (Dilation). A linear transformation K ∶M→M is said to be a dilation if K(v)=kvfor some positive real numberk. Geometrically is it a function that scales, enlarges or increases the vectors by a factork.

Definition 2.21 (Orthochronous). A orthogonal transformation L ∶M→ M is said to be orthochronous ifx⋅Lx<0 for all timelike or null vectors x.

In the following theorem the structure of orthochronous transformations is presented.

They are essentially translations, dilations and the orthogonal transformations. In this sense the orthochronous transformations of the Minkowski space are analogous of the M¨obius trans- formations of the Euclidean space.

Theorem 2.22(Zeeman Theorem [1] page 66). LetF ∶M→Mbe a causal automorphism of M. Then there exists an orthochronous orthogonal transformation L∶M→M, a translation T∶M→M, and a dilation K∶M→Msuch that F =T○K○L.

2.3 Lorentz Group

One have now seen how the causal structure came from the Lorentz metric in the Minkowski space and the definitions of null-, space-, and timelike vectors in Minkowski space. However, we need to relate two different observers O and ˆO in the Minkowski space, ”interpret” dif- ferent events. One must introduce a transformation rule between two observers, relating an event viewed from the first observer respective to the second one. In the Minkowski space this transformation is called the Lorentz group. We shall now look on some of its properties.

All the definitions in this section has been taken from Naber [1] pages 18-22.

Definition 2.23 (Linear and orthogonal transformations). If {ea} and {ˆea} are two or- thonormal bases for M then there is a unique linear transformation L∶M→M such that L(ea) = ˆea for each a = 1,2,3,4. The linear transformation L ∶M → M is said to be an orthogonal transformation of M if g(Lx, Ly) = g(x, y) for ∀x, y ∈ M, so, it preserves the length of vectors.

Lemma 2.24([1] page 13,[2]). LetL∶M→Mbe a linear transformation. Then the following are equivalent:

(a) L is an orthogonal transformation.

(b) L preserves the quadratic form of M, i.e., Q(Lx)=Q(x) for ∀x∈M

(c) L carries any orthonormal basis of Monto another orthonormal basis of M.

We define the matrix Λ= �Λaba,b=1,2,3,4 associated to the orthogonal transformation L and the orthonormal basis{ea}by

Λ=

����

����

Λ11 Λ12 Λ13 Λ14 Λ21 Λ22 Λ23 Λ24 Λ31 Λ32 Λ33 Λ34 Λ41 Λ42 Λ43 Λ44

����

����

� ,

(11)

meaning that for two orthonormal bases {ea} and {ˆea} for M, each element of {ea} can be expressed as a linear combination of the ˆea:

eu1uˆe12u23u34u4. The orthogonality conditionsg(ec, ed)=ηcd, c, d=1,2,3,4,is written as

Λ1cΛ1d2cΛ2d3cΛ3d−Λ4cΛ4dcd, or, with the summation convention,

ΛacΛbdηabcd and ΛacΛbdηcdab. (1) Having this matrix we can write the transformation of one event written in basis of {ea} to itself written in another basis{eˆa} by:

ˆ

xaabxb, a=1,2,3,4, (2) or more detailed

ˆ

x111x112x213x314x4 ˆ

x221x122x223x324x4 ˆ

x331x132x233x334x4 ˆ

x441x142x243x344x4.

Definition 2.25(General (homogeneous) Lorentz transformation). Any4×4matrixΛwhich satisfies

ΛTηΛ=η (3)

where T is the transposed, is called an general (homogeneous) Lorentz transformation.

Note that the eq. (3) is equivalent to eq. (1). Moreover one can find the inverse of Λ by Λ−1=ηΛTη.

With this one can write the entries of the inverse matrix Λ−1 =�Λaba,b=1,2,3,4 relative to the orthonormal basis{ea} by,

Λ−1=

����

����

Λ11 Λ21 Λ31 Λ41 Λ12 Λ22 Λ32 Λ42 Λ13 Λ23 Λ33 Λ43 Λ14 Λ24 Λ34 Λ44

����

����

=

����

����

Λ11 Λ21 Λ31 −Λ41 Λ12 Λ22 Λ32 −Λ42 Λ13 Λ23 Λ33 −Λ43

−Λ14 −Λ24 −Λ34 Λ44

����

����

. (4)

Definition 2.26(General Lorentz GroupLGH). The set of all general (homogeneous) Lorentz transformations form a group under matrix multiplication. This group is called the general (homogeneous) Lorentz group and is denoted byLGH.

Note that physically not all transformations fromLGH are interesting. The most impor- tant those which preserve the time orientations and do the same for the spacelike part of the Minkowski space. Now we are going to define these special transformations inside ofLGH.

Settingc=d=4 in Eq. (1) one obtains(Λ44)2=1+(Λ14)2+(Λ24)2+(Λ24)2, that implies (Λ44)2≥1. Consequently,

Λ44≥1 or Λ44 ≤−1.

(12)

Definition 2.27. Any element Λ of LGH is said to be orthochronous if Λ44 ≥ 1 or it is non-orthochronous if Λ44≤−1.

Theorem 2.28([1] page 18,[2]). LetΛ=�Λaba,b=1,2,3,4be an element ofLGH and{ea}a=1,2,3,4

an orthonormal basis forM. Then the following are equivalent:

(a) Λ is orthochronous,

(b) Λ preserves the time orientation of all null vectors, i.e., ifv=vaeais a null vector, then the numbers v4 and ˆv44bvb have the same sign,

(c) Λ preserves the time orientation of all timelike vectors.

Notice that if Λ nonorthochronous, then it reverses the time orientation of all timelike and nonzero vectors.

The determinant of Λ is found from taking the determinant of both side of Eq. (3):

det�ΛT�det(η)det(Λ)=det(η) det�ΛT�=det(Λ) (det(Λ))2=1, leading to det(Λ)=1 or det(Λ)=−1.

Definition 2.29 (Proper and improper Lorentz transformation). We call a Lorentz trans- formation proper ifdet Λ=1 and improper if det Λ=−1.

An improper orthochronous Lorentz transformation will reverse the spatial orientation from left-handed to right-handed, or right-handed to left-handed system. All in all, the orthochronous transformations (time preserving elements of LGH) form a subgroup of LGH and this subgroup has two components that we called proper and improper transformation.

Now we are ready to the following important definition.

Definition 2.30 (Lorentz Group L). The Lorentz group, L, is the subgroup of the general Lorentz group,LGH, of proper and orthochronous Lorentz transformation.

As we mentioned before, the elements of the Lorentz group are linear transformations ofM that preserve the length of vectors, preserves the time orientation and the space orientation.

Notice that in the literature the general (homogeneous) group often refers as a Lorentz group.

Further one can define an admissible basis, or a reference frame for an observer, where the Lorentz group will act.

Definition 2.31 (Admissible basis and reference frame). We define an admissible basis for Mto be an orthonormal basis {e1, e2, e3, e4} with e4 timelike and future-directed and {ea}= {e1, e2, e3} spacelike and “right-handed”, i.e., satisfying (e1×e2)⋅e3=1.

An admissible basis called as an admissible frame of reference. Any two such bases (frames) are related by a Lorentz transformation from the Lorentz groupL.

(13)

The Lorentz group L has an important subgroup R, consisting of matrices R=�Rab� of the form

R=

����

����

�Rij� 00 0

0 0 0 1

����

����

� ,

where �Riji,j=1,2,3 is a unimodular orthogonal matrix, i.e., satisfying det�Rij� = 1 and

�RijT =�Rij−1. The coordinate transformation associated with R corresponds physically corresponds to a rotation of the spatial coordinate axes within a given frame of reference [1]

page 21,[5]. For this reason R is called the rotational subgroup of L and its elements are called rotations inL.

Lemma 2.32 ([1] page 21[5]). Let Λ = �Λaba,b=1,2,3,4 be a proper, orthochronous Lorentz transformation. Then the following are equivalent:

(a) Λ is a rotation.

(b) Λ142434=0 (c) Λ414243=0 (d) Λ44=1.

2.4 Timelike Vectors and Curves

In two next sections we would like to investigate how a voyage of an observer O in the Minkowski space, represented by a curve in the Minkowski space, behaves and what properties it has. First, we look on properties of timelike vectors itself, and then go ahead to study curves through the behavior of its velocity vector.

This section with theory has been taken, with some rewriting, from Naber [1] pages 46-156.

Definition 2.33(Durationτ(v)). For any timelike vector inMwe define the duration τ(v) of v by τ(v)=�

−Q(v).

Ifvis defined as the displacement vector between two eventsx0 andx, i.e. v=x−x0, then τ(x−x0) is to be interpreted physically as the time separation ofx0 and x in any admissible frame of reference in which both events occur at the same spatial location. The duration τ(x−x0) is also a lower bound for the temporal separation of x0 and x, and it is called the proper time separation ofx0 and x.

Definition 2.34 (Time axis). A null line in the Minkowski space which passes trough the origin is called a time axis.

In general ifT is a time axis, then there exists an admissible basis {eˆa} forMsuch that the subspace ofM spanned by ˆe4 isT. Then the space Span{eˆ4} is timelike and Span{eˆ4} is spacelike by Cor. (2.12).

(14)

Theorem 2.35(Reversed Schwartz Inequality [1] page 48,[2]). Ifvandω are timelike vectors in M, then

(v⋅ω)2 ≥v2ω2

and equality holds if and only ifv and ω are linearly dependent.

Theorem 2.36(Reversed Triangle Inequality [1] page 49,[2]). Let vandωbe timelike vectors with the same time orientation (i.e. v⋅ω<0).Then

τ(v+ω)≥τ(v)+τ(ω) and equality holds if and only ifv and ω are linearly dependent.

Lemma 2.37 ([1] page 50,[2]). The sum of any finite number of vectors in M all of which are timelike or null and all future-directed (respectively, past-directed) is timelike and future- directed (respectively, past-directed) except when all of the vectors are null and parallel, in which case the sum is null and future-directed (respectively, past-directed).

Corollary 2.38 ([1] page 50). Let v1, . . . , vn be timelike vectors, all with the same time orientation. Then

τ(v1+v2+ � +vn)≥τ(v1)+τ(v2)+ � +τ(vn) and equality holds if and only ifv1, v2, . . . , vn are all parallel.

Corollary 2.39 ([1] page 51,[2]). Let v and ω be two non-parallel null vectors. Then v and ω have the same time orientation if and only if v⋅ω<0.

Definition 2.40 (Curve, Worldline). Let I ⊆ R be an open interval. A continuous map α∶I→Mis a curve in M. A curve in the Minkowski space is often called a worldline.

Illustration of curve, or worldline, in Minkowski space.

(15)

Relative to any admissible basis {ea} forM, one can write α(t)=xa(t)ea for each t∈I.

The curveα is smooth if each component function xa(t)is infinitely differentiable and ifα’s velocity vector

α(t)=dxa dt ea is nonzero for eacht∈I.

A curve α ∶I →M is said to be spacelike, timelike or null, if α(t)⋅α(t) is >0, <0 or

=0 respectively for each t. A timelike or null curve α is future-directed (respectively past- directed) if α(t) is future-directed (respectively past-directed) for each t. This notion can also be extended to intervals that can contain one or both of it endpoints.

Definition 2.41 (Reparametrization of a curve). If α ∶ I → M is a curve and J ⊆ R is another interval and h∶J →I, t=h(s) is an infinitely differentiable function with h(s)>0 for eachs∈J, then the curveβ =α○h∶J →M is called a reparametrization of α

Definition 2.42 (Proper time length). If α∶[a, b]→M is a timelike curve in M, then we define the proper time length ofα by

L(α)=�ab�α(t)⋅α(t)�1�2dt=�ab

−ηabdxa dt

dxb dt dt.

The proper timeL(α) is interpreted, by theClock Hypothesis, as the time lapse between the eventsα(a)andα(b)as measured by the ideal standard clock carried along by a observer whose travel in the Minkowski space is represented by the curveα.

Theorem 2.43([1] page 53,[2]). Let pand q be two points in M. Then p−q is timelike and future-directed if and only if there exists a smooth, future-directed timelike curveα∶[a, b]→M such thatα(a)=q andα(b)=p.

This leads us to the conclusion that the acceleration has no effect on the rate of the ideal standard clock, i.e., that theinstantaneous rateof such clock depends only on its instantaneous speed and not the rate at which the speed is changing.

In analogue to the proper time separation between two events one can define the proper time function (arclength) on a curve.

Definition 2.44 (Proper time function). Let α ∶I →M be a timelike smooth curve. The proper time function τ(t) onI is defined by

τ =τ(t)=�0t�α(u)⋅α(u)�12du.

Thus, dt =�α(t)⋅α(t)�12 is positive and infinitely differentiable since α is smooth and timelike. The inverset=h(τ) therefore exists and dh =�dt−1>0. We then conclude that τ is a legitimate parameter alongα. Notation: α(τ)=xa(τ)ea.

Using the proper time function, we can reparametrize our curve and define an analogue to the velocity vector, namely the world velocity, and world acceleration.

Definition 2.45 (World velocity, 4-velocity). The vector α = dxaea of α is called the world velocity (or 4-velocity) of α and denoted U(τ)=Uaea.

Definition 2.46(World acceleration, 4-acceleration). The second proper time derivativeα′′ =

d2xa

2 ea of α is called the world acceleration (or 4-acceleration) ofα and denotedA(τ)=Aaea.

(16)

2.4.1 Some properties of the world velocity and acceleration

This example has been taken, with some rewriting, from Naber [1] pages 56 to 58.

Having a world velocityU along a curve the Lorentz dot product with itself and the world acceleration is:

U⋅U =−1 and U⋅A=0, (5)

on each point along the curveα.

For a given curve an admissible observer would probably prefer to parametrize the curve by his timex4 than by the proper time τ. This will give us

dx4 =�α(x4)⋅α(x4)�1�2 =

��

��

�1−��

�����dx1 dx4

2

+�dx2 dx4

2

+�dx3 dx4

2��

����=�

1−β2(x4)=γ(x4)=γ, whereβ(x4) is the usual instantaneous speed of the observer whose curve isα relative to the frameS(x1, x2, x3, x4). We get then

Ui =dxi dτ = dxi

dx4 dx4

dτ =γdxi

dx4, i=1,2,3, and

U4=γ Thus

U =Uaea=γdx1

dx4e1+γdx2

dx4e2+γdx3

dx4e3+γe4 or

U =(U1, U2, U3, U4)=γ�dx1 dx4,dx2

dx4,dx3 dx4,1�, Similarly, we get the world acceleration

Ai=γ d

dx4 �γdxi

dx4�, i=1,2,3, and

A4=γ d dx4(γ), So

U =(A1, A2, A3, A4)=γ d

dx4 �γdx1 dx4, γdx2

dx4, γdx3 dx4, γ�.

(17)

At each fixed point α(τ0) along a timelike curveα, the world velocity U(τ0), is a future- directed unit timelike vector. It is often used as the timelike vector e4 in some admissible basis forM. Relative to such a basis, U(τ0)=(0,0,0,1). Letting x40=x40) we find

�dxi dx4

x4=x40 =0, i=1,2,3,

and soβ(x40)=0 andγ(x40)=1. One say that the frame of reference corresponds to a basis that is ”momentarily at rest”(x4 =x40). Any such frame of references is called an instantaneous rest frame for α at α(τ0), which is of importance in physics. In a instantaneous rest frame one will then have that g(A, A)=��a�2, wherea�=u�˙ is the 3-acceleration in S defined by the ordinary derivative of the 3-velocity in Eq. (6).

2.5 Spacelike Vectors

This part with theory has been taken, with some rewriting, from Naber [1] pages 61-63.

Spacelike separationsx−x0, i.e., two eventsx andx0 for whichQ(x−x0)>0, liesoutside of the null cone at x0. There does not exist an admissible basis in which the separation of two events is zero, i.e., there is no admissible observer who can experience both events. One then has to travel faster then the speed of light.

Choosing an admissible frameS for an observer for which the separation ∆x4 ofxand x0 isr∈R, an observer in another admissible frame ˜S, in general, will not agree on the temperal order of xand x0.

Definition 2.47 (Proper Spatial Separation S). For any two events x and x0 for which Q(x−x0)>0 (spacelike vector) one define the proper spatial separation S(x−x0) of x and x0 by

S(x−x0)=�

Q(x−x0).

Suppose that the spacelike displacement vectorx−x0 is orthogonal to the timelike lineT (containingx1,x2, and x). Then the proper spatial separation would be

S(x−x0)= 1

2(τ(x0−x1)+τ(x2−x1)).

Physically one can interpreted this as the distance of an eventxmeasured by an admissible observerO, between the emission and reception of light signals connecting O withx.

Suppose now that v and ω are nonzero vectors inM with v⋅ω=0. If v and ω are null, then they must be parallel. If on the other hand v is timelike then ω must be spacelike.

And ifv and ω are both spacelike then their proper spatial lengths satisfy the Pythagorean TheoremS2(v+ω)=S2(v)+S2(ω). This we proved in the Appendix, Exercise 1.5.3.

(18)

2.6 Some Classical Example

2.6.1 Time dilation and relativity of simultaneity

This example has been taken, with some rewriting, from Naber [1] pages 22 to 25.

Consider two admissible frames of referenceS and ˆS with two admissible bases{ea} and {ˆea}, respectively. Two events on a curve in ˆS being spatially (physically) at rest will satisfy the equations ∆ˆx1 = ∆ˆx2 = ∆ˆx3=0. The value ∆ˆe4 is the time difference between these two events. The coordinate difference inS then will be:

∆xbab∆ˆxa4b∆ˆx4.

From this and the fact that Λ44 and Λ44 are nonzero it follows that the ratio

∆xi

∆x4 = Λ4i

Λ44 =−Λ4i

Λ44, i=1,2,3,

is constant and independent on the particular point at rest in ˆS. Physically, these ratios are interpreted as the components of the ordinary velocity 3-vector of ˆS relative to S:

u=u1e1+u2e2+u3e3, where ui= Λ4i

Λ44 =−Λ4i

Λ44, i=1,2,3. (6) Similarly, the velocity 3-vector ofS relative to ˆS is:

�ˆ

u=uˆ1e1+uˆ2e2+uˆ3e3, where uˆi= Λi4

Λ44 =−Λi4

Λ44, i=1,2,3.

Next we observe that ∑3i=1∆x∆x4i2 = �Λ4423i=1�Λ4i2 = �Λ442��Λ442−1�. Similarly,

3i=1∆ˆ∆ˆxx4i2=�Λ44−2��Λ442−1�.Physically, we interpret these equalities as asserting that the velocity of ˆS relative to S and the velocity of S relative to ˆS have the same constant magnitude which we shall denote byβ. Thus β2=1−�Λ44−2, in particular, 0≤β2 <1, and β=0 if and only if Λ is a rotation. Solving for Λ44 (assuming orthochronous) yields

Λ4444=�1−β2−1�2−1�2. (7) Definition 2.48 (Direction 3-vector, direction cosine). Assuming that Λ is not a rotation one can define the direction 3-vectord�of Sˆrelative to S by:

u=βd�=β(d1e1+d2e2+d3e3), di=ui�β (8) where di is the direction cosines of the direction line segment along which the observer in S sees moving. Similarly the direction 4-vector (and direction cosines) of S relative to Sˆ is defined by:

�ˆ

u=βd�ˆ=β(dˆ11+dˆ2ˆe2+dˆ33), dˆi =uˆi�β.

(19)

Comparing (6) and (8) and using (7) we obtain

Λ4i=−Λ4i=β(1−β2)12di, i=1,2,3 (9) and similarly

Λi4=−Λi4=β(1−β2)12i, i=1,2,3 (10) Equations (7), (9) and (10) give the last row and column of Λ in terms of physically measurable quantities. We obtain from (2)

∆ˆx4=−βγ(d1∆x1+d2∆x2+d3∆x3)+γ∆x4

for any two events. A special case of two events on the curve of a point in rest in S ∆x1 =

∆x2 =∆x3=0) gives

∆ˆx4=γ∆x4 = 1

�1−β2∆x4.

In particular, ∆ˆx4=∆x4 if and only if Λ is a rotation. Any relative motion of S and ˆS gives rise to atime dilation effect according to the relation ∆ˆx4>∆x4.

Another special case is also interesting, namely when two events are simultaneous in S, i.e., ∆x4 =0. Then

∆ˆx4=−βγ(d1∆x1+d2∆x2+d3∆x3).

Assuming that β ≠ 0 gives, in general, that ∆ˆx4 ≠ 0, meaning that two events are not simultaneous in ˆS. The only way they will agree on the simultaneity is if and only if the spatial locations of the events have a very special relation in the direction along which ˆS is moving, namely,

d1∆x1+d2∆x2+d3∆x3=0.

It is called therelativity of simultaneity.

2.6.2 Special Lorentz Transformation and Boots

This example has been taken, with some rewriting, from Naber [1] pages 26 to 30.

Looking at a subgroup of the Lorentz group L, which direction cosines are given by d1 = 1,dˆ1 =−1 and d2 =dˆ2 = d3 =dˆ3 =0, the direction vectors be d�=e1 and d�ˆ=−eˆ1. This corresponds to the situation where the observerS sees ˆS moving in the positive x1-direction, and ˆS seesS moving in the negative ˆx1-direction. The origin of both systems will coincide at x4=xˆ4=0, and two of three spatial coordinates will be the same in both frames of reference.

Now, from the eq. (7), (9) and (10) we find that this Lorentz transformation matrix Λ must have the form

Λ=

����

����

Λ11 Λ12 Λ13 −βγ Λ21 Λ22 Λ23 0 Λ31 Λ32 Λ33 0

−βγ 0 0 γ

����

����

� ,

(20)

and with the use of the orthogonality conditions (1), Λ must take the form

Λ=

����

����

γ 0 0 −βγ

0 Λ22 Λ23 0 0 Λ32 Λ33 0

−βγ 0 0 γ

����

����

� ,

where�Λiji,j=2,3 is a(2×2)-matrix, is a rotation on the planeR2.

Definition 2.49 (Special Lorentz Transformation). Any Lorentz transformation with Λ24= Λ444243=0and�Λiji,j=2,3 equal to a(2×2)identity matrix is called a special Lorentz transformation. In matrix form it is written as

Λ=

����

����

γ 0 0 −βγ

0 1 0 0

0 0 1 0

−βγ 0 0 γ

����

����

, (11)

with the associated coordinate transformation, ˆ

x1=(1−β2)12x1−β(1−β)12x4, ˆ

x2=x2, ˆ x3=x3, ˆ

x4=−β(1−β)−1�2x1+(1−β)−1�2x4.

(12)

The inverse is

Λ−1 =

����

����

γ 0 0 βγ

0 1 0 0

0 0 1 0

βγ 0 0 γ

����

����

� .

and the corresponding coordinate transformation is

x1=(1−β2)−1�21+β(1−β)−1�24, x2=x2,

x3=x3,

x4=β(1−β)121+(1−β)124.

The special Lorentz transformation allows −1< β <1. By choosing β >0 when Λ14 <0 andβ<0 when Λ14>0 all special Lorentz transformations can be written in the form of (11).

Definition 2.50 (Boost). For each real number β with −1 < β < 1 we define γ = γ(β) = (1−β2)−1�2 and

Λ(β)=

����

����

γ 0 0 −βγ

0 1 0 0

0 0 1 0

−βγ 0 0 γ

����

����

� .

The matrixΛ(β) is called a boost in the x1-direction.

(21)

The composition of two boosts in the x1-direction is another boost in the x1-direction.

Since Λ−1(β)=Λ(−β)the collection of such special Lorentz transformations (boosts) forms a subgroup of the Lorentz groupL.

Remark that the composition of two boost in two different directions, is in general, not equivalent to a single boost in any direction.

Suppose now that −1<β1≤β2<1, then

� β12

1+β1β2�<1, (13)

and

Λ(β1)Λ(β2)=Λ� β12

1+β1β2�. (14)

Both Eq. (13) and Eq. (14) are proven in the Appendix, Exercise 1.3.14.

The physical interpretation is the following: if the speed of ˆS relative to S is β1 and the speed ofSˆˆrelative to ˆS isβ2, then the speed of Sˆˆrelative toS is not β12, but rather

β12 1+β1β2,

which is always less then β12, unless β1β2 = 0. Equation (14) is called the relativistic addition of velocities formula[1] page 29,[2].

Even if the velocities are not additive directly, one can define anvelocity parameter θthat is additive. Such that if the speed of ˆS relative to S is θ1 and the speed of Sˆˆrelative to ˆS is θ2, then the speed of Sˆˆrelative toS isθ12. The parameterβ is then a one-to-one relation withθ,β=f(θ). Additivity and (14) require that f satisfies the functional equation

f(θ12)=f� f(θ1)+f θ2) 1+f(θ1)f(θ2)�.

This formula reminiscent of the sum formula for the hyperbolic tangent, making the change of variable

β=tanh(θ) or θ=tanh−1β.

The hyperbolic form of the Lorentz transformation Λ(β) [1] page 30,[2] (proven in the Ap- pendix, Exercise 1.3.16) is

L(θ)=

����

����

cosh(θ) 0 0 −sinh(θ)

0 1 0 0

0 0 1 0

−sinh(θ) 0 0 cosh(θ)

����

����

� .

The boots and the rotations as a subgroups are buildings blocks of the Lorentz group, that is expressed in the following theorem.

(22)

Theorem 2.51 ([1] page 30). Let Λ = �Λaba,b=1,2,3,4 be a proper, orthochronous Lorentz transformation. Then there exist a real number θ and two rotations R1, R2 ∈ R such that Λ=R1L(θ)R2.

The physical interpretation of Theorem (2.51) is that a Lorentz transformation from S to ˆS can be accomplished by (1) rotating the relative motion of S with ˆS, (such that the positivex1-directions coincide), (2) boosting to corresponding speed as ˆS (relative to S), (3) rotating the spatial axes until it coincides with those of ˆS.

3 Particles and Electromagnetic Fields

In this chapter we shall investigate how particles and electromagnetic fields are described in the Minkowski space. The first section defines general properties of particles. In the second section we see how a charge of the particle, defines a electromagnetic and a magnetic field in the Minkowski space.

3.1 Particles

All the definitions has been taken from Naber [1] pages 87-91.

Definition 3.1(Material particle, proper mass). A material particle inM is a pair (α, m), where α ∶I →M is a timelike curve parametrized by proper time τ and m is a positive real number called the particle proper mass.

One interprets the curve α as the trajectory of the particle.

Definition 3.2(Free material particle). A free material particle is a material particle where α has the formα(τ)=x0+τ U for some fixed event x0 and unit timelike world velocity vector U(τ).

Definition 3.3(World Momentum). The world momentum (or the energy-momentum) of a material particle(α, m) is denoted by P and is defined by

P =P(τ)=mU(τ). Notice thatP⋅P =−m2.

Definition 3.4 (Relative 3-momentum). The world momentum in an arbitrary admissible basis {ea} is P =Paea, or in other notation

P =(P1, P2, P3, P4)=mγ(�µ,1)=(�p, mγ),

wherep�=(P1, P2, P3)is called the relative 3-momentum. The magnitude of the 3-momentum is given by its Euclidean norm ��p�2=�P12+�P22+�P32.

The quantitymγ= 1mβ2 is sometimes referred as the ”relativistic mass” of(α, m)relative to{ea}.

Referanser

RELATERTE DOKUMENTER

As part of enhancing the EU’s role in both civilian and military crisis management operations, the EU therefore elaborated on the CMCO concept as an internal measure for

The dense gas atmospheric dispersion model SLAB predicts a higher initial chlorine concentration using the instantaneous or short duration pool option, compared to evaporation from

In April 2016, Ukraine’s President Petro Poroshenko, summing up the war experience thus far, said that the volunteer battalions had taken part in approximately 600 military

This report documents the experiences and lessons from the deployment of operational analysts to Afghanistan with the Norwegian Armed Forces, with regard to the concept, the main

Based on the above-mentioned tensions, a recommendation for further research is to examine whether young people who have participated in the TP influence their parents and peers in

Azzam’s own involvement in the Afghan cause illustrates the role of the in- ternational Muslim Brotherhood and the Muslim World League in the early mobilization. Azzam was a West

There had been an innovative report prepared by Lord Dawson in 1920 for the Minister of Health’s Consultative Council on Medical and Allied Services, in which he used his

The ideas launched by the Beveridge Commission in 1942 set the pace for major reforms in post-war Britain, and inspired Norwegian welfare programmes as well, with gradual