Difference between revisions of "Special relativity, an introduction" - New World Encyclopedia

From New World Encyclopedia
(Add footer and category)
Line 1: Line 1:
''A simple introduction to this subject is provided in [[Special relativity for beginners]]''
+
{{Claimed}}
 +
:''This article is intended as a generally accessible introduction to the subject.''
 +
[[Image:Einstein1921 by F Schmutzer 2.jpg|right|thumb|200px|Albert Einstein during a lecture in Vienna in 1921.]]
  
'''Special relativity (SR)''' or the '''special theory of relativity''' is the [[physics|physical]] theory [http://www.fourmilab.ch/etexts/einstein/specrel/www/ published] in [[1905]] by [[Albert Einstein]]. It replaced [[Newtonian physics|Newtonian notions of space and time]] and incorporated [[electromagnetism]] as represented by [[Maxwell's equations]]. The theory is called "special" because it applies the [[principle of relativity]] only to the "restricted" or "special" case of inertial reference frames in [[flat spacetime]], where the effects of [[gravity]] can be ignored. Ten years later, Einstein published his [[general theory of relativity]] (general relativity, "GR") which incorporated these effects.
+
'''Special relativity''' is a fundamental [[physics]] [[theory]] about [[space]] and [[time]] that was developed by [[Albert Einstein]] in 1905<ref>"On the Electrodynamics of Moving Bodies." (fourmilab.ch web site): [http://www.fourmilab.ch/etexts/einstein/specrel/www/ Translation from the German article]: "Zur Elektrodynamik bewegter Körper,"  ''Annalen der Physik''. '''17''':891-921. (June 30, 1905) </ref> as a modification of [[Newtonian physics]]. It was created to deal with some pressing theoretical and experimental issues in the physics of the time involving [[light]] and [[electrodynamics]] (see [[History of special relativity]] for a detailed account). The predictions of special relativity [[correspondence principle|correspond closely]] to those of Newtonian physics at speeds which are low in comparison to that of light, but diverge rapidly for speeds which are a significant fraction of the [[speed of light]]. Special relativity has been experimentally tested on numerous occasions since its inception, and its predictions have been verified by those tests.
  
== History and motivation ==
+
Einstein postulated that the speed of light is the same for all observers, irrespective of their motion relative to the light source. This was in total contradiction to classical mechanics, which had been accepted for centuries. Einstein's approach was based on [[thought experiments]] and calculations. In 1908, [[Hermann Minkowski]] reformulated the theory based on different postulates of a more geometrical nature <ref>[[Hermann Minkowski]], [http://de.wikisource.org/wiki/Raum_und_Zeit_(Minkowski) "Raum und Zeit"], 80. Versammlung Deutscher Naturforscher (Köln, 1908).  Published in Physikalische Zeitschrift '''10''' 104-111 (1909) and Jahresbericht der Deutschen Mathematiker-Vereinigung '''18''' 75-88 (1909).  For an English translation, see Lorentz et al. (1952).</ref>. His approach depended on the existence of certain inter-relations between space and time, which were considered completely separate in classical physics. This reformulation set the stage for further developments of [[physics]].
  
''Main article: [[History of special relativity]]''
+
Special relativity makes numerous predictions which are incompatible with Newtonian physics (and everyday intuition).  The first such prediction described by Einstein is called the [[relativity of simultaneity]], under which observers who are in motion with respect to each other may disagree on whether two events occurred at the same time or one occurred before the other. The other major predictions of special relativity are [[time dilation]] (under which a moving clock ticks more slowly than when it is at rest with respect the observer), [[length contraction]] (under which a moving rod may be found to be shorter than when it is at rest with respect to the observer), and the [[Mass–energy equivalence|equivalence of mass and energy]] (written as ''E''=''mc''<sup>2</sup>).  Special relativity predicts a [[non-linear]] [[velocity addition formula]] which prevents speeds greater than that of light from being observed. Special relativity also explains why [[Maxwell's equations]] of [[electromagnetism]] are correct in any [[frame of reference]], and how [[electric field]] and [[magnetic field]] are two aspects of the same thing.
 +
 +
Special relativity has been [http://www.math.ucr.edu/home/baez/physics/Relativity/SR/experiments.html tested] [http://www2.corepower.com:8080/~relfaq/experiments.html in many ways], and so far has <!-- "passed every such test" : this is incorrect, SR is only an approximation to GR. —>been proved to be far more accurate than Newtonian mechanics.  The most famous of these tests is the [[Michelson-Morley experiment]], the failure of which was one of the things which motivated the creation of special relativity.  Other significant tests are the [[Fizeau experiment]] (which was first done decades before special relativity was proposed), the detection of the [[transverse Doppler effect]], and the [[Haefele-Keating experiment]].  Today, scientists are so comfortable with the idea that the speed of light is always the same that the [[meter]] is now defined as being the distance traveled by light in 1/299,792,458<sup>th</sup> of a [[second]].  This means that the speed of light is now defined as being 299,792,458 m/s.
  
The [[principle of relativity]] was introduced by [[Galileo Galilei|Galileo]].  Overturning the old absolutist views of [[Aristotle]], it held that motion, or at least uniform motion in a straight line, only had meaning relative to something else, and that there was no absolute reference frame by which all things could be measured. Galileo also assumed a set of transformations called the [[Galilean transformations]], which seem like common sense today.  Galileo produced five laws of motion. 
+
==Reference frames and Galilean relativity: a classical prelude==
  
In contrast, Newton inferred from the effects of rotation the existence of an "absolute space" - an absolute reference frame - on which he based his theory. Thus he constructed an improved set of equations containing only three laws of motion. Still, he kept the principle of relativity for what can be observed&mdash; uniform motion could not detect absolute space.
+
A [[reference frame]] is simply a selection of what constitutes stationary objects. Once the velocity of a certain object is arbitrarily defined to be zero, the velocity of everything else in the universe can be measured relative to it<ref>There exists a more technical but mathematically convenient description of reference frames. A reference frame may be considered to be an identification of points in space at different times. That is, it is the identification of space points at different times as being the same point. This concept, particularly useful in making the transition to relativistic spacetime, is described in the language of [[affine space]] by VI Arnold in Mathematical Methods in Classical Mechanics, and in the language of [[fibre bundle]]s by [[Roger Penrose]] in [[The Road to Reality]].</ref>. When a train is moving at a constant velocity past a platform, one may either say that the platform is at rest and the train is moving or that the train is at rest and the platform is moving past it. These two descriptions correspond to two different reference frames. They are respectively called the rest frame of the platform and the rest frame of the train (sometimes simply the platform frame and the train frame).
  
The principle of relativity seemed to work well for everyday phenomena involving solid objects, but light was still problematic. At the end of the 19th century light was understood in terms of [[Maxwell's equations]], which related the known facts that changes in magnetic fields causes changes in electric fields and vice versa, in such a way that propagating electro-magnetic waves could be set up. Since these waves were calculated to travel at the speed of light, these waves were identified to be 'light'. Since mechanical waves travel in a medium, so it was assumed for light, and Maxwell believed that light travelled within a medium at a constant speed. This hypothetical medium was called the "[[luminiferous aether]]."  Thus, the idea of an aether seemed to reintroduce the idea of a detectable absolute frame of reference, one that is stationary with respect to the aether.
+
The question naturally arises, can different reference frames be physically differentiated? In other words, can we conduct some experiments to claim that "we are now in an absolutely stationary reference frame?" Aristotle thought that all objects tend to cease moving and become at rest if there were no forces acting on them. Galileo challenged this idea and argued that the concept of absolute motion was unreal. All motion was relative. An observer who couldn't refer to some isolated object (if, say, he was imprisoned inside a closed spaceship) could never distinguish whether according to some external observer he was at rest or moving with constant velocity. Any experiment he could conduct would give the same result in both cases. However, accelerated reference frames are experimentally distinguishable. For example, if an astronaut moving in free space saw that the tea in his tea-cup was slanted rather than horizontal, he would be able to infer that his spaceship was accelerated. Thus not all reference frames are equivalent, but we have a class of reference frames, all moving at uniform velocity with respect to each other, in all of which Newton's first law holds. These are called the [[inertial reference frames]] and are fundamental to both classical mechanics and SR. Galilean relativity thus states that the laws of physics can not depend on absolute velocity, they must stay the same in any inertial reference frame. Galilean relativity is thus a fundamental principle in classical physics.
  
However, after [[James_Clerk_Maxwell|Maxwell]]'s very successful unification of light, electricity, and magnetism, experimental evidence such as [[Michelson-Morley experiment]] began to show that all was not well. A consensus began to emerge that the speed of light does not vary with the speed of the observer, and the speed of light must be ''[[invariant]]'' (the same) for all observers.  This seemed to further conflict with the principle of relativity. [[Hendrik Lorentz]] proposed a resolution by postulating an aether theory in which objects and observers travelling with respect to a stationary aether underwent a physical shortening (''[[Lorentz-Fitzgerald contraction hypothesis|Lorentz-Fitzgerald contraction]]'') and a change in temporal rate (''[[time dilation]]''). [[Henri Poincaré|Poincaré]]'s version of the relativity principle (1904) went: "The laws of physical phenomenons must be the same, whether for a fixed observer, as also for one dragged in a motion of uniform translation, so that we do not and cannot have any mean to discern whether or not we are dragged in a such motion."
+
Mathematically, it says that if we transform all velocities to a different reference frame, the laws of physics must be unchanged. What is this transformation that must be applied to the velocities? Galileo gave the common-sense 'formula' for adding velocities: if
 +
#particle P is moving at velocity v with respect to reference frame A and
 +
#reference frame A is moving at velocity u with respect to reference frame B, then
 +
#the velocity of P with respect to B is given by v + u.
 +
The formula for transforming coordinates between different reference frames is called the [[Galilean transformation]]. The principle of Galilean relativity then demands that laws of physics be unchanged if the Galilean transformation is applied to them. Laws of classical mechanics, like Newton's second law, obey this principle because they have the same form after applying the transformation. As Newton's law involves the derivative of velocity, any constant velocity added in a Galilean transformation to a different reference frame contributes nothing (the derivative of a constant is zero). Addition of a time-varying velocity (corresponding to an accelerated reference frame) will however change the formula (see [[pseudo force]]), since Galilean relativity only applies to non-accelerated inertial reference frames.
  
Einstein's contribution was, among other things, to ''derive'' Lorentz's equations from a more fundamental principle without assuming the presence of an aether. Under Special Relativity, the seemingly complex transformations of Lorentz and Fitzgerald derived cleanly from simple geometry—including the [[Pythagorean theorem]].  The original title for his theory was (translated from German) "On the Electrodynamics of Moving Bodies".  [[Max Planck]] first suggested the term "relativity" to highlight the notion of transforming the laws of physics between observers moving ''relative'' to one another.
+
Time is the same in all reference frames because it is absolute in classical mechanics. All observers measure exactly the same intervals of time and there is such a thing as an absolutely correct clock.
  
In effect since the laws of physics are identical for all inertial observers, Maxwell's equations are held to be literally true for each and every observer, resulting in a constant speed of light, and no modification is required to refer to any fictitious aether.
+
==Invariance of length: the Euclidean picture==
  
== Postulates ==
+
[[Image:Rel1.GIF|thumb|left|Pythagoras theorem]][[Image:Rel3.GIF|thumb|right|The length of an object is constant on the plane during rotations on the plane but not during rotations out of the plane]] In special relativity, space and time are joined into a unified four-dimensional continuum called [[spacetime]].  To gain a sense of what spacetime is like, we must first look at the [[Euclidean space]] of Newtonian physics.
  
''Main article: [[Postulates of special relativity]]''
+
This approach to the theory of special relativity begins with the concept of "[[length]]."  In everyday experience, it seems that the length of objects remains the same no matter how they are rotated or moved from place to place; as a result the simple length of an object doesn't appear to change or is "[[invariant (physics)|invariant]]."  However, as is shown in the illustrations below, what is actually being suggested is that length seems to be invariant in a three-dimensional coordinate system.
  
1. First [[postulate]] ([[principle of relativity]])
+
The length of a line in a two-dimensional [[Cartesian coordinate system]] is given by [[Pythagorean theorem|Pythagoras' theorem]]:
: The laws of electrodynamics and optics will be valid for all frames in which the laws of mechanics hold good.
 
 
 
: Every physical theory should look the same mathematically to every inertial observer.
 
  
: The laws of physics are independent of location space or time.
+
:<math>h^2 = x^2 + y^2. \,</math>
  
2. Second postulate (invariance of ''c'')
+
One of the basic theorems of vector algebra is that the length of a vector does not change when it is rotated. However, a closer inspection tells us that this is only true if we consider rotations confined to the plane. If we introduce rotation in the third dimension, then we can tilt the line out of the plane. In this case the projection of the line on the plane will get shorter. Does this mean length is not invariant? Obviously not. The world is three-dimensional and in a 3D Cartesian coordinate system the length is given by the three-dimensional version of Pythagoras's theorem:
: The [[speed of light]] in [[vacuum]], commonly denoted ''c'', is the same to all inertial observers, is the same in all directions, and does not depend on the velocity of the object emitting the light. When combined with the First Postulate, this Second Postulate is equivalent to stating that light does not require any medium (such as "aether") in which to propagate.
 
  
== Status ==
+
:<math> k^2 = x^2 + y^2 + z^2. \,</math>
 +
{{clear}}
 +
[[Image:Rel4.GIF|thumb|right|500px|'''Invariance in a 3D coordinate system:''' Pythagoras theorem gives <math>k^2 = h^2 + z^2</math> but <math>h^2 = x^2 + y^2</math> therefore <math> k^2 = x^2 + y^2 + z^2</math>.  The length of an object is constant whether it is rotated or moved from one place to another in a 3D coordinate system]]
 +
This is invariant under all rotations. The apparent violation of invariance of length only happened because we were 'missing' a dimension. It seems that, provided all the directions in which an object can be tilted or arranged are represented within a coordinate system, the length of an object does not change under rotations. A 3-dimensional coordinate system is enough in classical mechanics because time is assumed absolute and independent of space in that context. It can be considered separately.
  
''Main article: [[Status of special relativity]]''
+
Note that invariance of length is not ordinarily considered a dynamic principle, not even a theorem. It is simply a statement about the fundamental nature of space itself. Space as we ordinarily conceive it is called a three-dimensional [[Euclidean space]], because its geometrical structure is described by the principles of [[Euclidean geometry]]. The formula for distance between two points is a fundamental property of an Euclidean space, it is called the Euclidean metric tensor (or simply the Euclidean metric). In general, distance formulas are called metric tensors.
  
Special relativity is only accurate when gravitational effects are [[negligible]] or very weak, otherwise it must be replaced by [[general relativity]].  At very small scales, such as at the [[Planck length]] and below, it is also possible that special relativity breaks down, due to the effects of [[quantum gravity]]. However, at macroscopic scales and in the absence of strong gravitational fields, special relativity is now universally accepted by the physics community and experimental results which appear to contradict it are widely believed to be due to unreproducible experimental error.
+
Note that rotations are fundamentally related to the concept of length. In fact, one may define length or distance to be that which stays the same (is invariant) under rotations, or define rotations to be that which keep the length invariant. Given any one, it is possible to find the other. If we know the distance formula, we can find out the formula for transforming coordinates in a rotation. If, on the other hand, we have the formula for rotations then we can find out the distance formula.
  
Because of the freedom one has to select how one defines units of length and time in physics, it is possible to make one of the two postulates of relativity a [[tautology|tautological]] consequence of the definitions, but one cannot do this for both postulates simultaneously, as when combined they have consequences which are independent of one's choice of definition of length and time.
+
== The Minkowski formulation: introduction of spacetime==
 +
{{Main|Spacetime}}
 +
[[Image:Hermann Minkowski.jpg|thumbnail|left|120px|Hermann Minkowski]]
 +
After Einstein derived special relativity formally from the counterintuitive proposition that the speed of light is the same to all observers, the need was felt for a more satisfactory formulation.{{Fact|date=October 2007}} Minkowski, building on mathematical approaches used in non-euclidean geometry<ref>Walter, S.(1999) The non-Euclidean style of Minkowskian relativity. The Symbolic Universe, J. Gray (ed.), Oxford University Press, 1999 http://www.univ-nancy2.fr/DepPhilo/walter/papers/nes.pdf</ref> and the mathematical work of Lorentz and Poincaré, realised that a geometric approach was the key. Minkowski showed in 1908 that Einstein's new theory could be explained in a natural way if the concept of separate space and time is replaced with one four-dimensional continuum called spacetime. This was a groundbreaking concept, and [[Roger Penrose]] has said that relativity was not truly complete until Minkowski reformulated Einstein's work.
  
Special relativity is mathematically self-consistent, and is also compatible with all modern physical theories, most notably [[quantum field theory]], [[string theory]], and general relativity (in the limiting case of negligible gravitational fields). However special relativity is incompatible with several earlier theories, most notably Newtonian mechanics. See [[Status of special relativity]] for a more detailed discussion.
+
The concept of a four-dimensional space is hard to visualise. It may help at the beginning to think simply in terms of coordinates. In three-dimensional space, one needs three real numbers to refer to a point. In the [[Minkowski space]], one needs four real numbers (three space coordinates and one time coordinate) to refer to a point at a particular instant of time. This point at a particular instant of time, specified by the four coordinates, is called an event. The distance between two different events is called the spacetime interval.
  
A number of experiments have been conducted to test special relativity against rival theories. These include:
+
A path through the four-dimensional spacetime, usually called Minkowski space, is called a [[world line]]. Since it specifies both position and time, a particle having a known world line has a completely determined trajectory and velocity. This is just like graphing the displacement of a particle moving in a straight line against the time elapsed. The curve contains the complete motional information of the particle.
  
* The [[Michelson-Morley experiment]] disproved the possibility of ether drift, and tested the directional invariance of the speed of light
+
[[Image:Rel5.GIF|thumb|right|500px|The spacetime interval]]
* [[Hamar experiment]] - obstruction of ether flow
 
* [[Trouton-Noble experiment]] - torque on a capacitor
 
* [[Kennedy-Thorndike experiment]] - time contraction
 
* [[Rossi-Hall experiment]] - spacetime contraction's effects on a fast-moving particle's half-life
 
* Experiments to test [[emitter theory]] demonstrated that the speed of light is independent of the speed of the emitter.
 
  
== Consequences ==
+
In the same way as the measurement of distance in 3D space needed all three coordinates we must include time as well as the three space coordinates when calculating the distance in Minkowski space (henceforth called M). In a sense, the spacetime interval provides a combined estimate of how far two events occur in space as well as the time that elapses between their occurrence.
  
''Main article: [[Consequences of Special Relativity]]''
+
But there is a problem. Time is related to the space coordinates, but they are not equivalent. Pythagoras's theorem treats all coordinates on an equal footing (see [[Euclidean space]] for more details). We can exchange two space coordinates without changing the length, but we can not simply exchange a space coordinate with time, they are fundamentally different. It is an entirely different thing for two events to be separated in space and to be separated in time. Minkowski proposed that the formula for distance needed a change. He found that the correct formula was actually quite simple, differing only by a sign from Pythagoras's theorem:
  
Special relativity leads to different physical predictions than Galilean relativity
+
:<math> s^2 = x^2 + y^2 + z^2 - (ct)^2 \,</math>
when relative velocities become comparable to the speed of light. The speed of
 
light is so much larger than anything humans encounter that some of the effects
 
predicted by relativity are initially counter intuitive. 
 
  
* The time lapse between two events is not invariant from one observer to another, but is dependent on the relative speeds of the observers' reference frames. (See [[Lorentz transformation equations]])
+
where ''c'' is a constant and ''t'' is the time coordinate <ref>Originally Minkowski tried to make his formula look like Pythagoras's theorem by introducing the concept of imaginary time and writing -1 as i2. But Wilson, Gilbert, Borel and others proposed that this was unnecessary and introduced real time with the assumption that, when comparing coordinate systems, the change of spatial displacements with displacements in time can be negative. This assumption is expressed in differential geometry using a [[metric tensor]] that has a negative coefficient. The different signature of the Minkowski metric means that the Minkowski space has hyperbolic rather than Euclidean geometry.
* Two events that occur simultaneously in different places in one frame of reference may occur at different times in another frame of reference (lack of [[Absolute simultaneity|absolute simultaneity]]).
+
</ref>. Multiplication by c, which has the dimension <math>ms^{-1}</math>, converts the time to units of length and this constant has the same value as the [[speed of light]]. So the spacetime interval between two distinct events is given by
* The dimensions (e.g. length) of an object as measured by one observer may differ from the results of measurements of the same object made by another observer. (See [[Lorentz transformation equations]])
 
* The [[twin paradox]] concerns a twin who flies off in a spaceship travelling near the speed of light. When he returns he discovers that his twin has aged much more rapidly than he has (or he aged more slowly).
 
* The [[ladder paradox]] involves a long ladder travelling near the speed of light and being contained within a smaller garage.
 
  
== Lack of an absolute reference frame ==
+
:<math> s^2 = (x_2 - x_1)^2 + (y_2 - y_1)^2 + (z_2 - z_1)^2 - c^2 (t_2 - t_1)^2. \,</math>
  
Special Relativity rejects the idea of any observable absolute ('unique' or 'special') frame of reference; rather physics must look the same to '''''all''''' observers travelling at a constant velocity (inertial frame). This 'principle of relativity' dates back to Galileo, and is incorporated into Newtonian Physics. In the late 19<sup>th</sup> Century, some physicists suggested that the universe was filled with a substance known as "[[aether]]" which transmitted Electromagnetic waves. Aether constituted an absolute reference frame against which speeds could be measured. Aether had some wonderful properties: it was sufficiently elastic that it could support electromagnetic waves, those waves could interact with matter, yet it offered no resistance to bodies passing through it. The results of various experiments, including the [[Michelson-Morley experiment]], suggested that the Earth was always 'stationary' relative to the Aether - something that is difficult to explain. The most elegant solution was to discard the notion of Aether and an absolute frame, and to adopt Einstein's postulates.
+
There are two major points to be noted. Firstly, time is being measured in the same units as length by multiplying it by a constant conversion factor. Secondly, and more importantly, the time-coordinate has a different sign than the space coordinates. This means that in the four-dimensional spacetime, one coordinate is different from the others and influences the distance differently. This new 'distance' may be zero or even negative. This new distance formula, called the [[Metric (mathematics)|metric]] of the spacetime, is at the heart of relativity. This distance formula is called the [[metric tensor]] of M. This minus sign means that a lot of our intuition about distances can not be directly carried over into spacetime intervals. For example, the spacetime interval between two events separated both in time and space may be zero (see below). From now on, the terms distance formula and metric tensor will be used interchangeably, as will be the terms Minkowski metric and spacetime interval.
  
== Space, time, and velocity ==
+
In Minkowski spacetime the spacetime interval is the invariant length, the ordinary 3D length is not required to be invariant. The spacetime interval must stay the same under rotations, but ordinary lengths can change. Just like before, we were missing a dimension. Note that everything this far are merely definitions. We define a four-dimensional mathematical construct which has a special formula for distance, where distance means that which stays the same under rotations (alternatively, one may define a rotation to be that which keeps the distance unchanged).
[[image:Lorentz transform of world line.gif|right|framed|Changing views of spacetime along the [[world line]] of a rapidly accelerating observer.<br><br>In this animation, the dashed line is the world line of a particle whose view of spacetime is being illustrated.  The balls are placed at regular intervals of [[proper time]] along the world line.  The solid diagonal lines are the [[light cone]]s for the observer's current event, and intersect at that event. The small dots are other arbitrary events in the spacetime.  For the observer's current instantaneous inertial frame of reference, the vertical direction is temporal and the horizontal direction is spatial.<br><br>The slope of the world line (deviation from being vertical) is the velocity of the particle on that section of the world line.  So at a bend in the world line the particle is being accelerated. Note how the view of spacetime changes as current event passes through the accelerations, changing the instantaneous inertial frame of reference.  These changes are governed by the Lorentz transformationsAlso note that:<br>
 
&bull; the balls on the world line before/after future/past accelerations are more spaced out due to time dilation.<br>
 
&bull; events which were simultaneous before an acceleration are at different times afterwards (due to the [[relativity of simulataneity]]),<br>
 
&bull; events pass through the light cone lines due to the progression of proper time, but not due to the change of views caused by the accelerations,and<br>
 
&bull; the world line always remains within the future and past light cones of the current event.]]
 
Main article: [[Lorentz transformation]]
 
  
An event is an occurrence that can be assigned a single unique time and location in space: It is a "point" in [[Spacetime|space-time]]. For example, the explosion of a firecracker is an "event" to a good approximation. We can completely specify an event by its four space-time coordinates: The time of occurrence, and its 3-dimensional spatial location. Suppose we have two systems <math>S</math> and <math>S'</math>, whose spatial axes are co-aligned and are moving at a constant velocity (<math>v</math>) with respect to each other along their <math>x</math> axes. If an [[event]] has space-time coordinates <math>(t, x, y, z)</math> in system <math>S</math> and <math>(t', x', y', z')</math> in <math>S'</math>, and their origins coincide (in other words (0,0,0,0) in <math>S</math> coincides with (0,0,0,0) in <math>S'</math>), then the Lorentz transformation specifies that these coordinates are related in the following way:
+
Now comes the physical part. Rotations in Minkowski space have a different interpretation than ordinary rotations. These rotations correspond to transformations of reference frames. Passing from one reference frame to another corresponds to rotating the Minkowski space. An intuitive justification for this is given below, but mathematically this is a dynamical postulate just like assuming that physical laws must stay the same under Galilean transformations (which seems so intuitive that we don't usually recognise it to be a postulate).
: <math>t' = \gamma \left(t - \frac{v x}{c^{2}} \right)</math>
 
: <math>x' = \gamma (x - v t)\,</math>
 
: <math>y' = y\,</math>
 
: <math>z' = z\,</math>
 
where <math>\gamma \equiv \frac{1}{\sqrt{1 - v^2/c^2}}</math>  is called the [[Lorentz factor]] and <math>c</math> is the [[speed of light]] in a vacuum.
 
  
If the observer in <math>S</math> sees an object moving along the <math>x</math> axis at velocity <math>w</math> then the observer in the <math>S'</math> system will see the object moving with velocity <math>w'</math> where
+
Since by definition rotations must keep the distance same, passing to a different reference frame must keep the spacetime interval between two events unchanged. This requirement can be used to derive an explicit mathematical form for the transformation that must be applied to the laws of physics (compare with the application of Galilean transformations to classical laws) when shifting reference frames. These transformations are called the [[Lorentz transformations]]. Just like the [[Galilean transformations]] are the mathematical statement of the principle of Galilean relativity in classical mechanics, the Lorentz transformations are the mathematical form of Einstein's principle of relativity. Laws of physics must stay the same under Lorentz transformations. [[Maxwell's equations]] and [[Dirac's equation]] satisfy this property, and hence they are relativistically correct laws (but classically incorrect, since they don't transform correctly under Galilean transformations).
  
:<math>w'=\frac{w-v}{1-wv/c^2}</math>.
+
With the statement of the Minkowski metric, the common name for the distance formula given above, the theoretical foundation of special relativity is complete. The entire basis for special relativity can be summed up by the geometric statement "changes of reference frame correspond to rotations in the 4D Minkowski spacetime, which is defined to have the distance formula given above." The unique dynamical predictions of SR stem from this geometrical property of spacetime. Special relativity may be said to be the physics of Minkowski spacetime<ref>Einstein(1916) {{cite book
 +
| last = Einstein
 +
| first = A.
 +
| editor =
 +
| others = Tr. Lawson, R.W.
 +
| title = Relativity. The special and general theory.
 +
| year = 1916
 +
| publisher = Routledge classics 2001
 +
| location = London
 +
| id =
 +
| doi =
 +
}}: "It appears therefore more natural to think of physical reality as a four dimensional existence, instead of, as hitherto, the ''evolution'' of a three dimensional existence."
  
This equation can be derived from the space and time transformations above. Notice that if the object is moving at the speed of light in the <math>S</math> system (i.e. <math>w=c</math>), then it will also be moving at the speed of light in the <math>S'</math> system. Also, if both <math>w</math> and <math>v</math> are small with respect to the speed of light, we will recover the intuitive Galilean transformation of velocities: <math>w'=w-v</math>.
+
Roger Penrose (1998), {{cite book
 +
| last = Feynman
 +
| first = Richard
 +
| editor =
 +
| others = Introduction by R. Penrose
 +
| title = Six not so easy pieces.
 +
| year = 1998
 +
| publisher = Penguin Books
 +
| location = England
 +
| id =
 +
| doi =
 +
}}: "The idea that the history of the universe should be viewed, physically, as a ''four''-dimensional spacetime, rather than as a three dimensional space evolving with time is indeed fundamental to modern physics."
  
== Mass, momentum, and energy ==
+
Hermann Weyl(1918), {{cite book
 +
| last = Weyl
 +
| first = Hermann
 +
| editor =
 +
| others =  
 +
| title = Space, time, matter.
 +
| year = 1918
 +
| publisher = Dover Books edition 1952
 +
| location = New York
 +
| id =  
 +
| doi =
 +
}}: "The adequate mathematical formulation of Einstein's discovery was first given by Minkowski: to him we are indebted for the idea of four dimensional world-geometry, on which we based our argument from the outset."
  
In addition to modifying notions of space and time, special relativity forces one to reconsider the concepts of [[mass]], [[momentum]], and [[energy]], all of which are important constructs in [[Newtonian mechanics]]. Special relativity shows, in fact, that these concepts are all different aspects of the same physical quantity in much the same way that it shows space and time to be interrelated.
+
Kip Thorne and Roger Blandford in their [http://www.pma.caltech.edu/Courses/ph136/yr2004/0401.1.K.pdf Caltec physics notes] say: "Special relativity is the limit of general relativity in the complete absence of gravity; its arena is flat, 4-dimensional Minkowski spacetime."
  
There are a couple of (equivalent) ways to define momentum and energy in SR. One method uses [[conservation law]]s. If these laws are to remain valid in SR they must be true in every possible reference frame. However, if one does some simple [[thought experiment]]s using the Newtonian definitions of momentum and energy one sees that these quantities are not conserved in SR. One can rescue the idea of conservation by making some small modifications to the definitions to account for relativistic velocities. It is these new definitions which are taken as the correct ones for momentum and energy in SR.
+
Sean Carroll [http://www.pma.caltech.edu/Courses/ph136/yr2004/0401.1.K.pdf says]: "..it makes sense to think of SR as a theory of 4-dimensional spacetime, known as Minkowski space."</ref>. In this case of spacetime, there are six independent rotations to be considered. Three of them are the standard rotations on a plane in two directions of space.  The other three are rotations in a plane of both space and time:  These rotations correspond to a change of velocity, and are described by the traditional Lorentz transformations.
  
Given an object of [[invariant mass]] ''m_0'' traveling at velocity ''u'' the energy and momentum are given by
+
As has been mentioned before, one can replace distance formulas with rotation formulas. Instead of starting with the invariance of the Minkowski metric as the fundamental property of spacetime, one may state (as was done in classical physics with Galilean relativity) the mathematical form of the Lorentz transformations and require that physical laws be invariant under these transformations.  This makes no reference to the geometry of spacetime, but will produce the same result. This was in fact the traditional approach to SR, used originally by Einstein himself. However, this approach is often considered to offer less insight and be more cumbersome than the more natural Minkowski formalism.
  
:<math>E = \gamma m_0 c^2 \,\!</math>
+
==Reference frames and Lorentz transformations: relativity revisited==
  
:<math> p = \gamma m_0 u \,\!</math>
+
We have already discussed that in classical mechanics coordinate frame changes correspond to Galilean transfomations of the coordinates. Is this adequate in the relativistic Minkowski picture?
  
where ''&gamma;'' (the [[Lorentz factor]]) is given by
+
Suppose there are two people, Bill and John, on separate planets that are moving away from each other. Bill and John are on separate planets so they both think that they are stationary. John draws a graph of Bill's motion through space and time and this is shown in the illustration below:
  
:<math>\gamma = \frac{1}{\sqrt{1 - u^2/c^2}} \,\!</math>
+
[[Image:Rel6.GIF|frame|John's view of Bill and Bill's view of himself]]
  
and ''c'' is the speed of light. The term &gamma; occurs frequently in relativity, and comes from the [[Lorentz transformation equations]]. The energy and momentum can be related through the formula
+
John sees that Bill is moving through space as well as time but Bill thinks he is moving through time alone.  Bill would draw the same conclusion about John's motion. In fact, these two views, which would be classically considered a difference in reference frames, are related simply by a coordinate transformation in M. Bill's view of his own world line and John's view of Bill's world line are related to each other simply by a rotation of coordinates. One can be transformed into the other by a rotation of the time axis. Minkowski geometry handles transformations of reference frames in a very natural way.
  
:<math> E^2 - (p c)^2 = (m_0 c^2)^2 \,\!</math>
+
Changes in reference frame, represented by velocity transformations in classical mechanics, are represented by rotations in Minkowski space. These rotations are called [[Lorentz transformations]]. They are different from the Galilean transformations because of the unique form of the Minkowski metric. The Lorentz transformations are the relativistic equivalent of Galilean transformations. Laws of physics, in order to be relativistically correct, must stay the same under Lorentz transformations. The physical statement that they must be same in all inertial reference frames remains unchanged, but the mathematical transformation between different reference frames changes. Newton's laws of motion are invariant under Galilean rather than Lorentz transformations, so they are immediately recognisable as non-relativistic laws and must be discarded in relativistic physics. Schrödinger's equation is also non-relativistic.
  
which is referred to as the ''relativistic energy-momentum equation''. These equations can be more succinctly stated using
+
Maxwell's equations are trickier. They are written using vectors and at first glance appear to transform correctly under Galilean transformations. But on closer inspection, several questions are apparent that can not be satisfactorily resolved within classical mechanics (see [[History of special relativity]]). They are indeed invariant under Lorentz transformations and are relativistic, even though they were formulated before the discovery of special relativity. Classical electrodynamics can be said to be the first relativistic theory in physics. To make the relativistic character of equations apparent, they are written using 4-component vector like quantities called 4-vectors. 4-Vectors transform correctly under Lorentz transformations. Equations written using 4-vectors are automatically relativistic. This is called the manifestly covariant form of equations. 4-Vectors form a very important part of the formalism of special relativity.
the [[four-momentum]] <math>P^a</math> and the [[four-velocity]] <math>U^a</math> as
 
  
:<math>P^a = m_0 U^a</math>
+
== Einstein's postulate: the constancy of the speed of light ==
  
which can be viewed as a relativistic analogue of [[Newton's laws of motion|Newton's second law]].
+
Einstein's postulate that the speed of light is a constant comes out as a natural consequence of the Minkowski formulation<ref name="Einstein2">{{cite book
 +
| last = Einstein
 +
| first = A.
 +
| editor =
 +
| others = Tr. Lawson, R.W.
 +
| title = Relativity. The special and general theory.
 +
| year = 1916
 +
| publisher = Routledge classics 2001
 +
| location = London
 +
| id =
 +
| doi =
 +
}}</ref>.
  
For velocities much smaller than those of light &gamma; can be approximated
+
Proposition 1:
using a [[Taylor series|Taylor series expansion]] and one finds that
+
:When an object is travelling at ''c'' in a certain [[Frame of reference|reference frame]], the spacetime interval is '''zero'''.
 +
Proof:
 +
: The spacetime interval between the origin-event ''(0,0,0,0)'' and an event ''(x, y,z, t)'' is
 +
:: <math> s^2 = x^2 + y^2 + z^2 - (ct)^2 .\,</math>
 +
: The distance travelled by an object moving at velocity ''v'' for ''t'' seconds is: 
 +
:: <math> \sqrt{x^2 + y^2 + z^2} = vt \,</math>
 +
: giving
 +
:: <math> s^2 = (vt)^2 - (ct)^2 .\,</math>
 +
: Since the velocity ''v'' equals ''c'' we have
 +
:: <math> s^2 = (ct)^2 - (ct)^2 .\,</math>
 +
: Hence the spacetime interval between the events of departure and arrival is given by
 +
:: <math> s^2 = 0 \,</math>
  
:<math> E \approx m_0 c^2 + \begin{matrix} \frac{1}{2} \end{matrix} m_0 u^2 \,\!</math>
+
Proposition 2:
 +
:An object travelling at ''c'' in one reference frame is travelling at ''c'' in all reference frames.
 +
Proof:
 +
: Let the object move with velocity v when observed from a different reference frame. A change in reference frame corresponds to a rotation in M. Since the spacetime interval must be conserved under rotation, the spacetime interval must be the same in all reference frames. In proposition 1 we showed it to be zero in one reference frame, hence it must be zero in all other reference frames. We get that
 +
:: <math> (vt)^2 - (ct)^2 = 0 \,</math>
 +
: which implies
 +
:: <math> |v| = c .\,</math>
  
:<math> p \approx m_0 u \,\!</math>
+
The paths of light rays have a zero spacetime interval, and hence all observers will obtain the same value for the speed of light. Therefore, when assuming that the universe has four dimensions that are related by Minkowski's formula, the speed of light appears as a constant, and does not need to be assumed (postulated) to be constant as in Einstein's original approach to special relativity.
  
Barring the first term in the energy expression (discussed below), these formulas agree exactly with the standard definitions of Newtonian [[kinetic energy]] and momentum. This is as it should be, for special relativity must agree with Newtonian mechanics at low velocities.
+
==Clock delays and rod contractions: more on Lorentz transformations==
  
Looking at the above formulas for energy, one sees that when an object is at rest ('''''u''''' = 0 and &gamma; = 1) there is a non-zero energy remaining:
+
Another consequence of the invariance of the spacetime interval is that clocks will appear to go slower on objects that are moving relative to you. This is very similar to how the 2D projection of a line rotated into the third-dimension appears to get shorter. Length is not conserved simply because we are ignoring one of the dimensions. Let us return to the example of John and Bill.
  
:<math>E = m_0 c^2 \,\!</math>
+
John observes the length of Bill's spacetime interval as:  
  
This energy is referred to as ''rest energy''. The rest energy does not cause any conflict with the Newtonian theory because it is a constant and, as far as kinetic energy is concerned, it is only differences in energy which are meaningful.
+
: <math>s^2 = (vt)^2 - (ct)^2 \,</math>
  
Taking this formula at face value, we see that in relativity, <em>mass is simply another form of energy</em>. This formula becomes important when one measures the masses of different atomic
+
whereas Bill doesn't think he has traveled in space, so writes:
nuclei.  By looking at the difference in masses, one can predict which nuclei have extra stored
 
energy which can be released by [[nuclear reaction]]s, providing important information which
 
was useful in the development of the [[nuclear bomb]].  The implications of this formula on [[20th century]] life have made it one of the most famous equations in all of science.
 
  
=== On mass ===
+
: <math>s^2 = (0)^2 - (cT)^2 \,</math>
  
Introductory physics courses and some older textbooks on special relativity sometimes define a so-called <em>relativistic mass</em>
+
The spacetime interval, s<sup>2</sup>, is invariant. It has the same value for all observers, no matter who measures it or how they are moving in a straight line. This means that Bill's spacetime interval equals John's observation of Bill's spacetime interval so:
which may lead to the impression that special relativity implies the mass of a body increases as its velocity increases. According to many, this is technically incorrect from a theoretical standpoint since the first postulate was later used to construct a theory in which the properties of an object are independent of any inertial frame, i.e., <em>invariant</em>, while some others disagree because special relativity only imposes properties such as mass, length and resonance frequency to be invariant as measured at rest in any inertial frame.
 
  
Defining such a quantity may sometimes be <em>useful</em> in that
+
: <math>(0)^2 - (cT)^2 = (vt)^2 - (ct)^2 \,</math>  
doing so simplifies a calculation by restricting it to a specific frame. For
 
example, consider a body with an invariant mass <math>m_0 \!</math> moving at some velocity relative to an observer's reference system. That observer defines the <em>relativistic mass</em> of that body as:
 
  
:<math>m = \gamma m_0\!</math>
+
and
  
Note that the body does <em>not</em> actually become more massive in its proper frame, since the relativistic mass would be different for an observer in a different
+
: <math>-(cT)^2 = (vt)^2 - (ct)^2 \,</math>
frame. The <em>only</em> mass which is observer independent is the invariant
 
mass. When using the relativistic mass, one must always specify a velocity
 
relative to a particular observer. Physics textbooks sometimes use the
 
relativistic mass as it allows the students to utilize their knowledge
 
of Newtonian physics to gain some intuitive grasp of relativity in the frame
 
they find most interesting: their own! "Relativistic mass" is also consistent with the concepts "time dilation" and "length contraction".
 
  
== Simultaneity and causality ==
+
hence
  
Special relativity holds that events that are [[simultaneity|simultaneous]] in one frame of reference need not be simultaneous in another frame of reference.
+
: <math>t = \frac{T}{\sqrt{1 - \frac{v^2}{c^2}}} \,</math>.
  
 +
So, if John sees a clock that is at rest in Bill's frame record one second, John will find that his own clock measures between these same ticks an interval ''t'', called '''coordinate time''', which is greater than one second. It is said that clocks in motion slow down, relative to those on observers at rest. This is known as "relativistic [[time dilation]] of a moving clock." The time that is measured in the rest frame of the clock (in Bill's frame) is called the [[proper time]] of the clock.
  
[[Image:light_cone.png|thumb|right|light cone]]
+
In special relativity, therefore, changes in reference frame affect time also. Time is no longer absolute. There is no universally correct clock, time runs at different rates for different observers.
  
The interval AB in the diagram to the right is 'time-like'. I.e. there is a frame of reference in which event A and event B occur at the same location in space, separated only by occurring at different times. If A precedes B in that frame, then A precedes B in all frames. It is hypothetically possible for matter (or information) to travel from A to B, so there can be a causal relationship (with A the cause and B the effect).
+
Similarly it can be shown that John will also observe measuring rods at rest on Bill's planet to be shorter in the direction of motion than his own measuring rods<ref>It should also be made clear that the length contraction result only applies to rods aligned in the direction of motion. At right angles to the direction of motion, there is no contraction.</ref>. This is a prediction known as "relativistic [[length contraction]] of a moving rod." If the length of a rod at rest on Bill's planet is <math>X</math>, then we call this quantity the [[proper length]] of the rod. The length <math>x</math> of that same rod as measured on John's planet, is called '''coordinate length''', and given by
  
The interval AC in the diagram is 'space-like'. I.e. there is a frame of reference in which event A and event C occur simultaneously, separated only in space. However there are also frames in which A precedes C (as shown) and frames in which C precedes A.
+
:<math>x = X \sqrt{1 - \frac{v^2}{c^2}} \,</math>.
Barring some way of traveling [[faster than light]], it is not possible for any matter (or information) to travel from A to C or from C to A. Thus there is no causal connection between A and C.
 
<!-- a pair of diagrams, with x-t and x'-t' coordinates would help here —>
 
  
 +
[[Image:Rel7.GIF|thumb|300px|right|How Bill's coordinates appear to John at the instant they pass each other]]
 +
These two equations can be combined to obtain the general form of the Lorentz transformation in one spatial dimension:
  
 +
::<math>\begin{align}T &= \gamma \left( t - \frac{v x}{c^{2}} \right)  \\
 +
X &= \gamma \left( x - v t \right)\\\end{align}</math>
 +
or equivalently:
 +
::<math>\begin{align}t &= \gamma \left( T + \frac{v X}{c^{2}} \right)  \\
 +
x &= \gamma \left( X + v T \right)\\\end{align}</math>
 +
where the [[Lorentz factor]] is given by <math>\gamma = { 1 \over \sqrt{1 - v^2/c^2} } </math>
  
== The geometry of space-time ==
+
The above formulas for clock delays and length contractions are special cases of the general transformation.
  
SR uses a 'flat' 4-dimensional [[Minkowski space]], which is an example of a [[space-time]].  This space, however, is very similar to the standard 3 dimensional Euclidean space, and fortunately by that fact, very easy to work with.
+
Alternatively, these equations for time dilation and length contraction (here obtained from the invariance of the spacetime interval), can be ''obtained'' directly ''from'' the Lorentz transformation by setting X = 0 for time dilation, meaning that the clock is at rest in Bill's frame, or by setting t = 0 for length contraction, meaning that John must measure the distances to the end points of the moving rod at the same time.
  
The [[differential (mathematics)|differential]] of distance(''ds'') in cartesian 3D space is defined as:
+
A consequence of the Lorentz transformations is the modified [[Velocity-addition formula#Special Theory of Relativity|velocity-addition formula]]:
  
:<math> ds^2 = dx_1^2 + dx_2^2 + dx_3^2 </math>
+
:<math> s = {v+u \over 1+(v/c)(u/c)}. </math>
  
where <math>(dx_1,dx_2,dx_3)</math> are the differentials of the three spatial dimensions. In the geometry of special relativity, a fourth dimension, time, is added, with units of [[speed of light|c]], so that the equation for the differential of distance becomes:
+
==Simultaneity and clock desynchronisation==
 +
{{Expand|date=October 2007}}
 +
The last consequence of Minkowski's spacetime is that clocks will appear to be out of phase with each other along the length of a moving object.  This means that if one observer sets up a line of clocks that are all synchronised so they all read the same time, then another observer who is moving along the line at high speed will see the clocks all reading different times.  This means that observers who are moving relative to each other see different events as simultaneous.  This effect is known as "Relativistic Phase" or the "Relativity of Simultaneity." Relativistic phase is often overlooked by students of special relativity, but if it is understood, then phenomena such as the [[twin paradox]] are easier to understand.
  
:<math> ds^2 = dx_1^2 + dx_2^2 + dx_3^2 - c^2 dt^2 </math>
+
[[Image:Relsim2.GIF|thumb|left|300px|The "plane of simultaneity" or "surface of simultaneity" contains all those events that happen at the same instant for a given observer. Events that are simultaneous for one observer are not simultaneous for another observer in relative motion.]]Observers have a set of simultaneous events around them that they regard as composing the present instant. The relativity of simultaneity results in observers who are moving relative to each other having different sets of events in their present instant. 
  
In many situations it may be convenient to treat time as [[imaginary number|imaginary]] (e.g. it may simplify equations), in which case <math>t</math> in the above equation is replaced by <math>i.t'</math>, and the metric becomes
+
The net effect of the four-dimensional universe is that observers who are in motion relative to you seem to have time coordinates that lean over in the direction of motion, and consider things to be simultaneous that are not simultaneous for you. Spatial lengths in the direction of travel are shortened, because they tip upwards and downwards, relative to the time axis in the direction of travel, akin to a rotation out of three-dimensional space.
  
:<math> ds^2 = dx_1^2 + dx_2^2 + dx_3^2 + c^2(dt')^2 </math>
+
Great care is needed when interpreting spacetime diagrams.  Diagrams present data in two dimensions, and cannot show faithfully how, for instance, a zero length spacetime interval appears.
 +
{{clear}}
  
If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3-D space
+
==General relativity: a peek forward==
  
:<math> ds^2 = dx_1^2 + dx_2^2 - c^2 dt^2 </math>
+
Unlike Newton's laws of motion, relativity is not based upon dynamical postulates. It does not assume anything about motion or forces. Rather, it deals with the fundamental nature of spacetime. It is concerned with describing the geometry of the backdrop on which all dynamical phenomena take place. In a sense therefore, it is a meta-theory, a theory that lays out a structure that all other theories must follow. In truth, Special relativity is only a special case. It assumes that spacetime is flat. That is, it assumes that the structure of Minkowski space and the Minkowski metric tensor is constant throughout. In [[General relativity]], Einstein showed that this is not true. The structure of spacetime is modified by the presence of matter. Specifically, the distance formula given above is no longer generally valid except in space free from mass. However, just like a curved surface can be considered flat in the infinitesimal limit of calculus, a curved spacetime can be considered flat at a small scale. This means that the Minkowski metric written in the differential form is generally valid.
  
We see that the [[null geodesic|null]] [[geodesic]]s lie along a dual-cone:
+
: <math>ds^2 = dx^2 + dy^2 + dz^2 - c^2 dt^2 \,</math>
  
[[image:sr1.jpg]]
+
One says that the Minkowski metric is valid '''locally''', but it fails to give a measure of distance over extended distances. It is not valid '''globally'''. In fact, in general relativity the global metric itself becomes dependent on the mass distribution and varies through space. The central problem of general relativity is to solve the famous [[Einstein field equations]] for a given mass distribution and find the distance formula that applies in that particular case. Minkowski's spacetime formulation was the conceptual stepping stone to general relativity. His fundamentally new outlook allowed not only the development of general relativity, but also to some extent [[quantum field theories]].
  
defined by the equation
+
==Mass-energy equivalence: sunlight and atom bombs==
 +
<<This section needs expansion.>>
  
:<math> ds^2 = 0 = dx_1^2 + dx_2^2 - c^2 dt^2 </math>
+
Einstein showed that mass is simply another form of energy. The energy equivalent of rest mass m is m*c^2. This equivalence implies that mass should be interconvertible with other forms of energy. This is the basic principle behind atom bombs and production of energy in nuclear reactors and stars (like Sun).
  
or
+
==Applications==
 +
<<This section needs expansion.>>
  
:<math> dx_1^2 + dx_2^2 = c^2 dt^2 </math>
+
There is a common perception that relativistic physics is not needed in everyday life. This is not true. Many technologies are critically dependent on relativistic physics:
 +
* [[Cathode ray tube]]s {{Fact|date=October 2007}},
 +
* [[Particle accelerator]]s,
 +
* [[Global Positioning System]] (GPS) - although this really requires the full theory of [[general relativity]]
  
Which is the equation of a circle with ''r=c*dt''.
+
==The postulates of Special Relativity==
If we extend this to three spatial dimensions, the null geodesics are
 
continuous concentric spheres, with radius = distance = c*(+ or -)time.
 
  
[[image:sr3.jpg]]
+
Einstein developed Special Relativity on the basis of two postulates:
  
:<math> ds^2 = 0 = dx_1^2 + dx_2^2 + dx_3^2 - c^2 dt^2 </math>
+
* First [[postulate]] - '''Special [[principle of relativity]]''' - The laws of [[physics]] are the same in all [[inertial frames of reference]]. In other words, there are no privileged inertial frames of reference.
 +
* Second postulate - '''Invariance of ''c''''' -  The speed of light in a [[vacuum]] is independent of the motion of the [[light]] source.
  
:<math> dx_1^2 + dx_2^2 + dx_3^2 = c^2 dt^2 </math>
+
Special Relativity can be derived from these postulates, as was done by Einstein in 1905.  Einstein's postulates are still applicable in the modern theory but the origin of the postulates is more explicit. It was shown above how the existence of a universally constant velocity (the speed of light) is a consequence of modeling the universe as a particular four dimensional space having certain specific properties. The principle of relativity is a result of Minkowski structure being preserved under Lorentz transformations, which are ''postulated'' to be the physical transformations of inertial reference frames.
  
This null dual-cone represents the "line of sight" of a point in space.  That is, when we look at the stars and say "The light from that star which I am receiving is X years old.", we are looking down this line of sight: a null geodesic.  We are looking at an event <math>d = \sqrt{x_1^2+x_2^2+x_3^2} </math> meters away and ''d/c'' seconds in the past. For this reason the null dual cone is also known as the 'light cone'. (The point in the lower left of the picture below represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".)
+
== See also==
  
[[image:sr1.jpg]]
+
* [[Albert Einstein]]
 +
* [[Special relativity]]
 +
* [[History of special relativity]]
 +
* [[Light]]
  
The cone in the ''-t'' region is the information that the point is 'receiving', while the cone in the ''+t'' section is the information that the point is 'sending'.
+
== Notes ==
 +
<references/>
  
The geometry of Minkowski space can be depicted using [[Minkowski diagram]]s, which are also useful in understanding many of the thought-experiments in special relativity.
+
* The mass of objects and systems of objects has a complex interpretation in special relativity, see [[relativistic mass]].
 +
* "Minkowski also shared Poincaré's view of the Lorentz transformation as a rotation in a four-dimensional space with one imaginary coordinate, and his five four-vector expressions." (Walter 1999).
  
== Related topics ==
+
== References ==
 +
<<Needs at least 3 reliable references, properly formatted.>>
  
:'''People''': [[Arthur Eddington]] | [[Albert Einstein]] | [[Hendrik Lorentz]] | [[Hermann Minkowski]] | [[Bernhard Riemann]] | [[Henri Poincaré]] | [[Alexander MacFarlane]] | [[Harry Bateman]] | [[Robert S. Shankland]]
+
== External links ==
:'''Relativity''': [[Theory of relativity]] | [[principle of relativity]] | [[general relativity]] | [[frame of reference]] | [[inertial frame of reference]] | [[Lorentz transformations]]
+
===Special relativity for a general audience (no math knowledge required)===
:'''Physics''':  [[Newtonian Mechanics]] | [[spacetime]] | [[speed of light]] | [[simultaneity]] | [[cosmology]] | [[Doppler effect]] | [[relativistic Euler equations]] | [[Aether drag hypothesis]]
 
:'''Math''': [[Minkowski space]] | [[four-vector]] | [[world line]] | [[light cone]] | [[Lorentz group]] | [[Poincaré group]] | [[geometry]] | [[tensors]] | [[split-complex number]]
 
:'''Philosophy''': [[actualism]] | [[convensionalism]] | [[formalism]]
 
  
==External links==
+
*[http://www.phys.unsw.edu.au/einsteinlight Einstein Light] An [http://www.sciam.com/article.cfm?chanID=sa004&articleID=0005CFF9-524F-1340-924F83414B7F0000 award]-winning, non-technical introduction (film clips and demonstrations) supported by dozens of pages of further explanations and animations, at levels with or without mathematics.
{{Wikibookspar|Wikiversity|Special Relativity}}
+
*[http://www.einstein-online.info/en/elementary/index.html Einstein Online] Introduction to relativity theory, from the Max Planck Institute for Gravitational Physics.
{{Wikibookspar|Modern Physics|Special Relativity}}
 
*[http://www.everythingimportant.org/relativity/special.pdf Beneath the Foundations of Spacetime] Special relativity can be derived with moving rulers in such a way that the astonishing connection between space and time can be clearly understood.
 
*[http://www.magen.co.uk/calculator.html  Relativity calculator] Geometric calculations of relativistic problems such as the addition of velocities.  Note that it is Java-based and can take several minutes to load using a 56k modem.
 
*[http://www-gap.dcs.st-and.ac.uk/~history/HistTopics/Special_relativity.html Relativity in its Historical Context] The discovery of special relativity was inevitable, given the momentous discoveries that preceded it.
 
*[http://arxiv.org/PS_cache/physics/pdf/0302/0302045.pdf Nothing but Relativity] There are many ways to derive the Lorentz transformation without invoking Einstein's constancy of light postulate. The path preferred in this paper restates a simple, established approach.
 
*[http://www.mathpages.com/rr/rrtoc.htm Reflections on Relativity] A complete online book on relativity with an extensive bibliography.
 
*[http://www.phys.vt.edu/~takeuchi/relativity/notes Special Relativity Lecture Notes] is a standard introduction to special relativity containing illustrative explanations based on drawings and spacetime diagrams from Virginia Polytechnic Institute and State University.
 
*[http://www.mathpreprints.com/math/Preprint/paultrr/20040119/1/Evaluation_of_Brane_World_Mach_Principles.pdf Brane World Mach Principles and the Michelson-Morley experiment]
 
*[http://fr.wikipedia.org/wiki/Relativit%C3%A9_restreinte#Petites_exp.C3.A9riences_de_pens.C3.A9e Petites expériences de pensée] : five interesting ''thought experiments'' about special relativity quoted in the [[French Language Wikipedia]].
 
*[http://spoirier.lautre.net/en/relativity.htm Special relativity theory made intuitive] : a new approach to explain the theoretical meaning of Special Relativity from an intuitive geometrical viewpoint
 
*[http://www2.slac.stanford.edu/vvc/theory/relativity.html Special Relativity] Stanford University, Helen Quinn, 2003
 
*{{gutenberg|no=5001|name=Relativity: the Special and General Theory}}, by [[Albert Einstein]]
 
*[http://www.motionmountain.net/C-2-CLSC.pdf Special Relativity] This is chapter two of Christoph Schiller's 1000 page walk through the whole of physics, from classical mechanics to relativity, electrodynamics, thermodynamics, quantum theory, nuclear physics and unification. 61 pages.
 
*[http://www.guardian.co.uk/life/science/story/0,12996,1456747,00.html "Why Einstein may have got it wrong"] by David Adam, ''The Guardian'', April 11, 2005.
 
*[http://www.anu.edu.au/Physics/Savage/TEE/ Through Einstein's Eyes] The Australian National University. Relativistic visual effects explained with movies and images.
 
*[http://gregegan.customer.netspace.net.au/FOUNDATIONS/01/found01.html Greg Egan's ''Foundations''].
 
*[http://3quarksdaily.blogs.com/3quarksdaily/2005/06/monday_musing_s.html Short Essay Explaining Special Relativity] by S. Abbas Raza of [http://3quarksdaily.com ''3 Quarks Daily'']
 
*[http://www.phys.unsw.edu.au/einsteinlight Einstein Light tutorial]
 
*[http://www.cell-action.com/einstein/index.html Enlightening Ideas] a humoristic animation about the special relativity for the general public, Yannick Mahé, 2005
 
  
==References==
+
===Special relativity explained (using simple or more advanced math)===
===Textbooks===
 
* Tipler, Paul; Llewellyn, Ralph (2002). ''Modern Physics'' (4th ed.). W. H. Freeman Company. ISBN 0716743450
 
* Schutz, Bernard F. ''A First Course in General Relativity'', Cambridge University Press. ISBN 0521277035
 
* Taylor, Edwin, and [[John Archibald Wheeler|Wheeler, John]] (1992). ''Spacetime physics'' (2nd ed.). W.H. Freeman and Company. ISBN 0716723271
 
* Einstein, Albert (1996).  ''The Meaning of Relativity''. Fine Communications. ISBN 1567311369
 
 
 
===Journal articles===
 
*[http://www.fourmilab.ch/etexts/einstein/specrel/www/ On the Electrodynamics of Moving Bodies], A. Einstein, Annalen der Physik, 17:891, June 30, 1905 (in English translation)
 
* Wolf, Peter and Gerard, Petit. "Satellite test of Special Relativity using the Global Positioning System," ''Physics Review A'' 56 (6), 4405-4409 (1997).
 
* Will, Clifford M. "Clock synchronization and isotropy of the one-way speed of light," ''Physics Review D'' 45, 403-411 (1992).
 
* Alvager et al., "Test of the Second Postulate of Special Relativity in the GeV region," ''Physics Letters'' 12, 260 (1964).
 
 
 
{{Physics-footer}}
 
{{Natural sciences-footer}}
 
  
 +
* [http://www.bartleby.com/173/ Albert Einstein.  Relativity: The Special and General Theory. New York: Henry Holt 1920. BARTLEBY.COM, 2000]
 +
* [http://www.math.ucr.edu/home/baez/physics/index.html Usenet Physics FAQ]
 +
* [http://nedwww.ipac.caltech.edu/level5/March01/Carroll3/Carroll_contents.html Sean Carroll's online ''Lecture Notes on General Relativity'']
 +
* [http://hyperphysics.phy-astr.gsu.edu/hbase/relativ/tdil.html#c2 Hyperphysics Time Dilation]
 +
* [http://hyperphysics.phy-astr.gsu.edu/hbase/relativ/tdil.html#c1 Hyperphysics Length Contraction]
 +
* [http://gregegan.customer.netspace.net.au/FOUNDATIONS/01/found01.html Greg Egan's ''Foundations'']
 +
* [http://www.adamauton.com/warp/ Special Relativity Simulation]
 +
* [http://www.black-holes.org/relativity1.html Caltech Relativity Tutorial] A basic introduction to concepts of Special and General Relativity, requiring only a knowledge of basic geometry.
 +
* [http://www.phys.unsw.edu.au/einsteinlight/ Special Relativity in film clips and animations] from the University of New South Wales.
 +
* [http://www.relativitycalculator.com/ Relativity Calculator - Learn Special Relativity Mathematics]  Mathematics of special relativity presented in as simple and comprehensive manner possible within philosophical and historical contexts.
 +
* [http://insti.physics.sunysb.edu/~siegel/sr.html Special relativity made stupid ☺].
  
 
[[Category:Physical sciences]]
 
[[Category:Physical sciences]]
 
[[Category:Physics]]
 
[[Category:Physics]]
[[Category:Special relativity|*]]
 
<!-- WIKIPEIDIA POLICY NOTE:  This is the main article for Category:Special_relativity.  Additional categorizations should be done for the category, not this article. —>
 
 
{{Link FA|ar}}
 
{{Link FA|sk}}
 
 
[[ar:نظرية النسبية الخاصة]]
 
[[cs:Speciální teorie relativity]]
 
[[da:Speciel relativitetsteori]]
 
[[de:Spezielle Relativitätstheorie]]
 
[[el:Ειδική Θεωρία Σχετικότητας]]
 
[[es:Teoría de la Relatividad Especial]]
 
[[fr:Relativité restreinte]]
 
[[ko:특수 상대성 이론]]
 
[[hr:Posebna teorija relativnosti]]
 
[[ia:Relativitate special]]
 
[[it:Relatività ristretta]]
 
[[he:תורת היחסות הפרטית]]
 
[[la:Relativitas specialis]]
 
[[lt:Specialioji reliatyvumo teorija]]
 
[[hu:Speciális relativitáselmélet]]
 
[[nl:Speciale relativiteitstheorie]]
 
[[ja:特殊相対性理論]]
 
[[pl:Szczególna teoria względności]]
 
[[pt:Relatividade restrita]]
 
[[ru:Специальная теория относительности]]
 
[[sk:Špeciálna teória relativity]]
 
[[sl:Posebna teorija relativnosti]]
 
[[fi:Erityinen suhteellisuusteoria]]
 
[[sv:Speciella relativitetsteorin]]
 
[[tl:Espesyal na relativity]]
 
[[tt:Maxsus çağıştırmalılıq teorise]]
 
[[vi:Lý thuyết tương đối hẹp]]
 
[[uk:Спеціальна теорія відносності]]
 
[[zh:狭义相对论]]
 
 
  
{{Credit|25307680}}
+
{{credits|Introduction_to_special_relativity|177778946}}

Revision as of 23:47, 16 December 2007

This article is intended as a generally accessible introduction to the subject.
Albert Einstein during a lecture in Vienna in 1921.

Special relativity is a fundamental physics theory about space and time that was developed by Albert Einstein in 1905[1] as a modification of Newtonian physics. It was created to deal with some pressing theoretical and experimental issues in the physics of the time involving light and electrodynamics (see History of special relativity for a detailed account). The predictions of special relativity correspond closely to those of Newtonian physics at speeds which are low in comparison to that of light, but diverge rapidly for speeds which are a significant fraction of the speed of light. Special relativity has been experimentally tested on numerous occasions since its inception, and its predictions have been verified by those tests.

Einstein postulated that the speed of light is the same for all observers, irrespective of their motion relative to the light source. This was in total contradiction to classical mechanics, which had been accepted for centuries. Einstein's approach was based on thought experiments and calculations. In 1908, Hermann Minkowski reformulated the theory based on different postulates of a more geometrical nature [2]. His approach depended on the existence of certain inter-relations between space and time, which were considered completely separate in classical physics. This reformulation set the stage for further developments of physics.

Special relativity makes numerous predictions which are incompatible with Newtonian physics (and everyday intuition). The first such prediction described by Einstein is called the relativity of simultaneity, under which observers who are in motion with respect to each other may disagree on whether two events occurred at the same time or one occurred before the other. The other major predictions of special relativity are time dilation (under which a moving clock ticks more slowly than when it is at rest with respect the observer), length contraction (under which a moving rod may be found to be shorter than when it is at rest with respect to the observer), and the equivalence of mass and energy (written as E=mc2). Special relativity predicts a non-linear velocity addition formula which prevents speeds greater than that of light from being observed. Special relativity also explains why Maxwell's equations of electromagnetism are correct in any frame of reference, and how electric field and magnetic field are two aspects of the same thing.

Special relativity has been tested in many ways, and so far has been proved to be far more accurate than Newtonian mechanics. The most famous of these tests is the Michelson-Morley experiment, the failure of which was one of the things which motivated the creation of special relativity. Other significant tests are the Fizeau experiment (which was first done decades before special relativity was proposed), the detection of the transverse Doppler effect, and the Haefele-Keating experiment. Today, scientists are so comfortable with the idea that the speed of light is always the same that the meter is now defined as being the distance traveled by light in 1/299,792,458th of a second. This means that the speed of light is now defined as being 299,792,458 m/s.

Reference frames and Galilean relativity: a classical prelude

A reference frame is simply a selection of what constitutes stationary objects. Once the velocity of a certain object is arbitrarily defined to be zero, the velocity of everything else in the universe can be measured relative to it[3]. When a train is moving at a constant velocity past a platform, one may either say that the platform is at rest and the train is moving or that the train is at rest and the platform is moving past it. These two descriptions correspond to two different reference frames. They are respectively called the rest frame of the platform and the rest frame of the train (sometimes simply the platform frame and the train frame).

The question naturally arises, can different reference frames be physically differentiated? In other words, can we conduct some experiments to claim that "we are now in an absolutely stationary reference frame?" Aristotle thought that all objects tend to cease moving and become at rest if there were no forces acting on them. Galileo challenged this idea and argued that the concept of absolute motion was unreal. All motion was relative. An observer who couldn't refer to some isolated object (if, say, he was imprisoned inside a closed spaceship) could never distinguish whether according to some external observer he was at rest or moving with constant velocity. Any experiment he could conduct would give the same result in both cases. However, accelerated reference frames are experimentally distinguishable. For example, if an astronaut moving in free space saw that the tea in his tea-cup was slanted rather than horizontal, he would be able to infer that his spaceship was accelerated. Thus not all reference frames are equivalent, but we have a class of reference frames, all moving at uniform velocity with respect to each other, in all of which Newton's first law holds. These are called the inertial reference frames and are fundamental to both classical mechanics and SR. Galilean relativity thus states that the laws of physics can not depend on absolute velocity, they must stay the same in any inertial reference frame. Galilean relativity is thus a fundamental principle in classical physics.

Mathematically, it says that if we transform all velocities to a different reference frame, the laws of physics must be unchanged. What is this transformation that must be applied to the velocities? Galileo gave the common-sense 'formula' for adding velocities: if

  1. particle P is moving at velocity v with respect to reference frame A and
  2. reference frame A is moving at velocity u with respect to reference frame B, then
  3. the velocity of P with respect to B is given by v + u.

The formula for transforming coordinates between different reference frames is called the Galilean transformation. The principle of Galilean relativity then demands that laws of physics be unchanged if the Galilean transformation is applied to them. Laws of classical mechanics, like Newton's second law, obey this principle because they have the same form after applying the transformation. As Newton's law involves the derivative of velocity, any constant velocity added in a Galilean transformation to a different reference frame contributes nothing (the derivative of a constant is zero). Addition of a time-varying velocity (corresponding to an accelerated reference frame) will however change the formula (see pseudo force), since Galilean relativity only applies to non-accelerated inertial reference frames.

Time is the same in all reference frames because it is absolute in classical mechanics. All observers measure exactly the same intervals of time and there is such a thing as an absolutely correct clock.

Invariance of length: the Euclidean picture

File:Rel1.GIF
Pythagoras theorem
File:Rel3.GIF
The length of an object is constant on the plane during rotations on the plane but not during rotations out of the plane

In special relativity, space and time are joined into a unified four-dimensional continuum called spacetime. To gain a sense of what spacetime is like, we must first look at the Euclidean space of Newtonian physics.

This approach to the theory of special relativity begins with the concept of "length." In everyday experience, it seems that the length of objects remains the same no matter how they are rotated or moved from place to place; as a result the simple length of an object doesn't appear to change or is "invariant." However, as is shown in the illustrations below, what is actually being suggested is that length seems to be invariant in a three-dimensional coordinate system.

The length of a line in a two-dimensional Cartesian coordinate system is given by Pythagoras' theorem:

One of the basic theorems of vector algebra is that the length of a vector does not change when it is rotated. However, a closer inspection tells us that this is only true if we consider rotations confined to the plane. If we introduce rotation in the third dimension, then we can tilt the line out of the plane. In this case the projection of the line on the plane will get shorter. Does this mean length is not invariant? Obviously not. The world is three-dimensional and in a 3D Cartesian coordinate system the length is given by the three-dimensional version of Pythagoras's theorem:

File:Rel4.GIF
Invariance in a 3D coordinate system: Pythagoras theorem gives but therefore . The length of an object is constant whether it is rotated or moved from one place to another in a 3D coordinate system

This is invariant under all rotations. The apparent violation of invariance of length only happened because we were 'missing' a dimension. It seems that, provided all the directions in which an object can be tilted or arranged are represented within a coordinate system, the length of an object does not change under rotations. A 3-dimensional coordinate system is enough in classical mechanics because time is assumed absolute and independent of space in that context. It can be considered separately.

Note that invariance of length is not ordinarily considered a dynamic principle, not even a theorem. It is simply a statement about the fundamental nature of space itself. Space as we ordinarily conceive it is called a three-dimensional Euclidean space, because its geometrical structure is described by the principles of Euclidean geometry. The formula for distance between two points is a fundamental property of an Euclidean space, it is called the Euclidean metric tensor (or simply the Euclidean metric). In general, distance formulas are called metric tensors.

Note that rotations are fundamentally related to the concept of length. In fact, one may define length or distance to be that which stays the same (is invariant) under rotations, or define rotations to be that which keep the length invariant. Given any one, it is possible to find the other. If we know the distance formula, we can find out the formula for transforming coordinates in a rotation. If, on the other hand, we have the formula for rotations then we can find out the distance formula.

The Minkowski formulation: introduction of spacetime

Main article: Spacetime
Hermann Minkowski

After Einstein derived special relativity formally from the counterintuitive proposition that the speed of light is the same to all observers, the need was felt for a more satisfactory formulation.[citation needed] Minkowski, building on mathematical approaches used in non-euclidean geometry[4] and the mathematical work of Lorentz and Poincaré, realised that a geometric approach was the key. Minkowski showed in 1908 that Einstein's new theory could be explained in a natural way if the concept of separate space and time is replaced with one four-dimensional continuum called spacetime. This was a groundbreaking concept, and Roger Penrose has said that relativity was not truly complete until Minkowski reformulated Einstein's work.

The concept of a four-dimensional space is hard to visualise. It may help at the beginning to think simply in terms of coordinates. In three-dimensional space, one needs three real numbers to refer to a point. In the Minkowski space, one needs four real numbers (three space coordinates and one time coordinate) to refer to a point at a particular instant of time. This point at a particular instant of time, specified by the four coordinates, is called an event. The distance between two different events is called the spacetime interval.

A path through the four-dimensional spacetime, usually called Minkowski space, is called a world line. Since it specifies both position and time, a particle having a known world line has a completely determined trajectory and velocity. This is just like graphing the displacement of a particle moving in a straight line against the time elapsed. The curve contains the complete motional information of the particle.

File:Rel5.GIF
The spacetime interval

In the same way as the measurement of distance in 3D space needed all three coordinates we must include time as well as the three space coordinates when calculating the distance in Minkowski space (henceforth called M). In a sense, the spacetime interval provides a combined estimate of how far two events occur in space as well as the time that elapses between their occurrence.

But there is a problem. Time is related to the space coordinates, but they are not equivalent. Pythagoras's theorem treats all coordinates on an equal footing (see Euclidean space for more details). We can exchange two space coordinates without changing the length, but we can not simply exchange a space coordinate with time, they are fundamentally different. It is an entirely different thing for two events to be separated in space and to be separated in time. Minkowski proposed that the formula for distance needed a change. He found that the correct formula was actually quite simple, differing only by a sign from Pythagoras's theorem:

where c is a constant and t is the time coordinate [5]. Multiplication by c, which has the dimension , converts the time to units of length and this constant has the same value as the speed of light. So the spacetime interval between two distinct events is given by

There are two major points to be noted. Firstly, time is being measured in the same units as length by multiplying it by a constant conversion factor. Secondly, and more importantly, the time-coordinate has a different sign than the space coordinates. This means that in the four-dimensional spacetime, one coordinate is different from the others and influences the distance differently. This new 'distance' may be zero or even negative. This new distance formula, called the metric of the spacetime, is at the heart of relativity. This distance formula is called the metric tensor of M. This minus sign means that a lot of our intuition about distances can not be directly carried over into spacetime intervals. For example, the spacetime interval between two events separated both in time and space may be zero (see below). From now on, the terms distance formula and metric tensor will be used interchangeably, as will be the terms Minkowski metric and spacetime interval.

In Minkowski spacetime the spacetime interval is the invariant length, the ordinary 3D length is not required to be invariant. The spacetime interval must stay the same under rotations, but ordinary lengths can change. Just like before, we were missing a dimension. Note that everything this far are merely definitions. We define a four-dimensional mathematical construct which has a special formula for distance, where distance means that which stays the same under rotations (alternatively, one may define a rotation to be that which keeps the distance unchanged).

Now comes the physical part. Rotations in Minkowski space have a different interpretation than ordinary rotations. These rotations correspond to transformations of reference frames. Passing from one reference frame to another corresponds to rotating the Minkowski space. An intuitive justification for this is given below, but mathematically this is a dynamical postulate just like assuming that physical laws must stay the same under Galilean transformations (which seems so intuitive that we don't usually recognise it to be a postulate).

Since by definition rotations must keep the distance same, passing to a different reference frame must keep the spacetime interval between two events unchanged. This requirement can be used to derive an explicit mathematical form for the transformation that must be applied to the laws of physics (compare with the application of Galilean transformations to classical laws) when shifting reference frames. These transformations are called the Lorentz transformations. Just like the Galilean transformations are the mathematical statement of the principle of Galilean relativity in classical mechanics, the Lorentz transformations are the mathematical form of Einstein's principle of relativity. Laws of physics must stay the same under Lorentz transformations. Maxwell's equations and Dirac's equation satisfy this property, and hence they are relativistically correct laws (but classically incorrect, since they don't transform correctly under Galilean transformations).

With the statement of the Minkowski metric, the common name for the distance formula given above, the theoretical foundation of special relativity is complete. The entire basis for special relativity can be summed up by the geometric statement "changes of reference frame correspond to rotations in the 4D Minkowski spacetime, which is defined to have the distance formula given above." The unique dynamical predictions of SR stem from this geometrical property of spacetime. Special relativity may be said to be the physics of Minkowski spacetime[6]. In this case of spacetime, there are six independent rotations to be considered. Three of them are the standard rotations on a plane in two directions of space. The other three are rotations in a plane of both space and time: These rotations correspond to a change of velocity, and are described by the traditional Lorentz transformations.

As has been mentioned before, one can replace distance formulas with rotation formulas. Instead of starting with the invariance of the Minkowski metric as the fundamental property of spacetime, one may state (as was done in classical physics with Galilean relativity) the mathematical form of the Lorentz transformations and require that physical laws be invariant under these transformations. This makes no reference to the geometry of spacetime, but will produce the same result. This was in fact the traditional approach to SR, used originally by Einstein himself. However, this approach is often considered to offer less insight and be more cumbersome than the more natural Minkowski formalism.

Reference frames and Lorentz transformations: relativity revisited

We have already discussed that in classical mechanics coordinate frame changes correspond to Galilean transfomations of the coordinates. Is this adequate in the relativistic Minkowski picture?

Suppose there are two people, Bill and John, on separate planets that are moving away from each other. Bill and John are on separate planets so they both think that they are stationary. John draws a graph of Bill's motion through space and time and this is shown in the illustration below:

File:Rel6.GIF
John's view of Bill and Bill's view of himself

John sees that Bill is moving through space as well as time but Bill thinks he is moving through time alone. Bill would draw the same conclusion about John's motion. In fact, these two views, which would be classically considered a difference in reference frames, are related simply by a coordinate transformation in M. Bill's view of his own world line and John's view of Bill's world line are related to each other simply by a rotation of coordinates. One can be transformed into the other by a rotation of the time axis. Minkowski geometry handles transformations of reference frames in a very natural way.

Changes in reference frame, represented by velocity transformations in classical mechanics, are represented by rotations in Minkowski space. These rotations are called Lorentz transformations. They are different from the Galilean transformations because of the unique form of the Minkowski metric. The Lorentz transformations are the relativistic equivalent of Galilean transformations. Laws of physics, in order to be relativistically correct, must stay the same under Lorentz transformations. The physical statement that they must be same in all inertial reference frames remains unchanged, but the mathematical transformation between different reference frames changes. Newton's laws of motion are invariant under Galilean rather than Lorentz transformations, so they are immediately recognisable as non-relativistic laws and must be discarded in relativistic physics. Schrödinger's equation is also non-relativistic.

Maxwell's equations are trickier. They are written using vectors and at first glance appear to transform correctly under Galilean transformations. But on closer inspection, several questions are apparent that can not be satisfactorily resolved within classical mechanics (see History of special relativity). They are indeed invariant under Lorentz transformations and are relativistic, even though they were formulated before the discovery of special relativity. Classical electrodynamics can be said to be the first relativistic theory in physics. To make the relativistic character of equations apparent, they are written using 4-component vector like quantities called 4-vectors. 4-Vectors transform correctly under Lorentz transformations. Equations written using 4-vectors are automatically relativistic. This is called the manifestly covariant form of equations. 4-Vectors form a very important part of the formalism of special relativity.

Einstein's postulate: the constancy of the speed of light

Einstein's postulate that the speed of light is a constant comes out as a natural consequence of the Minkowski formulation[7].

Proposition 1:

When an object is travelling at c in a certain reference frame, the spacetime interval is zero.

Proof:

The spacetime interval between the origin-event (0,0,0,0) and an event (x, y,z, t) is
The distance travelled by an object moving at velocity v for t seconds is:
giving
Since the velocity v equals c we have
Hence the spacetime interval between the events of departure and arrival is given by

Proposition 2:

An object travelling at c in one reference frame is travelling at c in all reference frames.

Proof:

Let the object move with velocity v when observed from a different reference frame. A change in reference frame corresponds to a rotation in M. Since the spacetime interval must be conserved under rotation, the spacetime interval must be the same in all reference frames. In proposition 1 we showed it to be zero in one reference frame, hence it must be zero in all other reference frames. We get that
which implies

The paths of light rays have a zero spacetime interval, and hence all observers will obtain the same value for the speed of light. Therefore, when assuming that the universe has four dimensions that are related by Minkowski's formula, the speed of light appears as a constant, and does not need to be assumed (postulated) to be constant as in Einstein's original approach to special relativity.

Clock delays and rod contractions: more on Lorentz transformations

Another consequence of the invariance of the spacetime interval is that clocks will appear to go slower on objects that are moving relative to you. This is very similar to how the 2D projection of a line rotated into the third-dimension appears to get shorter. Length is not conserved simply because we are ignoring one of the dimensions. Let us return to the example of John and Bill.

John observes the length of Bill's spacetime interval as:

whereas Bill doesn't think he has traveled in space, so writes:

The spacetime interval, s2, is invariant. It has the same value for all observers, no matter who measures it or how they are moving in a straight line. This means that Bill's spacetime interval equals John's observation of Bill's spacetime interval so:

and

hence

.

So, if John sees a clock that is at rest in Bill's frame record one second, John will find that his own clock measures between these same ticks an interval t, called coordinate time, which is greater than one second. It is said that clocks in motion slow down, relative to those on observers at rest. This is known as "relativistic time dilation of a moving clock." The time that is measured in the rest frame of the clock (in Bill's frame) is called the proper time of the clock.

In special relativity, therefore, changes in reference frame affect time also. Time is no longer absolute. There is no universally correct clock, time runs at different rates for different observers.

Similarly it can be shown that John will also observe measuring rods at rest on Bill's planet to be shorter in the direction of motion than his own measuring rods[8]. This is a prediction known as "relativistic length contraction of a moving rod." If the length of a rod at rest on Bill's planet is , then we call this quantity the proper length of the rod. The length of that same rod as measured on John's planet, is called coordinate length, and given by

.
How Bill's coordinates appear to John at the instant they pass each other

These two equations can be combined to obtain the general form of the Lorentz transformation in one spatial dimension:

or equivalently:

where the Lorentz factor is given by

The above formulas for clock delays and length contractions are special cases of the general transformation.

Alternatively, these equations for time dilation and length contraction (here obtained from the invariance of the spacetime interval), can be obtained directly from the Lorentz transformation by setting X = 0 for time dilation, meaning that the clock is at rest in Bill's frame, or by setting t = 0 for length contraction, meaning that John must measure the distances to the end points of the moving rod at the same time.

A consequence of the Lorentz transformations is the modified velocity-addition formula:

Simultaneity and clock desynchronisation