Lorentz transformation

Author: www.NiNa.Az
Feb 08, 2025 / 19:45

In physics the Lorentz transformations are a six parameter family of linear transformations from a coordinate frame in s

Lorentz transformation
Lorentz transformation
Lorentz transformation

In physics, the Lorentz transformations are a six-parameter family of linear transformations from a coordinate frame in spacetime to another frame that moves at a constant velocity relative to the former. The respective inverse transformation is then parameterized by the negative of this velocity. The transformations are named after the Dutch physicist Hendrik Lorentz.

The most common form of the transformation, parametrized by the real constant representing a velocity confined to the x-direction, is expressed as where (t, x, y, z) and (t′, x′, y′, z′) are the coordinates of an event in two frames with the spatial origins coinciding at t=t=0, where the primed frame is seen from the unprimed frame as moving with speed v along the x-axis, where c is the speed of light, and is the Lorentz factor. When speed v is much smaller than c, the Lorentz factor is negligibly different from 1, but as v approaches c, grows without bound. The value of v must be smaller than c for the transformation to make sense.

Expressing the speed as a fraction of the speed of light, an equivalent form of the transformation is

Frames of reference can be divided into two groups: inertial (relative motion with constant velocity) and non-inertial (accelerating, moving in curved paths, rotational motion with constant angular velocity, etc.). The term "Lorentz transformations" only refers to transformations between inertial frames, usually in the context of special relativity.

In each reference frame, an observer can use a local coordinate system (usually Cartesian coordinates in this context) to measure lengths, and a clock to measure time intervals. An event is something that happens at a point in space at an instant of time, or more formally a point in spacetime. The transformations connect the space and time coordinates of an event as measured by an observer in each frame.

They supersede the Galilean transformation of Newtonian physics, which assumes an absolute space and time (see Galilean relativity). The Galilean transformation is a good approximation only at relative speeds much less than the speed of light. Lorentz transformations have a number of unintuitive features that do not appear in Galilean transformations. For example, they reflect the fact that observers moving at different velocities may measure different distances, elapsed times, and even different orderings of events, but always such that the speed of light is the same in all inertial reference frames. The invariance of light speed is one of the postulates of special relativity.

Historically, the transformations were the result of attempts by Lorentz and others to explain how the speed of light was observed to be independent of the reference frame, and to understand the symmetries of the laws of electromagnetism. The transformations later became a cornerstone for special relativity.

The Lorentz transformation is a linear transformation. It may include a rotation of space; a rotation-free Lorentz transformation is called a Lorentz boost. In Minkowski space—the mathematical model of spacetime in special relativity—the Lorentz transformations preserve the spacetime interval between any two events. They describe only the transformations in which the spacetime event at the origin is left fixed. They can be considered as a hyperbolic rotation of Minkowski space. The more general set of transformations that also includes translations is known as the Poincaré group.

History

Many physicists—including Woldemar Voigt, George FitzGerald, Joseph Larmor, and Hendrik Lorentz himself—had been discussing the physics implied by these equations since 1887. Early in 1889, Oliver Heaviside had shown from Maxwell's equations that the electric field surrounding a spherical distribution of charge should cease to have spherical symmetry once the charge is in motion relative to the luminiferous aether. FitzGerald then conjectured that Heaviside's distortion result might be applied to a theory of intermolecular forces. Some months later, FitzGerald published the conjecture that bodies in motion are being contracted, in order to explain the baffling outcome of the 1887 aether-wind experiment of Michelson and Morley. In 1892, Lorentz independently presented the same idea in a more detailed manner, which was subsequently called FitzGerald–Lorentz contraction hypothesis. Their explanation was widely known before 1905.

Lorentz (1892–1904) and Larmor (1897–1900), who believed the luminiferous aether hypothesis, also looked for the transformation under which Maxwell's equations are invariant when transformed from the aether to a moving frame. They extended the FitzGerald–Lorentz contraction hypothesis and found out that the time coordinate has to be modified as well ("local time"). Henri Poincaré gave a physical interpretation to local time (to first order in v/c, the relative velocity of the two reference frames normalized to the speed of light) as the consequence of clock synchronization, under the assumption that the speed of light is constant in moving frames. Larmor is credited to have been the first to understand the crucial time dilation property inherent in his equations.

In 1905, Poincaré was the first to recognize that the transformation has the properties of a mathematical group, and he named it after Lorentz. Later in the same year Albert Einstein published what is now called special relativity, by deriving the Lorentz transformation under the assumptions of the principle of relativity and the constancy of the speed of light in any inertial reference frame, and by abandoning the mechanistic aether as unnecessary.

Derivation of the group of Lorentz transformations

An event is something that happens at a certain point in spacetime, or more generally, the point in spacetime itself. In any inertial frame an event is specified by a time coordinate ct and a set of Cartesian coordinates x, y, z to specify position in space in that frame. Subscripts label individual events.

From Einstein's second postulate of relativity (invariance of c) it follows that:

in all inertial frames for events connected by light signals. The quantity on the left is called the spacetime interval between events a1 = (t1, x1, y1, z1) and a2 = (t2, x2, y2, z2). The interval between any two events, not necessarily separated by light signals, is in fact invariant, i.e., independent of the state of relative motion of observers in different inertial frames, as is shown using homogeneity and isotropy of space. The transformation sought after thus must possess the property that:

where (t, x, y, z) are the spacetime coordinates used to define events in one frame, and (t′, x′, y′, z′) are the coordinates in another frame. First one observes that (D2) is satisfied if an arbitrary 4-tuple b of numbers are added to events a1 and a2. Such transformations are called spacetime translations and are not dealt with further here. Then one observes that a linear solution preserving the origin of the simpler problem solves the general problem too:

(a solution satisfying the first formula automatically satisfies the second one as well; see polarization identity). Finding the solution to the simpler problem is just a matter of look-up in the theory of classical groups that preserve bilinear forms of various signature. First equation in (D3) can be written more compactly as:

where (·, ·) refers to the bilinear form of signature (1, 3) on R4 exposed by the right hand side formula in (D3). The alternative notation defined on the right is referred to as the relativistic dot product. Spacetime mathematically viewed as R4 endowed with this bilinear form is known as Minkowski space M. The Lorentz transformation is thus an element of the group O(1, 3), the Lorentz group or, for those that prefer the other metric signature, O(3, 1) (also called the Lorentz group). One has:

which is precisely preservation of the bilinear form (D3) which implies (by linearity of Λ and bilinearity of the form) that (D2) is satisfied. The elements of the Lorentz group are rotations and boosts and mixes thereof. If the spacetime translations are included, then one obtains the inhomogeneous Lorentz group or the Poincaré group.

Generalities

The relations between the primed and unprimed spacetime coordinates are the Lorentz transformations, each coordinate in one frame is a linear function of all the coordinates in the other frame, and the inverse functions are the inverse transformation. Depending on how the frames move relative to each other, and how they are oriented in space relative to each other, other parameters that describe direction, speed, and orientation enter the transformation equations.

Transformations describing relative motion with constant (uniform) velocity and without rotation of the space coordinate axes are called Lorentz boosts or simply boosts, and the relative velocity between the frames is the parameter of the transformation. The other basic type of Lorentz transformation is rotation in the spatial coordinates only, these like boosts are inertial transformations since there is no relative motion, the frames are simply tilted (and not continuously rotating), and in this case quantities defining the rotation are the parameters of the transformation (e.g., axis–angle representation, or Euler angles, etc.). A combination of a rotation and boost is a homogeneous transformation, which transforms the origin back to the origin.

The full Lorentz group O(3, 1) also contains special transformations that are neither rotations nor boosts, but rather reflections in a plane through the origin. Two of these can be singled out; spatial inversion in which the spatial coordinates of all events are reversed in sign and temporal inversion in which the time coordinate for each event gets its sign reversed.

Boosts should not be conflated with mere displacements in spacetime; in this case, the coordinate systems are simply shifted and there is no relative motion. However, these also count as symmetries forced by special relativity since they leave the spacetime interval invariant. A combination of a rotation with a boost, followed by a shift in spacetime, is an inhomogeneous Lorentz transformation, an element of the Poincaré group, which is also called the inhomogeneous Lorentz group.

Physical formulation of Lorentz boosts

Coordinate transformation

image
The spacetime coordinates of an event, as measured by each observer in their inertial reference frame (in standard configuration) are shown in the speech bubbles.
Top: frame F moves at velocity v along the x-axis of frame F.
Bottom: frame F moves at velocity −v along the x-axis of frame F.

A "stationary" observer in frame F defines events with coordinates t, x, y, z. Another frame F moves with velocity v relative to F, and an observer in this "moving" frame F defines events using the coordinates t′, x′, y′, z.

The coordinate axes in each frame are parallel (the x and x axes are parallel, the y and y axes are parallel, and the z and z axes are parallel), remain mutually perpendicular, and relative motion is along the coincident xx′ axes. At t = t′ = 0, the origins of both coordinate systems are the same, (x, y, z) = (x′, y′, z′) = (0, 0, 0). In other words, the times and positions are coincident at this event. If all these hold, then the coordinate systems are said to be in standard configuration, or synchronized.

If an observer in F records an event t, x, y, z, then an observer in F records the same event with coordinates

Lorentz boost (x direction)

image

where v is the relative velocity between frames in the x-direction, c is the speed of light, and image (lowercase gamma) is the Lorentz factor.

Here, v is the parameter of the transformation, for a given boost it is a constant number, but can take a continuous range of values. In the setup used here, positive relative velocity v > 0 is motion along the positive directions of the xx axes, zero relative velocity v = 0 is no relative motion, while negative relative velocity v < 0 is relative motion along the negative directions of the xx axes. The magnitude of relative velocity v cannot equal or exceed c, so only subluminal speeds c < v < c are allowed. The corresponding range of γ is 1 ≤ γ < ∞.

The transformations are not defined if v is outside these limits. At the speed of light (v = c) γ is infinite, and faster than light (v > c) γ is a complex number, each of which make the transformations unphysical. The space and time coordinates are measurable quantities and numerically must be real numbers.

As an active transformation, an observer in F′ notices the coordinates of the event to be "boosted" in the negative directions of the xx axes, because of the v in the transformations. This has the equivalent effect of the coordinate system F′ boosted in the positive directions of the xx axes, while the event does not change and is simply represented in another coordinate system, a passive transformation.

The inverse relations (t, x, y, z in terms of t′, x′, y′, z) can be found by algebraically solving the original set of equations. A more efficient way is to use physical principles. Here F is the "stationary" frame while F is the "moving" frame. According to the principle of relativity, there is no privileged frame of reference, so the transformations from F to F must take exactly the same form as the transformations from F to F. The only difference is F moves with velocity v relative to F (i.e., the relative velocity has the same magnitude but is oppositely directed). Thus if an observer in F notes an event t′, x′, y′, z, then an observer in F notes the same event with coordinates

Inverse Lorentz boost (x direction)

image

and the value of γ remains unchanged. This "trick" of simply reversing the direction of relative velocity while preserving its magnitude, and exchanging primed and unprimed variables, always applies to finding the inverse transformation of every boost in any direction.

Sometimes it is more convenient to use β = v/c (lowercase beta) instead of v, so that image which shows much more clearly the symmetry in the transformation. From the allowed ranges of v and the definition of β, it follows −1 < β < 1. The use of β and γ is standard throughout the literature.

When the boost velocity image is in an arbitrary vector direction with the boost vector image, then the transformation from an unprimed spacetime coordinate system to a primed coordinate system is given by,[1]

image

where the Lorentz factor is image. The determinant of the transformation matrix is +1 and its trace is image. The inverse of the transformation is given by reversing the sign of image. The quantity image is invariant under the transformation.

The Lorentz transformations can also be derived in a way that resembles circular rotations in 3d space using the hyperbolic functions. For the boost in the x direction, the results are

Lorentz boost (x direction with rapidity ζ)

image

where ζ (lowercase zeta) is a parameter called rapidity (many other symbols are used, including θ, ϕ, φ, η, ψ, ξ). Given the strong resemblance to rotations of spatial coordinates in 3d space in the Cartesian xy, yz, and zx planes, a Lorentz boost can be thought of as a hyperbolic rotation of spacetime coordinates in the xt, yt, and zt Cartesian-time planes of 4d Minkowski space. The parameter ζ is the hyperbolic angle of rotation, analogous to the ordinary angle for circular rotations. This transformation can be illustrated with a Minkowski diagram.

The hyperbolic functions arise from the difference between the squares of the time and spatial coordinates in the spacetime interval, rather than a sum. The geometric significance of the hyperbolic functions can be visualized by taking x = 0 or ct = 0 in the transformations. Squaring and subtracting the results, one can derive hyperbolic curves of constant coordinate values but varying ζ, which parametrizes the curves according to the identity image

Conversely the ct and x axes can be constructed for varying coordinates but constant ζ. The definition image provides the link between a constant value of rapidity, and the slope of the ct axis in spacetime. A consequence these two hyperbolic formulae is an identity that matches the Lorentz factor image

Comparing the Lorentz transformations in terms of the relative velocity and rapidity, or using the above formulae, the connections between β, γ, and ζ are image

Taking the inverse hyperbolic tangent gives the rapidity image

Since −1 < β < 1, it follows −∞ < ζ < ∞. From the relation between ζ and β, positive rapidity ζ > 0 is motion along the positive directions of the xx axes, zero rapidity ζ = 0 is no relative motion, while negative rapidity ζ < 0 is relative motion along the negative directions of the xx axes.

The inverse transformations are obtained by exchanging primed and unprimed quantities to switch the coordinate frames, and negating rapidity ζ → −ζ since this is equivalent to negating the relative velocity. Therefore,

Inverse Lorentz boost (x direction with rapidity ζ)

image

The inverse transformations can be similarly visualized by considering the cases when x′ = 0 and ct′ = 0.

So far the Lorentz transformations have been applied to one event. If there are two events, there is a spatial separation and time interval between them. It follows from the linearity of the Lorentz transformations that two values of space and time coordinates can be chosen, the Lorentz transformations can be applied to each, then subtracted to get the Lorentz transformations of the differences;

image with inverse relations image

where Δ (uppercase delta) indicates a difference of quantities; e.g., Δx = x2x1 for two values of x coordinates, and so on.

These transformations on differences rather than spatial points or instants of time are useful for a number of reasons:

  • in calculations and experiments, it is lengths between two points or time intervals that are measured or of interest (e.g., the length of a moving vehicle, or time duration it takes to travel from one place to another),
  • the transformations of velocity can be readily derived by making the difference infinitesimally small and dividing the equations, and the process repeated for the transformation of acceleration,
  • if the coordinate systems are never coincident (i.e., not in standard configuration), and if both observers can agree on an event t0, x0, y0, z0 in F and t0′, x0′, y0′, z0 in F, then they can use that event as the origin, and the spacetime coordinate differences are the differences between their coordinates and this origin, e.g., Δx = xx0, Δx′ = x′ − x0, etc.

Physical implications

A critical requirement of the Lorentz transformations is the invariance of the speed of light, a fact used in their derivation, and contained in the transformations themselves. If in F the equation for a pulse of light along the x direction is x = ct, then in F the Lorentz transformations give x′ = ct, and vice versa, for any c < v < c.

For relative speeds much less than the speed of light, the Lorentz transformations reduce to the Galilean transformation:image in accordance with the correspondence principle. It is sometimes said that nonrelativistic physics is a physics of "instantaneous action at a distance".


Three counterintuitive, but correct, predictions of the transformations are:

Relativity of simultaneity
Suppose two events occur along the x axis simultaneously (Δt = 0) in F, but separated by a nonzero displacement Δx. Then in F, we find that image, so the events are no longer simultaneous according to a moving observer.
Time dilation
Suppose there is a clock at rest in F. If a time interval is measured at the same point in that frame, so that Δx = 0, then the transformations give this interval in F by Δt′ = γΔt. Conversely, suppose there is a clock at rest in F. If an interval is measured at the same point in that frame, so that Δx′ = 0, then the transformations give this interval in F by Δt = γΔt. Either way, each observer measures the time interval between ticks of a moving clock to be longer by a factor γ than the time interval between ticks of his own clock.
Length contraction
Suppose there is a rod at rest in F aligned along the x axis, with length Δx. In F, the rod moves with velocity -v, so its length must be measured by taking two simultaneous (Δt′ = 0) measurements at opposite ends. Under these conditions, the inverse Lorentz transform shows that Δx = γΔx. In F the two measurements are no longer simultaneous, but this does not matter because the rod is at rest in F. So each observer measures the distance between the end points of a moving rod to be shorter by a factor 1/γ than the end points of an identical rod at rest in his own frame. Length contraction affects any geometric quantity related to lengths, so from the perspective of a moving observer, areas and volumes will also appear to shrink along the direction of motion.

Vector transformations

image
An observer in frame F observes F to move with velocity v, while F observes F to move with velocity v. The coordinate axes of each frame are still parallel[according to whom?] and orthogonal. The position vector as measured in each frame is split into components parallel and perpendicular to the relative velocity vector v.
Left: Standard configuration. Right: Inverse configuration.

The use of vectors allows positions and velocities to be expressed in arbitrary directions compactly. A single boost in any direction depends on the full relative velocity vector v with a magnitude |v| = v that cannot equal or exceed c, so that 0 ≤ v < c.

Only time and the coordinates parallel to the direction of relative motion change, while those coordinates perpendicular do not. With this in mind, split the spatial position vector r as measured in F, and r as measured in F′, each into components perpendicular (⊥) and parallel ( ‖ ) to v, image then the transformations are image where · is the dot product. The Lorentz factor γ retains its definition for a boost in any direction, since it depends only on the magnitude of the relative velocity. The definition β = v/c with magnitude 0 ≤ β < 1 is also used by some authors.

Introducing a unit vector n = v/v = β/β in the direction of relative motion, the relative velocity is v = vn with magnitude v and direction n, and vector projection and rejection give respectively image

Accumulating the results gives the full transformations,

Lorentz boost (in direction n with magnitude v)

image

The projection and rejection also applies to r. For the inverse transformations, exchange r and r to switch observed coordinates, and negate the relative velocity v → −v (or simply the unit vector n → −n since the magnitude v is always positive) to obtain

Inverse Lorentz boost (in direction n with magnitude v)

image

The unit vector has the advantage of simplifying equations for a single boost, allows either v or β to be reinstated when convenient, and the rapidity parametrization is immediately obtained by replacing β and βγ. It is not convenient for multiple boosts.

The vectorial relation between relative velocity and rapidity isimage and the "rapidity vector" can be defined as image each of which serves as a useful abbreviation in some contexts. The magnitude of ζ is the absolute value of the rapidity scalar confined to 0 ≤ ζ < ∞, which agrees with the range 0 ≤ β < 1.

Transformation of velocities

image
The transformation of velocities provides the definition relativistic velocity addition , the ordering of vectors is chosen to reflect the ordering of the addition of velocities; first v (the velocity of F′ relative to F) then u (the velocity of X relative to F′) to obtain u = vu (the velocity of X relative to F).

Defining the coordinate velocities and Lorentz factor by

image

taking the differentials in the coordinates and time of the vector transformations, then dividing equations, leads to

image

The velocities u and u are the velocity of some massive object. They can also be for a third inertial frame (say F′′), in which case they must be constant. Denote either entity by X. Then X moves with velocity u relative to F, or equivalently with velocity u relative to F′, in turn F′ moves with velocity v relative to F. The inverse transformations can be obtained in a similar way, or as with position coordinates exchange u and u, and change v to v.

The transformation of velocity is useful in stellar aberration, the Fizeau experiment, and the relativistic Doppler effect.

The Lorentz transformations of acceleration can be similarly obtained by taking differentials in the velocity vectors, and dividing these by the time differential.

Transformation of other quantities

In general, given four quantities A and Z = (Zx, Zy, Zz) and their Lorentz-boosted counterparts A and Z′ = (Zx, Zy, Zz), a relation of the form image implies the quantities transform under Lorentz transformations similar to the transformation of spacetime coordinates; image

The decomposition of Z (and Z) into components perpendicular and parallel to v is exactly the same as for the position vector, as is the process of obtaining the inverse transformations (exchange (A, Z) and (A′, Z′) to switch observed quantities, and reverse the direction of relative motion by the substitution n ↦ −n).

The quantities (A, Z) collectively make up a four-vector, where A is the "timelike component", and Z the "spacelike component". Examples of A and Z are the following:

Four-vector A Z
Position four-vector Time (multiplied by c), ct Position vector, r
Four-momentum Energy (divided by c), E/c Momentum, p
Four-wave vector angular frequency (divided by c), ω/c wave vector, k
Four-spin (No name), st Spin, s
Four-current Charge density (multiplied by c), ρc Current density, j
Electromagnetic four-potential Electric potential (divided by c), φ/c Magnetic vector potential, A

For a given object (e.g., particle, fluid, field, material), if A or Z correspond to properties specific to the object like its charge density, mass density, spin, etc., its properties can be fixed in the rest frame of that object. Then the Lorentz transformations give the corresponding properties in a frame moving relative to the object with constant velocity. This breaks some notions taken for granted in non-relativistic physics. For example, the energy E of an object is a scalar in non-relativistic mechanics, but not in relativistic mechanics because energy changes under Lorentz transformations; its value is different for various inertial frames. In the rest frame of an object, it has a rest energy and zero momentum. In a boosted frame its energy is different and it appears to have a momentum. Similarly, in non-relativistic quantum mechanics the spin of a particle is a constant vector, but in relativistic quantum mechanics spin s depends on relative motion. In the rest frame of the particle, the spin pseudovector can be fixed to be its ordinary non-relativistic spin with a zero timelike quantity st, however a boosted observer will perceive a nonzero timelike component and an altered spin.

Not all quantities are invariant in the form as shown above, for example orbital angular momentum L does not have a timelike quantity, and neither does the electric field E nor the magnetic field B. The definition of angular momentum is L = r × p, and in a boosted frame the altered angular momentum is L′ = r′ × p. Applying this definition using the transformations of coordinates and momentum leads to the transformation of angular momentum. It turns out L transforms with another vector quantity N = (E/c2)rtp related to boosts, see relativistic angular momentum for details. For the case of the E and B fields, the transformations cannot be obtained as directly using vector algebra. The Lorentz force is the definition of these fields, and in F it is F = q(E + v × B) while in F it is F′ = q(E′ + v′ × B′). A method of deriving the EM field transformations in an efficient way which also illustrates the unit of the electromagnetic field uses tensor algebra, given below.

Mathematical formulation

Throughout, italic non-bold capital letters are 4×4 matrices, while non-italic bold letters are 3×3 matrices.

Homogeneous Lorentz group

Writing the coordinates in column vectors and the Minkowski metric η as a square matrix image the spacetime interval takes the form (superscript T denotes transpose) image and is invariant under a Lorentz transformation image where Λ is a square matrix which can depend on parameters.

The set of all Lorentz transformations image in this article is denoted image. This set together with matrix multiplication forms a group, in this context known as the Lorentz group. Also, the above expression X·X is a quadratic form of signature (3,1) on spacetime, and the group of transformations which leaves this quadratic form invariant is the indefinite orthogonal group O(3,1), a Lie group. In other words, the Lorentz group is O(3,1). As presented in this article, any Lie groups mentioned are matrix Lie groups. In this context the operation of composition amounts to matrix multiplication.

From the invariance of the spacetime interval it follows image and this matrix equation contains the general conditions on the Lorentz transformation to ensure invariance of the spacetime interval. Taking the determinant of the equation using the product rule gives immediately image

Writing the Minkowski metric as a block matrix, and the Lorentz transformation in the most general form, image carrying out the block matrix multiplications obtains general conditions on Γ, a, b, M to ensure relativistic invariance. Not much information can be directly extracted from all the conditions, however one of the results image is useful; bTb ≥ 0 always so it follows that image

The negative inequality may be unexpected, because Γ multiplies the time coordinate and this has an effect on time symmetry. If the positive equality holds, then Γ is the Lorentz factor.

The determinant and inequality provide four ways to classify Lorentz Transformations (herein LTs for brevity). Any particular LT has only one determinant sign and only one inequality. There are four sets which include every possible pair given by the intersections ("n"-shaped symbol meaning "and") of these classifying sets.

Intersection, ∩ Antichronous (or non-orthochronous) LTs
image
Orthochronous LTs
image
Proper LTs
image
Proper antichronous LTs
image
Proper orthochronous LTs
image
Improper LTs
image
Improper antichronous LTs
image
Improper orthochronous LTs
image

where "+" and "−" indicate the determinant sign, while "↑" for ≥ and "↓" for ≤ denote the inequalities.

The full Lorentz group splits into the union ("u"-shaped symbol meaning "or") of four disjoint sets image

A subgroup of a group must be closed under the same operation of the group (here matrix multiplication). In other words, for two Lorentz transformations Λ and L from a particular subgroup, the composite Lorentz transformations ΛL and LΛ must be in the same subgroup as Λ and L. This is not always the case: the composition of two antichronous Lorentz transformations is orthochronous, and the composition of two improper Lorentz transformations is proper. In other words, while the sets image, image, image, and image all form subgroups, the sets containing improper and/or antichronous transformations without enough proper orthochronous transformations (e.g. image, image, image) do not form subgroups.

Proper transformations

If a Lorentz covariant 4-vector is measured in one inertial frame with result image, and the same measurement made in another inertial frame (with the same orientation and origin) gives result image, the two results will be related by image where the boost matrix image represents the rotation-free Lorentz transformation between the unprimed and primed frames and image is the velocity of the primed frame as seen from the unprimed frame. The matrix is given byimage

where image is the magnitude of the velocity and image is the Lorentz factor. This formula represents a passive transformation, as it describes how the coordinates of the measured quantity changes from the unprimed frame to the primed frame. The active transformation is given by image.

If a frame F is boosted with velocity u relative to frame F, and another frame F′′ is boosted with velocity v relative to F, the separate boosts are image and the composition of the two boosts connects the coordinates in F′′ and F, image Successive transformations act on the left. If u and v are collinear (parallel or antiparallel along the same line of relative motion), the boost matrices commute: B(v)B(u) = B(u)B(v). This composite transformation happens to be another boost, B(w), where w is collinear with u and v.

If u and v are not collinear but in different directions, the situation is considerably more complicated. Lorentz boosts along different directions do not commute: B(v)B(u) and B(u)B(v) are not equal. Although each of these compositions is not a single boost, each composition is still a Lorentz transformation as it preserves the spacetime interval. It turns out the composition of any two Lorentz boosts is equivalent to a boost followed or preceded by a rotation on the spatial coordinates, in the form of R(ρ)B(w) or B(w)R(ρ). The w and w are composite velocities, while ρ and ρ are rotation parameters (e.g. axis-angle variables, Euler angles, etc.). The rotation in block matrix form is simply image where R(ρ) is a 3d rotation matrix, which rotates any 3d vector in one sense (active transformation), or equivalently the coordinate frame in the opposite sense (passive transformation). It is not simple to connect w and ρ (or w and ρ) to the original boost parameters u and v. In a composition of boosts, the R matrix is named the Wigner rotation, and gives rise to the Thomas precession. These articles give the explicit formulae for the composite transformation matrices, including expressions for w, ρ, w, ρ.

In this article the axis-angle representation is used for ρ. The rotation is about an axis in the direction of a unit vector e, through angle θ (positive anticlockwise, negative clockwise, according to the right-hand rule). The "axis-angle vector" image will serve as a useful abbreviation.

Spatial rotations alone are also Lorentz transformations since they leave the spacetime interval invariant. Like boosts, successive rotations about different axes do not commute. Unlike boosts, the composition of any two rotations is equivalent to a single rotation. Some other similarities and differences between the boost and rotation matrices include:

  • inverses: B(v)−1 = B(−v) (relative motion in the opposite direction), and R(θ)−1 = R(−θ) (rotation in the opposite sense about the same axis)
  • identity transformation for no relative motion/rotation: B(0) = R(0) = I
  • unit determinant: det(B) = det(R) = +1. This property makes them proper transformations.
  • matrix symmetry: B is symmetric (equals transpose), while R is nonsymmetric but orthogonal (transpose equals inverse, RT = R−1).

The most general proper Lorentz transformation Λ(v, θ) includes a boost and rotation together, and is a nonsymmetric matrix. As special cases, Λ(0, θ) = R(θ) and Λ(v, 0) = B(v). An explicit form of the general Lorentz transformation is cumbersome to write down and will not be given here. Nevertheless, closed form expressions for the transformation matrices will be given below using group theoretical arguments. It will be easier to use the rapidity parametrization for boosts, in which case one writes Λ(ζ, θ) and B(ζ).

The Lie group SO+(3,1)

The set of transformations image with matrix multiplication as the operation of composition forms a group, called the "restricted Lorentz group", and is the special indefinite orthogonal group SO+(3,1). (The plus sign indicates that it preserves the orientation of the temporal dimension).

For simplicity, look at the infinitesimal Lorentz boost in the x direction (examining a boost in any other direction, or rotation about any axis, follows an identical procedure). The infinitesimal boost is a small boost away from the identity, obtained by the Taylor expansion of the boost matrix to first order about ζ = 0, image where the higher order terms not shown are negligible because ζ is small, and Bx is simply the boost matrix in the x direction. The derivative of the matrix is the matrix of derivatives (of the entries, with respect to the same variable), and it is understood the derivatives are found first then evaluated at ζ = 0, image

For now, Kx is defined by this result (its significance will be explained shortly). In the limit of an infinite number of infinitely small steps, the finite boost transformation in the form of a matrix exponential is obtained image where the limit definition of the exponential has been used (see also characterizations of the exponential function). More generally

image

The axis-angle vector θ and rapidity vector ζ are altogether six continuous variables which make up the group parameters (in this particular representation), and the generators of the group are K = (Kx, Ky, Kz) and J = (Jx, Jy, Jz), each vectors of matrices with the explicit forms

image

These are all defined in an analogous way to Kx above, although the minus signs in the boost generators are conventional. Physically, the generators of the Lorentz group correspond to important symmetries in spacetime: J are the rotation generators which correspond to angular momentum, and K are the boost generators which correspond to the motion of the system in spacetime. The derivative of any smooth curve C(t) with C(0) = I in the group depending on some group parameter t with respect to that group parameter, evaluated at t = 0, serves as a definition of a corresponding group generator G, and this reflects an infinitesimal transformation away from the identity. The smooth curve can always be taken as an exponential as the exponential will always map G smoothly back into the group via t → exp(tG) for all t; this curve will yield G again when differentiated at t = 0.

Expanding the exponentials in their Taylor series obtains image image which compactly reproduce the boost and rotation matrices as given in the previous section.

It has been stated that the general proper Lorentz transformation is a product of a boost and rotation. At the infinitesimal level the product image is commutative because only linear terms are required (products like (θ·J)(ζ·K) and (ζ·K)(θ·J) count as higher order terms and are negligible). Taking the limit as before leads to the finite transformation in the form of an exponential image

The converse is also true, but the decomposition of a finite general Lorentz transformation into such factors is nontrivial. In particular, image because the generators do not commute. For a description of how to find the factors of a general Lorentz transformation in terms of a boost and a rotation in principle (this usually does not yield an intelligible expression in terms of generators J and K), see Wigner rotation. If, on the other hand, the decomposition is given in terms of the generators, and one wants to find the product in terms of the generators, then the Baker–Campbell–Hausdorff formula applies.

The Lie algebra so(3,1)

Lorentz generators can be added together, or multiplied by real numbers, to obtain more Lorentz generators. In other words, the set of all Lorentz generators image together with the operations of ordinary matrix addition and multiplication of a matrix by a number, forms a vector space over the real numbers. The generators Jx, Jy, Jz, Kx, Ky, Kz form a basis set of V, and the components of the axis-angle and rapidity vectors, θx, θy, θz, ζx, ζy, ζz, are the coordinates of a Lorentz generator with respect to this basis.

Three of the commutation relations of the Lorentz generators are image where the bracket [A, B] = ABBA is known as the commutator, and the other relations can be found by taking cyclic permutations of x, y, z components (i.e. change x to y, y to z, and z to x, repeat).

These commutation relations, and the vector space of generators, fulfill the definition of the Lie algebra image. In summary, a Lie algebra is defined as a vector space V over a field of numbers, and with a binary operation [ , ] (called a Lie bracket in this context) on the elements of the vector space, satisfying the axioms of bilinearity, alternatization, and the Jacobi identity. Here the operation [ , ] is the commutator which satisfies all of these axioms, the vector space is the set of Lorentz generators V as given previously, and the field is the set of real numbers.

Linking terminology used in mathematics and physics: A group generator is any element of the Lie algebra. A group parameter is a component of a coordinate vector representing an arbitrary element of the Lie algebra with respect to some basis. A basis, then, is a set of generators being a basis of the Lie algebra in the usual vector space sense.

The exponential map from the Lie algebra to the Lie group, image provides a one-to-one correspondence between small enough neighborhoods of the origin of the Lie algebra and neighborhoods of the identity element of the Lie group. In the case of the Lorentz group, the exponential map is just the matrix exponential. Globally, the exponential map is not one-to-one, but in the case of the Lorentz group, it is surjective (onto). Hence any group element in the connected component of the identity can be expressed as an exponential of an element of the Lie algebra.

Improper transformations

Lorentz transformations also include parity inversion image which negates all the spatial coordinates only, and time reversal image which negates the time coordinate only, because these transformations leave the spacetime interval invariant. Here I is the 3d identity matrix. These are both symmetric, they are their own inverses (see involution (mathematics)), and each have determinant −1. This latter property makes them improper transformations.

If Λ is a proper orthochronous Lorentz transformation, then TΛ is improper antichronous, PΛ is improper orthochronous, and TPΛ = PTΛ is proper antichronous.

Inhomogeneous Lorentz group

Two other spacetime symmetries have not been accounted for. In order for the spacetime interval to be invariant, it can be shown that it is necessary and sufficient for the coordinate transformation to be of the form image where C is a constant column containing translations in time and space. If C ≠ 0, this is an inhomogeneous Lorentz transformation or Poincaré transformation. If C = 0, this is a homogeneous Lorentz transformation. Poincaré transformations are not dealt further in this article.

Tensor formulation

Contravariant vectors

Writing the general matrix transformation of coordinates as the matrix equation image allows the transformation of other physical quantities that cannot be expressed as four-vectors; e.g., tensors or spinors of any order in 4d spacetime, to be defined. In the corresponding tensor index notation, the above matrix expression is image

where lower and upper indices label covariant and contravariant components respectively, and the summation convention is applied. It is a standard convention to use Greek indices that take the value 0 for time components, and 1, 2, 3 for space components, while Latin indices simply take the values 1, 2, 3, for spatial components (the opposite for Landau and Lifshitz). Note that the first index (reading left to right) corresponds in the matrix notation to a row index. The second index corresponds to the column index.

The transformation matrix is universal for all four-vectors, not just 4-dimensional spacetime coordinates. If A is any four-vector, then in tensor index notation image

Alternatively, one writes image in which the primed indices denote the indices of A in the primed frame. For a general n-component object one may write image where Π is the appropriate representation of the Lorentz group, an n×n matrix for every Λ. In this case, the indices should not be thought of as spacetime indices (sometimes called Lorentz indices), and they run from 1 to n. E.g., if X is a bispinor, then the indices are called Dirac indices.

Covariant vectors

There are also vector quantities with covariant indices. They are generally obtained from their corresponding objects with contravariant indices by the operation of lowering an index; e.g., image where η is the metric tensor. (The linked article also provides more information about what the operation of raising and lowering indices really is mathematically.) The inverse of this transformation is given by image where, when viewed as matrices, ημν is the inverse of ημν. As it happens, ημν = ημν. This is referred to as raising an index. To transform a covariant vector Aμ, first raise its index, then transform it according to the same rule as for contravariant 4-vectors, then finally lower the index; image

But image

That is, it is the (μ, ν)-component of the inverse Lorentz transformation. One defines (as a matter of notation), image and may in this notation write image

Now for a subtlety. The implied summation on the right hand side of image is running over a row index of the matrix representing Λ−1. Thus, in terms of matrices, this transformation should be thought of as the inverse transpose of Λ acting on the column vector Aμ. That is, in pure matrix notation, image

This means exactly that covariant vectors (thought of as column matrices) transform according to the dual representation of the standard representation of the Lorentz group. This notion generalizes to general representations, simply replace Λ with Π(Λ).

Tensors

If A and B are linear operators on vector spaces U and V, then a linear operator AB may be defined on the tensor product of U and V, denoted UV according to

image               (T1)

From this it is immediately clear that if u and v are a four-vectors in V, then uvT2VVV transforms as

image               (T2)

The second step uses the bilinearity of the tensor product and the last step defines a 2-tensor on component form, or rather, it just renames the tensor uv.

These observations generalize in an obvious way to more factors, and using the fact that a general tensor on a vector space V can be written as a sum of a coefficient (component!) times tensor products of basis vectors and basis covectors, one arrives at the transformation law for any tensor quantity T. It is given by

image               (T3)

where Λχ′ψ is defined above. This form can generally be reduced to the form for general n-component objects given above with a single matrix (Π(Λ)) operating on column vectors. This latter form is sometimes preferred; e.g., for the electromagnetic field tensor.

Transformation of the electromagnetic field

image
Lorentz boost of an electric charge; the charge is at rest in one frame or the other.

Lorentz transformations can also be used to illustrate that the magnetic field B and electric field E are simply different aspects of the same force — the electromagnetic force, as a consequence of relative motion between electric charges and observers. The fact that the electromagnetic field shows relativistic effects becomes clear by carrying out a simple thought experiment.

  • An observer measures a charge at rest in frame F. The observer will detect a static electric field. As the charge is stationary in this frame, there is no electric current, so the observer does not observe any magnetic field.
  • The other observer in frame F′ moves at velocity v relative to F and the charge. This observer sees a different electric field because the charge moves at velocity v in their rest frame. The motion of the charge corresponds to an electric current, and thus the observer in frame F′ also sees a magnetic field.

The electric and magnetic fields transform differently from space and time, but exactly the same way as relativistic angular momentum and the boost vector.

The electromagnetic field strength tensor is given by image in SI units. In relativity, the Gaussian system of units is often preferred over SI units, even in texts whose main choice of units is SI units, because in it the electric field E and the magnetic induction B have the same units making the appearance of the electromagnetic field tensor more natural. Consider a Lorentz boost in the x-direction. It is given byimage where the field tensor is displayed side by side for easiest possible reference in the manipulations below.

The general transformation law (T3) becomes image

For the magnetic field one obtains

In physics the Lorentz transformations are a six parameter family of linear transformations from a coordinate frame in spacetime to another frame that moves at a constant velocity relative to the former The respective inverse transformation is then parameterized by the negative of this velocity The transformations are named after the Dutch physicist Hendrik Lorentz The most common form of the transformation parametrized by the real constant v displaystyle v representing a velocity confined to the x direction is expressed ast g t vxc2 x g x vt y yz z displaystyle begin aligned t amp gamma left t frac vx c 2 right x amp gamma left x vt right y amp y z amp z end aligned where t x y z and t x y z are the coordinates of an event in two frames with the spatial origins coinciding at t t 0 where the primed frame is seen from the unprimed frame as moving with speed v along the x axis where c is the speed of light and g 11 v2 c2 displaystyle gamma frac 1 sqrt 1 v 2 c 2 is the Lorentz factor When speed v is much smaller than c the Lorentz factor is negligibly different from 1 but as v approaches c g displaystyle gamma grows without bound The value of v must be smaller than c for the transformation to make sense Expressing the speed as a fraction of the speed of light b v c textstyle beta v c an equivalent form of the transformation isct g ct bx x g x bct y yz z displaystyle begin aligned ct amp gamma left ct beta x right x amp gamma left x beta ct right y amp y z amp z end aligned Frames of reference can be divided into two groups inertial relative motion with constant velocity and non inertial accelerating moving in curved paths rotational motion with constant angular velocity etc The term Lorentz transformations only refers to transformations between inertial frames usually in the context of special relativity In each reference frame an observer can use a local coordinate system usually Cartesian coordinates in this context to measure lengths and a clock to measure time intervals An event is something that happens at a point in space at an instant of time or more formally a point in spacetime The transformations connect the space and time coordinates of an event as measured by an observer in each frame They supersede the Galilean transformation of Newtonian physics which assumes an absolute space and time see Galilean relativity The Galilean transformation is a good approximation only at relative speeds much less than the speed of light Lorentz transformations have a number of unintuitive features that do not appear in Galilean transformations For example they reflect the fact that observers moving at different velocities may measure different distances elapsed times and even different orderings of events but always such that the speed of light is the same in all inertial reference frames The invariance of light speed is one of the postulates of special relativity Historically the transformations were the result of attempts by Lorentz and others to explain how the speed of light was observed to be independent of the reference frame and to understand the symmetries of the laws of electromagnetism The transformations later became a cornerstone for special relativity The Lorentz transformation is a linear transformation It may include a rotation of space a rotation free Lorentz transformation is called a Lorentz boost In Minkowski space the mathematical model of spacetime in special relativity the Lorentz transformations preserve the spacetime interval between any two events They describe only the transformations in which the spacetime event at the origin is left fixed They can be considered as a hyperbolic rotation of Minkowski space The more general set of transformations that also includes translations is known as the Poincare group HistoryMany physicists including Woldemar Voigt George FitzGerald Joseph Larmor and Hendrik Lorentz himself had been discussing the physics implied by these equations since 1887 Early in 1889 Oliver Heaviside had shown from Maxwell s equations that the electric field surrounding a spherical distribution of charge should cease to have spherical symmetry once the charge is in motion relative to the luminiferous aether FitzGerald then conjectured that Heaviside s distortion result might be applied to a theory of intermolecular forces Some months later FitzGerald published the conjecture that bodies in motion are being contracted in order to explain the baffling outcome of the 1887 aether wind experiment of Michelson and Morley In 1892 Lorentz independently presented the same idea in a more detailed manner which was subsequently called FitzGerald Lorentz contraction hypothesis Their explanation was widely known before 1905 Lorentz 1892 1904 and Larmor 1897 1900 who believed the luminiferous aether hypothesis also looked for the transformation under which Maxwell s equations are invariant when transformed from the aether to a moving frame They extended the FitzGerald Lorentz contraction hypothesis and found out that the time coordinate has to be modified as well local time Henri Poincare gave a physical interpretation to local time to first order in v c the relative velocity of the two reference frames normalized to the speed of light as the consequence of clock synchronization under the assumption that the speed of light is constant in moving frames Larmor is credited to have been the first to understand the crucial time dilation property inherent in his equations In 1905 Poincare was the first to recognize that the transformation has the properties of a mathematical group and he named it after Lorentz Later in the same year Albert Einstein published what is now called special relativity by deriving the Lorentz transformation under the assumptions of the principle of relativity and the constancy of the speed of light in any inertial reference frame and by abandoning the mechanistic aether as unnecessary Derivation of the group of Lorentz transformationsAn event is something that happens at a certain point in spacetime or more generally the point in spacetime itself In any inertial frame an event is specified by a time coordinate ct and a set of Cartesian coordinates x y z to specify position in space in that frame Subscripts label individual events From Einstein s second postulate of relativity invariance of c it follows that c2 t2 t1 2 x2 x1 2 y2 y1 2 z2 z1 2 0 lightlike separated events 1 2 displaystyle c 2 t 2 t 1 2 x 2 x 1 2 y 2 y 1 2 z 2 z 1 2 0 quad text lightlike separated events 1 2 D1 in all inertial frames for events connected by light signals The quantity on the left is called the spacetime interval between events a1 t1 x1 y1 z1 and a2 t2 x2 y2 z2 The interval between any two events not necessarily separated by light signals is in fact invariant i e independent of the state of relative motion of observers in different inertial frames as is shown using homogeneity and isotropy of space The transformation sought after thus must possess the property that c2 t2 t1 2 x2 x1 2 y2 y1 2 z2 z1 2 c2 t2 t1 2 x2 x1 2 y2 y1 2 z2 z1 2 all events 1 2 displaystyle begin aligned amp c 2 t 2 t 1 2 x 2 x 1 2 y 2 y 1 2 z 2 z 1 2 6pt amp c 2 t 2 t 1 2 x 2 x 1 2 y 2 y 1 2 z 2 z 1 2 quad text all events 1 2 end aligned D2 where t x y z are the spacetime coordinates used to define events in one frame and t x y z are the coordinates in another frame First one observes that D2 is satisfied if an arbitrary 4 tuple b of numbers are added to events a1 and a2 Such transformations are called spacetime translations and are not dealt with further here Then one observes that a linear solution preserving the origin of the simpler problem solves the general problem too c2t2 x2 y2 z2 c2t 2 x 2 y 2 z 2orc2t1t2 x1x2 y1y2 z1z2 c2t1 t2 x1 x2 y1 y2 z1 z2 displaystyle begin aligned amp c 2 t 2 x 2 y 2 z 2 c 2 t 2 x 2 y 2 z 2 6pt text or quad amp c 2 t 1 t 2 x 1 x 2 y 1 y 2 z 1 z 2 c 2 t 1 t 2 x 1 x 2 y 1 y 2 z 1 z 2 end aligned D3 a solution satisfying the first formula automatically satisfies the second one as well see polarization identity Finding the solution to the simpler problem is just a matter of look up in the theory of classical groups that preserve bilinear forms of various signature First equation in D3 can be written more compactly as a a a a ora a a a displaystyle a a a a quad text or quad a cdot a a cdot a D4 where refers to the bilinear form of signature 1 3 on R4 exposed by the right hand side formula in D3 The alternative notation defined on the right is referred to as the relativistic dot product Spacetime mathematically viewed as R4 endowed with this bilinear form is known as Minkowski space M The Lorentz transformation is thus an element of the group O 1 3 the Lorentz group or for those that prefer the other metric signature O 3 1 also called the Lorentz group One has a a La La a a L O 1 3 a a M displaystyle a a Lambda a Lambda a a a quad Lambda in mathrm O 1 3 quad a a in M D5 which is precisely preservation of the bilinear form D3 which implies by linearity of L and bilinearity of the form that D2 is satisfied The elements of the Lorentz group are rotations and boosts and mixes thereof If the spacetime translations are included then one obtains the inhomogeneous Lorentz group or the Poincare group GeneralitiesThe relations between the primed and unprimed spacetime coordinates are the Lorentz transformations each coordinate in one frame is a linear function of all the coordinates in the other frame and the inverse functions are the inverse transformation Depending on how the frames move relative to each other and how they are oriented in space relative to each other other parameters that describe direction speed and orientation enter the transformation equations Transformations describing relative motion with constant uniform velocity and without rotation of the space coordinate axes are called Lorentz boosts or simply boosts and the relative velocity between the frames is the parameter of the transformation The other basic type of Lorentz transformation is rotation in the spatial coordinates only these like boosts are inertial transformations since there is no relative motion the frames are simply tilted and not continuously rotating and in this case quantities defining the rotation are the parameters of the transformation e g axis angle representation or Euler angles etc A combination of a rotation and boost is a homogeneous transformation which transforms the origin back to the origin The full Lorentz group O 3 1 also contains special transformations that are neither rotations nor boosts but rather reflections in a plane through the origin Two of these can be singled out spatial inversion in which the spatial coordinates of all events are reversed in sign and temporal inversion in which the time coordinate for each event gets its sign reversed Boosts should not be conflated with mere displacements in spacetime in this case the coordinate systems are simply shifted and there is no relative motion However these also count as symmetries forced by special relativity since they leave the spacetime interval invariant A combination of a rotation with a boost followed by a shift in spacetime is an inhomogeneous Lorentz transformation an element of the Poincare group which is also called the inhomogeneous Lorentz group Physical formulation of Lorentz boostsCoordinate transformation The spacetime coordinates of an event as measured by each observer in their inertial reference frame in standard configuration are shown in the speech bubbles Top frame F moves at velocity v along the x axis of frame F Bottom frame F moves at velocity v along the x axis of frame F A stationary observer in frame F defines events with coordinates t x y z Another frame F moves with velocity v relative to F and an observer in this moving frame F defines events using the coordinates t x y z The coordinate axes in each frame are parallel the x and x axes are parallel the y and y axes are parallel and the z and z axes are parallel remain mutually perpendicular and relative motion is along the coincident xx axes At t t 0 the origins of both coordinate systems are the same x y z x y z 0 0 0 In other words the times and positions are coincident at this event If all these hold then the coordinate systems are said to be in standard configuration or synchronized If an observer in F records an event t x y z then an observer in F records the same event with coordinates Lorentz boost x direction t g t vxc2 x g x vt y yz z displaystyle begin aligned t amp gamma left t frac vx c 2 right x amp gamma left x vt right y amp y z amp z end aligned where v is the relative velocity between frames in the x direction c is the speed of light and g 11 v2c2 displaystyle gamma frac 1 sqrt 1 frac v 2 c 2 lowercase gamma is the Lorentz factor Here v is the parameter of the transformation for a given boost it is a constant number but can take a continuous range of values In the setup used here positive relative velocity v gt 0 is motion along the positive directions of the xx axes zero relative velocity v 0 is no relative motion while negative relative velocity v lt 0 is relative motion along the negative directions of the xx axes The magnitude of relative velocity v cannot equal or exceed c so only subluminal speeds c lt v lt c are allowed The corresponding range of g is 1 g lt The transformations are not defined if v is outside these limits At the speed of light v c g is infinite and faster than light v gt c g is a complex number each of which make the transformations unphysical The space and time coordinates are measurable quantities and numerically must be real numbers As an active transformation an observer in F notices the coordinates of the event to be boosted in the negative directions of the xx axes because of the v in the transformations This has the equivalent effect of the coordinate system F boosted in the positive directions of the xx axes while the event does not change and is simply represented in another coordinate system a passive transformation The inverse relations t x y z in terms of t x y z can be found by algebraically solving the original set of equations A more efficient way is to use physical principles Here F is the stationary frame while F is the moving frame According to the principle of relativity there is no privileged frame of reference so the transformations from F to F must take exactly the same form as the transformations from F to F The only difference is F moves with velocity v relative to F i e the relative velocity has the same magnitude but is oppositely directed Thus if an observer in F notes an event t x y z then an observer in F notes the same event with coordinates Inverse Lorentz boost x direction t g t vx c2 x g x vt y y z z displaystyle begin aligned t amp gamma left t frac vx c 2 right x amp gamma left x vt right y amp y z amp z end aligned and the value of g remains unchanged This trick of simply reversing the direction of relative velocity while preserving its magnitude and exchanging primed and unprimed variables always applies to finding the inverse transformation of every boost in any direction Sometimes it is more convenient to use b v c lowercase beta instead of v so that ct g ct bx x g x bct displaystyle begin aligned ct amp gamma left ct beta x right x amp gamma left x beta ct right end aligned which shows much more clearly the symmetry in the transformation From the allowed ranges of v and the definition of b it follows 1 lt b lt 1 The use of b and g is standard throughout the literature When the boost velocity v displaystyle boldsymbol v is in an arbitrary vector direction with the boost vector b v c displaystyle boldsymbol beta boldsymbol v c then the transformation from an unprimed spacetime coordinate system to a primed coordinate system is given by 1 ct gbxx 1 g21 gbx2y g21 gbxbyz g21 gbybz g gbx gby gbz gbx1 g21 gbx2g21 gbxbyg21 gbxbz gbyg21 gbxby1 g21 gby2g21 gbybz gbzg21 gbxbzg21 gbybz1 g21 gbz2 ct gbxx1 g21 gbx2yg21 gbxbyzg21 gbybz displaystyle begin bmatrix ct vphantom gamma beta x x vphantom 1 frac gamma 2 1 gamma beta x 2 y vphantom frac gamma 2 1 gamma beta x beta y z vphantom frac gamma 2 1 gamma beta y beta z end bmatrix begin bmatrix gamma amp gamma beta x amp gamma beta y amp gamma beta z gamma beta x amp 1 frac gamma 2 1 gamma beta x 2 amp frac gamma 2 1 gamma beta x beta y amp frac gamma 2 1 gamma beta x beta z gamma beta y amp frac gamma 2 1 gamma beta x beta y amp 1 frac gamma 2 1 gamma beta y 2 amp frac gamma 2 1 gamma beta y beta z gamma beta z amp frac gamma 2 1 gamma beta x beta z amp frac gamma 2 1 gamma beta y beta z amp 1 frac gamma 2 1 gamma beta z 2 end bmatrix begin bmatrix ct vphantom gamma beta x x vphantom 1 frac gamma 2 1 gamma beta x 2 y vphantom frac gamma 2 1 gamma beta x beta y z vphantom frac gamma 2 1 gamma beta y beta z end bmatrix where the Lorentz factor is g 1 1 b2 displaystyle gamma 1 sqrt 1 boldsymbol beta 2 The determinant of the transformation matrix is 1 and its trace is 2 1 g displaystyle 2 1 gamma The inverse of the transformation is given by reversing the sign of b displaystyle boldsymbol beta The quantity c2t2 x2 y2 z2 displaystyle c 2 t 2 x 2 y 2 z 2 is invariant under the transformation The Lorentz transformations can also be derived in a way that resembles circular rotations in 3d space using the hyperbolic functions For the boost in the x direction the results are Lorentz boost x direction with rapidity z ct ctcosh z xsinh zx xcosh z ctsinh zy yz z displaystyle begin aligned ct amp ct cosh zeta x sinh zeta x amp x cosh zeta ct sinh zeta y amp y z amp z end aligned where z lowercase zeta is a parameter called rapidity many other symbols are used including 8 ϕ f h ps 3 Given the strong resemblance to rotations of spatial coordinates in 3d space in the Cartesian xy yz and zx planes a Lorentz boost can be thought of as a hyperbolic rotation of spacetime coordinates in the xt yt and zt Cartesian time planes of 4d Minkowski space The parameter z is the hyperbolic angle of rotation analogous to the ordinary angle for circular rotations This transformation can be illustrated with a Minkowski diagram The hyperbolic functions arise from the difference between the squares of the time and spatial coordinates in the spacetime interval rather than a sum The geometric significance of the hyperbolic functions can be visualized by taking x 0 or ct 0 in the transformations Squaring and subtracting the results one can derive hyperbolic curves of constant coordinate values but varying z which parametrizes the curves according to the identity cosh2 z sinh2 z 1 displaystyle cosh 2 zeta sinh 2 zeta 1 Conversely the ct and x axes can be constructed for varying coordinates but constant z The definition tanh z sinh zcosh z displaystyle tanh zeta frac sinh zeta cosh zeta provides the link between a constant value of rapidity and the slope of the ct axis in spacetime A consequence these two hyperbolic formulae is an identity that matches the Lorentz factor cosh z 11 tanh2 z displaystyle cosh zeta frac 1 sqrt 1 tanh 2 zeta Comparing the Lorentz transformations in terms of the relative velocity and rapidity or using the above formulae the connections between b g and z are b tanh z g cosh z bg sinh z displaystyle begin aligned beta amp tanh zeta gamma amp cosh zeta beta gamma amp sinh zeta end aligned Taking the inverse hyperbolic tangent gives the rapidity z tanh 1 b displaystyle zeta tanh 1 beta Since 1 lt b lt 1 it follows lt z lt From the relation between z and b positive rapidity z gt 0 is motion along the positive directions of the xx axes zero rapidity z 0 is no relative motion while negative rapidity z lt 0 is relative motion along the negative directions of the xx axes The inverse transformations are obtained by exchanging primed and unprimed quantities to switch the coordinate frames and negating rapidity z z since this is equivalent to negating the relative velocity Therefore Inverse Lorentz boost x direction with rapidity z ct ct cosh z x sinh zx x cosh z ct sinh zy y z z displaystyle begin aligned ct amp ct cosh zeta x sinh zeta x amp x cosh zeta ct sinh zeta y amp y z amp z end aligned The inverse transformations can be similarly visualized by considering the cases when x 0 and ct 0 So far the Lorentz transformations have been applied to one event If there are two events there is a spatial separation and time interval between them It follows from the linearity of the Lorentz transformations that two values of space and time coordinates can be chosen the Lorentz transformations can be applied to each then subtracted to get the Lorentz transformations of the differences Dt g Dt vDxc2 Dx g Dx vDt displaystyle begin aligned Delta t amp gamma left Delta t frac v Delta x c 2 right Delta x amp gamma left Delta x v Delta t right end aligned with inverse relations Dt g Dt vDx c2 Dx g Dx vDt displaystyle begin aligned Delta t amp gamma left Delta t frac v Delta x c 2 right Delta x amp gamma left Delta x v Delta t right end aligned where D uppercase delta indicates a difference of quantities e g Dx x2 x1 for two values of x coordinates and so on These transformations on differences rather than spatial points or instants of time are useful for a number of reasons in calculations and experiments it is lengths between two points or time intervals that are measured or of interest e g the length of a moving vehicle or time duration it takes to travel from one place to another the transformations of velocity can be readily derived by making the difference infinitesimally small and dividing the equations and the process repeated for the transformation of acceleration if the coordinate systems are never coincident i e not in standard configuration and if both observers can agree on an event t0 x0 y0 z0 in F and t0 x0 y0 z0 in F then they can use that event as the origin and the spacetime coordinate differences are the differences between their coordinates and this origin e g Dx x x0 Dx x x0 etc Physical implications A critical requirement of the Lorentz transformations is the invariance of the speed of light a fact used in their derivation and contained in the transformations themselves If in F the equation for a pulse of light along the x direction is x ct then in F the Lorentz transformations give x ct and vice versa for any c lt v lt c For relative speeds much less than the speed of light the Lorentz transformations reduce to the Galilean transformation t tx x vt displaystyle begin aligned t amp approx t x amp approx x vt end aligned in accordance with the correspondence principle It is sometimes said that nonrelativistic physics is a physics of instantaneous action at a distance Three counterintuitive but correct predictions of the transformations are Relativity of simultaneity Suppose two events occur along the x axis simultaneously Dt 0 in F but separated by a nonzero displacement Dx Then in F we find that Dt g vDxc2 displaystyle Delta t gamma frac v Delta x c 2 so the events are no longer simultaneous according to a moving observer Time dilation Suppose there is a clock at rest in F If a time interval is measured at the same point in that frame so that Dx 0 then the transformations give this interval in F by Dt gDt Conversely suppose there is a clock at rest in F If an interval is measured at the same point in that frame so that Dx 0 then the transformations give this interval in F by Dt gDt Either way each observer measures the time interval between ticks of a moving clock to be longer by a factor g than the time interval between ticks of his own clock Length contraction Suppose there is a rod at rest in F aligned along the x axis with length Dx In F the rod moves with velocity v so its length must be measured by taking two simultaneous Dt 0 measurements at opposite ends Under these conditions the inverse Lorentz transform shows that Dx gDx In F the two measurements are no longer simultaneous but this does not matter because the rod is at rest in F So each observer measures the distance between the end points of a moving rod to be shorter by a factor 1 g than the end points of an identical rod at rest in his own frame Length contraction affects any geometric quantity related to lengths so from the perspective of a moving observer areas and volumes will also appear to shrink along the direction of motion Vector transformations An observer in frame F observes F to move with velocity v while F observes F to move with velocity v The coordinate axes of each frame are still parallel according to whom and orthogonal The position vector as measured in each frame is split into components parallel and perpendicular to the relative velocity vector v Left Standard configuration Right Inverse configuration The use of vectors allows positions and velocities to be expressed in arbitrary directions compactly A single boost in any direction depends on the full relative velocity vector v with a magnitude v v that cannot equal or exceed c so that 0 v lt c Only time and the coordinates parallel to the direction of relative motion change while those coordinates perpendicular do not With this in mind split the spatial position vector r as measured in F and r as measured in F each into components perpendicular and parallel to v r r r r r r displaystyle mathbf r mathbf r perp mathbf r quad mathbf r mathbf r perp mathbf r then the transformations are t g t r vc2 r g r vt r r displaystyle begin aligned t amp gamma left t frac mathbf r parallel cdot mathbf v c 2 right mathbf r amp gamma mathbf r mathbf v t mathbf r perp amp mathbf r perp end aligned where is the dot product The Lorentz factor g retains its definition for a boost in any direction since it depends only on the magnitude of the relative velocity The definition b v c with magnitude 0 b lt 1 is also used by some authors Introducing a unit vector n v v b b in the direction of relative motion the relative velocity is v vn with magnitude v and direction n and vector projection and rejection give respectively r r n n r r r n n displaystyle mathbf r parallel mathbf r cdot mathbf n mathbf n quad mathbf r perp mathbf r mathbf r cdot mathbf n mathbf n Accumulating the results gives the full transformations Lorentz boost in direction n with magnitude v t g t vn rc2 r r g 1 r n n gtvn displaystyle begin aligned t amp gamma left t frac v mathbf n cdot mathbf r c 2 right mathbf r amp mathbf r gamma 1 mathbf r cdot mathbf n mathbf n gamma tv mathbf n end aligned The projection and rejection also applies to r For the inverse transformations exchange r and r to switch observed coordinates and negate the relative velocity v v or simply the unit vector n n since the magnitude v is always positive to obtain Inverse Lorentz boost in direction n with magnitude v t g t r vnc2 r r g 1 r n n gt vn displaystyle begin aligned t amp gamma left t frac mathbf r cdot v mathbf n c 2 right mathbf r amp mathbf r gamma 1 mathbf r cdot mathbf n mathbf n gamma t v mathbf n end aligned The unit vector has the advantage of simplifying equations for a single boost allows either v or b to be reinstated when convenient and the rapidity parametrization is immediately obtained by replacing b and bg It is not convenient for multiple boosts The vectorial relation between relative velocity and rapidity isb bn ntanh z displaystyle boldsymbol beta beta mathbf n mathbf n tanh zeta and the rapidity vector can be defined as z zn ntanh 1 b displaystyle boldsymbol zeta zeta mathbf n mathbf n tanh 1 beta each of which serves as a useful abbreviation in some contexts The magnitude of z is the absolute value of the rapidity scalar confined to 0 z lt which agrees with the range 0 b lt 1 Transformation of velocities The transformation of velocities provides the definition relativistic velocity addition the ordering of vectors is chosen to reflect the ordering of the addition of velocities first v the velocity of F relative to F then u the velocity of X relative to F to obtain u v u the velocity of X relative to F Defining the coordinate velocities and Lorentz factor by u drdt u dr dt gv 11 v vc2 displaystyle mathbf u frac d mathbf r dt quad mathbf u frac d mathbf r dt quad gamma mathbf v frac 1 sqrt 1 dfrac mathbf v cdot mathbf v c 2 taking the differentials in the coordinates and time of the vector transformations then dividing equations leads to u 11 v uc2 ugv v 1c2gvgv 1 u v v displaystyle mathbf u frac 1 1 frac mathbf v cdot mathbf u c 2 left frac mathbf u gamma mathbf v mathbf v frac 1 c 2 frac gamma mathbf v gamma mathbf v 1 left mathbf u cdot mathbf v right mathbf v right The velocities u and u are the velocity of some massive object They can also be for a third inertial frame say F in which case they must be constant Denote either entity by X Then X moves with velocity u relative to F or equivalently with velocity u relative to F in turn F moves with velocity v relative to F The inverse transformations can be obtained in a similar way or as with position coordinates exchange u and u and change v to v The transformation of velocity is useful in stellar aberration the Fizeau experiment and the relativistic Doppler effect The Lorentz transformations of acceleration can be similarly obtained by taking differentials in the velocity vectors and dividing these by the time differential Transformation of other quantities In general given four quantities A and Z Zx Zy Zz and their Lorentz boosted counterparts A and Z Z x Z y Z z a relation of the form A2 Z Z A 2 Z Z displaystyle A 2 mathbf Z cdot mathbf Z A 2 mathbf Z cdot mathbf Z implies the quantities transform under Lorentz transformations similar to the transformation of spacetime coordinates A g A vn Zc Z Z g 1 Z n n gAvnc displaystyle begin aligned A amp gamma left A frac v mathbf n cdot mathbf Z c right mathbf Z amp mathbf Z gamma 1 mathbf Z cdot mathbf n mathbf n frac gamma Av mathbf n c end aligned The decomposition of Z and Z into components perpendicular and parallel to v is exactly the same as for the position vector as is the process of obtaining the inverse transformations exchange A Z and A Z to switch observed quantities and reverse the direction of relative motion by the substitution n n The quantities A Z collectively make up a four vector where A is the timelike component and Z the spacelike component Examples of A and Z are the following Four vector A ZPosition four vector Time multiplied by c ct Position vector rFour momentum Energy divided by c E c Momentum pFour wave vector angular frequency divided by c w c wave vector kFour spin No name st Spin sFour current Charge density multiplied by c rc Current density jElectromagnetic four potential Electric potential divided by c f c Magnetic vector potential A For a given object e g particle fluid field material if A or Z correspond to properties specific to the object like its charge density mass density spin etc its properties can be fixed in the rest frame of that object Then the Lorentz transformations give the corresponding properties in a frame moving relative to the object with constant velocity This breaks some notions taken for granted in non relativistic physics For example the energy E of an object is a scalar in non relativistic mechanics but not in relativistic mechanics because energy changes under Lorentz transformations its value is different for various inertial frames In the rest frame of an object it has a rest energy and zero momentum In a boosted frame its energy is different and it appears to have a momentum Similarly in non relativistic quantum mechanics the spin of a particle is a constant vector but in relativistic quantum mechanics spin s depends on relative motion In the rest frame of the particle the spin pseudovector can be fixed to be its ordinary non relativistic spin with a zero timelike quantity st however a boosted observer will perceive a nonzero timelike component and an altered spin Not all quantities are invariant in the form as shown above for example orbital angular momentum L does not have a timelike quantity and neither does the electric field E nor the magnetic field B The definition of angular momentum is L r p and in a boosted frame the altered angular momentum is L r p Applying this definition using the transformations of coordinates and momentum leads to the transformation of angular momentum It turns out L transforms with another vector quantity N E c2 r tp related to boosts see relativistic angular momentum for details For the case of the E and B fields the transformations cannot be obtained as directly using vector algebra The Lorentz force is the definition of these fields and in F it is F q E v B while in F it is F q E v B A method of deriving the EM field transformations in an efficient way which also illustrates the unit of the electromagnetic field uses tensor algebra given below Mathematical formulationThroughout italic non bold capital letters are 4 4 matrices while non italic bold letters are 3 3 matrices Homogeneous Lorentz group Writing the coordinates in column vectors and the Minkowski metric h as a square matrix X ct x y z h 1000010000100001 X ctxyz displaystyle X begin bmatrix c t x y z end bmatrix quad eta begin bmatrix 1 amp 0 amp 0 amp 0 0 amp 1 amp 0 amp 0 0 amp 0 amp 1 amp 0 0 amp 0 amp 0 amp 1 end bmatrix quad X begin bmatrix c t x y z end bmatrix the spacetime interval takes the form superscript T denotes transpose X X XThX X ThX displaystyle X cdot X X mathrm T eta X X mathrm T eta X and is invariant under a Lorentz transformation X LX displaystyle X Lambda X where L is a square matrix which can depend on parameters The set of all Lorentz transformations L displaystyle Lambda in this article is denoted L displaystyle mathcal L This set together with matrix multiplication forms a group in this context known as the Lorentz group Also the above expression X X is a quadratic form of signature 3 1 on spacetime and the group of transformations which leaves this quadratic form invariant is the indefinite orthogonal group O 3 1 a Lie group In other words the Lorentz group is O 3 1 As presented in this article any Lie groups mentioned are matrix Lie groups In this context the operation of composition amounts to matrix multiplication From the invariance of the spacetime interval it follows h LThL displaystyle eta Lambda mathrm T eta Lambda and this matrix equation contains the general conditions on the Lorentz transformation to ensure invariance of the spacetime interval Taking the determinant of the equation using the product rule gives immediately det L 2 1 det L 1 displaystyle left det Lambda right 2 1 quad Rightarrow quad det Lambda pm 1 Writing the Minkowski metric as a block matrix and the Lorentz transformation in the most general form h 100I L G aT bM displaystyle eta begin bmatrix 1 amp 0 0 amp mathbf I end bmatrix quad Lambda begin bmatrix Gamma amp mathbf a mathrm T mathbf b amp mathbf M end bmatrix carrying out the block matrix multiplications obtains general conditions on G a b M to ensure relativistic invariance Not much information can be directly extracted from all the conditions however one of the results G2 1 bTb displaystyle Gamma 2 1 mathbf b mathrm T mathbf b is useful bTb 0 always so it follows that G2 1 G 1 G 1 displaystyle Gamma 2 geq 1 quad Rightarrow quad Gamma leq 1 quad Gamma geq 1 The negative inequality may be unexpected because G multiplies the time coordinate and this has an effect on time symmetry If the positive equality holds then G is the Lorentz factor The determinant and inequality provide four ways to classify Lorentz Transformations herein LTs for brevity Any particular LT has only one determinant sign and only one inequality There are four sets which include every possible pair given by the intersections n shaped symbol meaning and of these classifying sets Intersection Antichronous or non orthochronous LTs L L G 1 displaystyle mathcal L downarrow Lambda Gamma leq 1 Orthochronous LTs L L G 1 displaystyle mathcal L uparrow Lambda Gamma geq 1 Proper LTs L L det L 1 displaystyle mathcal L Lambda det Lambda 1 Proper antichronous LTs L L L displaystyle mathcal L downarrow mathcal L cap mathcal L downarrow Proper orthochronous LTs L L L displaystyle mathcal L uparrow mathcal L cap mathcal L uparrow Improper LTs L L det L 1 displaystyle mathcal L Lambda det Lambda 1 Improper antichronous LTs L L L displaystyle mathcal L downarrow mathcal L cap mathcal L downarrow Improper orthochronous LTs L L L displaystyle mathcal L uparrow mathcal L cap mathcal L uparrow where and indicate the determinant sign while for and for denote the inequalities The full Lorentz group splits into the union u shaped symbol meaning or of four disjoint sets L L L L L displaystyle mathcal L mathcal L uparrow cup mathcal L uparrow cup mathcal L downarrow cup mathcal L downarrow A subgroup of a group must be closed under the same operation of the group here matrix multiplication In other words for two Lorentz transformations L and L from a particular subgroup the composite Lorentz transformations LL and LL must be in the same subgroup as L and L This is not always the case the composition of two antichronous Lorentz transformations is orthochronous and the composition of two improper Lorentz transformations is proper In other words while the sets L displaystyle mathcal L uparrow L displaystyle mathcal L L displaystyle mathcal L uparrow and L0 L L displaystyle mathcal L 0 mathcal L uparrow cup mathcal L downarrow all form subgroups the sets containing improper and or antichronous transformations without enough proper orthochronous transformations e g L displaystyle mathcal L downarrow L displaystyle mathcal L downarrow L displaystyle mathcal L uparrow do not form subgroups Proper transformations If a Lorentz covariant 4 vector is measured in one inertial frame with result X displaystyle X and the same measurement made in another inertial frame with the same orientation and origin gives result X displaystyle X the two results will be related by X B v X displaystyle X B mathbf v X where the boost matrix B v displaystyle B mathbf v represents the rotation free Lorentz transformation between the unprimed and primed frames and v displaystyle mathbf v is the velocity of the primed frame as seen from the unprimed frame The matrix is given byB v g gvx c gvy c gvz c gvx c1 g 1 vx2v2 g 1 vxvyv2 g 1 vxvzv2 gvy c g 1 vyvxv21 g 1 vy2v2 g 1 vyvzv2 gvz c g 1 vzvxv2 g 1 vzvyv21 g 1 vz2v2 g gb T gb I g 1 b b Tb2 displaystyle B mathbf v begin bmatrix gamma amp gamma v x c amp gamma v y c amp gamma v z c gamma v x c amp 1 gamma 1 dfrac v x 2 v 2 amp gamma 1 dfrac v x v y v 2 amp gamma 1 dfrac v x v z v 2 gamma v y c amp gamma 1 dfrac v y v x v 2 amp 1 gamma 1 dfrac v y 2 v 2 amp gamma 1 dfrac v y v z v 2 gamma v z c amp gamma 1 dfrac v z v x v 2 amp gamma 1 dfrac v z v y v 2 amp 1 gamma 1 dfrac v z 2 v 2 end bmatrix begin bmatrix gamma amp gamma vec beta T gamma vec beta amp I gamma 1 dfrac vec beta vec beta T beta 2 end bmatrix where v vx2 vy2 vz2 textstyle v sqrt v x 2 v y 2 v z 2 is the magnitude of the velocity and g 11 v2c2 textstyle gamma frac 1 sqrt 1 frac v 2 c 2 is the Lorentz factor This formula represents a passive transformation as it describes how the coordinates of the measured quantity changes from the unprimed frame to the primed frame The active transformation is given by B v displaystyle B mathbf v If a frame F is boosted with velocity u relative to frame F and another frame F is boosted with velocity v relative to F the separate boosts are X B v X X B u X displaystyle X B mathbf v X quad X B mathbf u X and the composition of the two boosts connects the coordinates in F and F X B v B u X displaystyle X B mathbf v B mathbf u X Successive transformations act on the left If u and v are collinear parallel or antiparallel along the same line of relative motion the boost matrices commute B v B u B u B v This composite transformation happens to be another boost B w where w is collinear with u and v If u and v are not collinear but in different directions the situation is considerably more complicated Lorentz boosts along different directions do not commute B v B u and B u B v are not equal Although each of these compositions is not a single boost each composition is still a Lorentz transformation as it preserves the spacetime interval It turns out the composition of any two Lorentz boosts is equivalent to a boost followed or preceded by a rotation on the spatial coordinates in the form of R r B w or B w R r The w and w are composite velocities while r and r are rotation parameters e g axis angle variables Euler angles etc The rotation in block matrix form is simply R r 100R r displaystyle quad R boldsymbol rho begin bmatrix 1 amp 0 0 amp mathbf R boldsymbol rho end bmatrix where R r is a 3d rotation matrix which rotates any 3d vector in one sense active transformation or equivalently the coordinate frame in the opposite sense passive transformation It is not simple to connect w and r or w and r to the original boost parameters u and v In a composition of boosts the R matrix is named the Wigner rotation and gives rise to the Thomas precession These articles give the explicit formulae for the composite transformation matrices including expressions for w r w r In this article the axis angle representation is used for r The rotation is about an axis in the direction of a unit vector e through angle 8 positive anticlockwise negative clockwise according to the right hand rule The axis angle vector 8 8e displaystyle boldsymbol theta theta mathbf e will serve as a useful abbreviation Spatial rotations alone are also Lorentz transformations since they leave the spacetime interval invariant Like boosts successive rotations about different axes do not commute Unlike boosts the composition of any two rotations is equivalent to a single rotation Some other similarities and differences between the boost and rotation matrices include inverses B v 1 B v relative motion in the opposite direction and R 8 1 R 8 rotation in the opposite sense about the same axis identity transformation for no relative motion rotation B 0 R 0 I unit determinant det B det R 1 This property makes them proper transformations matrix symmetry B is symmetric equals transpose while R is nonsymmetric but orthogonal transpose equals inverse RT R 1 The most general proper Lorentz transformation L v 8 includes a boost and rotation together and is a nonsymmetric matrix As special cases L 0 8 R 8 and L v 0 B v An explicit form of the general Lorentz transformation is cumbersome to write down and will not be given here Nevertheless closed form expressions for the transformation matrices will be given below using group theoretical arguments It will be easier to use the rapidity parametrization for boosts in which case one writes L z 8 and B z The Lie group SO 3 1 The set of transformations B z R 8 L z 8 displaystyle B boldsymbol zeta R boldsymbol theta Lambda boldsymbol zeta boldsymbol theta with matrix multiplication as the operation of composition forms a group called the restricted Lorentz group and is the special indefinite orthogonal group SO 3 1 The plus sign indicates that it preserves the orientation of the temporal dimension For simplicity look at the infinitesimal Lorentz boost in the x direction examining a boost in any other direction or rotation about any axis follows an identical procedure The infinitesimal boost is a small boost away from the identity obtained by the Taylor expansion of the boost matrix to first order about z 0 Bx I z Bx z z 0 displaystyle B x I zeta left frac partial B x partial zeta right zeta 0 cdots where the higher order terms not shown are negligible because z is small and Bx is simply the boost matrix in the x direction The derivative of the matrix is the matrix of derivatives of the entries with respect to the same variable and it is understood the derivatives are found first then evaluated at z 0 Bx z z 0 Kx displaystyle left frac partial B x partial zeta right zeta 0 K x For now Kx is defined by this result its significance will be explained shortly In the limit of an infinite number of infinitely small steps the finite boost transformation in the form of a matrix exponential is obtained Bx limN I zNKx N e zKx displaystyle B x lim N to infty left I frac zeta N K x right N e zeta K x where the limit definition of the exponential has been used see also characterizations of the exponential function More generally B z e z K R 8 e8 J displaystyle B boldsymbol zeta e boldsymbol zeta cdot mathbf K quad R boldsymbol theta e boldsymbol theta cdot mathbf J The axis angle vector 8 and rapidity vector z are altogether six continuous variables which make up the group parameters in this particular representation and the generators of the group are K Kx Ky Kz and J Jx Jy Jz each vectors of matrices with the explicit forms Kx 0100100000000000 Ky 0010000010000000 Kz 0001000000001000 Jx 00000000000 10010 Jy 0000000100000 100 Jz 000000 1001000000 displaystyle begin alignedat 3 K x amp begin bmatrix 0 amp 1 amp 0 amp 0 1 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 0 end bmatrix quad amp K y amp begin bmatrix 0 amp 0 amp 1 amp 0 0 amp 0 amp 0 amp 0 1 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 0 end bmatrix quad amp K z amp begin bmatrix 0 amp 0 amp 0 amp 1 0 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 0 1 amp 0 amp 0 amp 0 end bmatrix 10mu J x amp begin bmatrix 0 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 1 0 amp 0 amp 1 amp 0 end bmatrix quad amp J y amp begin bmatrix 0 amp 0 amp 0 amp 0 0 amp 0 amp 0 amp 1 0 amp 0 amp 0 amp 0 0 amp 1 amp 0 amp 0 end bmatrix quad amp J z amp begin bmatrix 0 amp 0 amp 0 amp 0 0 amp 0 amp 1 amp 0 0 amp 1 amp 0 amp 0 0 amp 0 amp 0 amp 0 end bmatrix end alignedat These are all defined in an analogous way to Kx above although the minus signs in the boost generators are conventional Physically the generators of the Lorentz group correspond to important symmetries in spacetime J are the rotation generators which correspond to angular momentum and K are the boost generators which correspond to the motion of the system in spacetime The derivative of any smooth curve C t with C 0 I in the group depending on some group parameter t with respect to that group parameter evaluated at t 0 serves as a definition of a corresponding group generator G and this reflects an infinitesimal transformation away from the identity The smooth curve can always be taken as an exponential as the exponential will always map G smoothly back into the group via t exp tG for all t this curve will yield G again when differentiated at t 0 Expanding the exponentials in their Taylor series obtains B z I sinh z n K cosh z 1 n K 2 displaystyle B boldsymbol zeta I sinh zeta mathbf n cdot mathbf K cosh zeta 1 mathbf n cdot mathbf K 2 R 8 I sin 8 e J 1 cos 8 e J 2 displaystyle R boldsymbol theta I sin theta mathbf e cdot mathbf J 1 cos theta mathbf e cdot mathbf J 2 which compactly reproduce the boost and rotation matrices as given in the previous section It has been stated that the general proper Lorentz transformation is a product of a boost and rotation At the infinitesimal level the product L I z K I 8 J I 8 J I z K I z K 8 J displaystyle begin aligned Lambda amp I boldsymbol zeta cdot mathbf K cdots I boldsymbol theta cdot mathbf J cdots amp I boldsymbol theta cdot mathbf J cdots I boldsymbol zeta cdot mathbf K cdots amp I boldsymbol zeta cdot mathbf K boldsymbol theta cdot mathbf J cdots end aligned is commutative because only linear terms are required products like 8 J z K and z K 8 J count as higher order terms and are negligible Taking the limit as before leads to the finite transformation in the form of an exponential L z 8 e z K 8 J displaystyle Lambda boldsymbol zeta boldsymbol theta e boldsymbol zeta cdot mathbf K boldsymbol theta cdot mathbf J The converse is also true but the decomposition of a finite general Lorentz transformation into such factors is nontrivial In particular e z K 8 J e z Ke8 J displaystyle e boldsymbol zeta cdot mathbf K boldsymbol theta cdot mathbf J neq e boldsymbol zeta cdot mathbf K e boldsymbol theta cdot mathbf J because the generators do not commute For a description of how to find the factors of a general Lorentz transformation in terms of a boost and a rotation in principle this usually does not yield an intelligible expression in terms of generators J and K see Wigner rotation If on the other hand the decomposition is given in terms of the generators and one wants to find the product in terms of the generators then the Baker Campbell Hausdorff formula applies The Lie algebra so 3 1 Lorentz generators can be added together or multiplied by real numbers to obtain more Lorentz generators In other words the set of all Lorentz generators V z K 8 J displaystyle V boldsymbol zeta cdot mathbf K boldsymbol theta cdot mathbf J together with the operations of ordinary matrix addition and multiplication of a matrix by a number forms a vector space over the real numbers The generators Jx Jy Jz Kx Ky Kz form a basis set of V and the components of the axis angle and rapidity vectors 8x 8y 8z zx zy zz are the coordinates of a Lorentz generator with respect to this basis Three of the commutation relations of the Lorentz generators are Jx Jy Jz Kx Ky Jz Jx Ky Kz displaystyle J x J y J z quad K x K y J z quad J x K y K z where the bracket A B AB BA is known as the commutator and the other relations can be found by taking cyclic permutations of x y z components i e change x to y y to z and z to x repeat These commutation relations and the vector space of generators fulfill the definition of the Lie algebra so 3 1 displaystyle mathfrak so 3 1 In summary a Lie algebra is defined as a vector space V over a field of numbers and with a binary operation called a Lie bracket in this context on the elements of the vector space satisfying the axioms of bilinearity alternatization and the Jacobi identity Here the operation is the commutator which satisfies all of these axioms the vector space is the set of Lorentz generators V as given previously and the field is the set of real numbers Linking terminology used in mathematics and physics A group generator is any element of the Lie algebra A group parameter is a component of a coordinate vector representing an arbitrary element of the Lie algebra with respect to some basis A basis then is a set of generators being a basis of the Lie algebra in the usual vector space sense The exponential map from the Lie algebra to the Lie group exp so 3 1 SO 3 1 displaystyle exp mathfrak so 3 1 to mathrm SO 3 1 provides a one to one correspondence between small enough neighborhoods of the origin of the Lie algebra and neighborhoods of the identity element of the Lie group In the case of the Lorentz group the exponential map is just the matrix exponential Globally the exponential map is not one to one but in the case of the Lorentz group it is surjective onto Hence any group element in the connected component of the identity can be expressed as an exponential of an element of the Lie algebra Improper transformations Lorentz transformations also include parity inversion P 100 I displaystyle P begin bmatrix 1 amp 0 0 amp mathbf I end bmatrix which negates all the spatial coordinates only and time reversal T 100I displaystyle T begin bmatrix 1 amp 0 0 amp mathbf I end bmatrix which negates the time coordinate only because these transformations leave the spacetime interval invariant Here I is the 3d identity matrix These are both symmetric they are their own inverses see involution mathematics and each have determinant 1 This latter property makes them improper transformations If L is a proper orthochronous Lorentz transformation then TL is improper antichronous PL is improper orthochronous and TPL PTL is proper antichronous Inhomogeneous Lorentz group Two other spacetime symmetries have not been accounted for In order for the spacetime interval to be invariant it can be shown that it is necessary and sufficient for the coordinate transformation to be of the form X LX C displaystyle X Lambda X C where C is a constant column containing translations in time and space If C 0 this is an inhomogeneous Lorentz transformation or Poincare transformation If C 0 this is a homogeneous Lorentz transformation Poincare transformations are not dealt further in this article Tensor formulationContravariant vectors Writing the general matrix transformation of coordinates as the matrix equation x 0x 1x 2x 3 L00L01L02L03x 0L10L11L12L13x 0L20L21L22L23x 0L30L31L32L33x 0 x0x 0x1x 0x2x 0x3x 0 displaystyle begin bmatrix x 0 x 1 x 2 x 3 end bmatrix begin bmatrix Lambda 0 0 amp Lambda 0 1 amp Lambda 0 2 amp Lambda 0 3 vphantom x 0 Lambda 1 0 amp Lambda 1 1 amp Lambda 1 2 amp Lambda 1 3 vphantom x 0 Lambda 2 0 amp Lambda 2 1 amp Lambda 2 2 amp Lambda 2 3 vphantom x 0 Lambda 3 0 amp Lambda 3 1 amp Lambda 3 2 amp Lambda 3 3 vphantom x 0 end bmatrix begin bmatrix x 0 vphantom x 0 x 1 vphantom x 0 x 2 vphantom x 0 x 3 vphantom x 0 end bmatrix allows the transformation of other physical quantities that cannot be expressed as four vectors e g tensors or spinors of any order in 4d spacetime to be defined In the corresponding tensor index notation the above matrix expression is x n Lnmxm displaystyle x nu Lambda nu mu x mu where lower and upper indices label covariant and contravariant components respectively and the summation convention is applied It is a standard convention to use Greek indices that take the value 0 for time components and 1 2 3 for space components while Latin indices simply take the values 1 2 3 for spatial components the opposite for Landau and Lifshitz Note that the first index reading left to right corresponds in the matrix notation to a row index The second index corresponds to the column index The transformation matrix is universal for all four vectors not just 4 dimensional spacetime coordinates If A is any four vector then in tensor index notation A n LnmAm displaystyle A nu Lambda nu mu A mu Alternatively one writes An Ln mAm displaystyle A nu Lambda nu mu A mu in which the primed indices denote the indices of A in the primed frame For a general n component object one may write X a P L abXb displaystyle X alpha Pi Lambda alpha beta X beta where P is the appropriate representation of the Lorentz group an n n matrix for every L In this case the indices should not be thought of as spacetime indices sometimes called Lorentz indices and they run from 1 to n E g if X is a bispinor then the indices are called Dirac indices Covariant vectors There are also vector quantities with covariant indices They are generally obtained from their corresponding objects with contravariant indices by the operation of lowering an index e g xn hmnxm displaystyle x nu eta mu nu x mu where h is the metric tensor The linked article also provides more information about what the operation of raising and lowering indices really is mathematically The inverse of this transformation is given by xm hmnxn displaystyle x mu eta mu nu x nu where when viewed as matrices hmn is the inverse of hmn As it happens hmn hmn This is referred to as raising an index To transform a covariant vector Am first raise its index then transform it according to the same rule as for contravariant 4 vectors then finally lower the index A n hrnLrshmsAm displaystyle A nu eta rho nu Lambda rho sigma eta mu sigma A mu But hrnLrshms L 1 mn displaystyle eta rho nu Lambda rho sigma eta mu sigma left Lambda 1 right mu nu That is it is the m n component of the inverse Lorentz transformation One defines as a matter of notation Lnm L 1 mn displaystyle Lambda nu mu equiv left Lambda 1 right mu nu and may in this notation write A n LnmAm displaystyle A nu Lambda nu mu A mu Now for a subtlety The implied summation on the right hand side of A n LnmAm L 1 mnAm displaystyle A nu Lambda nu mu A mu left Lambda 1 right mu nu A mu is running over a row index of the matrix representing L 1 Thus in terms of matrices this transformation should be thought of as the inverse transpose of L acting on the column vector Am That is in pure matrix notation A L 1 TA displaystyle A left Lambda 1 right mathrm T A This means exactly that covariant vectors thought of as column matrices transform according to the dual representation of the standard representation of the Lorentz group This notion generalizes to general representations simply replace L with P L Tensors If A and B are linear operators on vector spaces U and V then a linear operator A B may be defined on the tensor product of U and V denoted U V according to A B u v Au Bv u U v V u v U V displaystyle A otimes B u otimes v Au otimes Bv qquad u in U v in V u otimes v in U otimes V T1 From this it is immediately clear that if u and v are a four vectors in V then u v T2V V V transforms as u v Lu Lv Lmnun Lrsvs LmnLrsun vs LmnLrswns displaystyle u otimes v rightarrow Lambda u otimes Lambda v Lambda mu nu u nu otimes Lambda rho sigma v sigma Lambda mu nu Lambda rho sigma u nu otimes v sigma equiv Lambda mu nu Lambda rho sigma w nu sigma T2 The second step uses the bilinearity of the tensor product and the last step defines a 2 tensor on component form or rather it just renames the tensor u v These observations generalize in an obvious way to more factors and using the fact that a general tensor on a vector space V can be written as a sum of a coefficient component times tensor products of basis vectors and basis covectors one arrives at the transformation law for any tensor quantity T It is given by T8 i k a b z La mLb n Lz rL8 sLi y Lk zTsy zmn r displaystyle T theta iota cdots kappa alpha beta cdots zeta Lambda alpha mu Lambda beta nu cdots Lambda zeta rho Lambda theta sigma Lambda iota upsilon cdots Lambda kappa zeta T sigma upsilon cdots zeta mu nu cdots rho T3 where Lx ps is defined above This form can generally be reduced to the form for general n component objects given above with a single matrix P L operating on column vectors This latter form is sometimes preferred e g for the electromagnetic field tensor Transformation of the electromagnetic field Lorentz boost of an electric charge the charge is at rest in one frame or the other Lorentz transformations can also be used to illustrate that the magnetic field B and electric field E are simply different aspects of the same force the electromagnetic force as a consequence of relative motion between electric charges and observers The fact that the electromagnetic field shows relativistic effects becomes clear by carrying out a simple thought experiment An observer measures a charge at rest in frame F The observer will detect a static electric field As the charge is stationary in this frame there is no electric current so the observer does not observe any magnetic field The other observer in frame F moves at velocity v relative to F and the charge This observer sees a different electric field because the charge moves at velocity v in their rest frame The motion of the charge corresponds to an electric current and thus the observer in frame F also sees a magnetic field The electric and magnetic fields transform differently from space and time but exactly the same way as relativistic angular momentum and the boost vector The electromagnetic field strength tensor is given by Fmn 0 1cEx 1cEy 1cEz1cEx0 BzBy1cEyBz0 Bx1cEz ByBx0 SI units signature displaystyle F mu nu begin bmatrix 0 amp frac 1 c E x amp frac 1 c E y amp frac 1 c E z frac 1 c E x amp 0 amp B z amp B y frac 1 c E y amp B z amp 0 amp B x frac 1 c E z amp B y amp B x amp 0 end bmatrix text SI units signature text in SI units In relativity the Gaussian system of units is often preferred over SI units even in texts whose main choice of units is SI units because in it the electric field E and the magnetic induction B have the same units making the appearance of the electromagnetic field tensor more natural Consider a Lorentz boost in the x direction It is given byLmn g gb00 gbg0000100001 Fmn 0ExEyEz Ex0Bz By Ey Bz0Bx EzBy Bx0 Gaussian units signature displaystyle Lambda mu nu begin bmatrix gamma amp gamma beta amp 0 amp 0 gamma beta amp gamma amp 0 amp 0 0 amp 0 amp 1 amp 0 0 amp 0 amp 0 amp 1 end bmatrix qquad F mu nu begin bmatrix 0 amp E x amp E y amp E z E x amp 0 amp B z amp B y E y amp B z amp 0 amp B x E z amp B y amp B x amp 0 end bmatrix text Gaussian units signature text where the field tensor is displayed side by side for easiest possible reference in the manipulations below The general transformation law T3 becomes Fm n Lm mLn nFmn displaystyle F mu nu Lambda mu mu Lambda nu nu F mu nu For the magnetic field one obtains Bx F2 3 L2mL3nFmn L22L33F23 1 1 Bx Bx By F3 1 L3mL1nFmn L33L1nF3n L33L10F30 L33L11F31 1 bg Ez 1 gBy gBy bgEz g B b E yBz F1 2 L1mL2nFmn L1mL22Fm2 L10L22F02 L11L22F12 gb 1 Ey g 1 Bz gBz bgEy g B b E z displaystyle begin aligned B x amp F 2 3 Lambda 2 mu Lambda 3 nu F mu nu Lambda 2 2 Lambda 3 3 F 23 1 times 1 times B x amp B x B y amp F 3 1 Lambda 3 mu Lambda 1 nu F mu nu Lambda 3 3 Lambda 1 nu F 3 nu Lambda 3 3 Lambda 1 0 F 30 Lambda 3 3 Lambda 1 1 F 31 amp 1 times beta gamma E z 1 times gamma B y gamma B y beta gamma E z amp gamma left mathbf B boldsymbol beta times mathbf E right y B z amp F 1 2 Lambda 1 mu Lambda 2 nu F mu nu Lambda 1 mu Lambda 2 2 F mu 2 Lambda 1 0 Lambda 2 2 F 02 Lambda 1 1 Lambda 2 2 F 12 amp gamma beta times 1 times E y gamma times 1 times B z gamma B z beta gamma E y amp gamma left mathbf B boldsymbol beta times mathbf E right z end aligned

rec-icon Recommended Topics
Share this article
Read the free encyclopedia and learn everything...
See more
Read the free encyclopedia. All information in Wikipedia is available. No payment required.
Share this article on
Share
XXX 0C
Tuesday, 11 February, 2025
Follow Us On