The Jones and Mueller Matrices
Generally speaking there are three types of devices that will transmit, but modify the polarization state of an incident wave. They are:
- Polarizer: transmits a single polarization state and removes others
- Phase retarder: Introduces a phase difference between wave components
- Rotator: Rotates the direction of linearly polarized light
As discussed above, we will write these operations as matrix transforms such that
⇀E′=↔M⇀E
where ↔M is the Jones matrix,
⇀E is the input electric field vector, and
⇀E′ is the modified electric field vector. The first element we describe is the linear polarizer. The Jones matrix for a linear polarizer aligned with the x-y axes is
⇀E′=(px00py)⇀E
where 0≤pj≤1,
j=x,y. Then, for complete transmission along the x-axis,
py=0 and
px=1, or
(E′xE′y)=(1000)(E0,xE0,y)=(E0,x0)
and similarly for complete transmission along the y-axis, px=0 and
py=1, or
(E′xE′y)=(0001)(E0,xE0,y)=(0E0,y)
The phase retarder produces a total relative phase shift, φ, between the x- and y-components of the input wave. That means we would write
⇀E′=(eiφ/200e−iφ/2)⇀E
or, just to be concrete
(E′xE′y)=(eiφ/200e−iφ/2)(E0,xE0,y)=(E0,xeiφ/2E0,ye−iφ/2)
and
⇀E′=E0,xeiφ/2ˆi+E0,ye−iφ/2ˆj=eiφ/2(E0,xˆi+E0,ye−iφˆj)
where the last equality explicitly shows that Ey is phase advanced from
Ex by
φ.
Quarter Wave Plate
The Jones matrix representing a quarter wave plate is given when φ=π/2 (remember, π/2 is a quarter of the full wave, which is 2π)
↔MQWP=(eiπ/400e−iπ/4)=eiπ/4(100e−iπ/2)=eiπ/4(100−i)
or simply
↔MQWP=eiπ/4(100−i)
Half Wave Plate
The Jones matrix representing a half wave plate is given when φ=π (remember, π is a half of the full wave, which is 2π)
↔MHWP=(eiπ/200e−iπ/2)=eiπ/2(100e−iπ)=i(100−1)
Rotated Wave Retarders
All of the matrices we have derived just now have been aligned with the x- and y-axes.
It turns out that if we want to rotate a vector counter-clockwise (when the axis about which the vector is being rotated is pointing toward you, ˆz in these examples) you simply multiply the vector by the 2D rotation matrix:
↔R(θ)=(cos(θ)−sin(θ)sin(θ)cos(θ))
see the figure below
For instance, if we started with a vector aligned with the x-axis and wanted to rotate it by that in the counter-clockwise direction, we would have
⇀v′=(cos(θ)−sin(θ)sin(θ)cos(θ))⇀v
or, in this particular example
(cos(θ)sin(θ))=(cos(θ)−sin(θ)sin(θ)cos(θ))(10)
writing this in unit vector notation
⇀v′=cos(θ)ˆi+sin(θ)ˆj
which is simply a vector that points anywhere in the plane with magnitude 1.
Now, if we took that same vector and rotated it by NEGATIVE θ, we would get the figure on the left below here
In math, we would write
⇀v′=↔R(−θ)⇀v=(cos(θ)sin(θ)−sin(θ)cos(θ))⇀v
This turns out to be the same thing as rotating the AXES by a positive angle as you can see in the second figure on the right above. So if we want to rotate AXES we multiply by ↔R(−θ).
Now, if we want to 'rotate' a matrix so that the axes of the matrix are no longer aligned with 'x' and 'y', we write
↔M′=↔R(θ)↔M↔R(−θ)
So for instance, if we want to write the Jones matrix for a general phase retarder, which is
↔Mφ=(eiφ/200e−iφ/2)
in some rotated axes. We would write
↔M′φ=↔R(θ)↔Mφ↔R(−θ)=(cos(θ)−sin(θ)sin(θ)cos(θ))(eiφ/200e−iφ/2)(cos(θ)sin(θ)−sin(θ)cos(θ))
multiplying these products out, we get
↔M′φ=(e−iφ2(sin2(θ)+eiφcos2(θ))isin(2θ)sin(φ2)isin(2θ)sin(φ2)e−iφ2(cos2(θ)+eiφsin2(θ)))
and simplified using Euler's form for sine, sin(φ/2)=12i(eiφ/2−e−iφ/2) and factoring out
eiφ/2, we have
↔M′φ=eiφ/2(cos2(θ)+e−iφsin2(θ)(1−e−iφ)sin(2θ)12(1−e−iφ)sin(2θ)12e−iφcos2(θ)+sin2(θ))
or, getting rid of the factors of 1/2 with a trig. identity Links to an external site.
↔M′φ=eiφ/2(cos2(θ)+e−iφsin2(θ)(1−e−iφ)sin(θ)cos(θ)(1−e−iφ)sin(θ)cos(θ)e−iφcos2(θ)+sin2(θ))
Again, we can use this to write the Jones matrix for a quarter wave plate i.e., when φ=π/2
↔M′QWP(θ)=eiπ/4(cos2(θ)+e−iπ/2sin2(θ)(1−e−iπ/2)sin(θ)cos(θ)(1−e−iπ/2)sin(θ)cos(θ)e−iπ/2cos2(θ)+sin2(θ))
simplified using e−iπ/2=−i
↔M′QWP(θ)=eiπ/4(cos2(θ)−isin2(θ)(1+i)sin(θ)cos(θ)(1+i)sin(θ)cos(θ)sin2(θ)−icos2(θ))
Let's test to make sure this returns what we got before when θ=0
↔MQWP(0)=eiπ/4(100e−iπ/2)=eiπ/4(100−i)
which is indeed what we derived before. And when θ=π/2
↔M′QWP(π/2)=eiπ/4(−i001)
We get a quarter-wave plate with the fast axis horizontal, which is the original QWP rotated by 90o. Then the half-wave plate is given when φ=π
↔M′HWP(θ)=eiπ/2(cos2(θ)+e−iπsin2(θ)(1−e−iπ)sin(θ)cos(θ)(1−e−iπ)sin(θ)cos(θ)e−iπcos2(θ)+sin2(θ))
simplified using e−iπ=−1
↔M′HWP(θ)=eiπ/2(cos2(θ)−sin2(θ)2sin(θ)cos(θ)2sin(θ)cos(θ)sin2(θ)−cos2(θ))=eiπ/2(cos(2θ)sin(2θ)sin(2θ)−cos(2θ))
and again, when θ=0 and using
eiπ/2=i, we get
↔M′HWP(0)=i(100−1)
More generally speaking, when we apply a HWP to an arbitrary input beam
↔M′HWP(α)⇀E(θ)=eiπ/2(cos(2α)sin(2α)sin(2α)−cos(2α))(cos(θ)sin(θ))=eiπ/2(cos(2α−θ)sin(2α−θ))
we get a π phase shift of the y-component relative to the x-component of the wave. Also, note that for instance, when θ=0
↔M′HWP(α)⇀E(0)=eiπ/2(cos(2α)sin(2α))
the output polarization angle is rotated by TWICE the angle that the fast axis of the HWP makes with the x-axis. So, if you rotate a HWP by 10o, the output polarization (assuming it wasn't aligned with either the fast or slow axis) will be rotated by 20o.
Finally, applying the same formalism to a polarizer with transmission axis aligned with the x-axis, we get
↔M′LP(θ)=(cos2(θ)pxsin(θ)cos(θ)pxsin(θ)cos(θ)pxsin2(θ)px)
Examples
Now it turns out that the real power of the Jones and Mueller (shown later) matrices is the fact that they can be used in series to model a complicated optical system.
Do you need to model an optical system composed of a linear polarizer with transmission axis +45o to the x-axis, quarter-wave plate with fast axis on the y-axis, and ANOTHER quarter-wave plate with fast axis on the y-axis? If the incident wave is polarized along the x-axis, just multiply those three Jones matrices together in the following way
⇀Eout=eiπ/22(100−i)(100−i)(1111)⇀Ein
where ⇀Ein=(10), then we would have
⇀Eout=eiπ/22(1−1)
Let's break down what this systems is doing element by element. The incident wave is polarized along the x-axis:
⇀Ein=(10)
or
⇀Ein=E0,xˆi+0ˆj
which is just a linearly polarized wave oscillating in the x-direction. The linear polarizer at +45o to the x-axis (matrix) multiplied by this gives
⇀Eout=12(1111)(10)=12(11)
or (aside from scaling factors)
⇀Eout=E0,xˆi+E0,yˆj
which is just a linearly polarized wave oscillating at +45o from the x-axis. So the action of a polarizer rotated +45o from the x-axis on a wave polarized along the x-axis is to transmit half the power but now the electric field oscillates +45o from the x-axis.
Next we'll tackle the action of the two quarter-wave plates with fast axis aligned with the x-axis:
↔MQWP↔MQWP=(100−i)(100−i)=(100−1)
which, seems oddly familiar. Remember that
↔MHWP(0)=i(100−1)
Woah!! So two quarter wave plates act like a half-wave plate??! YES they do! So the action of two consecutive quarter-wave plates acting on a wave polarized +45o to the x-axis should be to rotate the wave so it is now polarized -45o to the x-axis
⇀Eout=↔MQWP↔MQWP⇀Ein=(100−1)(11)=(1−1)
And there you have it
⇀Eout=(1−1)
or
⇀Eout=E0,xˆi−E0,yˆj
which is a wave polarized -45o w.r.t. the x-axis.
A final note on the Jones matrices: Matrix multiplication is non-commutative, that means order matters. Furthermore, the matrices must be multiplied in 'reverse-order' so-to-speak. This just ensure that the input wave 'encounters' the optics in the correct order. For instance, in the example above, the incident beam went through a polarizer, and two quarter wave plates. The order of the matrices was
↔MQWP↔MQWP↔MLP
which when read aloud, is backwards from what first intuition would say. However, we multiply the incident electric field vector on the RIGHT to get the output field, so the order above is correct.
The following is a table containing Jones and Mueller matrices. While we did not derive any of the Mueller matrices, they operate on the Stokes vectors the same way the Jones matrices operate on the Jones vectors and have the advantage that they work for partially polarized light, unlike the Jones matrices. However, they do come at the cost of higher complexity.