Down the Pigeonhole

The Principle of Least Action states that the true trajectory of a system is that for which the action is stationary.

\delta S = 0\quad \textrm{where}\quad S = \int_{t_1}^{t_2}\mathcal L(q_i, \dot q_i)\mathrm dt

The action itself is a functional, meaning that it depends on the whole of the trajectories $q_i(t)$ . To say that the action is stationary with respect to variations in these trajectories is to say that it behaves as a stationary point with respect to any variation in these trajectories.

Computing the derivative of a functional with respect to a set of functions may seem daunting but we can make the problem more tractable by turning it into a familiar calculus problem. Recall that the action must be stationary with respect to any variation in the trajectories. We're going to pick a very specific kind of variation which will make computing the derivative of $S$ easier.

Let the true trajectories - those which correspond to a stationary point of the action - be denoted $q_i(t)$ . There are many ways to perturb these trajectories and so let's pick one which makes the problem easy:

q_i'(t) = q_i(t) + \epsilon \eta_i(t)

Where $\epsilon$ is just a number and $\eta_i(t)$ are a set of functions which all vanish at the endpoints of the integral. This second condition ensures that the endpoints of the trajectories stay fixed as we perturb them - $q_i(t_1) = q_i'(t_1)$ and $q_i(t_2) = q_i'(t_2)$ . The number $\epsilon$ gives us a way to turn on the perturbation in a smooth way, such that at $\epsilon = 0$ , $q_i'(t) = q_i(t)$ .

I said above that we were going to pick a specific kind of perturbation in order to make the problem easier. While that may have been our initial justification, the expression we've landed on is actually very general. There is enough flexibility in the choice of $\eta_i(t)$ and $\epsilon$ to define any possible perturbation we can think of. What we've really done is to write the perturbation in a convenient way that makes the problem easier.

The statement that the action is stationary for the true trajectories $q_i(t)$ is equivalent to the statement that the action is stationary with respect to $\epsilon$ at $\epsilon = 0$ :

\frac{\mathrm d S}{\mathrm d\epsilon}\bigg\rvert_{\epsilon = 0} = 0

We know how to compute the derivative of a function with respect to a single variable.

Computing the Derivative

Our goal is to compute the derivative:

\frac{\mathrm d S}{\mathrm d\epsilon}\quad \textrm{where}\quad S = \int_{t_1}^{t_2}\mathcal L(q_i', \dot q_i')\mathrm dt

First, since $t_1$ and $t_2$ don't depend on $\epsilon$ (they're just numbers) the derivative can be moved inside the integral.

\frac{\mathrm dS}{\mathrm d\epsilon} = \int_{t_1}^{t_2} \frac{\mathrm d\mathcal L(q_i', \dot q_i')}{\mathrm d\epsilon}\mathrm dt

Using the multivariable chain rule, we can expand the derivative.

\dots = \int_{t_1}^{t_2} \sum_i \left( \frac{\partial \mathcal L}{\partial q_i'}\frac{\mathrm dq_i'}{\mathrm d\epsilon} + \frac{\partial \mathcal L}{\partial \dot q_i'}\frac{\mathrm d\dot q_i'}{\mathrm d\epsilon} \right)\mathrm dt

Where the sum over $i$ has come from the fact that we need to apply the chain rule over all the $q_i$ that $\mathcal L$ depends on. The order of summation (whether infinite or otherwise) doesn't matter and so we can move the summation outside of the integral:

\dots = \sum_i\int_{t_1}^{t_2} \left( \frac{\partial \mathcal L}{\partial q_i'}\frac{\mathrm dq_i'}{\mathrm d\epsilon} + \frac{\partial \mathcal L}{\partial \dot q_i'}\frac{\mathrm d\dot q_i'}{\mathrm d\epsilon} \right)\mathrm dt

Next, recall that,

q_i'(t) = q_i(t) + \epsilon \eta_i(t)

Since $\epsilon$ doesn't depend on $t$ ,

\dot q_i'(t) = \dot q_i(t) + \epsilon \dot\eta_i(t)

And since the unperturbed paths, $q_i$ and $\dot q_i$ , don't depend on $\epsilon$ :

\frac{\mathrm dq_i'}{\mathrm d\epsilon} = \eta_i(t)\qquad \frac{\mathrm d\dot q_i'}{\mathrm d\epsilon} = \dot\eta_i(t)

Substituting back into the integral:

\frac{\mathrm dS}{\mathrm d\epsilon} = \sum_i\int_{t_1}^{t_2} \left( \frac{\partial \mathcal L}{\partial q_i'}\eta_i(t) + \frac{\partial \mathcal L}{\partial \dot q_i'}\dot\eta_i(t) \right)\mathrm dt = 0

In the left hand term we have an $\eta$ and in the right hand term an $\dot\eta$ . It would be nice if we could turn that $\dot\eta$ into an $\eta$ , since that would allow us to factorise $\eta$ out of the expression all together. There is a simple way to do this which is to integrate the second term by parts (leaving the first term as it is).

\frac{\mathrm dS}{\mathrm d\epsilon} = \sum_i\left\{\int_{t_1}^{t_2} \frac{\partial \mathcal L}{\partial q_i'}\eta_i(t)\mathrm d t + \left[ \eta_i(t)\frac{\partial \mathcal L}{\partial \dot q_i'} \right]_{t_1}^{t_2} - \int_{t_1}^{t_2}\eta_i(t)\frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'}\mathrm d t\right\} = 0

Since we defined $\eta_i(t)$ such that $\eta_i(t_1) = \eta_i(t_2) = 0$ , the term in the square brackets simplifies to $0 - 0$ and vanishes. This lets us recombine the integral and factorise out $\eta(t)$ :

\frac{\mathrm dS}{\mathrm d\epsilon} = \sum_i\int_{t_1}^{t_2} \eta_i(t)\left(\frac{\partial \mathcal L}{\partial q_i'} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'}\right)\mathrm d t = 0

Reducing the Expression

This unweildy expression would generally not be solvable if it were not for one important fact: It is true for all possible choices of $\eta_i(t)$ . This fact can be used in 2 (related) ways to simplify the expression further.

Firstly, we are free to choose the $\eta_i(t)$ such that they all equal zero everywhere, bar one - say at $i=j$ . This is equivalent to saying that we only perturb one of the degrees of freedom of the system and leave the rest at their true values.

Doing this, the sum over $i$ reduces to a single term at $j$ , removing the sum from the expression all together. Since we are free to do this for any choice of $j$ , we can simply say that each individual term of the sum is equal to zero individually.

\int_{t_1}^{t_2} \eta_i(t)\left(\frac{\partial \mathcal L}{\partial q_i'} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'}\right)\mathrm d t = 0\, (\textrm{no sum}) \quad\forall\,i

Where we've now made it clear that we are not summing over the $i$ index. This is a much stricter statement than saying that the sum as a whole equals zero. We were able to make it due to the flexibility afforded to us in our choice of $\eta_i(t)$ .

This flexibility will, in an exactly analagous way, allow us to remove the integral from the expression too. We could always pick $\eta_i(t)$ such that it had a spike at one value of $t$ and was zero everywhere else, therefore (in an infinitesimal sense) isolating a single term in the integral. This is the continuous equivalent of what we did with the discrete sum before, and allows us to write the final Euler-Lagrange equation:

\frac{\partial \mathcal L}{\partial q_i'} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'} = 0

A Bit of Pedanticism

Strictly speaking, the equation above is written in terms of the primed (perturbed) trajectories, not the real trajectories which are what we actually want to compute. To remedy this we return to our original statement of the principle of least action.

\frac{\mathrm d S}{\mathrm d\epsilon}\bigg\rvert_{\epsilon = 0} = 0

We notice that our derived Euler-Lagrange equation is strictly only true when $\epsilon = 0$ . Recall that at $\epsilon = 0$ , $q_i'(t) = q_i(t)$ , finally allowing us to write the Euler-Lagrange equation in terms of the real trajectories, $q_i(t)$ :

\frac{\partial \mathcal L}{\partial q_i} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i} = 0

courses

Derivation of the Euler-Lagrange Equation

Computing the Derivative

Reducing the Expression

A Bit of Pedanticism