Logo
down
the
pigeonhole.
classical
Action Principles and Lagrangian Mechanics

Derivation of the Euler-Lagrange Equation

The Principle of Least Action states that the true trajectory of a system is that for which the action is stationary.

δS=0whereS=t1t2L(qi,q˙i)dt\delta S = 0\quad \textrm{where}\quad S = \int_{t_1}^{t_2}\mathcal L(q_i, \dot q_i)\mathrm dt

The action itself is a functional, meaning that it depends on the whole of the trajectories qi(t)q_i(t). To say that the action is stationary with respect to variations in these trajectories is to say that it behaves as a stationary point with respect to any variation in these trajectories.

Computing the derivative of a functional with respect to a set of functions may seem daunting but we can make the problem more tractable by turning it into a familiar calculus problem. Recall that the action must be stationary with respect to any variation in the trajectories. We're going to pick a very specific kind of variation which will make computing the derivative of SS easier.

Let the true trajectories - those which correspond to a stationary point of the action - be denoted qi(t)q_i(t). There are many ways to perturb these trajectories and so let's pick one which makes the problem easy:

qi(t)=qi(t)+ϵηi(t)q_i'(t) = q_i(t) + \epsilon \eta_i(t)

Where ϵ\epsilon is just a number and ηi(t)\eta_i(t) are a set of functions which all vanish at the endpoints of the integral. This second condition ensures that the endpoints of the trajectories stay fixed as we perturb them - qi(t1)=qi(t1)q_i(t_1) = q_i'(t_1) and qi(t2)=qi(t2)q_i(t_2) = q_i'(t_2). The number ϵ\epsilon gives us a way to turn on the perturbation in a smooth way, such that at ϵ=0\epsilon = 0, qi(t)=qi(t)q_i'(t) = q_i(t).

I said above that we were going to pick a specific kind of perturbation in order to make the problem easier. While that may have been our initial justification, the expression we've landed on is actually very general. There is enough flexibility in the choice of ηi(t)\eta_i(t) and ϵ\epsilon to define any possible perturbation we can think of. What we've really done is to write the perturbation in a convenient way that makes the problem easier.

The statement that the action is stationary for the true trajectories qi(t)q_i(t) is equivalent to the statement that the action is stationary with respect to ϵ\epsilon at ϵ=0\epsilon = 0:

dSdϵϵ=0=0\frac{\mathrm d S}{\mathrm d\epsilon}\bigg\rvert_{\epsilon = 0} = 0

We know how to compute the derivative of a function with respect to a single variable.

Computing the Derivative

Our goal is to compute the derivative:

dSdϵwhereS=t1t2L(qi,q˙i)dt\frac{\mathrm d S}{\mathrm d\epsilon}\quad \textrm{where}\quad S = \int_{t_1}^{t_2}\mathcal L(q_i', \dot q_i')\mathrm dt

First, since t1t_1 and t2t_2 don't depend on ϵ\epsilon (they're just numbers) the derivative can be moved inside the integral.

dSdϵ=t1t2dL(qi,q˙i)dϵdt\frac{\mathrm dS}{\mathrm d\epsilon} = \int_{t_1}^{t_2} \frac{\mathrm d\mathcal L(q_i', \dot q_i')}{\mathrm d\epsilon}\mathrm dt

Using the multivariable chain rule, we can expand the derivative.

=t1t2i(Lqidqidϵ+Lq˙idq˙idϵ)dt\dots = \int_{t_1}^{t_2} \sum_i \left( \frac{\partial \mathcal L}{\partial q_i'}\frac{\mathrm dq_i'}{\mathrm d\epsilon} + \frac{\partial \mathcal L}{\partial \dot q_i'}\frac{\mathrm d\dot q_i'}{\mathrm d\epsilon} \right)\mathrm dt

Where the sum over ii has come from the fact that we need to apply the chain rule over all the qiq_i that L\mathcal L depends on. The order of summation (whether infinite or otherwise) doesn't matter and so we can move the summation outside of the integral:

=it1t2(Lqidqidϵ+Lq˙idq˙idϵ)dt\dots = \sum_i\int_{t_1}^{t_2} \left( \frac{\partial \mathcal L}{\partial q_i'}\frac{\mathrm dq_i'}{\mathrm d\epsilon} + \frac{\partial \mathcal L}{\partial \dot q_i'}\frac{\mathrm d\dot q_i'}{\mathrm d\epsilon} \right)\mathrm dt

Next, recall that,

qi(t)=qi(t)+ϵηi(t)q_i'(t) = q_i(t) + \epsilon \eta_i(t)

Since ϵ\epsilon doesn't depend on tt,

q˙i(t)=q˙i(t)+ϵη˙i(t)\dot q_i'(t) = \dot q_i(t) + \epsilon \dot\eta_i(t)

And since the unperturbed paths, qiq_i and q˙i\dot q_i, don't depend on ϵ\epsilon:

dqidϵ=ηi(t)dq˙idϵ=η˙i(t)\frac{\mathrm dq_i'}{\mathrm d\epsilon} = \eta_i(t)\qquad \frac{\mathrm d\dot q_i'}{\mathrm d\epsilon} = \dot\eta_i(t)

Substituting back into the integral:

dSdϵ=it1t2(Lqiηi(t)+Lq˙iη˙i(t))dt=0\frac{\mathrm dS}{\mathrm d\epsilon} = \sum_i\int_{t_1}^{t_2} \left( \frac{\partial \mathcal L}{\partial q_i'}\eta_i(t) + \frac{\partial \mathcal L}{\partial \dot q_i'}\dot\eta_i(t) \right)\mathrm dt = 0

In the left hand term we have an η\eta and in the right hand term an η˙\dot\eta. It would be nice if we could turn that η˙\dot\eta into an η\eta, since that would allow us to factorise η\eta out of the expression all together. There is a simple way to do this which is to integrate the second term by parts (leaving the first term as it is).

dSdϵ=i{t1t2Lqiηi(t)dt+[ηi(t)Lq˙i]t1t2t1t2ηi(t)ddtLq˙idt}=0\frac{\mathrm dS}{\mathrm d\epsilon} = \sum_i\left\{\int_{t_1}^{t_2} \frac{\partial \mathcal L}{\partial q_i'}\eta_i(t)\mathrm d t + \left[ \eta_i(t)\frac{\partial \mathcal L}{\partial \dot q_i'} \right]_{t_1}^{t_2} - \int_{t_1}^{t_2}\eta_i(t)\frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'}\mathrm d t\right\} = 0

Since we defined ηi(t)\eta_i(t) such that ηi(t1)=ηi(t2)=0\eta_i(t_1) = \eta_i(t_2) = 0, the term in the square brackets simplifies to 000 - 0 and vanishes. This lets us recombine the integral and factorise out η(t)\eta(t):

dSdϵ=it1t2ηi(t)(LqiddtLq˙i)dt=0\frac{\mathrm dS}{\mathrm d\epsilon} = \sum_i\int_{t_1}^{t_2} \eta_i(t)\left(\frac{\partial \mathcal L}{\partial q_i'} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'}\right)\mathrm d t = 0

Reducing the Expression

This unweildy expression would generally not be solvable if it were not for one important fact: It is true for all possible choices of ηi(t)\eta_i(t). This fact can be used in 2 (related) ways to simplify the expression further.

Firstly, we are free to choose the ηi(t)\eta_i(t) such that they all equal zero everywhere, bar one - say at i=ji=j. This is equivalent to saying that we only perturb one of the degrees of freedom of the system and leave the rest at their true values.

Doing this, the sum over ii reduces to a single term at jj, removing the sum from the expression all together. Since we are free to do this for any choice of jj, we can simply say that each individual term of the sum is equal to zero individually.

t1t2ηi(t)(LqiddtLq˙i)dt=0(no sum)i\int_{t_1}^{t_2} \eta_i(t)\left(\frac{\partial \mathcal L}{\partial q_i'} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'}\right)\mathrm d t = 0\, (\textrm{no sum}) \quad\forall\,i

Where we've now made it clear that we are not summing over the ii index. This is a much stricter statement than saying that the sum as a whole equals zero. We were able to make it due to the flexibility afforded to us in our choice of ηi(t)\eta_i(t).

This flexibility will, in an exactly analagous way, allow us to remove the integral from the expression too. We could always pick ηi(t)\eta_i(t) such that it had a spike at one value of tt and was zero everywhere else, therefore (in an infinitesimal sense) isolating a single term in the integral. This is the continuous equivalent of what we did with the discrete sum before, and allows us to write the final Euler-Lagrange equation:

LqiddtLq˙i=0\frac{\partial \mathcal L}{\partial q_i'} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i'} = 0

A Bit of Pedanticism

Strictly speaking, the equation above is written in terms of the primed (perturbed) trajectories, not the real trajectories which are what we actually want to compute. To remedy this we return to our original statement of the principle of least action.

dSdϵϵ=0=0\frac{\mathrm d S}{\mathrm d\epsilon}\bigg\rvert_{\epsilon = 0} = 0

We notice that our derived Euler-Lagrange equation is strictly only true when ϵ=0\epsilon = 0. Recall that at ϵ=0\epsilon = 0, qi(t)=qi(t)q_i'(t) = q_i(t), finally allowing us to write the Euler-Lagrange equation in terms of the real trajectories, qi(t)q_i(t):

LqiddtLq˙i=0\frac{\partial \mathcal L}{\partial q_i} - \frac{\mathrm d}{\mathrm d t}\frac{\partial \mathcal L}{\partial \dot q_i} = 0