Unizor - Creative Mind through Art of Mathematics

Thursday, November 20, 2025

Physics+ Lagrangian in N Degrees of Freedom: UNIZOR.COM - Physics+ 4 All...

Notes to a video lecture on UNIZOR.COM

Lagrangian
for N-dimensional Systems

Background

In one of the previous lectures we considered a point-mass object oscillating on a spring in an empty one-dimensional space with Cartesian coordinates (one degree of freedom).

In that case we defined a Lagrangian of an object as a difference between its kinetic and potential energies and built a theory equivalent to Newtonian but based on the Euler-Lagrange equation instead of the Newton's Second Law.

In another lecture we discussed a spring pendulum (two degrees of freedom) and used non-Cartesian parameters to define a state of a system - an angle of pendulum from a vertical and a spring's length.
It was possible but rather complicated to use the Newtonian approach, so we applied Lagrangian Mechanics to come up with the differential equations to describe a state of this system.

An important reason for using Lagrangian Mechanics with energies instead of vectors of force and accelerations in that second example was that energies are scalars, while forces and accelerations are vectors.
Manipulations with scalars are simpler, especially dealing with complex systems with more than one degree of freedom.

Obviously, no matter how we solve a problem, the calculated actual physical trajectories of objects in space must be the same.

Before proceeding any further, we strongly recommend to refresh your knowledge about conservative forces, independence of work performed by these forces from a trajectory of an object moved by them, a concept of a field and its potential.
The chapter Laws of Newton of this course is a good source of this information.

Recall that conservative forces are defined as those that depend only on position in space. Their work, performed by moving an object from one position to another, does not depend on trajectory or speed along this trajectory, but depends only on the beginning and ending positions of an object.

From the Energy Conservation Law follows that the work performed by a conservative force that moves an object changes the potential energy of this object by the amount of work performed.

Consider an isolated closed system of N point-mass objects acting on each other with conservative forces and no forces from outside of this system.
The force on the i^th object is
F_i = {F_ix,F_iy,F_iz}.
If this force moved this object by an infinitesimal increment
dr_i = {dx_i,dy_i,dz_i},
it performed some infinitesimal amount of work
dW_i = (F_i · dr_i) =
= F_ix·dx_i+F_iy·dy_i+F_iz·dz_i

From the Energy Conservation Law follows that potential energy of this object U_i should diminish by this amount of work
dU_i = −dW_i

When all N objects are moved by corresponding forces, the total increment of potential energy of an entire system is
dU = −Σ_i∈[1,N]dW_i =
= −Σ_i∈[1,N](F_i · dr_i) =
= −Σ_i∈[1,N](F_ix·dx_i+F_iy·dy_i+F_iz·dz_i)

From this equation follows the relationship between an increment of a total potential energy of an entire system and each individual force
F_ix = −∂U/∂x_i
F_iy = −∂U/∂y_i
F_iz = −∂U/∂z_i

Using vectors and operator
∇_i = {∂/∂x_i,∂/∂y_i,∂/∂z_i}
above equations can be expressed as
F_i = −∇_iU
for all i∈[1,N]

Lagrangian Mechanics was invented to simplify analysis of complex systems acted upon by conservative forces, like electrostatic, gravitational or spring forces.

We are going to prove that Lagrangian Mechanics of a closed (no external forces) mechanical system with its components acted among themselves with some conservative forces produces differential equations of motion that are equivalent to Newtonian equations, but easier to deal with.

Consider a system that consists of N point-masses with each i^th component Ω_i acted upon by conservative forces from all components within this system.
The time-dependent Cartesian coordinates of each Ω_i are {x_i(t),y_i(t),z_i(t)}, which we will denote as a vector r_i(t).

Let a combined conservative force acting on component Ω_i that depends on positions of all components of a system be
F_i(r₁,...r_N)

The Newton's Second Law of motion for each object in a system is, therefore,
F_i(r₁(t),...r_N(t)) = m_i·r"_i(t)
where each F_i is a vector of three components {F_ix,F_iy,F_iz}, and each component is a function of 3N coordinates of all objects in a system,
mass m_i is a mass of Ω_i and
a symbol " means the second derivative of position vector r_i(t) of object Ω_i by time, that is, its vector of acceleration {x"_i(t),y"_i(t),z"_i(t)}.

In coordinate form the above equation can be written as
F_ix(r₁(t),...r_N(t)) = m_i·x"_i(t)
F_iy(r₁(t),...r_N(t)) = m_i·y"_i(t)
F_iz(r₁(t),...r_N(t)) = m_i·z"_i(t)

In summary, we have 3N differential equations of 2-nd order, three for each component of a system of N objects.

In case of multiple objects in three-dimensional space exerting forces on each other (like all the planets of our Solar system or a nucleus with all its electrons in an atom) the vectors of forces are directed in different directions and the system of differential equations based on Newton's Second law is extremely complex.

Let's approach it differently.
Since all forces F_i are conservative, each one can be represented as
F_i(r₁(t),...r_N(t)) =
= −∇_iU(r₁(t),...r_N(t))
where U(r₁(t),...r_N(t)) is a total potential energy of an entire system and
∇_i is a vector of its partial derivatives by each coordinate of object Ω_i.
∇_i = {∂/∂x_i,∂/∂y_i,∂/∂z_i}.

Let's shorten for convenience the results above as
F_i = −∇_iU
and rewrite it in (x,y,z) components of Cartesian coordinates
F_ix = −∂U/∂x_i
F_iy = −∂U/∂y_i
F_iz = −∂U/∂z_i

As we see, knowing the potential energy of a system of objects is sufficient to know all the conservative forces acted on individual objects in this system.

The Newton's Second Law equation for object Ω_i in vector form is
F_i(r₁(t),...r_N(t)) = m_i·r"_i(t)
which in coordinate form would be
F_ix(x₁(t),...z_N(t)) = m_i·x"_i(t)
F_iy(x₁(t),...z_N(t)) = m_i·y"_i(t)
F_iz(x₁(t),...z_N(t)) = m_i·z"_i(t)
Using the potential energy, the same equations for object Ω_i would be
−∂U /∂x_i = m_i·x"_i(t)
−∂U /∂y_i = m_i·y"_i(t)
−∂U /∂z_i = m_i·z"_i(t)
where U=U(x₁(t),...z_N(t)) is a total potential energy of an entire system.

Let's address the m_i·r"_i(t) side of the Newton's Second Law and derive its value from the kinetic energy of an object Ω_i.
We express a vector r"_i(t) in (x,y,z) coordinates as
(x"_i(t),y"_i(t),z"_i(t))

Kinetic energy of Ω_i equals to K_i(t)=½m_i·v_i(t)² where v_i is a linear speed along a trajectory
v_i(t)² = r'_i(t)·r'_i(t) =
= x'_i(t)²+y'_i(t)²+z'_i(t)²

Now we can express the components of an acceleration vector r"_i(t) of Ω_i in terms of its kinetic energy
K_i(t)=½m_i·[x'_i(t)²+y'_i(t)²+z'_i(t)²]

Partial derivatives of kinetic energy by each coordinate of velocity vector produces coordinates of an object's momentum
∂K_i/∂x'_i = m_i·x'_i
∂K_i/∂y'_i = m_i·y'_i
∂K_i/∂z'_i = m_i·z'_i
Derivative by time of a momentum gives the right side of the Newton's Second Law d/dt ∂K_i/∂x'_i = d/dt m_i·x'_i =
= m_i·x"_i
d/dt ∂K_i/∂y'_i = d/dt m_i·y'_i =
= m_i·y"_i
d/dt ∂K_i/∂z'_i = d/dt m_i·z'_i =
= m_i·z"_i

We are ready to express the Newton's Second Law in terms of kinetic and potential energy.

From
−∂U /∂x_i = m_i·x"_i
and
d/dt ∂K_i/∂x'_i = m_i·x"_i
follows
−∂U/∂x_i = d/dt ∂K_i/∂x'_i
Similarly,
−∂U /∂y_i = d/dt ∂K_i/∂y'_i
−∂U/∂z_i = d/dt ∂K_i/∂z'_i

The total kinetic energy of a system is a sum of kinetic energies of its components
K = K₁+...+K_N
Since each K_i depends only on a velocity of the i^th object {x'_i(t),y'_i(t),z'_i(t)}, partial derivative of K_i by x'_i(t), y'_i(t) or by z'_i(t) is the same as partial derivatives of an entire kinetic energy of a system K by the same components x'_i(t), y'_i(t) or z'_i(t) of the velocity of the i^th object
∂K_i/∂x'_i = ∂K/∂x'_i
∂K_i/∂y'_i = ∂K/∂y'_i
∂K_i/∂z'_i = ∂K/∂z'_i

Using total kinetic energy of a system, the formulas describing the laws of motion of object Ω_i would be
−∂U/∂x_i = d/dt ∂K/∂x'_i
−∂U/∂y_i = d/dt ∂K/∂y'_i
−∂U/∂z_i = d/dt ∂K/∂z'_i
Now the only participants in these equations are the total kinetic and potential energies of an entire system - just two numbers that depend on positions and velocities of system's components.

To make the theory more elegant, let's introduce a Lagrangian L=K−U that equals to a difference between kinetic and potential energy of this system.

Since kinetic energy of a system K is independent of positions of its components {x_i(t),y_i(t),z_i(t)},
−∂U/∂x_i=∂(K−U)/∂x_i=∂L/∂x_i
and similar with partial derivatives by y_i and z_i.

Since potential energy of a system U is independent on velocities of its components,
∂K/∂x'_i=∂(K−U)/∂x'_i=∂L/∂x'_i
and similar with partial derivatives by y_i and z_i.

Therefore, our equations look even simpler
∂L/∂x_i = d/dt ∂L/∂x'_i
∂L/∂y_i = d/dt ∂L/∂y'_i
∂L/∂z_i = d/dt ∂L/∂z'_i

The equations above are also differential equations of the second order, like with Newton's Second Law.
There are also 3N of these equations (three coordinates for N objects in a system).
But there is only one number, Lagrangian, also called action, a function of all positions and velocities, to deal with for all objects instead of individual forces for each object.
Lagrangian Mechanics allows to deal with complex system in a simpler way.

As the cherry on top, consider the same equations in the form
d/dt ∂L/∂x'_i − ∂L/∂x_i = 0
d/dt ∂L/∂y'_i − ∂L/∂y_i = 0
d/dt ∂L/∂z'_i − ∂L/∂z_i = 0
In these equations Lagrangian L depends on 3N parameters, positions and velocities of N components of our system, which are, in turn, time-dependent.
So, ultimately, Lagrangian L is a function of time.

On one hand, these are equations that define a motion of a system, that is they define a trajectory of a system changing positions and velocities of its components from one moment in time to another.

On another hand, as we discussed in the Variations chapter of this course, they produce a function of time L(t) that brings to extremum (usually, minimum) the action functional
S = ∫_[t₁,t₂] L(t)·dt

Therefore, we can say that a mechanical system with only conservative forces present changes its state along an N-dimensional trajectory that minimizes the action functional above.

Sunday, September 21, 2025

Physics+ N-dimensional Lagrangian: UNIZOR.COM - Physics+ 4 All - Lagrangian

Notes to a video lecture on UNIZOR.COM

Lagrangian
for N-dimensional Systems

In one of the previous lectures we considered a point-mass object oscillating on a spring in an empty one-dimensional space with Cartesian coordinates (one degree of freedom).

In that case we defined a Lagrangian of an object as a difference between its kinetic and potential energies and built a theory equivalent to Newtonian but based on the Euler-Lagrange equation instead of the Newton's Second Law.

In another lecture we discussed a spring pendulum (two degrees of freedom) and used non-Cartesian parameters to define a state of a system - an angle of pendulum from a vertical and a spring's length.
It was possible but rather complicated to use the Newtonian approach, so we applied Lagrangian Mechanics to come up with the differential equations to describe a state of this system.

An important reason for using energies instead of vectors of force and acceleration in that second example was that energies are scalars, while forces and accelerations are vectors.
Manipulations with scalars are simpler, especially dealing with complex systems with more than one degree of freedom.

Obviously, no matter how we solve a problem, the calculated actual physical trajectories of objects in space must be the same.

Before proceeding any further, we strongly recommend to refresh your knowledge about conservative forces, independence of work performed by these forces from a trajectory of an object moved by them, a concept of a field and its potential.
The chapter Laws of Newton of this course is a good source of this information.

Recall that conservative forces are defined as those that depend only on position in space, independent of time. Their work, performed by moving an object from one position to another, does not depend on trajectory or speed along this trajectory, but depends only on the beginning and ending positions of an object.

From the Energy Conservation Law follows that the work performed by a conservative force that moves an object changes the potential energy of this object by the amount of work performed.

Recall that an increment of potential energy
ΔE_pot=E_pot(P₂)−E_pot(P₁)
of an object moved by a conservative force F (a force of a field, a force of a spring etc.) from position P₁ to P₂ along any trajectory equals to the work performed by this conservative force.
ΔE_pot = ∫_[P₁~P₂]F(P)·dr
where P is a variable position of an object moving along a trajectory from P₁ to P₂,
r=OP is a vector from the origin of coordinates O to a position of an object P as it moves along a trajectory,
multiplication F(P)·dr is a scalar product of two vectors - a vector of force and an infinitesimal vector of increment of a position of an object along its trajectory and
[P₁~P₂] denotes that an integral is taken along a trajectory from P₁ to P₂.

The same conservative force (a force of a field, a force of a spring etc.) can be represented as a vector of gradient of a potential
F = −∇E_pot
where a minus sign '−' indicates that the conservative force (a force of a field, a force of a spring etc.) is always directed towards decreasing of potential.

Lagrangian Mechanics was invented to simplify analysis of complex systems acted upon by conservative forces, like electrostatic, gravitational or spring forces.

We are going to prove that Lagrangian Mechanics of a closed (no external forces) mechanical system with its components acted among themselves with some conservative forces produces differential equations of motion that are equivalent to Newtonian equations, but easier to deal with.

Consider a system that consists of N point-masses with each i^th component Ω_i acted upon by conservative forces (three-dimensional vectors) from all components of this system with a combined force
F_i(r₁,...r_N)
that depends on positions of all components of a system.

The time-dependent Cartesian coordinates of each Ω_i are {x_i(t),y_i(t),z_i(t)}, which we will denote as a vector r_i(t).

The Newton's Second Law of motion for each object in a system is, therefore,
F_i(r₁(t),...r_N(t)) = m_i·r"_i(t)
where each F_i is a vector of three components {F_ix,F_iy,F_iz}, and each component is a function of 3N coordinates of all objects in a system,
mass m_i is a mass of Ω_i and
a symbol " means the second derivative of position vector r_i(t) of object Ω_i by time, that is, its vector of acceleration {x"_i(t),y"_i(t),z"_i(t)}.

In coordinate form the above equation can be written as
F_ix(r₁(t),...r_N(t)) = m_i·x"_i(t)
F_iy(r₁(t),...r_N(t)) = m_i·y"_i(t)
F_iz(r₁(t),...r_N(t)) = m_i·z"_i(t)

In summary, we have 3N differential equations of 2-nd order, three for each component of a system of N objects.

In case of multiple objects in three-dimensional space exerting forces on each other (like all the planets of our Solar system or a nucleus with all its electrons in an atom) the vectors of forces are directed in different directions and the system of differential equations based on Newton's Second law is extremely complex.

Let's approach it differently.
Since all forces F_i are conservative, each one can be represented as
F_i(r₁(t),...r_N(t)) =
= −∇_iU(r₁(t),...r_N(t))
where U(r₁(t),...r_N(t)) is a total potential of an entire system and
∇_i is a vector of its partial derivatives by each coordinate of object Ω_i.
∇_i = {∂/∂x_i,∂/∂y_i,∂/∂z_i}.

Let's shorten for convenience the results above as
F_i = −∇_iU
and rewrite it in (x,y,z) components of Cartesian coordinates
F_ix = −∂U/∂x_i
F_iy = −∂U/∂y_i
F_iz = −∂U/∂z_i

As we see, knowing the potential of a system of objects at each point is sufficient to know all the conservative forces acted on individual objects in this system.

Let's address the m_i·r"_i(t) side of the Newton's Second Law and derive its value from the kinetic energy of an object Ω_i.
We express a vector r"_i(t) in (x,y,z) coordinates as
(x"_i(t),y"_i(t),z"_i(t))

The Newton's Second Law equations for object Ω_i in coordinate form would then be
F_ix = m_i·x"_i(t)
F_iy = m_i·y"_i(t)
F_iz = m_i·z"_i(t)
Using the potential energy, the same equations for object Ω_i would be
−∂U /∂x = m_i·x"_i(t)
−∂U /∂y = m_i·y"_i(t)
−∂U /∂z = m_i·z"_i(t)

Kinetic energy of Ω_i equals to K_i(t)=½m_i·v_i(t)² where v_i is a linear speed along a trajectory
v_i(t)² = r'_i(t)·r'_i(t) =
= x'_i(t)²+y'_i(t)²+z'_i(t)²

Now we can express the components of an acceleration vector r"_i(t) of Ω_i in terms of its kinetic energy
K_i(t)=½m_i·[x'_i(t)²+y'_i(t)²+z'_i(t)²]

Partial derivatives of kinetic energy by each coordinate of velocity vector produces coordinates of an object's momentum
∂K_i/∂x'_i = m·x'_i
∂K_i/∂y'_i = m·y'_i
∂K_i/∂z'_i = m·z'_i
Derivative by time of a momentum gives the right side of the Newton's Second Law d/dt ∂K_i/∂x'_i=d/dt m·x'_i=m·x"_i
d/dt ∂K_i/∂y'_i=d/dt m·y'_i=m·y"_i
d/dt ∂K_i/∂z'_i=d/dt m·z'_i=m·z"_i

We are ready to express the Newton's Second Law in terms of kinetic and potential energy.
From
F_ix = −∂U/∂x_i
and
d/dt ∂K_i/∂x'_i = m·x"_i
follows
−∂U/∂x_i = d/dt ∂K_i/∂x'_i
Similarly,
−∂U /∂y_i = d/dt ∂K_i/∂y'_i
−∂U/∂z_i = d/dt ∂K_i/∂z'_i

The total kinetic energy of a system is a sum of kinetic energies of its components
K = K₁+...+K_N
Since each K_i depends only on a velocity of the i^th object {x'_i(t),y'_i(t),z'_i(t)}, partial derivative of K_i by x'_i(t), y'_i(t) or by z'_i(t) is the same as partial derivatives of an entire kinetic energy of a system K by the same components x'_i(t), y'_i(t) or z'_i(t) of the velocity of the i^th object
∂K_i/∂x'_i = ∂K/∂x'_i
∂K_i/∂y'_i = ∂K/∂y'_i
∂K_i/∂z'_i = ∂K/∂z'_i

Using total kinetic energy of a system, the formulas describing the laws of motion of object Ω_i would be
−∂U/∂x_i = d/dt ∂K/∂x'_i
−∂U/∂y_i = d/dt ∂K/∂y'_i
−∂U/∂z_i = d/dt ∂K/∂z'_i
Now the only participants in these equations are the total kinetic and potential energies of an entire system - just two numbers that depend on positions and velocities of system's components.

To make the theory more elegant, let's introduce a Lagrangian L=K−U that equals to a difference between kinetic and potential energy of this system.

Since kinetic energy of a system K is independent of positions of its components {x_i(t),y_i(t),z_i(t)},
−∂U/∂x_i=∂(K−U)/∂x_i=∂L/∂x_i
and similar with partial derivatives by y_i and z_i.

Since potential energy of a system U is independent on velocities of its components,
∂K/∂x'_i=∂(K−U)/∂x'_i=∂L/∂x'_i
and similar with partial derivatives by y_i and z_i.

Therefore, our equations look even simpler
∂L/∂x_i = d/dt ∂L/∂x'_i
∂L/∂y_i = d/dt ∂L/∂y'_i
∂L/∂z_i = d/dt ∂L/∂z'_i

The equations above are also differential equations of the second order, like withNewton's Second Law.
There are also 3N of these equations (three coordinates for N objects in a system).
But there is only one number, Lagrangian, also called action, a function of all positions and velocities, to deal with for all objects instead of individual forces for each object.
Lagrangian Mechanics allows to deal with complex system in a simpler way.

As the cherry on top, consider the same equations in the form
d/dt ∂L/∂x'_i − ∂L/∂x_i = 0
d/dt ∂L/∂y'_i − ∂L/∂y_i = 0
d/dt ∂L/∂z'_i − ∂L/∂z_i = 0
In these equations Lagrangian L depends on 3N parameters, positions and velocities of N components of our system, which are, in turn, are time-dependent.
So, ultimately, Lagrangian L is a function of time.

On one hand, these are equations that define a motion of a system, that is they define a trajectory of a system changing positions and velocities of its components from one moment in time to another.

On another hand, as we discussed in the Variations chapter of this course, they produce a function of time L(t) that brings to extremum (usually, minimum) the action functional
S = ∫_[t₁,t₂] L(t)·dt

Therefore, we can say that a mechanical system with only conservative forces present changes its state along an N-dimensional trajectory that minimizes the action functional above.

Saturday, August 23, 2025

Physics+ Pendulum in Lagrangian Mechanics: UNIZOR.COM - Physics+ 4 All -...

Notes to a video lecture on UNIZOR.COM

Mathematical Pendulum

Plain Pendulum

We will illustrate the application of Lagrangian mechanics by analyzing the movement of a mathematical pendulum - a problem we have already discussed in the Physics 4 Teens - Mechanics - Pendulum, Spring - Pendulum using the Newtonian approach.

We recommend reviewing the lecture mentioned above and refresh the Newtonian method of deriving the main equation of motion of the pendulum:
α"(t) = −(g/l)·sin(α(t))
which was obtained from properly determining the force F that moves a pendulum as a vector sum of the gravity force directed vertically down and the tension of an unstretchable thread that keeps an object at the free end of a thread on a constant distance from the fixed end of a thread.

The two forces involved in formation of a resulting force F, gravity P=m·g and tension of a thread T, had to be combined using the rules for addition of vectors, which required some thinking.

Let's apply the Lagrangian mechanics to this problem using an angle of a thread with a vertical α as the one and only parameter that determines a position of an object.
This way of identification of a position is more convenient than Cartesian coordinates originated at the fixed end of a thread because both of them can be easily derived from α
x = l·sin(α)
y = −l·cos(α)

The Lagrangian is the difference between kinetic and potential energies.
L(α(t),α'(t)) =
= E_kin(α'(t)) − E_pot(α(t))

Kinetic energy depends on mass M and linear speed of an object along its circular trajectory v=l·α'(t)
E_kin = ½M·v² =
= ½M·l²·[α'(t)]²

Potential energy depends on a mass of an object M, its height over the ground h(t) and an acceleration of free fall g.
E_pot(t) = M·g·h(t)
If the origin of our coordinates, the fixed end of a thread, is at height H over the ground,
h = H − l·cos(α)
and, therefore,
E_pot(t) = M·g·[H−l·cos(α(t))]

Now we can construct the Euler-Lagrange equation
(∂/∂α)L(α(t),α'(t)) =
= (d/dt)(∂/∂α')L(α(t),α'(t))

Calculate left and right sides separately.
(∂/∂α)L(α(t),α'(t)) =
= (∂/∂α)[E_kin(α'(t)) − E_pot(α(t))] =
= (∂/∂α)[−E_pot(α(t))] =
= (∂/∂α)[−M·g·[H−l·cos(α(t))]] =
= −M·g·l·sin(α(t))

(d/dt)(∂/∂α')L(α(t),α'(t)) =
=(d/dt)(∂/∂α')[E_kin(α'(t))−E_pot(α(t))]=
= (d/dt)(∂/∂α')E_kin(α'(t)) =
= (d/dt)(∂/∂α')½M·l²·[α'(t)]² =
= (d/dt)M·l²·α'(t) =
= M·l²·α"(t)

The Euler-Lagrange equation is
−M·g·l·sin(α(t)) = M·l²·α"(t)
or
α"(t) = −(g/l)·sin(α(t))
which is exactly as applying Newtonian mechanics.
If you don't think the Lagrangian mechanics is simpler than Newtonian for those who are familiar with Calculus of partial derivatives, consider the next example.

Spring Pendulum

A weightless spring replaces an unstretchable thread of the previous problem.
The spring and an object on its end are in a weightless frictionless tube that maintains a straight form, so an object has two degrees of freedom - radial inside a tube stretching and squeezing a spring and pseudo-circular as it moves together with a tube in a pendulum like motion.

The problem of specifying the motion of an object is much more complex here because the spring tension is changing not only with an angle α(t) but also because of the movement of an object within a tube.

However, using the Langrangian mechanics, this problem can be analyzed with much less efforts and the corresponding differential equation can be constructed relatively easy.

As before, let's calculate the kinetic and potential energies of an object.

The object's kinetic energy can be calculated as a sum of its radial movement's kinetic energy inside the tube and kinetic energy of its pseudo-circular movement perpendicularly to the tube.
The reason is simple. The object's linear velocity vector can be represented as a sum of two perpendicular to each other vectors, one is inside the tube and another perpendicular to it.
v = v_|| + v_⊥
Since kinetic energy depends on a square of the linear speed, according to Pythagorean Theorem
v² = v_||² + v_⊥²
from which follows that
E_kin = ½M·v² =
= ½M·v_||² + ½M·v_⊥²

Distance of an object from the fixed point of oscillation l is variable and depends on time: l=l(t).
Therefore, v_||(t)=l'(t).
Perpendicular to l component of an object speed is v_⊥(t)=l(t)·α'(t).
Therefore, kinetic energy of an object is
E_kin = ½M·[l'(t)²+l(t)²·α'(t)²]

The object's potential energy can be calculated as a sum of its potential energy due to gravity and potential energy of a stretched or a squeezed spring.
If the fixed point of oscillation is at the height H above the ground, an object is at height h(t)=H−l(t)·cos(α(t)) above the ground, and potential energy of an object related to its position in the gravitational field is
E_grav = M·g·[H−l(t)·cos(α(t))]

Potential energy of a spring depends on the degree of its stretching or squeezing.
Assume, the length of a spring in a neutral state is l₀. Then the length of stretching or squeezing at time t is l(t)−l₀.
Therefore, potential energy of an object related to a spring is
E_spr = ½k·[l(t)−l₀]²
where k is a coefficient of elasticity of a spring.

Total potential energy of an object is
E_pot = E_grav + E_spr =
= M·g·[H−l(t)·cos(α(t))] +
+ ½k·[l(t)−l₀]²

Construct Lagrangian
L(l,l',α,α') = E_kin − E_pot =
= ½M·[l'(t)²+l(t)²·α'(t)²] −
− M·g·[H−l(t)·cos(α(t))] −
− ½k·[l(t)−l₀]²

All which remains is to write the Euler-Lagrange equation for this Lagrangian.

The problem is, we are familiar only with the Euler-Lagrange equation for a system with one degree of freedom, like x(t). Now we have two degrees of freedom - l(t) and α(t).

Fortunately, the Euler-Lagrange equation can be specified for each degree of freedom independently, which will be proven in the next lecture.

Therefore, we can write two independent Euler-Lagrange equations (skipping (t) for brevity):
(∂/∂l)L(l,l',α,α') =
= (d/dt)(∂/∂l')L(l,l',α,α')
and
(∂/∂α)L(l,l',α,α') =
= (d/dt)(∂/∂α')L(l,l',α,α')

The first equation is
M·l·α'²+M·g·cos(α)−k·(l−l₀) =
= M·l"

The second equation is
−M·g·l·sin(α) = M·l²·α"
or
−g·sin(α) = l·α"
or
−(g/l)·sin(α) = α"
which looks exactly the same as in the case above with unstretchable thread instead of a spring.

Wednesday, August 20, 2025

Physics+ Lagrangian: UNIZOR.COM - Physics+ 4 All - Lagrangian

Notes to a video lecture on UNIZOR.COM

Lagrangian

First of all, let's stipulate that the Laws of Newton are based on experiment, they are not derived from some more fundamental theories.

Lagrangian mechanics presents a different approach to analyze the motion than Newtonian mechanics.
In many cases it presents a simpler, more universal way to describe the motion of a mechanical system than Newtonian one.

Let's start with an example where both methodologies lead to the same result.

Spring Oscillation

Consider an ideal spring with one end fixed and a point-mass attached to another end.
The oscillations will occur along the length of a spring that coincides with X-axis.
Position of a point-mass on the spring's end will be described by it X-coordinate x(t) as a function of time t with initial position at time t=0 being an origin of X-coordinate, that is x(0)=0.

According to the Hooke's Law, the force F of a spring applied to an object attached to its end is proportional to a length x by which this spring is stretched or squeezed from its neutral position and directed always towards a neutral point x=0 of no stretching or squeezing.
F = −k·x
where k is a coefficient of elasticity that characterizes physical properties of a spring.

According to the Newton's Second Law, the acceleration a of an object is proportional to a force F applied to it
F = m·a
where m is the object's mass being a coefficient of proportionality.

A linear acceleration a(t), as a function of time, is a derivative of a linear speed v(t) by time t
a(t) = dv(t)/dt = v'(t)
A linear speed v(t) is, in turn, a derivative of a position of an object x(t) by time
v(t) = dx(t)/dt = x'(t)
Therefore, an acceleration is a second derivative of a position by time
a(t) = d²x(t)/dt² = x"(t)

Equating the value of force by Hooke's Law to that of Newton's Second Law, we get a differential equation that defines a motion of the object
−k·x(t) = m·a(t)
or, equivalently,
−k·x(t) = m·x"(t)
Solution to this differential equation of the second order is a trajectory of our object.

Let's approach the same problem from another side.

An object attached to a spring's end that has mass m and linear speed v has kinetic energy that is equal to
E_kin = ½m·v²
Since speed v(t), as a function of time t is just a derivative of a position x(t) by time, we can express kinetic energy in terms of position, as in the case of potential energy above
E_kin = ½m·[x'(t)]²
NOTICE:
E_kin depends explicitly only on speed x'(t) and
d/dt[∂(E_kin)/∂x'] =
= d/dt[m·x'] = mx"(t) = F

A stretched or a squeezed spring has potential energy equal to the amount of work needed to stretch or squeeze it against the force of its elasticity (you can refer to a lecture Physics 4 Teens - Energy - Potential Energy - Spring on UNIZOR.COM).
Thus, a potential energy of a spring squeezed or stretched by the length x(t), as a function of time t, equals to
E_pot = ½k·[x(t)]²
where k is the same coefficient of elasticity as above that characterizes the physical properties of a spring.
NOTICE:
E_pot depends explicitly only on position x(t) and
∂(−E_pot)/∂x = −k·x(t) = F

Based on two NOTICEs above, it is IMPORTANT to see that
∂(−E_pot)/∂x =
= d/dt[∂(E_kin)/∂x'] = F

Since E_pot depends explicitly only on position x(t), not on speed x'(t), and E_kin depends explicitly only on speed x'(t), not on position x(t),
∂(E_kin−E_pot)/∂x =
= ∂(−E_pot)/∂x =
= d/dt[∂(E_kin)/∂x'] =
= d/dt[∂(E_kin−E_pot)/∂x']

At this point it's essential to recall the Euler-Lagrange equation (you can refer to a lecture Physics+ 4 All - Variations - Euler-Lagrange on UNIZOR.COM) - a differential equation of the second order that defines a function f₀(x) that minimizes or maximizes a functional
Φ[f(x)] = ∫_[a,b] F[x,f(x),f '(x)]dx
where F[...] is some known smooth real function of three arguments - real variable x, real value of function f(x) and real value of derivative f '(x).
This Euler-Lagrange differential equation looks like this:
(∂/∂f)F [x,f₀(x),f '₀(x)] =
= (d/dx)(∂/∂f ')F [x,f₀(x),f '₀(x)]

Let's change more abstract symbols x and f(x) to those applicable to our task.
The argument will be time t instead of abstract x. The function will be a position x(t) instead of abstract f₀(x).
Now the Euler-Lagrange equation looks like
(∂/∂x)F [t,x(t),x'(t)] =
= (d/dt)(∂/∂x')F [t,x(t),x'(t)]

Compare this to the equation above that equates partial derivative from E_kin−E_pot by x with its partial derivative by x'.

Obviously, L=E_kin−E_pot satisfies the Euler-Lagrange equation
(∂/∂x)L[t,x(t),x'(t)] =
= (d/dt)(∂/∂x')L[t,x(t),x'(t)]
which in this case is exactly the same as the equation obtained from the Newton's Second Law
−k·x(t) = m·x"(t)
Expression
L[x(t),x'(t)]=E_kin(x')−E_pot(x)
is called Lagrangian.

Consider an object moving along some trajectory x(t) from the moment of time t₁ to the moment of time t₂.
At any moment it has certain kinetic and potential energy, so we can constract a Lagrangian
L(t) = E_kin(x'(t)) − E_pot(x(t))

Consider an integral of this Lagrangian by time
S = ∫_[t₁,t₂]L(t)·dt
This integral is call action.
The trajectory that minimizes or maximizes this action is a solution to an Euler-Lagrange equation
(∂/∂x)L[x(t),x'(t)] =
= (d/dt)(∂/∂x')L[t,x(t),x'(t)]
which has the same solution as Newtonian F=m·a.

Therefore, the trajectory obtained using a Lagrangian approach is the same the one from Newtonian mechanics. BUT IN SOME CASES IT MIGHT BE MUCH MORE CONVENIENT.

The equivalence of a differential equation obtained from the Newton's Second Law and the Euler-Lagrange equation is not just a coincidence peculiar for springs.

In general, kinetic energy always depends on mass and speed
E_kin = ½m·v²
In general, derivative of E_kin by speed v is a momentum of motion p
p = m·v = ∂/∂v(E_kin) =
= ∂/∂v(½mv²)
In general, derivative of momentum p by time t is the force F
dp/dt = d(m·v)/dt = m·a = F
In general, potential energy is, actually, an amount of work W.
Since dW=F·dx, its derivative by x is the force, and a derivative of potential energy by coordinates gives the force as well.

So, the Newton's Second Law and Euler-Lagrange equation are equivalent. Why do we need both?
Practical mechanical problems are rarely as simple as we are taught at high school.
It appears that the more complicated problems with more than one object involved are easier to solve using Lagrangian L=E_kin−E_pot than to deal with complicated forces and their interaction constructing the equations of F=m·a type.

The next lectures will be dedicated to a few important physical problems and their solutions using Euler-Lagrange equation.

Monday, August 18, 2025

Physics+ Brachistochrone: UNIZOR.COM - Physics+ 4 All - Variations

Notes to a video lecture on UNIZOR.COM

Brachistochrone

The approach to choose a path along which any system progresses (light propagates, planet moves around the Sun etc.) based on minimizing some numeric function defined for each path appears to be very valuable in Physics, and it helps to solve certain tasks faster and more efficiently than using only the classic Newton's Laws.

Before generalizing this idea, let's consider a specific problem suggested by Johann Bernoulli in 1696.
It's called the Brachistochrone problem (from Greek 'brachistos' + 'chronus' = 'short' + 'time') and is formulated as follows.

Consider two points A and B in the uniform gravitational field (like near the surface of the Earth) with force of gravity directed vertically down. These points are positioned on different heights and not on the same vertical.

A small object should slide from the top point A(a,A) to the lower point B(b,B) along some frictionless supporting track.

We use a standard Cartesian reference frame with Y-coordinates increasing upwards, and X-coordinates increasing from left to right on a picture above.
The vector of gravity force is directed down along Y-axis.
Therefore, Y-coordinate A is greater then B, and X-coordinate a is less than b.

The supporting track can go straight from A to B or take some curved form.
The straight brown line of descend on a picture above is shorter, but the curved blue or purple lines, while longer, allow for an object to gain speed faster and the resulting time of descend might still be shorter than for a straight line.

The problem is to determine the shape of a supporting track to minimize the time of sliding.

Mathematically speaking, we have to consider all smooth functions f(x) on a segment [a,b] that satisfy the conditions:
f(a) = A and f(b) = B
Then, out of all these functions, we have to find such that represents the curve of fastest descend from A(a,A) to B(b,B).

This simply formulated problem is far from having a simple solution.
Best mathematicians of 17th century worked on it and solved using different methodologies.

Let's solve it using the apparatus developed for finding a minimum of a functional - the Euler-Lagrange equation. This methodology was discussed in the previous lectures of this course.

We have to express the time T of moving from point A to point B as a functional of a trajectory represented by function f(x):
T = Φ[f(x)]
and find a function y=f₀(x) that minimizes this functional.

Hopefully, our functional will look like
Φ[f(x)] = ∫_[a,b] F[x,f(x),f '(x)]dx
where F[...] is some known smooth real function of three arguments - real variable x, real value of function f(x) and real value of derivative f '(x)
and we will be able to apply Euler-Lagrange equation to find y=f₀(x) as its solution.

The picture above illustrates a trajectory of a movement of an object in a uniform gravitational field along a supporting curved track described by a function y=f(x).
The object's weight (the force of gravity vector) is P=m·g, where m is its mass and g is an acceleration of a free falling in the gravitational field.

Besides the gravitational force, a reaction of a supporting curved track vector of force R acts on this object - the force always directed perpendicularly to a tangential line to a curve.

Both the force of gravity P and the reaction force of a curved track R result in the force vector F moving an object along a trajectory and directed along a tangential line to a curved track.

Consider a segment of a trajectory from x to x+dx, where dx is an infinitesimal increment of argument x.
This segment has a length ds and its value satisfies the Pythagorean Theorem
(ds)² = (dx)² + (dy)²
where dy=d(f(x))=f '(x)·dx
so (ds)²=[1+(f '(x))²]·(dx)²
and ds=√1+(f '(x))²·dx

Assume, at point x the linear sliding speed of an object along its trajectory is v(x).
Then the time an object spends passing a segment ds equals to
ds/v(x) = √1+(f '(x))²·dx / v(x)

To find speed of an object v(x), recall the Conservation of Energy Law.

Potential energy of an object depends on its mass and the height over some zero level.
Assume, the zero level of potential energy is at y=0.
Then the initial potential energy U_a of our object at the beginning of its motion is
U_a = m·g·A
where m is an object's mass,
g is an acceleration of free falling and
A is its initial Y-coordinate.

Its kinetic energy K_a at the beginning is zero because its speed along a trajectory is zero at that point.

Then the total initial mechanical energy of an object (potential + kinetic) is
E_a = m·g·A

When our object moved along a curve from its initial position at (a,A) to position (x,f(x)), its potential energy diminished and kinetic energy grew by the same amount because of the Energy Conservation Law.

New potential energy equals to
U_x = m·g·f(x)
New kinetic energy equals to
K_x =m·v²(x)/2.

The decrease in potential energy U_a−U_x should be compensated by an increase in kinetic energy K_x.

From the Energy Conservation Law the total energy should remain the same E_a = E_x
which leads us to an equation
m·g·[A−f(x)] = m·v²(x)/2
Therefore,
v(x) = √2g·[A−f(x)]

Now the time an object spends passing a segment ds equals to

dT(x) =

√1+(f '(x))²

√2g·[A−f(x)]

Integrating this by x from a to b gives a total time of moving along a trajectory - the functional we need to minimize
Φ[f(x)] = ∫_[a,b] dT(x)
or

Φ[f(x)]=∫_[a,b]

√1+(f '(x))²

√2g·[A−f(x)]

At this point we can drop 2g from the denominator, as this does not change the function-argument f₀(x) that minimizes functional Φ[f(x)].
So, our task is to minimize a functional

Φ[f(x)]=∫_[a,b]

√1+(f '(x))²

√A−f(x)

Recall from the previous lecture that for a given functional
Φ[f(x)] = ∫_[a,b] F[x,f(x),f '(x)]dx
the function f₀(x) that minimizes or maximizes it should satisfy the Euler-Lagrange differential equation
(∂/∂f)F [x,f₀(x),f '₀(x)] −
− (d/dx)(∂/∂f ')F [x,f₀(x),f '₀(x)] = 0

Let's construct this equation for our case.
To shorten formulas, let's temporarily use
h(x) instead of f '(x) and
omit (x) from both f(x) and h(x).

Using this substitution, our functional looks like

Φ[f(x)]=∫_[a,b]

√1+h²

√A−f

From this follows that an expression under an integral is

F[x,f,h]=

√1+h²

√A−f

In terms of these functions, the Euler-Lagrange equation looks like
(∂/∂f)F [x,f,h] =
= (d/dx)(∂/∂h)F [x,f,h]

Let's calculate each term separately.

Left side of an equation is
(∂/∂f)F [x,f,h] =

= (∂/∂f)

√1+h²

√A−f

=

= √1+h²·(∂/∂f)(A−f)^−½ =
= √1+h²·(−½)·(A−f)^−3/2·(−1) =

√1+h²

2√(A−f)³

Right side of an equation is
(d/dx)(∂/∂h)F [x,f,h] =

= (d/dx)(∂/∂h)

√1+h²

√A−f

=

= (d/dx)

h

√1+h²·√A−f

=

=

h'

√1+h²·√A−f

−

−

h·2h·h'

2√(1+h²)³ ·√A−f

+

+

h·f '

2√1+h²·√(A−f)³

Note that we've agreed to substitute h(x) for f '(x) in the numerator of the last term.

Equating left and right sides of the Euler-Lagrange equation, multiplying both sides by 2√(1+h²)³ ·√(A−f)³ and opening the parenthesis leads to a simpler equation
1 + h² − 2h'·(A−f) = 0

Returning back to original symbols, replace f with f(x) and h with f '(x) getting a second order differential equation for function y=f(x)
1+[f '(x)]²−2f "(x)·[A−f(x)] = 0
or, equivalently,
1+y'²−2y"·(A−y) = 0

The solution y=f₀(x) to this second order differential equation is the function that minimizes the functional Φ[f(x)].

A not so obvious transformation can reduce this second order differential equation to the first order one.

Integration is easy if a subject of an integration is a derivative of some function
∫ s'(x)·dx = s(x) + C
where C is some constant.

The left side of the equation above is not a derivative of any function, but let's see what happens if we multiply it by −y'.
−y'·[1+y'²−2y"·(A−y)] =
= −y'·(1+y'²)+2y'·y"·(A−y)

Notice that we can substitute −y' with (A−y)' and 2y'·y" with (1+y'²)' to get
(A−y)'·(1+y'²) +
+ (A−y)·(1+y'²)'
which is a derivative of a product of (A−y) by (1+y'²).

Therefore, if y=f₀(x) is a solution to Euler-Lagrange equation
1+y'²−2y"·(A−y) = 0,
it's also a solution to the equation
−y'·[1+y'²−2y"·(A−y)] =
[(A−y)·(1+y'²)]' = 0
or,
{[A−f₀(x)]·[1+f₀'(x)²]}' = 0

Integrating this, we get the first order differential equation
[A−f₀(x)]·[1+f₀'(x)²] = C
or, in a shorter notation,
(A−y)·(1+y'²) = C
where C is some constant.

We can solve this differential equation for function y=f₀(x) as follows.

Let's resolve this differential equation for y':

dy/dx = ±√

C−A+y

A−y

From physical considerations, our function must decrease with increase of x. Therefore, it's derivative must be negative, and we will choose a minus sign in all transformations below.

This differential equation can be transformed to integrate separately by x and y:

−√

A−y

C−A+y

·dy = dx

We can integrate now both sides separately.

Without getting into the details of integration, we can just write the result of the integration by y of the left size.

− ∫√

A−y

C−A+y

·dy =

= − C·arctan√

C−A+y

A−y

−

− √(A−y)·(C−A+y) + D
where C and D are some constants.

Integration of dx produces x+E, where E is yet another constant, but we can combine it into constant D that appears in integration by dy.

Therefore, our function y=f₀(x) that minimizes the object's time of sliding down can be expressed as

x = −C·arctan√

C−A+y

A−y

−

− √(A−y)·(C−A+y) + D
where C and D are some constants.

It's not really expressed as y being a function of x, just the opposite way, but it should not stop us to explore this curve.

As we see, the function depends on two constants C and D. At the same time we have two initial conditions:
f₀(a) = A and
f₀(b) = B
Substituting x=a, y=A into an equation above will produce on equation for C and D.
Substituting x=b, y=B into this equation will produce another equation for C and D.
These two equations determine the values of C and D needed to specify the full solution to a problem.

As an example, we used points A(1,5) and B(4,1) and calculated approximate values
C=5.8 and D=−10.1.
Here is the graph that, in particular, crosses points A(1,5) and B(4,1).

At the end of this discussion about brachistochrone it's appropriate to mention that the curve we have found as a solution to the Euler-Lagrange equation is a cycloid - a trajectory of a point on a circle that is rolling along a straight line.
We have not discussed this because our main purpose was just to show how to use the Euler-Lagrange equation to find an extremum of a functional.

Sunday, August 3, 2025

Physics+ Euler Lagrange Equation: UNIZOR.COM - Physics+ 4 All - Variations

Notes to a video lecture on UNIZOR.COM

Euler-Lagrange Equation

Based on theoretical knowledge of functionals, their extremums and calculus of variations (method of finding these extremums using directional derivatives), let's see what is the result of application of this method in many cases occurring in Physics.

The spectrum of functions, that these functionals are defined on, in many cases is reduced to smooth (sufficiently differentiable) real functions defined on some segments [a,b] with fixed values at the ends of this segment:
f(a)=A; f(b)=B.
For examples, to find the best in some sense trajectory we have to know the starting and ending points of this trajectory and search for the best one among functions with fixed values at the beginning (start) point and at the ending (finish) point of movement.

Many problems in Physics are related to finding extremums of specific type of functionals of the above mentioned functions:
Φ[f(x)] = ∫_[a,b] F[x,f(x),f '(x)]dx
where F[...] is some known smooth real function of three arguments - real variable x, real value of function f(x) and real value of derivative f '(x).
This function F[...] is derived from known laws of Physics.
The function f(x), the argument to a functional Φ[f(x)], is the function from a class of smooth functions defined on some segment with fixed values at the ends described above.

Our task is to find function f₀(x) where functional Φ[f(x)] has an extremum.

The plan is:

1. Assuming f₀(x) is a point where Φ[f(x)] reaches its extremum, increment this function by some Δ(x) to f₁(x)=f₀(x)+Δ(x), keeping in mind that f₁(x) should belong to the same class as f₀(x), that is it should be smooth, defined on the same segment [a,b] and have the same values A and B at the ends of this segment, which means that Δ(x) should be smooth, defined on the same [a,b] and be equal to zero at the ends of this segment.

2. Consider all functions of type g(x,t)=f₀(x)+t·Δ(x).
This set of functions is parameterized by parameter t and g(x,0)=f₀(x).
This set of functions can be considered as filling some neighborhood of function f₀(x) on a "line" from f₀(x) in a direction defined by increment Δ(x).

3. Since f₀(x) is a point where functional Φ[f(x)] has an extremum, changing the value of t closer and closer to zero would result in moving Φ[g(x,t)] closer and closer to Φ[f₀(x)], which is an extremum.
Therefore, a derivative of Φ[g(x,t)] by t at t=0 (that is, when g(x,t)=g(x,0)=f₀(x)) should be equal to zero.
That gives an equation where f₀(x) and Δ(x) participate.

4. Since increment Δ(x) can be chosen relatively freely (as long as it's a smooth function with values of zero on both ends of [a,b]), the equation obtained at step 3 above should be true for any Δ(x), which gives additional condition that might lead to identification of f₀(x).

Let's follow the plan step by step.

1. f₁(x)=f₀(x)+Δ(x)
Δ(a)=Δ(b)=0

2. g(x,t)=f₀(x)+t·Δ(x)
Φ[g(x,t)] =
= ∫_[a,b] F[x,g(x,t),g'(x,t)]dx
(apostrophe is a derivative by variable x)
where g(x,t)=f₀(x)+t·Δ(x).
Now the functional Φ[g(x,t)] can be considered as a function of one real argument t that characterizes how close function g(x,t) is to an assumed point of extremum f₀(x).

3. The derivative of functional Φ[g(x,t)] considered as a function of one real argument t by parameter t must be equal to zero at t=0 and g(x,0)=f₀(x):
(d/dt)Φ[g(x,t)]|_t=0 = 0
For many practical problems in Physics functional Φ[g(x,t)] is an integral by x presented in the beginning, while differentiation above is by t.
These two variables (x and t) and corresponding operations (integration by x and differentiation by t) are independent. If, instead of integration by x we had a sum by some index i, we would not hesitate to replace a derivative of a sum with a sum of derivatives.
Integration is just a sum of infinitesimal parts and the same rule of interchanging the order of operations can be applied.
Therefore,
(d/dt)Φ[g(x,t)] =
= (d/dt)∫_[a,b] F[x,g(x,t),g'(x,t)]dx =
= ∫_[a,b] (d/dt)F[x,g(x,t),g'(x,t)]dx

Since argument of differentiation t is contained inside a function F[...] of multiple arguments, the derivative by t should be taken using partial derivative by each argument (∂F/∂x, ∂F/∂g, ∂F/∂g') multiplied by an inner derivative of this argument by t (correspondingly, dx/dt, dg/dt, dg'/dt): (d/dt)F[x,g(x,t),g'(x,t)] =
= (∂/∂x)F[x,g(x,t),g'(x,t)]·(dx/dt) +
+ (∂/∂g)F[x,g(x,t),g'(x,t)]·(dg/dt) +
+ (∂/∂g')F[x,g(x,t),g'(x,t)]·(dg'/dt)

Since the first argument of function F[...] is just x and does not depend on t, its derivative by t is zero:
dx/dt = 0
Since g(x,t)=f₀(x)+t·Δ(x), its derivative by t is
dg(x,t)/dt = Δ(x)
Since g'(x,t)=f '₀(x)+t·Δ'(x),
dg'(x,t)/dt = Δ'(x)
Now the derivative by t looks like
(d/dt)F[x,g(x,t),g'(x,t)] =
= (∂/∂g)F[x,g(x,t),g'(x,t)]·Δ(x) +
+ (∂/∂g')F[x,g(x,t),g'(x,t)]·Δ'(x)

Therefore, the expression for a derivative of our functional Φ[g(x,t)] by parameter t is:

(d/dt)Φ[g(x,t)] =
= ∫_[a,b] (d/dt)F[x,g(x,t),g'(x,t)]dx =
=∫_[a,b] (∂/∂g)F[x,g(x,t),g'(x,t)]·Δ(x)dx+
+∫_[a,b] (∂/∂g')F[x,g(x,t),g'(x,t)]·Δ'(x)dx

As mentioned above, the condition
(d/dt)Φ[g(x,t)]|_t=0 = 0
is necessary for f₀(x)=g(x,0) to be a function-argument where our functional reaches its extremum.
Substituting t=0 and changing g(x,0) to f₀(x), we have an equation

0 = (d/dt)Φ[f₀(x)] =
=∫_[a,b] (∂/∂f)F[x,f₀(x),f '₀(x)]·Δ(x)dx+
+∫_[a,b] (∂/∂f ')F[x,f₀(x),f '₀(x)]·Δ'(x)dx

To shorten the formulas, let's replace
(∂/∂f)F[x,f₀(x),f '₀(x)]
with
F_∂f [x,f₀(x),f '₀(x)]
and
(∂/∂f ')F[x,f₀(x),f '₀(x)]
with
F_{∂f '} [x,f₀(x),f '₀(x)]
With this substitution the equation above looks like
0 = (d/dt)Φ[f₀(x)] =
=∫_[a,b] F_∂f [x,f₀(x),f '₀(x)]·Δ(x)dx+
+∫_[a,b] F_{∂f '} [x,f₀(x),f '₀(x)]·Δ'(x)dx

If only the first integral in the above expression participated in the equation, the requirement that this integral should be equal to zero regardless of Δ(x) would cause the smooth function under an integral to be zero everywhere on [a,b].

Existence of the second integral complicates the picture, but we can change the second integral to contain only Δ(x) instead of Δ'(x) by using the formula of integrating by parts for two functions u(x) and v(x):
∫_[a,b]u·dv = u·v|_[a,b] − ∫_[a,b]v·du

Let's apply this formula for the second integral in the equation above, substituting
u(x) = F_{∂f '}[x,f₀(x),f '₀(x)]
v(x) = Δ(x)
Then
du(x) = u'(x)·dx =
= (d/dx)F_{∂f '}[x,f₀(x),f '₀(x)]·dx
dv(x) = v'(x)·dx = Δ'(x)·dx

Using these substitutions we transform the second integral in the above equation as follows
∫_[a,b] F_{∂f '} [x,f₀(x),f '₀(x)]·Δ'(x)dx =
= ∫_[a,b] F_{∂f '} [x,f₀(x),f '₀(x)]·dΔ(x) =
= F_{∂f '} [x,f₀(x),f '₀(x)]·Δ(x)|[a,b] −
− ∫_[a,b] Δ(x)·dF_{∂f '} [x,f₀(x),f '₀(x)] =

The first component of the above expression equals to zero:
F_{∂f '} [x,f₀(x),f '₀(x)]·Δ(x)|[a,b] = 0
because Δ(a)=Δ(b)=0

So, the final expression for a variation of our functional contains two integrals. both with Δ(x) as a factor:
0 = (d/dt)Φ[f₀(x)] =
=∫_[a,b] F_∂f [x,f₀(x),f '₀(x)]·Δ(x)dx−
−∫_[a,b] Δ(x)·dF_{∂f '} [x,f₀(x),f '₀(x)] =
= ∫_[a,b] h(x)·Δ(x)·dx
where
h(x) = F_∂f [x,f₀(x),f '₀(x)] −
− (d/dx)F_{∂f '} [x,f₀(x),f '₀(x)]

Since the last integral must be equal to zero for any Δ(x), function h(x) must be equal to zero for any x∈[a,b].

Therefore, the necessary condition for function-argument f₀(x) to be a point where functional Φ[f(x)] reaches its extremum is that f₀(x) is a solution to a differential equation h(x)=0 or, in terms of original functional Φ[f(x)],
F_∂f [x,f₀(x),f '₀(x)] −
− (d/dx)F_{∂f '} [x,f₀(x),f '₀(x)] = 0

CONCLUSION

Consider a class Ω of all sufficiently differentiable real functions f(x) on segment [a,b] that take fixed values on the ends of this segment:
f(a)=A and f(b)=B.
Given functional
Φ[f(x)] = ∫_[a,b] F[x,f(x),f '(x)]dx
defined for all f(x)∈Ω
and where F[...] is some known sufficiently differentiable real function of three arguments
real variable x∈[a,b],
real value of function f(x)∈Ω
and real value of its derivative f '(x).

If function f₀(x) from the same class Ω is where the above functional reaches its extremum (minimum or maximum) than this function should be a solution to Euler-Lagrange differential equation
(∂/∂f)F [x,f₀(x),f '₀(x)] −
− (d/dx)(∂/∂f ')F [x,f₀(x),f '₀(x)] = 0

Sunday, July 27, 2025

Physics+ MIN/MAX Problem 1: UNIZOR.COM - Physics+ 4 All - Variations

Notes to a video lecture on UNIZOR.COM

Min/Max Variation Problem 1

Problem 1

Among all smooth (sufficiently differentiable) functions f(x) defined on segment [a,b] and taking values at endpoints f(a)=A and f(b)=B find the one with the shortest graph between points (a,A) and (b,B).

Solution

First of all the length of a curve representing a graph of a function is a functional with that function as an argument. Let's determine its explicit formula in our case.

The length ds of an infinitesimal segment of a curve that represents a graph of function y=f(x) is
ds = [(dx)² + (dy)²]^½ =
= [(dx)² + (df(x))²]^½ =
= [1 + (df(x)/dx)²]^½·(dx) =
= [1 + f '(x)²]^½·(dx)
The length of an entire curve would then be represented by the following functional of function f(x):
Φ[f(x)] = ∫_[a,b][1 + f '(x)²]^½dx

We have to minimize this functional within a family of smooth functions defined on segment [a,b] and satisfying initial conditions
f(a)=A and f(b)=B

As explained in the previous lecture, if functional Φ[f(x)] has local minimum at function-argument f₀(x), the variation (directional derivative)
d/dt Φ[f₀(x)+t(f₁(x)−f₀(x))]
at function-argument f₀(x) (that is, for t=0) in the direction from f₀(x) to f₁(x) should be equal to zero regardless of location of f₁(x) in the neighborhood of f₀(x).

Assume, f₀(x) is a function that minimizes the functional Φ[f(x)] above.
Let f₁(x) be another function from the family of functions defined on segment [a,b] and satisfying initial conditions
f(a)=A and f(b)=B
Let Δ(x) = f₁(x) − f₀(x).
It is also defined on segment [a,b] and, according to its definition, satisfies the initial conditions Δ(a)=0 and Δ(b)=0.

Using an assumed point (function-argument) of minimum f₀(x) of our functional Φ[f(x)], another point f₁(x) that defines the direction of an increment of a function-argument and real parameter t, we can describe a subset of points (function-arguments) linearly dependent on f₀(x) and f₁(x) as
f₀(x)+t·(f₁(x)−f₀(x)) =
= f₀(x) + t·Δ(x)

Let's calculate the variation (we will use symbol δ for variation) of functional Φ[f(x)] at any point (function-argument) defined above by function-argument f₀(x) minimizing our functional, directional point f₁(x) and real parameter t :
δ_{[f₀,f₁,t]} Φ[f(x)] =
= d/dt Φ[f₀(x)+t(f₁(x)−f₀(x))] =
= d/dt Φ[f₀(x)+t·Δ(x)] =
(use the formula for a length of a curve)
= d/dt ∫_[a,b][1+((f₀+t·Δ)')²]^½dx
In the above expression we dropped (x) to shorten it.
The derivative indicated by an apostrophe is by argument x of functions f(x) and Δ(x).

Under very broad conditions, when smooth functions are involved, a derivative d/dt and integral by dx are interchangeable.
So, let's take a derivative d/dt from an expression under an integral first, and then we will do the integration by dx.

d/dt [1+((f₀(x)+t·Δ(x))')²]^½ =
= d/dt [1+(f₀'(x)+t·Δ'(x))²]^½ =

=

2(f₀'(x)+t·Δ'(x))·Δ'(x)

2[1+(f₀'(x)+t·Δ'(x))²]^½

=

(f₀'(x)+t·Δ'(x))·Δ'(x)

[1+(f₀'(x)+t·Δ'(x))²]^½

Now we can integrate the above expression by x on segment [a,b].

Let's use the integrating by parts using a known formula for two functions u(x) and v(x)
∫_[a,b]u·dv = u·v|_[a,b] − ∫_[a,b]v·du

Use it for

u(x) =

f₀'(x)+t·Δ'(x)

[1+(f₀'(x)+t·Δ'(x))²]^½

v(x) = Δ(x)
and, therefore,
dv(x) = dΔ(x) = Δ'(x)·dx
with all participating functions assumed to be sufficiently smooth (differentiable to, at least, second derivative).

Since
v(a) = Δ(a) = 0 and
v(b) = Δ(b) = 0,
the first component of integration is zero
u·v|_[a,b]=u(b)·v(b)−u(a)·v(a)=0

Now the variation of our functional is
δ_{[f₀,f₁,t]} Φ[f(x)] =
= −∫_[a,b]v(x)·du(x,t)
where

u(x,t) =

f₀'(x)+t·Δ'(x)

[1+(f₀'(x)+t·Δ'(x))²]^½

v(x) = Δ(x)

As we know, the necessary condition for a local minimum of functional Φ[f(x)] at function-argument f₀(x) is equality to zero of all its directional derivatives at point f₀(x) (that is at t=0).
It means that for any direction defined by function f₁(x) or, equivalently, defined by any Δ(x)=f₁(x)−f₀(x), the derivative by t of functional Φ[f₀(x)+t·Δ(x)] should be zero at t=0.

So, in our case of minimizing the length of a curve between two points in space, the proper order of steps would be

(1) calculate the integral above getting variation of a functional δ_{[f₀,f₁,t]}Φ[f₀(x)+t·Δ(x)] which is a functional of three variables:
- real parameter t,
- function f₀(x) that is an argument to an assumed minimum of functional Φ[f(x)],
- function Δ(x) that signifies an increment of function f₀(x) in the direction of function f₁(x);
(2) set t=0 obtaining a directional derivative of functional Φ[f(x)] at assumed minimum function-argument f₀(x) and increment Δ(x):
δ_{[f₀,f₁,t=0]}Φ[f₀(x)];
(3) equate this functional to zero and find f₀(x), that solves this equation regardless of argument shift to f₁(x).

Integration in step (1) above is by x, while step (2) sets the value of t.
Since x and t are independent variables, we can exchange the order and, first, set t=0 and then do the integration.

This simplifies the integration to the following
δ_{[f₀,f₁,t=0]} Φ[f(x)] =
= −∫_[a,b]Δ(x)·du(x,0)
where

u(x,0)=

f₀'(x)+0·Δ'(x)

[1+(f₀'(x)+0·Δ'(x))²]^½

f₀'(x)

[1+(f₀'(x))²]^½

Therefore,
δ_{[f₀,f₁,t=0]} Φ[f(x)] =

= −∫_[a,b]Δ(x)·d

f₀'(x)

[1+(f₀'(x))²]^½

And the final formula for variation δ_{[f₀,f₁,t=0]} Φ[f(x)] is

−∫_[a,b]Δ(x)·[

f₀'(x)

[1+f₀'(x)²]^½

]'dx

For δ_{[f₀,f₁,t=0]} Φ[f(x)] to be equal to zero for t=0 regardless of Δ(x), or, in other words, for the integral above to be equal to zero at t=0 regardless of function Δ(x), function u'(x,0) (the one in [...]') must be identically equal to zero for all x∈[a,b].

If u'(x,0) is not zero at some point x (and in some neighborhood of this point since we deal with smooth functions), there can always be constructed such function Δ(x) that makes the integral above not-zero.

From this follows that u(x,0)=const.
Therefore,

f₀'(x)

[1+f₀'(x)²]^½

= u(x,0) = const

from which easily derived that f₀'(x)=const and, therefore, the function f₀(x), where our functional has a minimum, is a linear function of x.

All that remains is to find a linear function f₀(x) that satisfies initial conditions f₀(a)=A and f₀(b)=B.

Obviously, it's one and only function
f₀(x) = (B−A)·(x−a)/(b−a) + A
whose graph in (X,Y) Cartesian coordinates is a straight line from (a,A) to (b,B).

Unizor - Creative Mind through Art of Mathematics

Thursday, November 20, 2025

Physics+ Lagrangian in N Degrees of Freedom: UNIZOR.COM - Physics+ 4 All...

Sunday, September 21, 2025

Physics+ N-dimensional Lagrangian: UNIZOR.COM - Physics+ 4 All - Lagrangian

Saturday, August 23, 2025

Physics+ Pendulum in Lagrangian Mechanics: UNIZOR.COM - Physics+ 4 All -...

Wednesday, August 20, 2025

Physics+ Lagrangian: UNIZOR.COM - Physics+ 4 All - Lagrangian

Monday, August 18, 2025

Physics+ Brachistochrone: UNIZOR.COM - Physics+ 4 All - Variations

Sunday, August 3, 2025

Physics+ Euler Lagrange Equation: UNIZOR.COM - Physics+ 4 All - Variations

Sunday, July 27, 2025

Physics+ MIN/MAX Problem 1: UNIZOR.COM - Physics+ 4 All - Variations

Facebook Badge

Blog Archive

About Me