Monday, March 31, 2025

Physics+ 2nd Order Curves: UNIZOR.COM - Physics+ 4 All - Laws of Newton

Notes to a video lecture on UNIZOR.COM

Laws of Newton -
2nd-order Curves


We talk here about second-order curves because they describe the trajectories that objects move along in a central gravitational field.
These curves were discussed in the UNIZOR.COM course Math+ 4 All in the part Geometry, where you can find lectures on ellipse, hyperbola and parabola, their defining characteristics and equations in Cartesian and polar coordinates.

Here is a quick recap of this material.

Ellipse
Ellipse is a locus of points on a plane that satisfy the following condition.
The sum of distances from each such point P to two given points F1 and F2 (called foci) is equal to a given positive real number called its length of major axis (2a on a graph below), that is supposed to be greater than the distance between its two foci (2c on a graph below).

The value 2b, where b²=a²−c², is called the length of minor axis.
The equation of an ellipse in Cartesian coordinates (x,y) with X-axis coinciding with the line between the foci and origin of coordinates is a midpoint between foci is
x²/a² + y²/b² = 1
The equation in polar coordinates (r,θ) with an origin at one of the ellipse' foci and a base axis coinciding with the line between the foci is
r= a·(1−e²)/[1−e·cos(θ)]
where the ratio e=c/a is called eccentricity of an ellipse and it's always less than 1.

Hyperbola
Hyperbola is a locus of points on a plane that satisfy the following condition.
The difference of distances from each such point P from two given points F1 and F2 (called foci) is equal by absolute value to a given positive real number called its length of major axis (2a on a graph below), that is supposed to be less than the distance between its two foci (2c on a graph below).

The value 2b, where b²=c²−a², is called the length of minor axis.
The equation of a hyperbola in Cartesian coordinates (x,y) with X-axis coinciding with the line between the foci and origin of coordinates is a midpoint between foci is
x²/a² − y²/b² = 1
The equation in polar coordinates (r,θ) with an origin at one of the hyperbola' foci and a base axis coinciding with the line between the foci is
r= a·(e²−1)/[1−e·cos(θ)]
where the ratio e=c/a is called eccentricity of a hyperbola and it's always greater than 1.

Parabola
Parabola is a locus of points on a plane that satisfy the following condition.
Each such point P is equidistant from a given point F (called focus) and a given straight line d (called directrix) with the distance between a focus and a directrix being a given positive number (2c on a graph below).
The equation of an ellipse in Cartesian coordinates (x,y) with X-axis coinciding with the perpendicular line from a focus to a directrix and origin of coordinates is a midpoint between focus and directrix
y² = 4c·x
The equation in polar coordinates (r,θ) with an origin at parabola' focus and a base axis coinciding with the perpendicular line from a focus to a directrix is
r= 2c/[1−cos(θ)]

Generally speaking, a second-order curve is defined in Cartesian coordinates as a locus of (x,y) points on a plane that satisfy a general equation of a second order for two variables
Ax²+Bxy+Cy²+Dx+Ey+F=0

We can show that any second-order curve belongs to one of the three types presented above, it's either ellipse or hyperbola, or parabola.

If coefficient B at xy is zero, the expression above can be easily converted into one of the three canonical equations by shifting and stretching coordinates (which shifts and stretches the graph without changing its type, an ellipse will remain an ellipse etc.) and/or exchanging places of x and y (which changes the orientation of a graph from horizontal to vertical).
If coefficient B is not zero, we can turn the system of coordinates in such a way that the equation of the same curve in the new system will not contain a non-zero coefficient at xy.
Details of the transformation of a general equation into one of three canonical forms are at the end of these notes.

The main purpose of the above recap of the properties of second-order curves is to analyze the motion of a point-mass object in the central gravitational field of another point-mass object fixed at the origin of an inertial reference frame.

More precisely, we will prove in the next lecture that a trajectory of an object in a central gravitational field is some second-order curve (ellipse, hyperbola or parabola) depending on its position and velocity relative to the center of gravitational field at initial moment of time t=0.

To prove it, we will, primarily, rely on representation of second-order curves in polar coordinates.
The reason for this is that all polar coordinate representations of three types of curves (ellipse, hyperbola or parabola) are very similar and differ only in certain parameters.
If we derive the equation of a trajectory in polar coordinates as
r(θ) = A/[B+C·cos(θ)],
where A, B and C depend only on position and velocity of our object relative to a center of the gravitational field at the beginning of motion, we can say that a trajectory would be either ellipse or hyperbola, or parabola.

Moreover, analyzing the values of position and velocity of our object relative to a center of the gravitational field at the beginning of motion, we will be able to predict the type of trajectory it will take.


Transformation of General Second-Order Equation into One of Three Canonical Forms

The general form of a second-order curve on an XY-plane as an equation in Cartesian coordinates is
A·x²+B·x·y+C·y²+D·x+E·y+F=0
It's called 'second-order' because X- and Y-coordinates participate in this equation as polynomial with combined exponent 2.

Our task is to analyze the shape of a curve defined by such an equation depending on coefficients A, B. C etc.

Let's start with a simpler case of B=0.
Then our equation looks like
A·x²+C·y²+D·x+E·y+F=0

In this form we will analyze the shape of a curve in two different cases:
1. Either A or C is zero.
2. Both A and C are not equal to zero.
Both of them cannot be equal to zero because then the equation would not be of a second order.

1. CASE Either A or C is zero

We will consider two separate subcases.

Subcase A≠0 but C=0

In this case we can easily transform the original equation into a form
A·(x+½D/A)²+E·y=K
where
K = D²/(4A) − F

Let's shift our coordinate system parallel to itself moving the origin to point (½D/A,0).
This transformation will not change the shape of a curve, but will allow to express it in a simple form in new coordinates
A·x² + E·y = K
or
E·y = −A·x² + K
which is a parabola for any E≠0.
For example, for E=1, A=−1, K=−4 this parabola looks like this:
Physical trajectory described by this equation would be when an object moves towards but slightly off the source of gravity. The gravitational force will turn it around the origin, but it will not be sufficient to keep it on an orbit around a center, and an object will move away from the center of gravitation to infinity.

Subcase A=0 but C≠0

In this case we can easily transform the original equation into a form
C·(y+½E/C)²+D·x=K
where
K = E²/(4C) − F
Let's shift our coordinate system parallel to itself moving the origin to point (0,½E/C).
This transformation will not change the shape of a curve, but will allow to express it in a simple form in new coordinates
C·y²+D·x=K
or
D·x= −C·y² + K
which is a parabola for any D≠0.
For example, for D=1, C=−1, K=−4 this parabola looks like this:


2. CASE A≠0 and C≠0

In this case we can easily transform the original equation into a form
A·(x+½D/A)²+C·(y+½E/C)²=K
where
K = D²/(4A) + E²/(4C) − F

Let's shift our coordinate system parallel to itself moving the origin to point (½D/A,½E/C).
This transformation will not change the shape of a curve, but will allow to express it in a simple form in new coordinates
A·x²+C·y²=K

Subcase A and C are
both positive or both negative


If K is of the same sign as A and C, it's an ellipse.
For example, if A=4, C=9 and K=36, this ellipse looks like this
which, from the physical standpoint, would be an orbit of an object moving around a central point-mass.

If K=0, the only point (0,0) fits the equation. You can consider it as a degenerate ellipse with zero dimension in any direction.
Physical meaning of this is that this is a trajectory of an object in central gravitational field with its initial position at its center and initial velocity zero.
So, it's stuck at the center where the source of gravity is located and will not escape without any initial velocity, which is meaningless for our analysis.

Finally, if K is of the opposite sign to A and C, there are no points that satisfy the equation.
In a physical sense it means that there is no object in the gravitational field.

Subcase A and C are
of opposite signs


The curve will be a hyperbola

(if signs of A and K are the same)
or

(if signs of C and K are the same)
From the physical standpoint the trajectory like that will be of an object moving fast enough to overcome the force of gravitation and, after its trajectory has been curved by this force, still manage to fly away from the source of gravitation force to infinity.

If K=0 we will have two straight lines defined by equation
A·x² = C·y²
which can be simplified to
y=±√(A/C)x
with the graph looking like this The physical meaning of this trajectory corresponds to an object moving straight towards the center of gravity or directly from it.

All the above cases assumed that in the original equation of a second-order curve
A·x²+B·x·y+C·y²+D·x+E·y+F=0
the coefficient B=0.

What if it's not?

We will show that with a proper turning of our system of Cartesian coordinates (which would not change the shape of a curve, only its representation as a second-order equation in Cartesian coordinates) we can change the equation into a form with B=0.
That would reduce a general task of analysis of the shape of any second-order curve to a case analyzed above, when B=0.

Assume, we can turn our Cartesian system of coordinates by angle β.
Recall that the new system will have coordinates u and v that are related to old coordinates x and y in the original system as
u = x·cos(β) + y·sin(β)
v = −x·sin(β) + y·cos(β)
or in vector form
u
v
=
x
y
cos(β)sin(β)
−sin(β)cos(β)

Old coordinates x and y can be expressed in terms of new ones u and v by turning the new system of coordinates by angle −β:
x = u·cos(β) − v·sin(β)
y = u·sin(β) + v·cos(β)
or in vector form
x
y
=
u
v
cos(β)−sin(β)
sin(β)cos(β)

Now we can substitute new coordinates u and v into original second-order equation for a curve getting another second order equation for the same curve in terms of new coordinates.

Can we find such an angle β that all mixed terms that contain a product u·v will cancel each other?
If yes, that would prove that in some coordinate system the equation of a second-order curve contains no terms with product of different coordinates, like x·y.

In terms of u and v the new equation will contain terms with , u·v, , u, v and a constant.
Let's collect only terms with a product of different coordinates u·v that we want to nullify.

These are:
A·x²−2·A·u·v·cos(β)·sin(β)
B·x·yB·u·v·(cos²(β)−sin²(β))
C·y²2·C·u·v·sin(β)·cos(β)
We would like the sum of them to be zero.
After obvious usage of the formulas for sin and cos of double angle, it means that we have to find such an angle β that
A·sin(2β)=B·cos(2β)+C·sin(2β)

The above trigonometric equation can be easily solved by dividing by cos(2β) getting
tan(2β) = B/(A−C)
β = ½arctan[B/(A−C)]
It works only for A≠C. In case A=C the equation for β looks like
B·cos(2β) = 0
from which follows
β=π/4

Therefore, by rotating the system of Cartesian coordinates we can find the one where the second-order equation of our curve will have no terms with a product of different coordinates x·y.

After that we can use the technique presented above to shift the origin of our system of coordinates to simplify this equation even further to a form
A·x² + E·y = K (parabola open along Y-axis) or
C·y² + D·x = K (parabola open along X-axis) or
A·x² + C·y² = K (ellipse if all multipliers are positive or hyperbola if A and C are of opposite signs)

Saturday, March 29, 2025

Math+ Parabola Optics: UNIZOR.COM - Math+ 4 All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Parabola Optics

Though the word "optics" sounds very much like a topic of Physics, we will consider strictly mathematical aspect of it.
One of the properties of parabola is that if its contour is reflective, a ray of light emitted from its focus will be reflected parallel to its axis of symmetry regardless of the direction it was sent.

Reflection of a curve occurs exactly as if, instead of a curve at the point of incidence, there was a tangential line to a curve, and reflection was of that tangential straight line.

Consider a parabola with focus F(c,0), directrix d, point P(x,y) on this parabola and tangential BC to a parabola at point P(x,y).
The ray of light is emitted from a focus F(c,0), goes to point P(x,y) and reflects along PS.
Because of the laws of reflection, angles ∠FPB and ∠SPC are equal, they are marked as α.
If we prove that these angles α are also equal to angle ∠ψ the tangential at point P line BC makes with the axis of symmetry of parabola BX, it will prove the parallelism of reflected ray PS with line BC, regardless of position of point P on a parabola.

Our plan to prove it is to prove that tan(α)=tan(ψ), from which the equality α=ψ follows because function tan() is monotonic for these angles.

Since sum of angles of a triangle ΔBPF equals to π,
α + ψ + (π−φ) = π
Therefore,
α = φ − ψ
We can calculate tan(α) using a formula for tangent of difference between angles, we can express tan(α) in terms of tan(φ) and tan(ψ).
tan(α) =
=
[tan(φ)−tan(ψ)]/[1+tan(φ)·tan(ψ)]

Angle ∠φ=∠XFP.
Knowing coordinates of points P(x,y) and F(c,0), it's easy to calculate
tan(φ) = y/(x−c)

As we know, a tangent of an angle between a tangential line to function f(x) at some point and X-axis is a function's derivative f'(x) at that point.
Therefore, tan(ψ)=y'(x)
Since an equation of a parabola is y²=4c·x, differentiating this equation we get
2y·y' = 4c
Hence, y' = 2c/y = tan(ψ).

Now we have all the components to calculate tan(α):
tan(α) =
=
[y/(x−c)−2c/y]/[1+(y/(x−c))·(2c/y)]
Let's simplify this expression.
Its numerator equals to
(y²−2c·x+2c²)/(x·y−c·y)
But for each point of a parabola y²=4c·x.
Use it in the formula above, getting the same numerator as
(4c·x−2c·x+2c²)/(x·y−c·y) =
= 2c·(x+c)/(x·y−c·y)

The denominator in the formula above can be simplified as
1 + (y/(x−c))·(2c/y) =
= 1 + 2c·y/((x−c)·y) =
= (x·y−c·y+2c·y)/((x−c)·y) =
= y·(x+c)/(x·y−c·y)


Dividing the numerator
2c·(x+c)/(x·y−c·y)
by denominator
y·(x+c)/(x·y−c·y)
we get the value of tan(α) as
tan(α) = 2c/y

But this is the same value as tan(ψ), which proves that angles ∠α and ∠ψ are equal, which, in turn, proves that lines PS (reflected ray of light) and FX (axis of symmetry of a parabola) are parallel regardless of the position of point P on a parabola.

Therefore, all the rays from the parabola's focus directed in any direction will be reflected in one direction - parallel to the axis of symmetry of this parabola.

Friday, March 28, 2025

Math+ Parabola: UNIZOR.COM - Math+ 4 All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Parabola

Parabola is a class of curves on a plane with the following defining properties.

For each curve of this class (that is, for each parabola) there is one specific point called its focus and a specific straight line called its directrix not going through a focus, such that this parabola consists of all points on a plane (or is a locus of all points on a plane) equidistant from its focus and directrix.

As we see, the position of a focus point F and a directrix d (should not go through focus F) uniquely identifies a parabola.

Obviously, a midpoint of a perpendicular from focus F onto directrix d belongs to a parabola defined by them.
From this point we can draw a parabola point by point maintaining equality of the distance from each point to both a focus and a directrix.

Choosing an X-axis perpendicular to a directrix with an origin of coordinates 0 at midpoint between a focus and a directrix, we will have the following picture of a parabola on a coordinate plane.
We can derive an equation that defines this parabola using the main characteristic property of every point on this parabola to be equidistant from focus and a directrix.

The distance from any point P(x,y) on a parabola to a focus F that has coordinates (c,0) can be calculated using the known formula of a distance between two points.
The distance from point P to a directrix is calculated along a perpendicular from point P to a directrix that is parallel to X-axis.

If the distance between a focus and a directrix is 2c, as on a drawing above, the equality of the distances to a focus and a directrix is
(x−c)²+(y−0)² = x−(−c)

We can simplify this as follows
(x−c)²+(y−0)² = (x+c)²
y² = (x+c)²−(x−c)²
y² = 4c·x

The only parameter that defines the shape of a parabola is c (half of a distance between a focus and a directrix), which is called the focal distance (or focal length) of a parabola.

The perpendicular from a focus to a directrix is an axis of symmetry of a parabola.
The midpoint of this perpendicular is called a vertex of a parabola.

Another item of interest is parabola's focal width. This is the length of a segment drawn through a focus parallel to a directrix with its endpoints being its intersections with a parabola.
Using a formula of a parabola y²=4c·x we determine that if x=c then y²=4c² and y=2c, which is a half of a segment described above.
Therefore, parabola's focal width is 4c.

Let's derive a formula r=r(θ) for a parabola in polar coordinates with an origin at its focus and base axis perpendicular to a directrix.
The distance from point (r,θ) to a focus is, obviously, r.
The distance from this point to a directrix is
AB = AF + FB = 2c+r·cos(θ).
From this the equation of a parabola in polar coordinates is

r = 2c/[1−cos(θ)]


Wednesday, March 26, 2025

Math+ Hyperbola Asymptotes: UNIZOR.COM - Math+ 4All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Hyperbola Asymptotes

Let's start with a definition of an asymptote.
From less rigorous standpoint an asymptote to a curve is a straight line to which a curve goes infinitesimally close as a point of observation on our straight line moves to infinity.

More precisely, we assume that there is a plane with Cartesian coordinates on it.
There is some relationship between coordinates f(x,y)=0 that defines a curve on a coordinate plane that graphically represents it.
This relationship can be expressed as a function of one coordinate of another, like y=1/x or not in a functional way, like x·y=1.

In most cases we will consider the functional representation of dependency between coordinates, like y=f(x).

Vertical (parallel to Y-axis) asymptote to this function at point x=c where c, is a constant, is a line parallel to Y-axis and crossing X-axis at point x=c such that any one particular or any combination of the conditions below is true
limx→c f(x) = +∞ or = −∞
limx→c+0 f(x) = +∞ or = −∞
limx→c−0 f(x) = +∞ or = −∞
where x→c means that an argument x tends to value c in any possible way,
x→c+0 means that an argument x tends to value c while always being greater than c (tending from the right),
x→c−0 means that an argument x tends to value c while always being less than c (tending from the left).

The value of a function at exact point x=c might or might not be defined and is irrelevant.

Here is an example of a function with a vertical asymptote that satisfies the condition
limx→c f(x) = +∞
where f(x)=1/(x−2)² and c=2. Here is an example of a function with a vertical asymptote that satisfies the condition
limx→c+0 f(x) = −∞
where f(x)=ln(x−2) and c=2. Here is an example of a function with a vertical asymptote that satisfies the conditions
limx→c+0 f(x) = +∞
limx→c−0 f(x) = −∞
where f(x)=1/(x−2) and c=2.
In any of the above cases the distance between a point on a vertical asymptote and a point on a curve in the direction perpendicular to an asymptote (that is, along a line parallel to X-axis) tends to zero as an argument tends to point c.

Horizontal (parallel to X-axis) asymptote has a different definition.
First of all, it's about a curve that represents a function getting infinitesimally close to a horizontal line as an argument tends to +∞ or −∞.
Obviously, we assume that this function is defined on an infinite segment.
Line y=c (c is a constant) is an asymptote for a function y=f(x) if
limx→+∞f(x) = c or
limx→−∞f(x) = c
Here is an example of a function with a horizontal asymptote that satisfies the condition
limx→−∞ f(x) = c
where f(x)=2+ex and c=2. The distance between a point on a horizontal asymptote and a point on a curve in the direction perpendicular to an asymptote (that is, along a line parallel to Y-axis) tends to zero as an argument x tends to infinity.

Oblique (or slanted) asymptote for some function y=f(x) is a straight line L described by an equation y=a·x+b (a≠0) which is not parallel to any of the axes of a coordinate system, such that a distance from a point P[p,q]∈L (that is, q=a·p+b) to a point S[x,y] on a curve representing a graph of our function (that is, y=f(x)) in a direction perpendicular to line L (that is, PSL) will be an infinitesimal variable, as a point P on a line L goes to infinity (that is if p→+∞ or p→−∞).

Here is an example of an oblique asymptote that satisfies a condition of infinitesimal closeness of a graph of a function to an asymptote as a value of an argument goes to infinity (positive or negative).

If a function is represented by a graph, and we expect that some straight line is an asymptote, the distance between any point on an asymptote and a graph of a function in a direction perpendicular to an asymptote must tend to zero as our point moves to infinity along an asymptote.
Let's derive the relationship between a function y=f(x) and its asymptote y=a·x+b.
Since PR=QR·cos(φ),
it will tend or not tend to zero exactly as QR=f(x)−a·x−b.
Therefore, if
limx→∞[f(x)−a·x−b]=0
then y=a·x+b is an asymptote.

Let's use this result to find asymptotes of a hyperbola defined by an equation
x²/a² − y²/b² = 1
Let's concentrate on the top right branch of this hyperbola, that we can express as function
y = √x²·b²/a² − b²

Without subtracting a constant under a square root, the equation above would look like y=(b/a)·x - straight line.
This prompts us that a straight line y=(b/a)·x might be an asymptote.
Let's check it out.

limx→∞[x²·b²/a²−b²−(b/a)·x]=
multiply and divide by x²·b²/a²−b²+(b/a)·x
= 0
because we will have a constant −b² in the numerator and a function in the denominator that monotonically tends to infinity.

Analogously, a line y=−(b/a)·x is an asymptote to the top left side of a graph.

Sunday, March 23, 2025

Math+ - Hyperbola Optics: UNIZOR.COM - Math+ 4 All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Hyperbola Optics

Though the word "optics" sounds very much like a topic of Physics, we will consider strictly mathematical aspect of it.
One of the properties of hyperbola is that if its contour is reflective, a ray of light emitted from one of its foci will be reflected along a line that crosses another focus regardless of the direction it was sent.

Consider a hyperbola with foci F1(−c,0) and F2(c,0), point P(x,y) on this hyperbola and tangential AR to a hyperbola at this point P(x,y).

Here F2P is an incident ray, PS is a reflected ray. The optical property of a hyperbola that we want to prove is that reflected ray PS lies on a line that crosses a focus point F1 regardless of a position of point P on a hyperbola.

As mentioned in the lecture Ellipse Optics of this course, reflection of the curve at some point occurs exactly as if the ray of light reflects of the tangential to this curve at the point of incidence.

The segment F1P lies on a continuation of a reflected ray. Angle of F2P with X-axis is α.
Angle of F1P with X-axis is β.
Angle of a tangent to a hyperbola at point P(x,y) with X-axis is γ.
Angle between a tangent and incident ray F2P is φ.
Angle between a tangent and reflected ray F1P is ψ.

To prove that the light emitted from focus F2(c,0) will reflect at point P(x,y) on a hyperbola and will lie on a line that will hit point F1(−c,0), it is sufficient to prove that angles φ and ψ are equal.

Simple considerations, based on the theorem that the sum of angles of any triangle equals to π, lead us to the following equalities
φ = α − γ
ψ = γ − β

All the above angles are in the range from 0 to π. In this interval equality of angles follows from equality of their tangents since in this interval tangent is a monotonic function.
So, let's determine tangents of φ and ψ and prove that they are equal.

In both cases we will use the formula for tangent of a difference between angles.
tan(φ) =
tan(α)−tan(γ)
1+tan(α)·tan(γ)
tan(ψ) =
tan(γ)−tan(β)
1+tan(γ)·tan(β)

Tangents of α and β are easy to calculate since we know coordinates of segments F1P and F2P.

tan(α) = (y−0)/(x−c) = y/(x−c)
tan(β) = (y−0)/(x+c) = y/(x+c)

The issue with tan(γ) is a bit more complicated since we know that this is a slope of a tangential line to a hyperbola at point P(x,y), and we don't have coordinates of a second point A on this line to calculate a slope using the same method as with angles α or β.

However, we know that the same slope is a derivative of variable y, as a function of variable x, describing our hyperbola.

Using the equation of hyperbola in Cartesian coordinates, we can calculate it as follows.
1. Equation of hyperbola is
x²/a² − y²/b² = 1
where parameter a is a semi-major axis of a hyperbola and equals to a distance from the origin of coordinates to a point of intersection of a hyperbola with X-axis,
parameter c is a semi-focal distance and is equal to a distance from the origin of coordinates to a focus point and
parameter b is called a semi-minor axis and is defined by an equation b²=c²−a².
2. Differentiate both sides by x, taking into consideration that y is a function of x
2x/a² − (2y/b²)·y'(x) = 0
(where y'(x) is a derivative of y by x).
3. Find the slope of our tangential to hyperbola line at point P(x,y)
y' = x·b²/(y·a²) = tan(γ)

Now we have all components to calculate tan(φ) and tan(ψ) in terms of calculated tangents of angles α, β, γ.
If we prove that tan(φ)=tan(ψ), we can say that a light ray emitted from focus F2 towards any point P(x,y) on a hyperbola will be reflected from point P(x,y) along a line that crosses focus F1.

So, here are the calculations of tan(φ) and tan(ψ).

We start with calculations of tan(φ) based on values of tan(α) and tan(γ).
We will use the original relation between x and y that reflects the fact that point P(x,y) lies on a hyperbola
x²/a²−y²/b²=1,
from which follows
x²·b²−y²·a²=a²·b²

1. tan(α) − tan(γ) =
= y/(x−c) − x·b²/(y·a²) =
= (y²·a²−x²·b²+c·x·b²)/(x·y·a²−c·y·a²) =
= (−a²·b²+c·x·b²)/(x·y·a²−c·y·a²)


2. 1 + tan(α)·tan(γ) =
= 1 +
[y/(x−c)]·[x·b²/(y·a²)] =
= 1 + x·y·b²/(x·y·a²−c·y·a²) =
= (x·y·a²−c·y·a²+x·y·b²)/(x·y·a²−c·y·a²) =
[since a²+b²=c²]
= (x·y·c²−c·y·a²)/(x·y·a²−c·y·a²)


3. tan(φ) = [tan(α)−tan(γ)] / [1+tan(α)·tan(γ)] =
= (−a²·b²+c·x·b²)/(x·y·c²−c·y·a²)


Now let's calculate the value of tan(ψ) based on values of tan(β) and tan(γ).

1. tan(γ) − tan(β) =
= x·b²/(y·a²) − y/(x+c) =
= (x²·b²+c·x·b²−y²·a²)/(x·y·a²+c·y·a²) =
= (a²·b²+c·x·b²)/(x·y·a²+c·y·a²)


2. 1 + tan(γ)·tan(β) =
= 1 +
[x·b²/(y·a²)]·[y/(x+c)] =
= 1 + x·y·b²/(x·y·a²+c·y·a²) =
= (x·y·a²+c·y·a²+x·y·b²)/(x·y·a²+c·y·a²) =
[since a²+b²=c²]
= (x·y·c²+c·y·a²)/(x·y·a²+c·y·a²)


3. tan(ψ) = [tan(γ)−tan(β)] / [1+tan(γ)·tan(β)] =
= (a²·b²+c·x·b²)/(x·y·c²+c·y·a²)


What remains is to show that two calculated values, tan(φ) and tan(ψ) are the same.
Both are fractions with numerator and denominator. Equality between two fractions P/Q=R/S is equivalent to equality P·S=Q·R.
Let's check equality of our fractions using this method.

Numerator of tan(φ) multiplied by denominator of tan(ψ) is
(−a²·b²+c·x·b²)·(x·y·c²+c·y·a²) =
= −x·y·a²·b²·c² + x²·y·b²·c³ −
− y·a4·b²·c + x·y·a²b²·c² =
= x²·y·b²·c³ − y·a4·b²·c


Denominator of tan(φ) multiplied by numerator of tan(ψ) is
(x·y·c²−c·y·a²)·(a²·b²+c·x·b²) =
= x·y·a²·b²·c² − y·a4·b²·c +
+ x²·y·b²·c³ − x·y·a²·b²·c² =
= − y·a4·b²·c + x²·y·b²·c³


Final expressions for two cases above are identical, which proves that tan(φ)=tan(ψ).

This, as we mentioned above, is a sufficient condition for a ray of light emitted from one focus point of a hyperbola to any direction after reflecting from the hyperbola to end up at the other focus point.

Thursday, March 20, 2025

Math+ - Ellipse Optics: UNIZOR.COM - Math+ 4 All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Ellipse Optics

Though the word "optics" sounds very much like a topic of Physics, we will consider strictly mathematical aspect of it.
One of the properties of ellipse is that if its contour is reflective, a ray of light emitted from one of its foci will come to another foci regardless of the direction it was sent.

Let's start with a mathematical meaning of reflection of the contour of ellipse, which is not a straight line.
We do know what a reflection of the straight line is. Simply speaking, angles of incidence and reflection are equal to each other. Reflection of a curve occurs exactly as if, instead of a curve at the point of incidence, there was a tangential line to a curve, and reflection was of that tangential straight line.

Consider an ellipse with foci F1(−c,0) and F2(c,0), point P(x,y) on this ellipse and tangential to an ellipse at this point P(x,y).
Angle of F1P with X-axis is α.
Angle of F2P with X-axis is β.
Angle of a tangent to an ellipse at point P(x,y) with X-axis is γ.
Angle between a tangent and F1P is φ.
Angle between a tangent and F2P is ψ.

To prove that the light emitted from focus F1(−c,0) will reflect at point P(x,y) on an ellipse and will hit point F2(c,0), it is sufficient to prove that angles φ and ψ are equal.

Simple considerations, based on the theorem that the sum of angles of any triangle equals to &pi, lead us to the following equalities
φ = α − γ
ψ = γ − β + π

All the above angles are in the range from 0 to π. In this interval equality of angles follows from equality of their tangents since in this interval tangent is a monotonic function.
So, let's determine tangents of φ and ψ and prove that they are equal.

In both cases we will use the formula for tangent of a difference between angles and, in case of angle ψ, will take into consideration that tangent is periodic with a period π.
tan(φ) =
tan(α)−tan(γ)
1+tan(α)·tan(γ)
tan(ψ) =
tan(γ)−tan(β)
1+tan(γ)·tan(β)
As we know, the tangent of an angle from a positive direction of the X-axis and a straight line connecting to points on a plane A(ax,ay) and B(bx,by) (usually called a slope of a line) equals to (by−ay)/(bx−ax).

Therefore,
tan(α) = (y−0)/(x+c) = y/(x+c)
tan(β) = (y−0)/(x−c) = y/(x−c)

The issue with tan(γ) is a bit more complicated since we know that this is a slope of a tangential line to an ellipse at point P(x,y), and we don't have a second point on this line to calculate a slope using the same method as with angles α or β.

However, we know that the same slope is a derivative of variable y, as a function of variable x, describing our ellipse.

Using the equation of ellipse in Cartesian coordinates, we can calculate it as follows.
1. Equation of ellipse is
x²/a² + y²/b² = 1
2. Differentiate both sides by x, taking into consideration that y is a function of x
2x/a² + (2y/b²)·y'(x) = 0
(where y'(x) is a derivative of y by x).
3. Find the slope of our tangential to ellipse line at point P(x,y)
y' = −x·b²/(y·a²) = tan(γ)

Now we have all components to calculate tan(φ) and tan(ψ) in terms of calculated tangents of angles α, β, γ.
If we prove that tan(φ)=tan(ψ), we can say that a light ray emitted from focus F1 towards any point P(x,y) on an ellipse will be reflected from point P(x,y) towards focus F2.

So, here are the calculations of tan(φ) and tan(ψ).

We start with calculations of tan(φ) based on values of tan(α) and tan(γ).
We will use the original relation between x and y that reflects the fact that point P(x,y) lies on an ellipse
x²/a²+y²/b²=1,
from which follows
x²·b²+y²·a²=a²·b²

1. tan(α) − tan(γ) =
= y/(x+c) + x·b²/(y·a²) =
= (y²·a²+x²·b²+c·x·b²)/(x·y·a²+c·y·a²) =
= (a²·b²+c·x·b²)/(x·y·a²+c·y·a²)


2. 1 + tan(α)·tan(γ) =
= 1 − x·y·b²/(x·y·a²+c·y·a²) =
= (x·y·a²+c·y·a²−x·y·b²)/(x·y·a²+c·y·a²)


3. tan(φ) = [tan(α)−tan(γ)] / [1+tan(α)·tan(γ)] =
= (a²·b²+c·x·b²)/(x·y·a²+c·y·a²−x·y·b²)


Now let's calculate the value of tan(ψ) based on values of tan(β) and tan(γ).

1. tan(γ) − tan(β) =
= −x·b²/(y·a²) − y/(x−c) =
= (−x²·b²+c·x·b²−y²·a²)/(x·y·a²−c·y·a²) =
= (−a²·b²+c·x·b²)/(x·y·a²−c·y·a²)


2. 1 + tan(γ)·tan(β) =
= 1 +
[−x·b²/(y·a²)]·[y/(x−c)] =
= (x·y·a²−c·y·a²−x·y·b²)/(x·y·a²−c·y·a²)


3. tan(ψ) = [tan(α)−tan(γ)] / [1+tan(α)·tan(γ)] =
= (−a²·b²+c·x·b²)/(x·y·a²−c·y·a²−x·y·b²)


What remains is to show that two calculated values, tan(φ) and tan(ψ) are the same.
Both are fractions with numerator and denominator. Equality between two fractions P/Q=R/S is equivalent to equality P·S=Q·R.
Let's check equality of our fractions using this method.

Numerator of tan(φ) multiplied by denominator of tan(ψ) is
(a²·b²+c·x·b²)·(x·y·a²−c·y·a²−x·y·b²) =
= x·y·a4·b² + x²·y·a²·b²·c −
− y·a4·b²·c − x·y·a²b²·c² −
− x·y·a²·b4 − x²·y·b4·c

Notice that
x·y·a4·b² − x·y·a²·b4 =
= x·y·a²·b²·(a²−b²) =
= x·y·a²·b²·c²

which cancels the analogous term with a minus sign in the above expression.
This leaves our expressions equal to
(a²·b²+c·x·b²)·(x·y·a²−c·y·a²−x·y·b²) =
= x²·y·a²·b²·c − y·a4·b²·c − x²·y·b4·c

Next simplification is
x²·y·a²·b²·c − x²·y·b4·c =
= x²·y·b²·c·(a²−b²) = x²·y·b²·c³

That leaves our expression to
x²·y·b²·c³ − y·a4·b²·c

Denominator of tan(φ) multiplied by numerator of tan(ψ) is
(x·y·a²+c·y·a²−x·y·b²)·(−a²·b²+c·x·b²) =
= −x·y·a4·b² − y·a4·b²·c + x·y·a²·b4 +
+ x²·y·a²·b²·c + x·y·a²·b²·c² − x²·y·b4·c

Notice that
−x·y·a4·b² + x·y·a²·b4 =
= −x·y·a²·b²·(a²−b²) =
= −x·y·a²·b²·c²

which cancels the analogous term with a plus sign in the above expression.
This leaves our expressions equal to
(x·y·a²+c·y·a²−x·y·b²)·(−a²·b²+c·x·b²) =
= − y·a4·b²·c + x²·y·a²·b²·c − x²·y·b4·c

Next simplification is
x²·y·a²·b²·c − x²·y·b4·c =
= x²·y·b²·c·(a²−b²) = x²·y·b²·c³

That leaves our expression to
x²·y·b²·c³ − y·a4·b²·c
Final expressions for two cases above are identical, which proves that tan(φ)=tan(ψ).

This, as we mentioned above, is a sufficient condition for a ray of light emitted from one focus point of an ellipse to any direction after reflecting from the ellipse to end up at the other focus point.

Monday, March 17, 2025

Math+ Hyperbola: UNIZOR.COM - Math+ 4 All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Hyperbola

Hyperbola is another class of curves on a plane with the following defining properties.

For each curve of this class (that is, for each hyperbola) there are two specific points called foci (plural of focus) or focuses and a specific positive real number called its major axis that is supposed to be smaller than the distance between its two foci, such that this hyperbola consists of all points on a plane (or is a locus of all points on a plane) with the difference of their distances to its two foci equaled to this hyperbola's major axis, a constant for all points on a hyperbola.

As we see, the position of two focus points F1, F2 and a major axis 2a (should be smaller than the length 2c of segment F1F2) uniquely identify a hyperbola.

Assume, we fix the position of foci F1 and F2 with the distance between them being 2c. The parameter c is called a focal distance.
Assume further that we have chosen the length of a major axis 2a smaller than the length 2c of a focal line F1F2 between foci.
Consider a hyperbola on a Cartesian coordinate plane with foci to be at points F1(−c,0) and F2(0,c).

The following drawing represents a hyperbola defined by these two parameters, a half of focal distance c and a half of major axis a.

Point O is a midpoint of segment F1F2.
Points A and B are located on the focal line F1F2 on a distance a from point O.

Point A lies on this hyperbola because the distance from A to F1 is c−a and the distance from A to F2 is a+c. The difference of these distances is
(a+c)−(c−a)=2a,
which is a condition of any point on a hyperbola.

Similarly, point B lies on a hyperbola because BF1=a+c, BF2=c−a and
BF1−BF2=(a+c)−(c−a)=2a,
which is a condition of any point on a hyperbola.

Segment AB constitutes the major axis of this hyperbola.

Our goal is to find an equation for x and y that describes the condition on point P(x,y) to lie on a hyperbola with focal distance 2c and major axis length of 2a.

The definition of hyperbola requires that difference between distances from point P(x,y) to foci is 2a.
Therefore, an equation on x and y that is a necessary and sufficient condition for point P(x,y) to lie on our hyperbola is
(x+c)²+y² − √(x−c)²+y² = 2a

Fortunately, this cumbersome equation can be converted into a much simpler form

x²/a² − y²/b² = 1

where parameter b is called a minor axis and is defined by an equation b²=c²−a².

The transformation into this simple form is straightforward but, unfortunately, lengthy and boring.
(x+c)²+y² − √ (x−c)²+y² = 2a
(x+c)²+y² = 2a + √ (x−c)²+y²

Square both sides of an equation
x²+2xc+c²+y² =
= 4a²+4a√ (x−c)²+y²+x²−2xc+c²+y²

Cancel equal terms on both sides of an equation
2xc = 4a²+4a√ (x−c)²+y²−2xc
Separate a square root
into the left side of an equation

−4a√ (x−c)²+y² = 4a²−4xc
Divide both sides by 4 and square the equation
a²x²−2a²xc+a²c²+a²y² =
= a4−2a²xc+x²c²

Cancel equal terms on both sides of an equation and combine expressions for variables x and y in the right side of an equation
a²(c²−a²) = (c²−a²)x²−a²y²,
Using relation b²=c²−a²,
it can be simplified

b²x²−a²y² = a²b²
Divide both sides by a²b²
to get a canonical equation for hyperbola

x²/a² − y²/b² = 1

Our next task is to represent a hyperbola in polar coordinates r (a distance from a center of polar coordinates to a point under consideration) and θ (angle from a polar axis to a direction from a center to a point under consideration).

It makes sense to set a polar base axis coinciding with the line between foci with an origin of polar coordinates positioned at one of the foci of a hyperbola. Then the distance from any point P(r,θ) of a hyperbola to this focus is r.
The difference between this value and a distance from P(r,θ) to another focus should be equal to 2a by absolute value.

While it seems more natural to take a midpoint between the foci as an origin of polar coordinates, the equation is simpler to derive if this origin is at the focus of a hyperbola because a distance to one of the foci in this case would be just equal to r.

Another reason for choosing a focus as an origin of polar coordinates is that, when we will analyze the trajectory of an object in the central gravitational field, we will derive the equation of this trajectory in polar coordinates originated at the source of gravity - a single focus point, the only fixed point we would know.

For definitiveness, let's concentrate on the right branch of a hyperbola and place the origin of polar coordinates at focus F2.

Consider a triangle ΔPF1F2 formed by two foci and a point P(r,θ) on the hyperbola.
The length of side PF2 is r.
We can use the Law of Cosines to determine PF1 by two other sides PF2=r and F1F2=2c.
(PF1)²=r²+4c²−4rc·cos(π−θ)=
= r²+4c²+4rc·cos(θ)


This allows to get an equation of a hyperbola in polar coordinates with a focus F2 as the origin of coordinates:
r²+4c²+4r·c·cos(θ) − r = 2a

Now we have to transform it into a form of distance r of point P from an origin of polar coordinates (focus F2) as a function of a polar angle θ.

Straight forward way is to separate a square root in one side of an equation and square both sides.
r²+4c²+4r·c·cos(θ) = r + 2a
r²+4c²+4r·c·cos(θ) =
= r²+4a·r+4a²

4a·r−4r·c·cos(θ) = 4c²−4a²
[a−c·cos(θ)] = c²−a²
Therefore,
r= (c²−a²)/[a−c·cos(θ)] =
= a²(c²/a²−1)/
[a−c·cos(θ)]

The ratio e=c/a is called eccentricity of a hyperbola and its greater than 1. Using it, the formula can be transformed to

r= a·(e²−1)/[1−e·cos(θ)]

The above is an equation of a hyperbola in polar coordinates, when one of the foci is taken as an origin of coordinates and polar base axis is directed from it towards another focus.
This equation is expressed in terms of semi-major axis a and eccentricity e=c/a, where c is half of a distance between foci.

The distance r from focus F2 is not always positive for all values of polar angle θ, which means that the ray from F2 will not intersect a hyperbola in every direction.

For example, for θ=0 cos(θ)=1, and the denominator in the equation above will be negative, which means there is no point of a hyperbola in that direction.
On the other hand, for θ=π cos(θ)=−1, and
r= a·(e²−1)/[1+e] = a·(e−1) =
= c − a


In general, r is positive and defines a particular point on a hyperbola when 1−e·cos(θ) is positive, which is true for angle θ to be outside of interval from −arccos(1/e) to arccos(1/e), which is the same as from −arccos(a/c) to arccos(a/c).

Friday, March 14, 2025

Math+ Ellipse: UNIZOR.COM - Math+ 4 All - Geometry

Notes to a video lecture on http://www.unizor.com

Geometry+ Ellipse

Certain curves on a plane have common properties that allow to combine them into classes.

For example, a circle is a class of curves with the following property.
For each curve of this class (that is, for each circle) there is one specific point called its center and a specific positive real number called its radius, such that this circle consists of all points on a plane located on the distance equaled to this circle's radius from its center.

Ellipse is another class of curves on a plane with the following defining properties.
For each curve of this class (that is, for each ellipse) there are two specific points called foci (plural of focus) or focuses and a specific positive real number called its length of major axis that is supposed to be greater than the distance between its two foci, such that this ellipse consists of all points on a plane (or is a locus points on a plane) with the sum of their distances to its two focuses equaled to this ellipse's length of major axis.

As we see, the position of two focus points F1, F2 and a length of major axis 2a (should be greater than the length of segment F1F2) uniquely identify an ellipse.

Using this definition of ellipse, we can draw it on the board using two nails and a thread by fixing a thread on both ends at the nails, tightening a thread using a pencil and moving a pencil around keeping a thread tightened.
The result of this procedure will be an ellipse.

Below is a drawing of such an ellipse and the prove that, if a sum of distances from any point on an ellipse to its two foci is 2a, the length of its major axis - a segment P4P3 between intersections of a focal line F1F2 with an ellipse - is also 2a.

Assume, we fix the position of foci F1 and F2 with the distance between them being 2c. The parameter c is called a focal distance.
Assume further that we have chosen the length of a major axis 2a greater than the length 2c of a focal line F1F2 between focus points.
The following drawing represents an ellipse defined by these two parameters, half a major axis a and half of focal distance c.
Let point O be a midpoint of segment F1F2.
Point C is an intersection of an ellipse with a midpoint perpendicular to F1F2, so OCF1F2.
Points A and B are intersection of the focal line F1F2 with an ellipse.

Consider an ellipse on a Cartesian coordinate plane with foci to be at points F1(−c,0) and F2(0,c).

Point A(−a,0) lies on the X-axis and on this ellipse because the distance from A to F1 is a−c and the distance from A to F2 is a+c. The sum of these distances is 2a, which is a condition of any point on an ellipse.

Similarly, point B(0,a) lies on the X-axis and on this ellipse because the distance from B to F1 is a+c and the distance from B to F2 is a−c. The sum of these distances is 2a, which is a condition of any point on an ellipse.

Segment AB constitutes the major axis of this ellipse.


Consider points C(0,b) and D(0,−b), where b is a positive real number that satisfies the Pythagorean equation b²=a²−c² and, therefore, a²=b²+b².
Point C lies on the Y-axis and on this ellipse because the distance from C to F1 is (0+c)²+(b−0)²=a and the distance from C to F2 is (0−c)²+(b−0)²=a. The sum of these distances is 2a, which is a condition of any point on an ellipse.

Similarly, point D lies on the Y-axis and on this ellipse because the distance from D to F1 is (0+c)²+(−b−0)²=a and the distance from D to F2 is (0−c)²+(−b−0)²=a. The sum of these distances is 2a, which is a condition of any point on an ellipse.

Our goal is to find an equation for x and y that describes the condition on point P(x,y) to lie on an ellipse with focal distance 2c and major axis length of 2a.

The definition of ellipse requires that sum of distances from point P(x,y) to both foci is 2a.
Therefore, an equation on x and y that is a necessary and sufficient condition for point P(x,y) to lie on our ellipse is
(x+c)²+y² + √(x−c)²+y² = 2a

Fortunately, this cumbersome equation can be converted into a much simpler form

x²/a² + y²/b² = 1

where substitutes a²−c².

The transformation into this simple form is straightforward but, unfortunately, lengthy and boring. We have decided to omit it from this lecture and recommend you to do it yourself by repeatedly separating one of the square roots into one side of an equation and squaring both sides.

Alternatively, you can look at any Web page that presents this transformation in details.
For example, detail steps of such a transformation can be found at
https://courses.lumenlearning.com/odessa-collegealgebra/chapter/deriving-the-equation-of-an-ellipse-centered-at-the-origin/

Our next task is to represent an ellipse in polar coordinates r (a distance from a center of polar coordinates to a point under consideration) and θ (angle from a polar axis to a direction from a center to a point under consideration).

It makes sense to set a polar base axis coinciding with the line between foci with an origin of polar coordinates positioned at the focus F1 of an ellipse and direction of a base axis to be towards another focus. Then the distance from any point P(r,θ) of an ellipse to this focus is r.
This value, summed with a distance from P(r,θ) to another focus F2, should be equal to 2a.

While it seems more natural to take a midpoint between the foci as an origin of polar coordinates, the equation is simpler to derive if this origin is at the focus of an ellipse.

Consider a triangle ΔPF1F2 formed by two foci and a point P(r,θ) on the ellipse.
We can use the Law of Cosines to determine PF2 by two other sides PF1=r and F1F2=2c.
(PF2)²=r²+4c²−4r·c·cos(θ)

This allows to get an equation of an ellipse in polar coordinates with a focus as an origin of coordinates as
r + √r²+4c²−4r·c·cos(θ) = 2a

Now we have to transform it into a form of distance r of a point P from an origin of polar coordinates (focus F1) as a function of a polar angle θ.

Straight forward way is to separate a square root in one side of an equation and square both sides.
r²+4c²−4r·c·cos(θ) = 2a − r
r²+4c²−4r·c·cos(θ) =
= 4a²−4a·r+r²

4a·r−4r·c·cos(θ) = 4a²−4c²

Using a²=b²+c², the equation for an ellipse would be
[a−c·cos(θ)] = b²
Therefore,
r= b²/[a−c·cos(θ)]

The ratio e=c/a is called eccentricity of an ellipse. Using it, the formula can be transformed to
r= (b²/a)/[1−(c/a)·cos(θ)]
or, using again formula a²=b²+c²,

r= a·(1-e²)/[1−e·cos(θ)]

The above is an equation of an ellipse in polar coordinates, when one of the foci is taken as an origin of coordinates and polar base axis is directed from it towards another focus.
This equation is expressed in terms of semi-major axis a and eccentricity e=c/a, where c is half of a distance between foci.

Let's fix the major axis of an ellipse and change the distance between foci.
The smaller the distance between the foci (that is, the smaller c with fixed a) - the more our ellipse resembles a circle. When e=0 the equation of an ellipse looks like an eqiation of a circle in polar coordinates:
r = a
(that is, constant distance from the origin)

If we increase the eccentricity, that is if e increases to its maximum value of 1, its foci get closer and closer to endpoints of a major axis and the ellipse's size in a direction perpendicular to its major axis (that is, 2b) decreases to zero.

Sunday, February 9, 2025

Physics+ Newton Laws, Planet Orbits: UNIZOR.COM - Physics+ - Laws of Newton

Notes to a video lecture on UNIZOR.COM

Laws of Newton -
Planet Orbits


Kepler's First Law states that all planets move around the Sun on elliptical orbits with the Sun in one of the two focus points of their orbits.
Kepler had come up with this law based on many years of observation, that is, experimentally.

On our quest to analytically prove the First Kepler's Law we begin with the proof that the trajectory of a point-mass moving in a central gravitational field produced by another point-mass fixed at some location in space is a flat curve, that is a curve all points of which belong to the same plane.

Consider a gravitational field produced by a point-mass M fixed at some point O in space.
Assume, a point-mass m is moving in this field and at some moment of time t is at point P.

Gravitational field is a central force field, and in the Central Force Field lecture of this chapter of this course we have proven the Angular Momentum Conservation Law.
Here is a short recap.

The point O, the fixed location of the source of gravitational field, is chosen as the origin of coordinates.
Let r(t) be a position vector OP of an object moving in this field at time t.
Let v(t) be the derivative of this position vector by time, that is a vector of velocity of our point-mass m:
v(t) = (dr/dt)(t) = r'(t).
Let a(t) be the derivative of a velocity vector by time, that is a vector of acceleration of our point-mass m, that is the second derivative of a position vector:
a(t) = v'(t) = r"(t).

We assume that our object does not move directly towards or from the center of gravity O, that is vectors r and v are not collinear.
If vectors of position and velocity are collinear, an object would move along a straight line towards or away from the source of gravity, which is a trivial case that we will not consider here.

The Angular Momentum Conservation Law states that vector L(t)=m·r(t)v(t) is a constant of motion, that is L(t) does not depend on time, which is equivalent to its derivative by time is a zero-vector:
dL(t)/dt = L'(t) = 0

The proof is simple and is based on the fact that a vector product of collinear vectors is a zero-vector.
L'(t) = d[m·r(t)v(t)]/dt =
=
[r(t)v'(t) + r'(t)v(t)] =
= m·
[r(t)a(t) + v(t)v(t)]
The first vector product is a zero-vector because, according to the Second Law of Newton acceleration of an object is collinear with the force, which is central, that is collinear with position vector r(t). So, r(t) and a(t) are collinear and, therefore, their vector product is a zero-vector.
The second vector product in the above expression is a zero-vector because the vector product by itself is always a zero-vector, since any vector is collinear to itself.

Since the derivative L'(t) is zero-vector, the Angular Momentum vector L(t) is independent of time and is fixed in space, which allows us to omit (t) from its value.

Recall that the result of a vector product of two vectors is a vector perpendicular to each of them.
Therefore,
Lr(t) and Lv(t).

Let's construct a plane in space going through vector r(t) (that is, points O and P) and vector v(t) (that is, through endpoint of this vector, thus having three points that define a plane). This plane will be perpendicular to a constant vector L.

Let's define a Cartesian system of coordinates in space with XY-plane to be the plane that we just constructed based on vectors r and v.
The Z-axis in our system of coordinates can be chosen coinciding with vector L that is perpendicular to XY plane.

Both vectors r(t) and v(t) lie in the constructed XY plane, and their Z-coordinate is zero.
Hence, the incremented position of our object at infinitesimally incremented time t+dt, that is a vector
r(t+dt) = r(t) + v(t)·dt
will also lie in the same plane, it Z-coordinate is zero, which means that the movement from point P will continue within the same XY plane.

Repeating this logic to consecutive incremental positions, we see that an entire trajectory will lie in our XY plane.
In other words, Z-coordinate of vector r(t) is zero at any time t.

This concludes the proof that a trajectory of an object in the central gravitational field is a curve lying in one plane.

Wednesday, January 15, 2025

Physics+ Newton Laws, Gravity Two Objects: UNIZOR.COM - Physics+ - Laws ...

Notes to a video lecture on UNIZOR.COM

Laws of Newton -
Gravitation of Two Objects


We recommend to refresh your knowledge of the gravitational field in three-dimensional space and the concept of field potential (scalar defined at each point of a field) using the material presented in this chapter in lectures
Field & Potential,
Work Lemmas,
Potential Theorem and
Problem 1.

The Problem 1 in this chapter of this course presented a proof that gravitation field of a point-mass M located at some point Q in three-dimensional space is conservative.

The work performed by this field when it moves some test object of mass m from point A to point B depends only on the location of these endpoints of its trajectory.

The potential of the field at point P is an amount of work needed by a field force to move a test object of a unit mass from point P to infinity.

Also, the gradient of a potential equals to the field intensity - the force (vector) acting on a test object of unit mass at any point.

Earlier in this course the gravitational force F(P) at some point P of the field was presented as
F(P) = F(r) = −G·M·m·r/
where r=QP is the relative position vector from the source of gravity Q to point P,
r is the magnitude (scalar) of a relative position vector r,
M is a mass of an object that is the source of gravitation,
m is a mass of a test object and
G is the Universal Gravitational Constant.

Of course, r²=x²+y²+z²
where (x,y,z) are Cartesian coordinates of vector QP (point Q is the source of gravity).

In coordinate form, components of the gravitational force vector F(P)=F(r) are:
Fx(P)=−G·M·m·x/(x²+y²+z²)3/2
Fy(P)=−G·M·m·y/(x²+y²+z²)3/2
Fz(P)=−G·M·m·z/(x²+y²+z²)3/2

The potential of a gravitational field was determined as a scalar function defined at any point P of the field as
U(P) = U(r) = G·M/r =
= G·M/(x²+y²+z²)1/2 =
= G·M·(x²+y²+z²)−1/2


Its gradient, a vector of partial derivatives by coordinates, is
U(r)={∂U/∂x, ∂U/∂y, ∂U/∂z}
which should be equal to a vector of gravitational field intensity (the force per unit of test object's mass).

Indeed,
∂U(P)/∂x =
= −½G·M·(x²+y²+z²)−3/2·2x =
= −G·M·x/(x²+y²+z²)3/2 =
= (1/m)·Fx(P)


Similarly, partial derivatives by y and z produce the corresponding components of the gravitational field intensity - the force acting on a unit of mass of a test object.
∂U(P)/∂y =
= −G·M·y/(x²+y²+z²)3/2 =
= (1/m)·Fy(P)

∂U(P)/∂z =
= −G·M·z/(x²+y²+z²)3/2 =
= (1/m)·Fz(P)


We came to conclusion that
U(P)={∂U/∂x,∂U/∂y,∂U/∂z}=
= (1/m)·(Fx(P),Fy(P),Fz(P)) =
= E(P)

where E(P) is a vector of field intensity (the force per unit of test object's mass) at point P.

Let's determine a potential of the gravitational field produced by two point-masses Ma and Mb fixed at positions A and B correspondingly within some inertial frame of reference.

At any point P in space we can consider two positional vectors:
ra=AP originated at A (first source of gravitation) and ending at point P and
rb=BP originated at B (second source of gravitation) and ending at the same point P.

Assume, coordinates of all points involved are
A(xa,ya,za)
B(xb,yb,zb)
P(xp,yp,zp)
Then the coordinates of vectors ra and rb are
ra = (xp−xa, yp−ya, zp−za)
rb = (xp−xb, yp−yb, zp−zb)

According to a principle of superposition, the combined gravitational force of our two sources acting on a test object of mass m positioned at point P should be a vector sum of two separate forces, that is
F(P) = −(G·Ma·m)·(ra/ra³) −
− (G·Mb·m)·(rb/rb³)


The field intensity (the force per unit of mass of a test object) at point P, therefore, is
E(P) = −(G·Ma)·(ra/ra³) −
− (G·Mb)·(rb/rb³)

Notice, the above formula involves the addition of vectors.

From the fact that field potential is an amount of work needed to move a test object of a unit mass from its initial location P to infinity and that the work is, generally speaking, an additive quantity intuitively follows that potential of two fields should be equal to a sum of potentials.

More rigorously, work is an integral of a scalar product of force by differential of distance
W = [P,∞](F·dl)

If F=Fa+Fb, the work of a combined force is the sum of work of its components
W = [P,∞]((Fa+Fb)·dl) =
= [P,∞](Fa·dl) + [P,∞](Fb·dl) =
= Wa + Wb

which proves additive characters of work and, consequently, of field potential in case a field is generated by more than one source.

The sum of potentials involves the addition of scalars, which is easier than addition of vectors of forces, so dealing with potentials is a preferable way.

Now consider a different scenario.

Assume, our two objects of mass Ma and Mb are not fixed at particular points A and B in space, but can move freely, and their corresponding position vectors are ra and rb.
Assume further that there are no external forces that can influence the movement of our system of two objects, and the only force that somehow affects their movement is the force of gravitation of one to another.
We will prove that in this case a center of mass of these two objects moves uniformly in space along a straight line with constant velocity.

The only force acting on the a-object with mass Ma is the gravitation force sourced at b-object with mass Mb.
This force is directed along a straight line connecting these two objects and its magnitude is
Fab(t) = G·Ma·Mb/d²(t).
where d=|rarb| is a time-dependent distance between these objects.

In a more appropriate vector form it looks like
Fab(t) =
= G·Ma·Mb·
[ra(t)−rb(t)]/d³(t).

Similarly, the only force acting on the b-object with mass Mb is the gravitation force sourced at a-object with mass Ma.
This force is directed along a straight line connecting these two objects and its magnitude is
Fba(t) = G·Mb·Ma/d²(t).

In a more appropriate vector form it looks like
Fba(t) =
= G·Ma·Mb·
[rb(t)−ra(t)]/d³(t).

So, the magnitudes of these forces, Fab(t) and Fba(t), are the same, but directions are opposite to each other, Fab(t) is directed from a-object to b-object, while Fba(t) is directed from b-object to a-object:
Fab(t) = −Fba(t)

Using the Newton's Second Law, knowing the forces and masses, we obtain equations for accelerations of these two objects, second derivatives of position vectors ra" and rb".
ra" · Ma = Fab
rb" · Mb = Fba
In the equations above and following it is assumed that all variables except masses are time-dependent functions. We just skip (t) for brevity.

If masses Ma and Mb are positioned at ra and rb, the center of mass is at position
r = (Ma·ra + Mb·rb)/(Ma+Mb)
The acceleration of this point is it's second derivative of position
r" = (Ma·ra" + Mb·rb)"/(Ma+Mb) =
= (Fab + Fba)/(Ma+Mb) = 0

because, as we stated above, Fab(t)=−Fba(t)

Since acceleration of the center of mass is zero, it velocity is a constant vector, which means that the center of mass in the absence of external forces moves along straight line with the same velocity.