Whittle : EXTRAGALACTIC ASTRONOMY
8. STELLAR DYNAMICS II : 3-D SYSTEMS
(1) Introduction
We have, of course, already begun our study of Stellar Dynamics :
Topic 6 considered the highly
restricted situation of nearly circular motion in cool galaxy disks.
Here we broaden the discussion considerably to consider motion within more
general 3-D systems.
In large part, these notes follow (though simplify) the treatment in B&T.
(a) Gas/Fluid Physics and Stellar Dynamics
To set the stage, lets first compare stellar systems with atomic
(or molecular) gases :
(b) A Path Through the Subject
There are a number of themes to cover, and chosing the right sequence isn't
straightforward
Here is an outline to help navigate the upcoming (sometimes dense) material.
(2) Potential Theory
(a) Preliminaries
- We initially characterize mass distributions as smooth functions
(r)
(this is usually legitimate for galaxies, see § 8.10 below)
- The gravitational potential energy is a scalar field
its gradient gives the net gravitational force (per unit mass)
which is a vector field :
|
(8.1a) (8.1b) |
- evaluating the divergence of F(r) gives :
|
(8.2a) (8.2b) (8.2c) |
8.2b is Poisson's equation, for locations within the mass distribution
8.2c is Laplace's equation, for locations outside the mass distribution
- For a volume V with surface A enclosing mass M we have
(using Divergence/Gauss's Theorem) :
|
(8.3a) (8.3b) |
- Since the force field is the gradient of a potential, it
is conservative,
ie the energy required to move mass from
r1 to r2 is independent of the path
the total Potential Energy is therefore well defined
setting
= 0 at r =
we
get (B&T-2 p 59) :
|
(8.4) |
Note that, with this definition, potential energy is always negative
(b) Selected Examples of Density-Potential Pairs
Often, choosing a simple form for
(r)
[or
(r)] yields a complex form for
(r)
[or
(r)]
There are, however, a number of useful illustrative analytic
(r)
(r) pairs :
(i) Point Mass
(r) = - GM / r   ;   F(r) = -
= -d
/dr = -GM / r2
Vc2(r) = GM / r = -
(r)   ;
 
Vesc2(r) = 2GM / r = - 2
(r)
where Vc & Vesc are the circular and escape velocities,
respectively.
This is called a Keplerian Potential, since it pertains
to the solar system.
(ii) Uniform Spherical Shell
Outside :
(r) = - GM / r   (Keplerian)
Inside :
(r) = const   ;   F(r) = 0
(iii) Homogeneous Sphere
Sphere radius = a, with
(r) = const (r < a)
Outside :
(r) = - GM / r   (Keplerian)
Inside :
(r) = -2
G
(a2 - r2/3)   ;  
Fr = -G M(r) / r2 = -(4/3)
G
× r
which gives SHM with period Pr = (3
/ G
)½     and free-fall tff ~ ¼ Pr ~ (G
)-½
Vc =
[(4/3)
G
]½ × r   so that  
(r) = const    
solid body rotation
note also that Pc = Pr
(iv) Logarithmic Potentials from Flat Rotation Curves
Many rotation curves are flat at large radii : Vc = Vo, so we have :
|
(8.5) |
(v) Spherical Systems
- Power Laws :
=
o (r/a)-
have M(<r) = (4
G a3
o) / (3 -
) × (r / a)3-
and
(r) = -(4
G a2
o) / [(3 -
)(
- 2)] × (r / a)2-
= Vc2 / (
- 2)
= 3 is a break point:
For
  >   3,   M(<r)
  for r
0 : we have infinite mass at the origin.
For
  <   3,  
M(<r)
  for
r
: mass diverges at large r.
However for 2 <
< 3 the potential is finite, as are Vc and Vesc, at all radii.
The case
  =   2 is special : it is the
singular isothermal sphere
with Vc   =   (4
G a2
o)½   =   const at all radii, yielding
(r) = 4
G
a2
o ln(r / a)
See § 8.8a,b for other isothermal and related (King) spheres
- Hernquist (1990) and Jaffe (1983) models : have
r-4 at large r
which fits E gals well, and is theoretically grounded in violent relaxation
at small r, Jaffe core is steeper than Hernquist core :
|
(8.6a)   (8.6b) |
- Plummer (1911) Sphere : is analytic solution of hydrostatic support for
polytropic stellar system of index 5; see § 8.8c :
(r) matches GCs well, but is too steep
at large r for Ellipticals (
r-5).
|
(8.7) |
- Plummer; Isothermal; Jaffe; and Hernquist density laws are
shown here : [ image]
(vi) Axisymmetric Thin Disks
- Before considering global potentials for disks, first consider the vertical potential near z = 0
We have two conditions :
within a disk of volume density
o near
the plane
above a disk of surface density
Using equation 8.3b we have :
|
(8.8a) (8.8b) |
- Usually, calculating global
and F for disks
is algebraically dense.
Unlike spherical systems, disk potentials usually depend on mass outside R.
Here are a few simple examples :
- Mestel's disk :
(R) =
o Ro / R, has constant
Vc :
Vc2(R) = 2
G
o Ro = GM(<R) / R
this is unusual in that Vc(R) doesn't depend
on mass outside R
- Exponential disk :
(R) =
oexp(-R/Rd)
this fits the light profile of sprial disks much better than Mestel's disk, and has circular velocity
|
(8.9) |
where y = R / 2Rd, and In Kn are Bessel functions
or the 1st and 2nd kind
see [eqn 6.7] for an analytic approximation and rotation curve.
- Kuzmin (1956) disk : has a simple form for both R and z potential
|
(8.10a) |
- Note that because Poisson's equation is linear :
differences between any two
pairs is
also a
pair, and
differentials of
or
(eg w.r.t. shape constants) are also
pairs
for example :
- Toomre (1962) disks : a series of n disks derived from Kuzmin
disks by differentiation w.r.t. a2 :
(n=1 is a Kuzmin disk, n=
is a Gaussian disk
|
(8.10b) |
- Unfortunately, inverting Vc(R) to get
(R) is very sensitive to data noise (eg spiral
perturbations).
(vii) Axisymmetric Flattened Systems
Spirals with bulge and disk are, of course, neither just spherical nor just thin disks
We need potentials which are both combined, ie flattened potentials
- Miyamoto-Nagai (1975) flattened system [ images ]:
reduces to the Plummer model if a=0 and the Kuzmin disk if b=0
(Satoh flattened systems are derived in similar manner to the Toomre disks) [ images ] :
|
(8.11a)   (8.11b) (8.11c) |
(viii) Triaxial Ellipsoids
- more complicated, (see B&T-2 § 2.5)
(ix) Multipole Expansion
An arbitrary mass distribution
sums of spherical
shells of non-uniform surface density.
Calculating the potential involves solving
2
= 0 in spherical polar coordinates
Solutions involve spherical harmonics : Yl,m
(
,
)
Pl|m|( cos
) exp(i m
)
where Pl|m|(x) are associated Legendre functions.
The potential
(r,
,
) is the sum of a monopole (l=0), a dipole (l=2)
quadrupole (l=4) etc...
each with associated amplitudes
For coordinates with origin at center of mass :
- monopole : largest term; associated with Vc(r) ~
(GM / r)½
- dipole : zero outside the mass (because no -ve mass)
- quadrupole : first significant non-spherical term
(3) Orbit Classes
TBD
(4) Numerical N-Body Methods
Several methods are used :
See B&T-2 § 2.9 and
Josh Barnes's nice writeup for more details : 
(5) The Virial Theorem
This fundamental result describes how the total energy (E) of a self-gravitating
system is
shared between kinetic energy (K) and potential energy (W)
Specifically, we are interested in their ratio :
= K / |W|   (note K is always +ve, W always -ve)
We begin by looking at two illustrative cases and then deal with the general case.
(a) Simple Illustrations
(i) Circular Orbit
-
Consider a satellite mass m in circular orbit about M (>>m) :
m V2 / r = G m M / r2
multiply by r :  
m V2 = G m M / r    
    2K = -W   or 2K + W = 0
=
K / |W| = ½ and E = - K
Kinetic energy is half the (-ve) potential energy
The total energy E = K + W is -ve and equal to (minus) the kinetic energy
-
As we shall see,
= ½ is a characteristic shared by a wide range of systems.
Note that in this case, the instantaneous values are also equal to the time averaged values
(ii) Time Averaged Keplerian Orbit
-
In general,
= K / |W| changes along a
Keplerian orbit path [image].
eg compare
at pericenter and apocenter :
p /
a = ra / rp
1   (using rp Vp = ra Va from AM conservation)
-
However, taking time averages over an orbit, we find :
< -W > = < GM/r > = GM < 1/r > = GM × (1/a), and
< K > = < ½ V2 > = GM < (1/r - 1/2a) > = ½GM × (1/a)
and we recover, once again : <
> = ½ and E = -< K >
-
Note that time averages for single non-Keplerian orbits do not usually
have <
> = ½
As we will see, however,
= ½ always holds
when we average over all particles in a system
For our Keplerian orbit, m and M are the whole system
(with M having ~zero KE)
(b) The General Case
The general case comprises an isolated system of self-gravitating masses
Once again, we ask what is
, the ratio of
kinetic to potential energies
-
There are 3 equations of motion for member
(i represents x, y, z) :
|
(8.12) |
- take the 1st moment in position : multiply by xj
and sum over
(j represents x,y,z)
dimensionally, we have changed an equation of forces into an
equation of energies
after some algebra, we get a set of 9 equations
these can be neatly written using 3 × 3 matrices (ie tensors of order 2)
this set of equations constitute the Tensor Virial Theorem :
|
(8.13) |
where the five tensors are :
|
(8.14) (a,b,c,d,e) |
- For steady state systems, d2Iij / d t2 = 0
  and we get
|
(8.15a) |
the Kinetic and potential energies are related for each tensor element
for example, they are related separately along each axis
- Considering just the diagonal terms, we also have :
Trace(T)   +   ½ Trace(
)  
  K   =   total kinetic energy, and
Trace(W)  
  W   =   total potential energy
so for the static case, we get the Scalar Virial Theorem :
|
(8.15b) |
- Considering the total energy, E, we find :
|
(8.15c) |
So the total energy is negative : the system is bound !
its value is equal to either
minus the (+ve) Kinetic Energy, or
half the (-ve) Potential Energy
- Here is a very useful little diagram to illustrate the situation :
[image]
- Briefly reviewing the conditions necessary to use these simple equations :
the system must be self gravitating
the system must be in steady state (orbit timescale << evolution timescale)
quantities must be time averaged (or many objects sampled with random
orbital phase)
the system must be isolated (or at least embedded in a slowly varying
potential)
Note that the system may be either collisionless (stellar) or collisional (gaseous)
(c) Mass Determination
- The most famous use of the virial theorem is to determine the masses
of stellar systems.
For a system of total mass M and mean squared velocity <v2>,
K is simply ½ M <v2>
the virial theorem then gives :
<v2> = -W / M
GM / Rg
which in practice defines :the gravitational radius : Rg
Knowing Rg and measuring <v2> allows us to
determine M, the system mass.
What to use for Rg isn't obvious for most stellar systems
with no clear "edge" or "size"
However, we can make use of the median radius : Rm which
encloses half the mass
For many stellar systems, it turns out that Rm  
  0.4 Rg  
(note Rm is written rh in B&T)
we then have :
|
(8.16) |
- notice that for our circular Keplerian orbit, we recover the simple
relation : <v2> = G M / R
(d) Binding Energy : Energy Released During Collapse
(e) Stellar Systems Have Negative Specific Heat
(f) Rotational Flattening
- Consider an axisymmetric system rotating about the z axis
By symmetry :
T,
, and W are all diagonal
x & y elements of these tensors are the same
- The tensor virial theorem gives :
2 Txx   +  
xx
  +   Wxx   =   0
2 Tzz   +  
zz
  +   Wzz   =   0
- We also have :
Tzz   =   0   (rotation about z  
  no drift
 
  to z)
2 Txx   =   ½
<V
>2
d3r   =   ½ M Vo2
  (Vo is the mass weighted rotation speed)
xx   =   M
o2  
(
o is the mass weighted dispersion)
zz  
 
(1 -
)
xx  
=   (1 -
) M
o2   (
  <   1, measures anisotropy)
Wxx / Wzz  
 
(A/B)0.9   =   (1 -
)
-0.9 (A/B is axis ratio of isodensity surfaces)
- Finally, substituting all these into the ratio of the two tensor relations
above, we get :
|
(8.17a) |
B&T-1 fig 4.5 shows this relation for several
,
including projection corrections [ images ]
for isotropic velocities,
  =   0,
and we get, for small
:
|
(8.17b) |
- In this case, the inclination
corrections to Vo /
o
and
are similar, so the prediction is robust
- Observationally, in Topic 7 we found (B&T-1 fig 4.6;
[ images ]
Low luminosity Ellipticals and Bulges follow the isotropic relation
Luminous Ellipticals often fall in the anisotropic
(
> 0) region
(6) Describing Collisionless Systems
We first consider collisionless dynamics :
"Collision", here, means star-star deflection, not direct impact
For the collisionless case, stars are assumed to move in a completely smooth potential
For galaxies this is almost always a very good approximation
(in § 8.10 we consider when and how star-star encounters are relevant)
(a) The Distribution Function (DF) : f(r, v, t)
(b) Collisionless Boltzmann (Vlasov) Equation (CBE)
- Look for a continuity equation, since :
no stars created/destroyed : flow conserves stars
stars do not jump across the phase space (ie no deflective encounters)
View the DF as a moving fluid of stars in 6-D space (r, v),   ie x,y,z,vx,vy,vz
stars move/flow through the region as their positions and velocities change
- Consider a 1-D example using x and vx, and recall f is a number
density
focus on a small element of phase space at x and vx with size
dx by dvx
this [image] will help vizualise the situation
- In interval dt, net flow in x is :
|
(8.18a) |
the net flow due to the velocity gradient is
|
(8.18b) |
the sum of these equals the net change to f in the region, ie at x, vx of size dx dvx
|
(8.18c) |
or, dividing by dx dvx dt, we get
|
(8.19a) |
but since
|
(8.19b) |
we have
|
(8.19c) |
adding the y and z dimensions, which are independent, we finally have
This is the collisionless Boltzmann equation (CBE)
-
The CBE describes how the DF changes in time
It is a direct consequence of :
| 1 conservation of stars |
| 2 stars follow smooth orbits |
| 3 flow of stars through r defines implicily the location v   (= dr/dt) |
4 flow of stars through v is given explicitly by - |
|
- Since
f/
t
is a Eulerian (partial) differential, it describes the change in
DF at a point in phase space
- However, consider the Lagrangian (total, or convective)
derivative : Df/Dt
df/dt.
This describes the change in f as we follow along the "orbit" through phase space
But, this Lagrangian derivative is nothing more than the LHS of the CBE
|
(8.20) |
Clearly, the phase space density (f) along the star's orbit is constant
ie the flow is "incompressible" in phase-space
for example
if a region gets more dense,
will increase
if a region expands,
will decrease
example -- marathon race : start : n high,
v high ;
end : n low,
v low
- The CBE applies to all sub-populations of stars (eg each
spectral class)
even though no single class determines the potential
in § 8.7b we introduce a self-consistent f which itself generates
:
(c) The Jeans Equation(s)
- As it stands, the CBE is of rather limited use :
- the constraints it provides are still insufficient to find
f(r,v,t)
- the complexity of f(r,v,t) renders it observationally inaccessible.
- What we observe are :
- mean velocities : < v >
- velocity dispersions :
(which is related to < v2 > )
- stellar densities : n (also
for mass density, or j for luminosity denisty)
We need to recast the CBE in terms of these quantities.
-
Clearly, these observable quantities are contained within the DF :
f (r,v,t)
they can be extracted by taking appropriate averages or
moments
for example :
number density   =   n(r, t)     =  
f(r, v, t) d3v  
            =   0th moment in v
mean velocity   =   <vi(r, t)>   =
 (1/n)
vi f(r, v, t)
d3v   =   1st moment in v
If we take moments of the CBE, we transform it into equations in these
new variables.
Lets look in more detail at these first two moments in v (see B&T-2 §4.8) :
- Using the 1-D x axis as example, simply integrate the CBE (eq 8.19c)
over all vx
We obtain (0th moment in vx) :
|
(8.21) |
where n
n(x,t) is the space density and
<vx> is the mean drift velocity along x
This is a simple continuity equation for the number of stars along the
x axis.
- Now multiply the CBE (eq 8.19c again) through by vx and
again integrate over all vx
on rearranging and using eq 8.21 above, we obtain (1st moment in vx) :
|
(8.22a) |
where
x2 is the velocity
dispersion about the mean velocity,
it arises from <vx2>
  =   <vx>2 +
x2
- repeating this in 3-D requires a little care (B&T-2 § 4.8) :
we obtain the Jeans Equation (for coordinate j) :
|
(8.22b) |
where the summation convention applies (sum over repeated indices)
here, i=1,2,3 and j=1,2,3 refer to x,y,z, eg x2
y and v2
vy
-
This Jeans equation is akin to Newtons's second law : dv/dt = F/m   with :
LHS is the derivative of <v>
RHS are force terms
- It is instructive to compare this to
Euler's Equation for fluid flow :
|
(8.23) |
which is clearly analogous.
-
In 8.22b n
i,j2 is a
stress tensor which takes the role of an anisotropic pressure
(hence the phrase "pressure supported")
in a fluid, pressure is a scalar and is therefore always isotropic
for stellar systems,
i,j is a
tensor which can be anisotropic
-
i,j is symmetric, : i.e. axes exist where
1,1,
2,2,
3,3 are semi-axes of a velocity ellipsoid
if
1,1 =
2,2 =
3,3 we have isotropic dispersion
Jeans and Euler equations are identical
-
For collisionless systems there is
no equation of state linking pressure (
i,j2) to density
Usually, therefore, we are forced to assume
i,j (or, equivalently, the anisotropy
parameter
)
Recently, however, the LOSVD has been used
to constrain
(see T 5.7a :
).
(d) Applications of the Jeans Equation
The Jeans equation, when combined with
observations, has a number of applications :
-- deriving M/L profiles in spherical galaxies (B&T-1 4.2.1d)
-- deriving the flattening of a rotating spheroid with isotropic velocity
dispersion (B&T-1 4.2.1e)
-- analysis of asymmetric drift (B&T-1 4.2.1a)
-- surface density (and volume density) in the galactic disk (B&T-1 4.2.1b)
-- analysis of the local velocity ellipsoid in terms of Oort's constants (B&T-1 4.2.1c)
Here we look briefly at the first and second :
(i) Spherically Symmetric Steady State Systems
-
This is, of course, an important special case to consider :
For steady state, the first term in eq 22b is zero
For spherical symmetry : <vr> = <v
> = 0,
giving < vr2 > =
r2   and  
< v
2 > = 
2.
After transforming to spherical polar coordinates, the Jeans Equation reads
   :
|
(8.24a) |
Introducing anisotropy parameters   :  

=
1 - 
2 /
r2
  and 
=
1 - 
2 /
r2
and writing 2
for 
+ 
and Vrot for
<v
>   this becomes
|
(8.24b) |
which is equivalent to the equation of hydrostatic support :
dp /dr   +   anisotropic correction +   centrifugal correction   =   Fgrav
-
Going a little further, recasting
d
/ dr   as  
GM(<r) / r2 = Vc2 / r  
(Vc = circular velocity)
and rewriting the first term in eq 8.24b in logarithmic gradients,
we have :
|
(8.24c) |
This parallels the equation for hydrostatic support of an ideal gas, where
p = nkT
the equivalences are :
r2  
  T
d(ln n) / d(ln r)  +   d(ln T) / d(ln r)  
  (n/p) dp / dr
2
and Vrot2 / r  
are anisotropy and rotation correction terms
-
By measuring brightness profiles and velocity dispersion & rotation profiles,
we can derive (assuming
) : M(r) and hence
M/L (r)
This is very important, eg, in the search for nuclear black holes (see Topic 14.2 :
link)
(ii) Rotational Flattening Revisited.
TBD
(iii) Vertical Disk Structure.
TBD
(7) Steady State : The DF as f(E, |L|, Lz)
Taking moments of the CBE lost almost all detailed information from the
DF
Rather than working with the full DF, the Jeans equation works with just n, <v> and <v2>
Can we reintroduce the full DF and regain a more complete description of
a system ?
The answer is yes, by introducing two new powerful constraints :
demand that the system is in steady
state (
in equilibrium)
demand that the DF generate the
full potential (not just act as a tracer population)
We consider these in turn
(a) Integrals of Motion and the Jeans Theorem
- When a system is in steady state,
and f are not explicit functions of time
In this case, we may introduce a powerful new
entity : Integrals of motion
An "integral of motion" is a function I (x, v) which
is constant along a star's orbit (B&T-1 § 3.1.1)
Obvious examples of possible integrals of motion are :
| E (r, v) | =   ½v2 + (x) | =   energy per unit mass |
in a static potential |
|
L (r, v) | =   r × v | = total AM | in a spherical static potential |
|
Lz (r, v)   | =   (x2 + y2)½ v
| =   z component of AM | in an axisymmetric static potential
| |
- Since I (x, v) is constant along an orbit, it is also
a solution to the steady state CBE
specifically :
|
(8.25) |
- Since the CBE is a linear equation, then functions of solutions are
themselves solutions
This yields the Jeans Theorem :
|
Any function of integrals of motion f (I1, I2, I3, ..... ) is also a solution of the steady state CBE
|
- This is extremely useful since it allows us to construct
legitimate DFs using integrals of motion :
eg,: the DF : f (E, Lz) = No (E2 +
3Lz5/2) is a solution to the CBE for an
axisymmetric potential
- Going further, the Strong Jeans Theorem states (B&T-1 § 4.4) :
steady state DFs are functions only of three (or less)
independent (isolating) integrals.
- In the special case of steady state spherical systems,
it is easy to show (B&T-1 § 4.4.2) that :
-
DFs must have the form f(E, |L|)
-
DFs of the form f(E) must have an isotropic velocity dispersion
r  =  

  =  

-
DFs of the form f(E, |L|) must have an anisotropic velocity dispersion
r  
 

  =  

- Summarizing : these theorems provide a very useful way to begin constructing working models :
Rather than trying to find DF solutions to the CBE of the form f(r, v),
instead :
Choose DFs which are expressed as functions of, for example,   E,   |L|,   Lz
These DFs now automatically satisfy the continuity condition expressed
by the steady state CBE.
(b) Self-Consistency
- Both the CBE and the Jeans Equation include a potential gradient,

In neither equation, however, are these potentials linked explicitly to the
DF
(recall
f(r, v) d3v =
n(r)
(r) which
could, in principle, define
)
As it stands, the DFs only describe tracer populations.
- Clearly, an important step is to require that the DF also
yields the potential
(r)
ie :
|
(8.26a) (8.26b) |
where f here is the mass DF (ie we've multiplied f by the mean stellar mass)
- Taking the spherical form for
2, this reads (eg for a DF of the form f (E, |L|) :
|
(8.27) |
This is now a fundamental equation describing spherical equilibrium systems.
Solutions not only have self consistent
and f, but
f also satisfies the steady state CBE.
Such a solution now describes a self-consistent, physically plausible stellar
dynamical system.
- When using this equation to solve the structures of many systems,
we introduce (B&T-1 § 4.4) :
-
relative potential :
  =  
o -
 
-
relative energy : E   =  
- ½ v2
-
note : both
and E are more +ve
for more bound stars deeper in the system
-
choose
o so that f > 0 for E > 0 (bound)
-
at given
:   E spans range 0 to
, as v spans the range from
(2
)
( = Vesc) to 0
(c) Spherical Isotropic Systems : DF = f(E)
(d) Deriving f(E) from
(r) for
Non-Rotating Spherical Systems
(e) From f(E)d3r d3v   to   N(E)dE
- For N-body simulations, it is often useful to evaluate N(E) dE :
i.e. the total number of stars as a function of energy, E.
Note that N(E) is not simply the DF f(E) since this describes
the number of stars
of energy E at each point in phase space r, v
in the range d3r d3v
- For example, while the Boltzmann (exponential) distribution gives
f(E) for molecules in a gas
the Maxwell-Boltzmann distribution gives N(E) [conventionally written as
N(c), c = speed].
-
Consider just the region at r
now E =
(r) - ½v2, so
v = 2(
- E)½ and
dv = -(
- E)-½dE
the velocity phase space element : d3v = 4
v2 dv
so the number of stars at r of energy E in the interval dE is
N(E, r) dE d3r  
  f(E) × (
(r) - E)-½ dE d3r
Notice that at r, E ranges from 0   (v=vesc) to
(r)   (v=0)
so if f(E) is, for example, the King form : exp (E /
2) - 1   (see § 8.8b), then :
N(E, r) starts at 0 for v=0, it then
rises and then falls again to 0 when v=vesc
Integrating over all positions, we find (B&T-1 4.4.5) :
|
(8.30) |
where rm = largest radius out to which a star with E can be found   i.e. v=0 at
(rm) = E
While f(E) typicallyincreases exponentially with E
N(E) typically decreases with E, with a maximum near E ~ 0
 (where f(E) is usually small)
Most stars are nearly unbound (E ~ 0)
Few stars are deeply bound (E ~
(0) )
Examples of N(E) dE for the deVaucouleurs, King, and two Jaffe models :
(B&T fig 4.15) [ images ]
(8) Model Building Using DFs
We begin with the simplest cases : equilibrium, non-rotating, spherical systems,
ie DF
f(E)
With equations 8.28a,b now in hand, we are ready to construct specific models
The process goes as follows :
(1) Choose a DF which is a function of energy : f(E)  
  f(
- ½v2)
from Jeans Theorem, f(E) is already a solution to the steady state CBE,
so our solutions will naturally satisfy the basic phase space continuity condition
(2) Integrate the DF over v to find
(
)     (ie evaluate 8.26a)
(3) Solve Poisson's equation (8.28a) to find
(r)
(4) Combine
(
)
and
(r)
to give the mass distribution   :  
(r)
Here are some examples
(a) Polytropic Sphere : Power Law f(E)
- consider a power law DF   :   f(E) = F En-(3/2)
  for E > 0     (otherwise f(E) = 0)
Integrate f(E) over velocity to find the density in terms
of
  (eq 8.26a) :
|
(8.31) |
after substituing v = (2
)½cos
,
  we find
(
) = cn
n   (
> 0)
where cn is a constant depending on n and F.
- Substitute this into the spherical version of Poisson's equation (eqn 8.28a) :
|
(8.32) |
- This is the Lane-Emden equation, first studied as the equation describing hydrostatic
equilibrium of a self-gravitating sphere of polytropic gas
(ie equation of state : p

)
Thus, we find that for a self-gravitating sphere, the density profile
(r) is the same for
stars with DF
En-(3/2), and
gas with polytropic equation of state and
= 1 + (1/n)
- Simple solutions only exist for n = 5 (
= 6/5)
this is the Plummer Sphere with
(r)
(1 + (r/b)2)-5/2
it has finite mass and is well behaved at r = 0
it is a good match to Globular Clusters but is too steep at large r for Ellipticals
- n > 5 systems are more extended and have infinite mass
density profiles for n=0,1,2,3,4,5 are shown here :
density; potential; rotation &
image for a Plummer sphere are shown here : [image]
- n =
  so
= 1 and p
which is
the isothermal equation of state (recall P = n k T )
for n =
the above analysis breaks down, but we have an alternative approach :
(b) Isothermal Sphere : Exponential f(E)
- Consider an exponential (Boltzmann) DF
|
(8.33) |
Recall, more +ve
& E means more bound.
Also, note f(E) > 0 for E < 0: there are unbound stars! .... we anticipate problems at large radii.
OK, substituting
- ½v2 for E and integrating f(E) over v gives
=
1 exp
(
/
2)
-
Plugging this into Poisson's equation gives :
|
(8.34) |
This is, in fact, the equation for a hydrostatic
sphere of isothermal gas, with
2 =
kT/m
Why is this ?
At every point, N(v)
exp(-½v2/
2), for both
the stellar system and a gas of atoms
it is irrelevant, therefore, whether the stars are collisionless or not, they
mimic a gas of atoms.
-
Traditionally, we consider the solutions to 8.34 as (i) "a special case" and
(ii) "the rest" :
(i) Singular Isothermal Sphere (SIS)
-
for the central boundary condition
(0) =
  we have
(r)   =  
2 / (2
G r2)
this is the singular isothermal sphere   :  
 
 
r-2
-
circular velocity :   Vc = const =

2
-
dispersion velocity :
<v2> = 3
2  
everywhere (isothermal !);   1-D   :   <vr2> =
2 -
but the model has infinite density at r = 0, and has infinite mass
as r
!
-
Density; potential; rotation &
image for SIS are shown here : [image]
(ii) General Isothermal Sphere
- Choose as central boundary conditions at r = 0 :
(0) =
o  
          finite central density
(d
/dr )r=0 = 0   flat central density
profile
from eq 8.34, as r
0 we have LHS
(2/
) (d
/dr)   and  
RHS
0
Integration of 8.34 yields
(r)    
( [ images ] : B&T-1 figs 4.7, 4.8)
-
We find a constant near-nuclear density :
(r) ~
o within a radius ro =
3
/ (4
G
o)½
this is a core and ro is called the King (or core) radius
I(ro) = 0.5013 I(0) , so ro is appropriately defined
ro is also the scale length of the
r-2 envelope (see below) : big cores are in big galaxies
circular velocity : Vc = -
(d ln
/ d ln r)½
-
When plotted as log (
/
o) vs log (r / ro), there is only
one isothermal profile
-
The scale length and central density together
define the dispersion :
2
o
ro2
for a given central density, hotter galaxies are larger
for a given core radius, hotter galaxies are denser
basically, stars are bound and must not escape
-
Quantitatively :
2 = (4 / 9)
G
o
ro2
To simplify calculations, use G = 4.5 × 10-3 in units of pc, km/s, and M
Eg for
= 100 km/s, ro = 100 pc we have
o = 159 M
pc-3
-
A good isothermal core match to the centers of Ellipticals can be used
to estimate central M/L
obtain ro and I(0) from isothermal fits to I(R), and measure
(express I(0) in units of L
pc-2 to allow simplified calculations with G = 4.5 × 10-3)
j(0) = 0.5 I(0) / ro
(0) = 9
2/
(4
G ro2)
M/L =
(0) / j(0)
This method is called "core fitting" or "King's method"
typical values for ellipticals cores are
10-20 h M
/ L
suggesting minimal/no dark matter
-
There is a problem with all isothermal models : they have infinite total mass
It is easy to see why the system is at least infinite in extent :
at any given radius, stars have isotropic dispersion
at this radius at least some stars are therefore moving outward
but further out the dispersion is still
,
and stars are moving outward
the system must have infinite extent
Ultimately, this arises because f(E) > 0 for negative E, i.e. the model includes unbound stars.
To rectify this problem, we attempt to modify things slightly by removing the unbound stars:  
(c) Lowered Isothermal (King) : Truncated Exponential f(E)
- Suppress stars at large radius (ie as E
0,
  we want f(E)
0)
modify the exponential DF :
|
(8.35) |
where
o is a (dispersion like)
parameter.
-
Repeating the same analysis as before, we get for Poisson's eqn :
|
(8.36) |
Solve this by integration, choosing boundary conditions at r = 0 :
(0) = q
o2
    (q > 0, large q = deep central potential)
d
/ dr = 0   (as before)
- Inner regions : like isothermal, with core (King) radius ~ ro (defined
as before)
Outer regions :
(r) decreases & approaches 0 at rt
recall : velocity range at
is 0
(2
)
so density =
f d3v = 0 at rt
  = tidal or truncation radius = edge of sphere
larger
(0) (larger q)
larger rt &
Mtot
M(rt)
Alternative parameter to
(0) or q is
concentration   c = log10 (rt / ro)
([ images ] : B&T fig 4.9, 4.10, 4.11)
-
Single sequence of King models by varying (equivalently) :
(0); q; c
([ images ] : B&T fig 4.9)
Empirically, we find :
c = 0.75 - 1.75 (
q = 3 - 7)   fit GCs very
well
c > 2.2 (
q > 10)   fit some Ellipticals quite
well
c = 1.7 (
q = 8)   fits Hubble law well
c =
(
q =
) is the isothermal sphere
-
King models are not isothermal :
2
<v2>
o2 within ro but
drops at larger radii
([ images ] : B&T fig 4.11)
However, as with isothermal models, for each c (or q) we have a range of
King models
each of different
o, subject to
o2
o ro2
eg for given ro, high
o has
high
o
(d) Spherical Models with Rotation : DF = f(E, |L|)
TBD
(e) Axisymmetric Systems
- From the strong Jeans Theorem, in general we expect DFs of the form
f(E, Lz, I3)
Finding such DFs is usually very difficult
We therefore first consider the subset of simpler cases with f(E,
Lz)   (cases i & ii below)
Note that since DF
f(E), we
expect anisotropic velocities (see §
8.7b
).
(i) Thin Disks with DF = f(E, Lz)
-
For thin disks, the problem is simpler : DF = f(E, Lz)
For given
(R), virial theorem sets total
KE (since 2KE + W = 0)
but this KE is shared between ordered (rotational) and
random (dispersion) components
-
Example :
consider DF = f(E, Lz) = F exp (E /
2)Lzq  
  (Lz > 0; otherwise f = 0)
Applying the earlier methods, we derive a Mestel disk :
(R) =
o r / Ro   with Vc2 = 2
G
o Ro
The parameter q = (Vc /
)
2 - 1   measures the degree of centrifugal support
we find :
2 = <VR2>
= the radial dispersion, while
<V
> =
×
(½q) /
(½(1-q))   is the streaming rotation
for q = -1   the disk doesn't rotate but is held up by dispersion
for q =
we have pure circular orbits with
<V
> = Vc
Note : this illustrates the phenomenon of asymmetric drift  
(see T 5.4d
)
(ii) Thick Axisymmetric Systems with DF = f(E, Lz)
-
In general, axisymmetric systems have DFs with three integrals : f(E, Lz, I3)
These are difficult (see below), so the
subset with f(E, Lz) has received more attention
So far, most chosen DFs have not yielded very realistic models (B&T 4.5.2)
- What about reversing the process, and starting with a density distribution
As for spherical systems (§8.7d above), methods exist to convert any
(R,z) into f(E, Lz)
unfortunatly, they are highly unstable when trying to use observational data
- There is one nice result for systems with f = f(E, Lz)   (B&T p250) :
The radial and z velocity dispersions are equal :
<VR2> = <Vz2>
this is not true when a third integral is present.
(iii) Thick Axisymmetric Systems with DF = f(E, Lz, I3)
As just mentioned : whenever <VR2>
<Vz2>, we know a third integral I3 must
exist
-
Example 1 : In the solar neighborhood, we measure :
<VR2> ~ 2 × <Vz2>
the DF for the MW disk must involve some I3
in addition to E and Lz
This quite shocked Jeans in ~1915, who concluded the MW was not in steady state
now, we suspect MW is in steady state, but don't yet know the form of
I3
-
Example 2 : some flattened elliptical galaxies don't rotate much
(see Topic 7)
we conclude they are anisotropic, with <VR2>   >  
<V
2>
These ellipticals must have DFs which require I3
- In general, construction of DFs with 3 integrals is very difficult :
I3(r, v) is usually not analytically simple.
The DF is now a funciton of three other functions, adding to the
difficulties.
More progress is made by employing action variables
J = Jr, Ja, Jl   (B&T § 4.5.3)
Entire conferences have been devoted to "The Third Integral"
(f) Triaxial Systems (eg Bars & Some Ellipticals)
(9) Violent Relaxation
-
The previous discussion focussed on static systems, since
they are relatively tractable.
Varying potentials are usually intractable and
require a numerical approach.
There are, however, a few situations which can be treated analytically
Paradoxically, one of these is when the potential is maximally fluctuating
This is the case of violent relaxation, which we now describe briefly
-
For galaxies, 2-body encounters are negligible and evolution is
determined by the CBE
For a static potential, energy (E) of a star is conserved and the
DF doesn't change
Isolated galaxies in steady state do not, therefore, evolve dynamically
(we're ignoring gas & 2-body processes here)
- For a galaxy to change, there needs to be a changing potential
For each star, dE / dt   =   
/
t   at the star
The DF evolves and the structure of the galaxy changes
This occurs during (i) inhomogeneous collapse, and (ii)
encounters (Topic 12)
These are brief traumatic times :
"galaxy changes are by revolution rather than by evolution"
(nice quote from Binney's EAA article)
-
In collapse of large cold system,
changes
rapidly
stars gain and lose energy, which broadens f(E)
energy is redistributed via collective interactions
this acts like a relaxation process
-
Note : the total energy remains constant : this is a
non-dissipational process
energy is not radiated away, as with dissipational (gaseous) collapse
If the total energy is initially zero (eg diffuse system at rest), then
following collapse :
some stars will be strongly bound, but some must also have been ejected.
-
Note : scattering is independent of the star's mass
fundamentally different from 2-body relaxation
no segregation by mass (eg heavy stars don't sink to center)
-
Phase mixing helps smooth out lumps fairly quickly
distribution is ~smooth after ~few collapse times
violent relaxation timescale is
~few × dynamical (collapse) timescale
-
If relaxation is complete, then fully random scattering occurs
  results in isotropic velocity field and Boltzmann-like f(E)
Usually, however, the density distribution becomes smooth before
scattering is complete
relaxation ceases and is incomplete  
 
residual anisotropies & phase-space substructures
(
)
-
N-Body example : van Albada 1982 (B&T 4.7.3)  
([ images ] : B&T figs 4.19-23)
start with ~ homogeneous sphere with low
1st infall  
  dense center
settles into ~ R¼ law
drops with radius
(anisotropy) is 0 at nucleus,  
  1 at edge
(most scattering occurred at small r on 1st infall  
  most stars have low AM)
N(E) dE spreads out, most stars have E ~ 0, few are deeply bound
If the initial distribution is hotter  
 
less concentrated
If the initial distribution is rotating slowly  
  less concentrated & rotating oblate figure
If the initial distribution is rotating faster  
  even less concentrated & prolate/bar figure
If the initial distribution is ellipsoidal  
  rotating ellipsoid, anisotropic everywhere
(10) Introducing Star-Star Encounters
So far, we have considered star motion in a perfectly smooth potential
However, in reality, individual stars render this potential bumpy on fine scales
How does this affect the motion of stars --- ie is the "collisionless"
assumption valid ?
(a) Estimating Encounter and Relaxation Timescales
Substituing, we get
|
(8.38c) |
Surprisingly, this only depends on N, the total number of stars in the
system
Notice that trelax > tcross   for  
N
30
to good approximation, stars usually orbit in the overall potential
Equal logarithmic intervals in b contribute equally to long term
deflection.
eg the ranges : R to ½R ;   ½R to ¼R ; .......
2bmin to bmin   all contribute equally
However, since the deflection drops rapidly with b as
V / V
1 / b
for systems with R >> bmin most scattering is due to
weak encounters (
V << V)
Example :
galaxy with R ~ 10 kpc ; bmin ~ 1 AU (1 M
stars); so ln
= 20
half deflection from encounters outside b1 where ln R/b1 = 10
3/4 deflection from encounters outside b2 where ln R/b2 = 15
b1 = 0.5 pc for which
V / V =
bmin / b1
10-5
b2 = 0.003 pc for which
V / V =
bmin / b2
0.15%
Need care with N-Body simulations when N << Nstars
is more grainy than reality, and
trelax(simulation) << trelax(reality)
avoid by softening star potentials to increase bmin
care : lose structure on scales R < bmin
Finally, note that this treatment of 2-body relaxation is almost identical
to the derivation of Dynamical Friction in [ Topic 12.3a]
(b) Timescales for Real Stellar Systems
- Here are rough timescales (in years) for a number of stellar systems :
| System |
N |
R (pc) |
V (km/s) |
tcross |
trelax |
tage |
age/relax |
| Open Cluster |
102 |
2 |
0.5 |
106 |
107 |
108 |
10 |
| Globular Cluster |
105 |
4 |
10 |
5 ×105 |
4 ×108 |
1010 |
20 |
| Dwarf Galaxy |
109 |
103 |
50 |
2 ×107 |
1014 |
1010 |
10-4 |
| Elliptical |
1011 |
104.5 |
250 |
108 |
4 ×1016 |
1010 |
10-7 |
| Spiral Disk |
1011 |
104.5 |
20 |
1.5 ×109 |
6 ×1017 |
1010 |
10-8 |
| MW Nucleus |
106 |
1 |
150 |
104 |
108 |
1010 |
100 |
| Luminous Nucleus |
108 |
10 |
500 |
2 ×104 |
1010 |
1010 |
1 |
| (Galaxy Cluster) |
102 |
5 ×105 |
500 |
109 |
(3 ×109) |
1010 |
(3) |
-
The presence of dark matter complicates the situation in clusters (see
[Topic 13.4c])
In practice, 2-body relaxation is not as fast as our simple analysis
suggests.
-
2-Body relaxation may be relevant for star clusters and
galaxy nuclei
-
For most galaxies, 2-body relaxation is utterly negligible.
Because this course deals specifically with galaxies (and not star clusters)
we will only briefly consider the ramifications of relaxation.
-
Dont forget, relaxation times can vary greatly within a given system
for example, a GC core can be relaxing while the halo is not
(c) Analytic Treatment : The Fokker-Planck Equation
-
You may be wondering when Max Planck (& Adriaan Fokker) worked on stellar
dynamics....
They didn't : much of this work has its roots in plasma physics
Unlike neutral gases, charges in plasmas have long range Coulomb interactions
The early work on plasmas has been appropriated and applied to stellar systems
we continue.....
-
For a smooth potential, the DF obeys the CBE : df / dt = 0
with encounters, stars scatter into and out of the phase space
trajectory :   df / dt =
(f)
(f) is a collision term and itself
depends on f
- let w = r, v be the 6-D phase space vector, and
S(w,
w)d3w
t   =   probability of scattering by
w in time
t
(B&T use symbol
for S)
the net change is the sum of the collisions in minus the
sum of the collisions out,
giving the master equation :
|
(8.39) |
-
Now, the majority of scattering comes from distant encounters
with small
w
so expand the first term in
,
(scattering in), as Taylor series up to 2nd order
since 0th order term is 2nd term in
, these both
disappear on subtraction.
The remaining terms give the Fokker-Planck equation (where df / dt
LHS of CBE) :
|
(8.40) |
where D(
wi) and
D(
wi,
wj) are diffusion coefficients
giving the rate that stars diffuse through phase space due to encounters
eg : D(
wi)   =  
wi
S(w,
w) d3w
-
Once D(
wi) and
D(
wi,
wj) are know, the F-P eqn is a
PDE which is, in principle, solvable.
Much work has been invested in solving the F-P equation for stellar systems.
Several good approximations help further :
- (1) Orbit Averaging :
for trelax >> tcross  
w
is small during a single orbit
average scattering over single orbit, and then follow drift of orbit parameters
best achieved by working in action-angle variables J,
(instead of r, v)
in these coordinates, orbit average is simply integration over
d3
F-P eqn is now collapsed to a PDE in 4-D (J+time)
for spherical symmetry, we lose another degree of freedom
F-P eqn further reduced to a PDE in 3-D (Jr, L, & time)
- (2) Local Approximation :
since equal logarithmic intervals in b contribute equally to scattering,
contributions from scales ~R are small  
 
most scattering occurs locally (b << R)
consequently :
-
scattering occurs only in v, and not in r
diffusion terms D(
xi) =
D(
xi,
xj) = D(
xi,
vj)
0   they all vanish
-
encounters occur as Keplerian hyperbolae
- (3) Isotropic Field Star Velocity Distribution :
The symmetry of the scattering population eliminates many of the
diffusion terms
Only three are left, expressed relative to the direction of the test star :
D(
v
),
D(
v
2),
D(
v
2)
Together, these allow analytic expressions for diffusion coefficients (B&T App 8.A) :
|
(8.41a) (8.41b) (8.41c) |
where subsript `a' refers to field stars
- Combining these, one may look at the rate of change of KE of a star :
|
(8.42) |
the 1st/2nd terms represent heating by/of the field stars
note : these cancel when mava2 = mv2
equipartition
- Often, a Maxwellian for fa(va) is used to evaluate
the diffusion coefficients explicitly
from here, one can now solve the Fokker-Planck eqn.
Solutions to the Fokker-Planck Equation
Several methods have been explored :
- Moment Equations :
following earlier : 1st moment of CBE  
Jeans Equation
similarly 1st moment of F-P
useful moment equations
as before, one needs to assume a form for the DF to proceed
a Maxwellian is often quite successful
(eg Larson; Lynden-Bell and Eggleton)
- Monte Carlo :
Numerically calculate a sample (~1000) of orbits,
subject each to appropriately chosen random perturbations
follow the response of the orbits and, therefore, the system
(eg Spitzer; Henon)
- Direct Integration :
the Fokker-Planck eqn is a PDE in 3 variables (Jr, L, t)
solutions are now possible by numerical integration.
(eg Cohn)
(d) Results : The Effects of Encounters
There are a number of distinct phenomena which result from 2-body encounters :
(i) Relaxation
-
As 2-body relaxation proceeds, stellar systems evolve to states of
higher entropy
-
For self-gravitating systems, this can be a rather interesting process
recall from [§8.5e]
that such systems have negative specific heat
if you remove energy (heat), stars fall deeper in the gravitational well
they therefore speed up, and that part of the system gets hotter
-
In its simplest form, this relaxation renders clusters more centrally
concentrated
stellar encounters in the core pass energy to envelope stars
the core contracts and heats, the envelope expands and cools
after some time the envelope developes a density
profile
(r)
r-3.5
radial anisotropy increases with time and radius
(stars have been kicked out from encounters in the core and carry little AM)
a successful DF is due to Michie, and is f(E,L)
exp(-L2/Lo2) × [exp(E /
2) - 1]
-
After about 15 trelax, the process takes off in a runaway
gravothermal catastrophe
(*** TBD : explain why in intuitive terms)
This "event" is called core collapse and leaves a density law
(r)
r-2.23 (infinite at r=0 !)
since GC are about 20 × trelax old, at least some have
probably undergone core collapse
In practice, core collapse is not as dramatic as its name suggests :
either
- the core "runs out of stars" before densities become exotic
- a hard binary forms which
(a) scatters core stars, heating the core and halting core collapse
(b) ejects stars from the system, accelerating evaporation
(the binary acts like a source of nuclear burning in a star --- see below)
- Similar behaviour is found in stars : contracting cores heat
while expanding envelopes cool
Lynden-Bell and Eggleton (1980) derive power-law of -9/4 (-2.25)
for a conducting gas sphere
(obviously, this had no nuclear energy source, so could undergo gravothermal
collapse)
(ii) Equipartition
-
Violent relaxation during formation leaves all stars the same velocity
distribution
consequently heavier stars have more kinetic energy
this is unlike a gas, where molecules have the same
kinetic energy (heavier ones move slower)
-
2-body encounters mimic molecular interactions: energy passed from
high mass to low mass stars
in the limit of complete interaction, energy is shared equally (hence equipartition)
-
More massive stars begin to sink deeper  
 
mass segregation.
Probably occurred in GCs, though difficult to check since (visible) giants
all have similar mass.
May have played role in galaxy clusters, but other effects (dynamical
friction, mergers) confuse interpretation.
(iii) Escape (Ejection and Evaporation)
-
Encounters can result in stars with V > Vesc
this can occur in two ways :
a single encounter gives the star sufficient energy to escape (ejection)
a star slowly wanders into unbound phase space due to many distant encounters
From the analysis above (10.a.iii), the second is much more important
-
Using the fact that Vesc2 = -2
(r),   it is easy to show (B&T p 490) that
<Vesc2> = 4 <V2>
so the rms escape velocity is just twice the rms velocity
for a Maxwellian, the fraction with V > 2Vrms = 7 ×10-3
so this fraction is lost every trelax  
  tevap
140 trelax
more detailed calculations confirm this
-
The process speeds up in a galaxy tidal field, since
Vesc is reduced (see
[Topic 12.3.b])
-
Evaporation + equipartition
less massive
stars evaporate first (higher velocities)
explains unusually low M/L (~2) for GCs compared to other pop II objects
(M/L ~ 10)
-
The observed distribution of trelax for the ~150 MW GCs
shows essentially none < 108 years
selection effect : since tevap ~ 100 trelax
these GCs have probably already evaporated
suggests young MW may have had many more GCs
(iv) Inelastic Encounters
-
Occasionally close enough to raise tides
dissipate energy
lose KE
of star
tcoll
trelax × 0.2 ln N
× (V*/V)4 where V* is escape
velocity from star's surface
if V* < V   then tcoll < trelax
and collisions dominate
-
While not dominant, a few stars in GCs will have been tidally affected
-
Important in galaxy clusters :   V*
300 km/s while V
1000 km/s
as expected, tidally influenced galaxies are common in clusters
(11) Further Topics
We defer a few topics of Stellar Dynamics to later sections :