08 September 2020

Energy partitions part one

Now it’s time for a more general description of what Boltzmann and Planck did with their different versions of discrete energy elements distributed in equilibrium among their respective systems of molecules (Boltz) or resonators (Planck). This description goes by the name “energy partitions”. I'm not sure who invented this generic terminology, but it was put to good use in 1922 by Darwin and Fowler in their paper "On the partition of energy".


The reason for the invention of energy partitioning by Boltzmann and its later use by Planck was to calculate entropy.


Energy partitions describe the number of ways in which discrete energy elements can be distributed among a collection of particles capable of absorbing and emitting energy. You may not have encountered the name “energy partitions” before, but if you’ve studied thermodynamics and kinetic theory beyond the freshman level, you’ve encountered the equipartition theorem and the partition function. Since I’m providing links to those terms, I won’t describe them further except to say they are based on the idea of energy partitions.

 

Occupancy numbers and occupation numbers

 

There are two equivalent ways energy partitions can be expressed: the number of energy elements per particle, or the number of particles per energy level. The first use of energy partitions in a way that later became standard was by Boltzmann in 1877, in his paper “On the Relationship between the Second Fundamental Theorem of the Mechanical Theory of Heat and Probability Calculations Regarding the Conditions for Thermal Equilibrium.”


In this paper, Boltzmann described how to calculate the number of different ways kinetic energy elements could be partitioned among the molecules of a gas, and he considered his “molecules” to be structureless particles possessing only translational kinetic energy. In his model, portions of kinetic energy that are integer multiples of a very small indivisible element of energy, ε, are transferred between the molecules during elastic collisions. (Check on Goodstein or d’Abro or somebody and 1/3 of molecules traveling in each x, y, z direction, no angles of collision needed in calculations.)

 

A more rigorous mathematical description of Boltzmann’s energy partitions was developed by Alexander Bach in the late 1980s. (Bach claims that Boltzmann used the idea of indistinguishable classical particles, and also introduced Bose-Einstein statistics, which was supposedly not discovered until 1924 by Bose and Einstein. I’ll discuss both of these controversial claims later.) Bach’s scheme describes energy partitions in terms of cells that contain a certain number of particles, where the cells and particles can be either energy elements or physical particles (molecules or resonators in our case).  

 

Bach uses the term “occupancy number” for the number of energy elements per physical particle, and “occupation number” for the number of physical particles per energy level. (Bach also has a category called "configurations," but that would get us into the problem of particle distinguishability, which I'm not ready to do yet.)

 

Let’s look at occupancy numbers first. In Boltzmann’s notation the total kinetic energy of his gas molecules is L = λε and the number of molecules is n.  The overall occupancy number for his gas is therefore λ energy elements (particles) distributed among the n molecules (cells).  Individual-cell occupancy numbers are needed to calculate probabilities, and those are dealt with in the following section.


It may seem that just inverting the overall occupancy number description would give the overall occupation number as n molecules (particles) distributed over λ energy levels (cells), but here we run into a little counting problem. Or maybe I should say an accounting problem. 

 

--------

 

One part of the problem is that Boltzmann introduced another constant in addition to λ to count the number of energy elements, which, remember, are necessarily kinetic energy elements. He says “we assume that the kinetic energies of each molecule form an arithmetic progression, such as the following:  0, ε, 2ε, 3ε,…, pε.  We call P the largest value of the kinetic energy, ."


But this little p seems to me to be the same as λ, because the largest possible kinetic energy any one molecule can have is the total kinetic energy. Then all others would have zero energy, and p would equal λ.

 

(This would never happen in a realistic setting with many molecules, with “never” meaning an infinitesimal probability of something like 1/n.  But we can imagine a video-game (not much of a game, but never mind that) in which n-1 molecules are at rest at the bottom of the screen, and we send in the nth molecule from the top of the screen with kinetic energy pε = λε.  Eventually, this total energy would be distributed evenly, in an average way, among the molecules. Equilibrium would be established, with the usual mean-square fluctuations about the equilibrium value or average value of the kinetic-energy-per-molecule.)

 

But Boltzmann was no dummy.  He introduced p and λ for a reason.  He wanted “the highest possible value of the kinetic energy” to be different from the constant total energy. Apparently he meant this to be interpreted as the highest possible kinetic energy of any single distribution, with p = λ being only one such p, and a highly unlikely one at that.

 

But the strangest thing is that Boltzmann later gives an example in which he allows p to go to infinity while  λ = 7 (see below), so there’s some kind of madness in his method (rather than method in his madness), but it’s too much for me to juggle at the moment, and I’ll just continue with occupancy numbers and occupation numbers now.

 

--------

 

I have to backtrack a little bit, because of the accounting problem. Boltzmann’s states the main calculational purpose of his 1877 paper this way:  “I want to establish the probability of a state distribution regardless of how it is created, or more specifically, I want to find all possible combinations of the p+1 kinetic energy values allowed to each of the n molecules and then establish how many of these combinations correspond to each state distribution.”

 

But nobody counts zero as a value when we’re talking about occupancy numbers.  There are λ energy elements. There are n molecules.  No matter how many of them have zero energy, there are still λ energy elements distributed over n molecules.  Zero as an energy element is not counted as an occupancy number. However, again, Boltz was no dummy, and his (English-translated) statement is certainly correct: each molecule when considered independently of the others, has equal access to p+1 kinetic energy values.

But this is not very significant, since nobody uses occupancy numbers in calculations of probability anyway, except me (see below).  Occupation numbers are used instead, and since occupation numbers have cells that are energy levels, the zero-energy level must be counted. 

For occupation numbers, then, we have p+1 cells, and the overall occupation number is n molecules (particles) distributed over p+1 energy levels (cells). The counting problem I mentioned above is now accounted for.

 

Occupancy versus occupation calculations for n=3 and λ=p=5

 

Before any calculations are done, the total-number-of-particles constraint and the total-energy constraint must be expressed in equations. An occupancy-number variable and an occupation-number variable are needed in order to do this.

 

Occupancy number variable:  kj = the number of energy elements in molecule j. The sum of the kj’s from j = 1 to j = n is the total number of energy elements λ. Let’s say there are three molecules and five energy elements. Then

 

k1 + k2 + k3 = 5.

 

This equation is both a constraint on the energy (the sum = λ = 5) and a constraint on the number of molecules (the upper limit of the sum = n = 3)

 

Occupation number variable: wi is the number of molecules in energy level i. The sum of the wi’s from i = 0 to i = λ is the total number of molecules n. Using three molecules and five energy elements tambien, we have

w0 + w1 + w2 + w3 + w4 + w5 = 3

 

w0 + w1 + 2·w2 + 3·w3 + 4·w4 + 5·w5 = 5.

 

Now let’s make a table.  It's helpful to show occupancy and occupation number tables together.  So, here’s my two-tables-in-one, with occupancy numbers on the left, occupation numbers on the right, using n=3 and λ=5:


        occupancy #’s                                                occupation #’s

k1

k2

k3

---

w0

w1

w2

w3

w4

w5

5

0

0

 

2

0

0

0

0

1

0

5

0

 

2

0

0

0

0

1

0

0

5

 

2

0

0

0

0

1

 

 

 

 

 

 

 

 

 

 

4

1

0

 

1

1

0

0

1

0

4

0

1

 

1

1

0

0

1

0

0

4

1

 

1

1

0

0

1

0

1

4

0

 

1

1

0

0

1

0

1

0

4

 

1

1

0

0

1

0

0

1

4

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3

2

0

 

1

0

1

1

0

0

3

0

2

 

1

0

1

1

0

0

0

3

2

 

1

0

1

1

0

0

2

3

0

 

1

0

1

1

0

0

2

0

3

 

1

0

1

1

0

0

0

2

3

 

1

0

1

1

0

0

 

 

 

 

 

 

 

 

 

 

2

2

1

 

0

1

2

0

0

0

2

1

2

 

0

1

2

0

0

0

1

2

2

 

0

1

2

0

0

0

 

 

 

 

 

 

 

 

 

 

3

1

1

 

0

2

0

1

0

0

1

3

1

 

0

2

0

1

0

0

1

1

3

 

0

2

0

1

0

0

 


(to be continued with permutations, combinations, particle distinguishability (or not), and the multinomial theorem)