The Smoluchowski Ensemble—Statistical Mechanics of Aggregation

Matsoukas, Themis

doi:10.3390/e22101181

Open AccessEditor’s ChoiceArticle

The Smoluchowski Ensemble—Statistical Mechanics of Aggregation

by

Themis Matsoukas

Department of Chemical Engineering, Pennsylvania State University, University Park, PA 16802, USA

Entropy 2020, 22(10), 1181; https://0-doi-org.brum.beds.ac.uk/10.3390/e22101181

Submission received: 15 September 2020 / Revised: 12 October 2020 / Accepted: 13 October 2020 / Published: 20 October 2020

(This article belongs to the Special Issue Generalized Statistical Thermodynamics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

We present a rigorous thermodynamic treatment of irreversible binary aggregation. We construct the Smoluchowski ensemble as the set of discrete finite distributions that are reached in fixed number of merging events and define a probability measure on this ensemble, such that the mean distribution in the mean-field approximation is governed by the Smoluchowski equation. In the scaling limit this ensemble gives rise to a set of relationships identical to those of familiar statistical thermodynamics. The central element of the thermodynamic treatment is the selection functional, a functional of feasible distributions that connects the probability of distribution to the details of the aggregation model. We obtain scaling expressions for general kernels and closed-form results for the special case of the constant, sum and product kernel. We study the stability of the most probable distribution, provide criteria for the sol-gel transition and obtain the distribution in the post-gel region by simple thermodynamic arguments.

Keywords:

statistical thermodynamics; irreversible aggregation; Smoluchowski equation; gelation; phase transitions

1. Introduction

Aggregation is the process of forming structures through the merging of clusters. This generic process is encountered in a large variety of systems, from polymerization and colloidal aggregation to the clustering of social groups and the merging of galaxies. The mathematical foundations of aggregation were set by Smoluchowski [1], whose particular interest was in Brownian coagulation. The aggregation equation, more commonly known as Smoluchowski equation, is a rate equation on a distribution of clusters whose size (mass) changes by binary aggregation events. For a discrete population of clusters with integer masses in multiples of a unit mass (“monomer”) it takes the form [1],

\frac{d c_{k}}{d t} = \frac{1}{2} \sum_{j = 1}^{k - 1} c_{k - j} c_{j} K_{k - j, j} - \sum_{j = 1}^{\infty} c_{k} c_{j} K_{k, j},

(1)

where

c_{k}

is the number concentration of clusters with mass k and

K_{i, j}

the aggregation kernel, a rate constant for the merging of masses i and j. A large body of literature has focused on the theory of the Smoluchowski equation, the existence of analytic solution and the scaling limit [2]. Of particular interest is gelling, a condition that arises under the product kernel

K_{i, j} = i j

; it refers to the formation of a giant structure, as in polymer gels, and is manifested by the failure of the Smoluchowski equation to conserve mass. This process is commonly described as a phase transition, suggesting the possibility that statistical thermodynamics, a theory developed for equilibrium states of interacting particles, may perhaps be applicable in this clearly irreversible process.

Studies of Smoluchowski aggregation broadly fall in one of two categories, kinetic and stochastic. The kinetic approach is based on Equation (1) and its solution. Stable solutions conserve mass; gelling is identified as the point where mass conservation breaks down [3,4]. Post-gel solutions require additional assumptions as to how the gel and the dispersed phase interact [5]. The stochastic approach views clusters as entities that merge with probabilities proportional to the aggregation kernel. It was first formulated by Marcus [6] for a discrete finite population, and its formal mathematical treatment was developed by Lushnikov, who obtained solutions for certain special cases, including gelation [7,8,9,10,11]. In Lushnikov’s method all feasible distributions are given a probability, whose evolution in time is tracked via a generating functional. The approach is explicitly probabilistic and views the Smoluchowski equation as the mean-field approximation of the underlying stochastic process [12]. A different approach within the probabilistic realm makes use of combinatorial methods. This treatment originated with Stockmayer [13] and was further explored by Spouge [14,15,16,17]. The combinatorial approach considers the number of ways to build a particular distribution of clusters and assigns probabilities in proportion to that combinatorial weight. The ensemble of distributions is then reduced to the most probable distribution, which is identified by maximizing the combinatorial weight. This approach has two appealing advantages. It deals with a time-free ensemble in which time appears implicitly via the mean cluster mass. More importantly, it brings the problem closer to the viewpoint of statistical mechanics and the notion that an ensemble may be represented in the scaling limit by its most probable element. Stockmayer recognized this connection and his treatment of gelation is replete with references to the theory of phase transitions [13]. The analogy between aggregation and thermodynamics was not formalized, however. Stockmayer obtained the gel point by mathematical, not thermodynamic methods, and arrived at a post-gel solution that is not consistent with the kinetics of aggregation [5].

We have previously shown that gelation can be indeed treated as a formal phase transition and have presented solutions for the product kernel in the pre- and post-gel regions [18] based on our earlier work on the cluster ensemble [19,20]. Here we generalize the methodology to formulate a rigorous thermodynamic theory of Smoluchowski aggregation. We begin with a finite population that starts from a well defined state and construct the set of all possible distributions that can be reached in a fixed number of elementary transitions. The probability of distribution in this ensemble is governed by the kinetics of the elementary processes that act on the population. In the thermodynamic limit the most probable distribution is overwhelmingly more probable than all others and is governed by a set of mathematical relationships that we recognize as thermodynamics. The work is organized as follows. In Section 2 we define the Smoluchowski ensemble of distributions and their probabilities. In Section 3 we formulate the probability of distribution in terms of a special functional W that introduces the partition function and the Shannon entropy of distribution. In Section 4 we treat the scaling limit and derive the thermodynamic relationships of the Smoluchowski ensemble. In Section 5 we obtain solutions of the Gibbs form for the classical kernels, constant, sum and product. We analyze the stability and phase behavior of the ensemble in Section 6 and treat the sol-gel process as a phase transition. In Section 7 we express the results in the continuous domain and finally offer concluding remarks in Section 8.

2. The Smoluchowski Ensemble

We consider a population of clusters composed of

i = 1, 2 \dots

units (monomers). In binary aggregation two clusters merge to form a new cluster that conserves mass, via the schematic reaction

(i) + (j) \overset{K_{i, j}}{\to} (i + j) .

(2)

The merging of a pair constitutes an elementary stochastic event, whose probability depends on the aggregation kernel

K_{i, j}

. At the initial state the population consists of

N_{0} = M

single members (monomers). This distribution constitutes generation

g = 0

. The next generation is constructed by implementing every possible aggregation event in the distribution of generation

g = 0

. The set of distributions formed in this manner constitutes the microcanonical ensemble of generation

g = 1

. We continue recursively to form the ensemble of distributions in generation g by implementing all possible aggregation events, one at a time, in all distributions of the parent ensemble. We represent a distribution of clusters by the vector

n = (n_{1}, n_{2} \dots)

, where

n_{i}

is the number of clusters with i members. All distributions in generation g satisfy the conditions

\sum_{i} n_{i} = M - g = N, \sum_{i} i n_{i} = M .

(3)

The first condition expresses the fact each elementary event decreases the number of clusters by 1, according to the stoichiometry of binary merging; the second condition expresses the fact that the number of members is conserved. Conversely, any distribution that satisfies the conditions in Equation (3) is a member of the ensemble of generation g because it can be formed in g steps from M monomers. We view the two equations in Equation (3) as the constraints that define the ensemble of feasible distributions. We call this ensemble microcanonical to indicate that it is conditioned by two extensive constraints that fix the mean cluster mass

M / N = \bar{x}

in all distributions of the ensemble.

The evolution of the ensemble may be represented in the form of a layered graph (Figure 1), whose vertices represent distributions and edges represent elementary transitions according to Equation (2). Edges are directed from parent in generation

g - 1

to offspring in generation g. Layers are organized by generation and contain all distributions in a generation. The graph begins in generation

g = 0

with a distribution of all monomers and ends when all units have joined the same cluster. Stochastic aggregation is a random walk on this graph. A trajectory is a possible sequence of connected edges from top to bottom. Our goal is to establish the probability

P (n)

of distribution in generation

g = 0, 1, \dots M - 1

, in terms the aggregation kernel

K_{i, j}

for any M.

2.1. Kinetics

When cluster masses

i - j

and j, in distribution

n^{'}

of generation

g - 1

, merge to form a cluster of mass i, the parent distribution

n^{'}

is transformed to offspring distribution

n

via the transition

n^{'} \overset{(i - j) + (j) \to (i)}{\to} n .

(4)

This transition is represented by an edge in the graph of Figure 1. Its rate

R_{i - j, j}

is proportional to the number of ways to choose the reactants and the proportionality factor is the aggregation kernel:

R_{i - j, j} = K_{i - j, j} \frac{n_{i - j}^{'} (n_{j}^{'} - δ_{i - j, j})}{1 + δ_{i - j, j}} .

(5)

The total rate by which parent

n^{'}

produces offspring is

R (n^{'}) = K (n^{'}) \frac{N^{'} (N^{'} - 1)}{2},

(6)

where

N^{'} = \sum_{i} n_{i}^{'}

is the number of clusters and

K (n^{'})

is the mean kernel in parent distribution

n^{'}

:

K (n^{'}) = \frac{2}{N^{'} (N^{'} - 1)} \sum_{i} \sum_{j} K_{i, j} \frac{n_{i}^{'} (n_{j}^{'} - δ_{i, j})}{1 + δ_{i, j}} .

(7)

In physical terms the aggregation kernel

K_{i, j}

is the rate constant for the reaction between masses i and j. Its mathematical form may be constructed on the basis of a kinetic model for the particular problem. It is beyond the scope of this work to review the numerous kernels that have been proposed in the literature. We mention a selected few that are important for their physical, mathematical and historical significance, and summarize them in Table 1.

The Brownian coagulation kernel was derived by Smoluchowski [1] to describe the kinetics of diffusion limited aggregation in colloidal systems. The constant kernel was adopted by Smoluchowski [1] as an approximation for the Brownian kernel, a simplification that allows analytic results. This kernel is obtained by setting

i = j

in the Brownian kernel. The Flory/Stockmayer kernel [13,21] is a model for polymerization of chains composed of monomers with f functional groups. Assuming no cycles, a polymer with i monomers contains

f i - 2 i + 2

unreacted functional groups that are available to react. The Flory/Stockmayer kernel is the product of the unreacted functional groups in the two chains that merge. This kernel leads to gelation [13]. The product kernel is the limiting form of the Flory/Stockmayer kernel when the number of functional groups approaches infinity. It also leads to gelation, and being a simpler kernel than the Flory/Stockmayer, it serves as the standard model to study gelation. The sum kernel is proportional to the number of units in each cluster. This kernel may be viewed as the limiting form of a Flory/Stockmayer type kernel with two kinds functional groups [15], but its significance is primarily mathematical as one of a handful of kernels that lead to analytic solutions.

We discuss the constant, sum and product kernel in detail in Section 5. For now we leave the kernel general and unspecified. We only place the minimum conditions,

K_{i, j} = K_{j, i} > 0

, which are required from elementary physical considerations; additionally, we adopt the normalization

K_{1, 1} = 1

.

2.2. Probabilities

We assign a probability

P (n)

to each distribution

n

within generation g and formulate its propagation in proportion to the probability of the parent and the rate of the parent-offspring transition:

P (n) = \sum_{n^{'}} P (n^{'}) \frac{R_{i - j, j}}{{〈R〉}_{g - 1}} .

(8)

Here

n

is a distribution in generation g,

n^{'}

is its parent of

n

in generation

g - 1

via the reaction

(i - j) + (j) \to (i)

,

R_{i - j, j}

is the rate of the reaction, and

{〈R〉}_{g - 1}

is the mean reaction rate in parent generation

g - 1

:

{〈R〉}_{g - 1} = \sum_{n^{'}} P (n^{'}) R (n^{'}) .

(9)

In both Equations (8) and (9) the summations are over all distributions

n^{'}

in generation

g - 1

. Expressing the transition rate in terms of the aggregation kernel we obtain

\frac{R_{i - j, j}}{{〈R〉}_{g - 1}} = \frac{2}{N^{'} (N^{'} - 1)} \frac{K_{i - j, j}}{{〈K〉}_{g - 1}} \frac{n_{i - j}^{'} (n_{j}^{'} - δ_{i - j, j})}{1 + δ_{i - j, j}} .

(10)

with

{〈K〉}_{g - 1} = \sum_{n^{'}} P (n^{'}) K (n^{'}) .

(11)

We may confirm that

P (n)

as defined in Equation (8) satisfies normalization over all distributions in the same generation. Beginning with

P (n_{0}) = 1

at the initial state, Equation (8) uniquely determines the probabilities of all distributions in all future generations.

2.3. Smoluchowski Equation

The mean number of clusters with mass k in generation g is

〈n_{k}〉 = \sum_{n} n_{k} P (n),

(12)

with the summation going over all distributions in the same generation. We will derive the evolution of the mean distribution from parent generation

g - 1

to generation g. The probability of distribution

P (n)

is given by Equation (8) and is expressed as a summation over its parents

n^{'}

. By the stoichiometry of the transition in Equation (2), the parent and offspring distributions satisfy

n_{k} = n_{k}^{'} + δ_{k, i} - δ_{k, i - j} - δ_{k, j} .

(13)

Combining this relationship with (12) and (8) the result is (see Supplementary Material)

\begin{matrix} 〈n_{k}〉 - 〈n_{k}^{'}〉 = \frac{2}{N (N + 1) {〈K〉}_{M, N + 1}} {〈\frac{1}{2} \sum_{j = 1}^{k - 1} n_{k - j} (n_{j} - δ_{k - j, j}) K_{k - j, j} - \sum_{j = 1}^{\infty} n_{k} (n_{j} - δ_{k, j}) K_{k, j}〉}_{M, N + 1} . \end{matrix}

(14)

The left-hand side is the change in the mean number of k-mers between generations; the right-hand side is the ensemble average of the production and depletion of k-mers within all distributions of the parent ensemble. Define the mean time increment

Δ t

from parent to offspring generation as

Δ t |_{g - 1 \to g} = \frac{2}{N (N + 1) {〈K〉}_{M, N + 1}};

(15)

then Equation (14) reads

{\frac{Δ 〈n_{k}〉}{Δ t}|}_{g - 1 \to g} = {〈\frac{1}{2} \sum_{j = 1}^{k - 1} n_{k - j} (n_{j} - δ_{k - j, j}) K_{k - j, j} - \sum_{j = 1}^{\infty} n_{k} (n_{j} - δ_{k, j}) K_{k, j}〉}_{M, N + 1} .

(16)

In the mean-field approximation we reduce the ensemble into a single distribution,

n^{*}

. This resolves the ensemble averages trivially and leads to the governing equation for

n^{*}

:

\frac{Δ n_{k}^{*}}{Δ t} = \frac{1}{2} \sum_{j = 1}^{k - 1} n_{k - j}^{*} (n_{j}^{*} - δ_{k - j, j}) K_{k - j, j} - \sum_{j = 1}^{\infty} n_{k}^{*} (n_{j}^{*} - δ_{k, j}) K_{k, j} .

(17)

This is the Smoluchowski equation for binary aggregation, the discrete finite equivalent of Equation (1). The mean field approximation, which is invoked to obtain (17), implies that a single distribution is representative of the entire ensemble. In Section 4 and Section 6 we examine the conditions under which this is true.

3. Thermodynamic Formalism

3.1. Partition Function and Selection Functional

We now formulate the probability of distribution in terms of a special functional,

W (n)

. It is through this formulation that we will make contact with statistical thermodynamics. We begin by writing the probability

P (n)

in generation g in the form

P (n) = \frac{n! W (n)}{Ω_{M, N}},

(18)

where

n!

is the multinomial coefficient of vector

n

,

n! = \frac{(n_{1} + n_{2} \dots)!}{n_{1}! n_{2}! \dots} = \frac{N!}{n_{1}! n_{2}! \dots},

(19)

N = M - g

is the number of clusters in all distributions of generation g,

W (n)

is a functional of distribution

n

, to be determined, and

Ω_{M, N}

is the partition function. By the normalization condition on

P (n)

the partition function satisfies

Ω_{M, N} = \sum_{n} n! W (n),

(20)

with the summation over all distributions in generation

g = M - N

.

3.2. Shannon Entropy

The multinomial coefficient represents the combinatorial multiplicity of distribution

n

, namely, the number of ways to order the clusters in the distribution, if clusters with the same number of units are treated as indistinguishable. In the Stirling approximation,

log x! = x log x + O (log x)

, the log of the multinomial coefficient is

log n! = - \sum_{i} n_{i} log \frac{n_{i}}{N} ≐ H (n) .

(21)

It is a concave functional of

n

with functional derivatives

\frac{\partial H (n)}{\partial n_{i}} = - log \frac{n_{i}}{N} .

(22)

It is also homogeneous in

n

with degree 1 and satisfies the Euler condition

H (n) = \sum n_{i} \frac{\partial H (n)}{\partial n_{i}} .

(23)

Setting

p_{i} = n_{i} / N

and applying H to vector

p

we obtain

H (p) = - \sum_{i} p_{i} log p_{i} .

(24)

In this form H reverts to the familiar entropy functional, historically associated with Boltzmann, Gibbs and Shannon. We will call it Shannon functional and avoid the generic term “entropy,” whose meaning across disciplines varies. For our purposes the Shannon functional is defined as

H (a) = H (a_{1}, a_{2} \dots) = - \sum_{i} a_{i} log \frac{a_{i}}{\sum_{k} a_{k}}

(25)

and may be applied to any vector

a

with non-negative elements regardless of normalization.

3.3. The Selection Functional

Functional

W (n)

biases the statistical weight of distribution

n

relative to its combinatorial multiplicity. We call it selection functional because it effectively selects distributions relative to each other. The functional derivative of

log W

is

log w_{i; n} = \frac{\partial log W (n)}{\partial n_{i}},

(26)

and defines the cluster function

w_{i; n}

, a property cluster mass i in distribution

n

. The cluster function

w_{i; n}

depends not only on i but also on the distribution

n

on which this factor is evaluated. In the special case that

log W

is linear functional of

n

the functional derivative is a function of i alone and is the same in all distributions. This special condition is associated with Gibbs distributions, which are discussed in Section 5.

If

W (n) = 1

for all distributions, then the probability of distribution is proportional to its combinatorial multiplicity

n!

. If this special condition is met we will call the ensemble unbiased. The partition function of the unbiased ensemble can be easily determined by a combinatorial argument: it is equal to number of ways to assign M objects into N groups and is given by [22]

Ω_{M, N}^{\circ} = (\binom{M - 1}{N - 1}) .

(27)

Accordingly, the probability of distribution in this special case is

P^{\circ} (n) = n! /(\binom{M - 1}{N - 1}) .

(28)

In a population undergoing transformations, for example aggregation, fragmentation and so forth, the selection functional is determined by the kinetic details of the mechanisms that produce these transformations; in the case of aggregation it is determined by the aggregation kernel

K_{i, j}

. The question arises whether the unbiased ensemble is a possible solution of the Smoluchowski ensemble under some kernel. The answer is yes, and is given in Section 5.

3.4. Propagation Equations

At the initial state all clusters are monomers and the distribution is

n_{i, 0} = M δ_{i, 1}

. We set

W (n^{0}) = 1

and since

n^{0}! = 1

we also have

Ω_{M, M} = 1

. We insert Equation (18) into Equation (8) and express the summation over parents of

n

as a summation over all pairs

(i - j, j)

that produce mass i in distribution

n

. The result is (see Supplementary Material)

\frac{Ω_{M, N + 1}}{Ω_{M, N}} = (\frac{M - N}{N} \frac{1}{{〈K〉}_{M, N + 1}}) (\sum_{i = 2}^{\infty} \frac{n_{i}}{M - N} \sum_{j = 1}^{i - 1} K_{i - j, j} \frac{W (n^{'})}{W (n)}) .

(29)

Here N is the number of clusters in distribution

n

of generation

g = M - N

,

n^{'}

is the parent distribution via the transition

(i - j) + (j) \to (i)

and

{〈K〉}_{M, N + 1}

is the mean kernel in the parent generation

g^{'} = g - 1

. The left-hand side of Equation (29) depends solely on M and N whereas the second term on the right-hand side contains functionals of distribution

n

. This term must be the same for all distributions

n

in the same generation in order to produce a result that is a function of M and N alone. From Equation (18) it is clear that W and

Ω_{M, N}

may be defined within a proportionality constant

α_{M, N}

; as long as this constant is common for all distributions in a generation it has no effect on probabilities and may be chosen arbitrarily. We choose it to satisfy the following criterion: if

W = constant

for all distributions, we require this constant to be 1. The choice that satisfies this condition is to set the double summation in Equation (29) to 1. Equation (29) now splits into two separate recursions, one for the partition function,

\frac{Ω_{M, N + 1}}{Ω_{M, N}} = \frac{M - N}{N} \frac{1}{{〈K〉}_{M, N + 1}}

(30)

and one for the selection functional,

\sum_{i = 2}^{\infty} \frac{n_{i}}{M - N} \sum_{j = 1}^{i - 1} \frac{W (n^{'})}{W (n)} K_{i - j, j} = 1 .

(31)

The recursion for the partition function is readily solved to produce the partition function in generation

g = M - N

:

Ω_{M, N} = Ω_{M, N}^{\circ} \prod_{γ = 0}^{M - N + 1} {〈K〉}_{M, M - γ} .

(32)

Accordingly, the partition function is equal to the unbiased partition function times the product of all mean kernels from generation 0 up to the parent generation

g - 1

. We write the recursion for the selection functional in the form

W (n) = \sum_{i = 2}^{\infty} \frac{n_{i}}{M - N} \sum_{j = 1}^{i - 1} K_{i - j, j} W (n^{'}) .

(33)

The result gives the selection functional of the offspring as a linear combination of selection functionals of all its parents. In principle this can be solved recursively for any distribution in any generation. For certain special cases the recursion can be solved in closed form. These are discussed in Section 5.

4. Scaling Limit

4.1. Most Probable Distribution

We define the scaling limit by the condition

M, N \to \infty

at fixed

M / N = \bar{x}

. The expectation is that in this limit the intensive mean distribution

〈n_{k}〉 / N

must converge to a limiting distribution

{\bar{p}}_{k}

that is independent of M and N and depends only on

M / N = \bar{x}

:

\frac{〈n_{k}〉}{N} \to {\bar{p}}_{k} .

(34)

We further anticipate that the probability of distribution

P (n)

becomes infinitely sharp around a single distribution,

n^{*} = N p^{*}

, such that

p_{k}^{*}

is not merely the most probable distribution, it is overwhelmingly more probable than any other distribution in the ensemble. This further implies that the mean distribution and most probable distribution converge to each other:

〈p_{k}〉 \to p_{k}^{*} .

(35)

This convergence is an implicit requirement for the validity of the Smoluchowski equation: the mean-field approximation is exact if a single distribution is representative of the entire ensemble. This is possible only if

P (n)

peaks very sharply about the most probable distribution. When a single term dominates the summation that defines the partition function in Equation (20), the log of the sum converges to the log of the maximum term,

log Ω_{M, N} = H (n^{*}) + log W (n^{*}),

(36)

with

H (n^{*}) = log n^{*}!

. As a further consequence of the intensive convergence in (34) we have the Euler relationship for

log W

:

log W (n^{*}) = \sum_{i} n_{i}^{*} log w_{i}^{*} .

(37)

where

log w_{i}^{*} = log w_{i; n^{*}}

is the functional derivative of

log W (n^{*})

,

log w_{i}^{*} = \frac{\partial log W (n^{*})}{\partial n_{i}^{*}} .

(38)

Equation (37) expresses the fact that

log W

is homogeneous functional of the MPD. This condition follows from Equation (36) and the homogeneity properties of

H (n^{*})

and

log Ω_{M, N}

.

The most probable distribution (MPD) maximizes the probability in Equation (18) among all distributions that satisfy the constraints in Equation (3). By Lagrange maximization we obtain the MPD in the form

\frac{n_{k}^{*}}{N} = w_{k}^{*} \frac{e^{- β i}}{q},

(39)

and q and

β

are parameters related to the Lagrange multipliers. We insert the MPD into Equation (36) to obtain

log Ω_{M, N} = β M + (log q) N .

(40)

This fundamental equation relates the partition function to the primary variables of the ensemble: the macroscopic variables

(M, N)

that define the ensemble, and the Lagrange multipliers

(β, q)

that appear in the MPD. The convergence of

n_{k}^{*} / N

to intensive limit

p_{k}^{*}

implies that

β

and q are intensive, that is, they are functions of

\bar{x} = M / N

but not of M or N individually. This further implies that Equation (40) is homogeneous function of M and N with degree 1 and thus must satisfy Euler’s theorem:

log Ω_{M, N} = (\frac{\partial log Ω_{M, N}}{\partial M}) M + (\frac{\partial log Ω_{M, N}}{\partial N}) N .

(41)

Direct comparison with Equation (40) leads to:

\begin{matrix} β = {(\frac{\partial log Ω_{M, N}}{\partial M})}_{N}, \end{matrix}

(42)

\begin{matrix} log q = {(\frac{\partial log Ω_{M, N}}{\partial N})}_{M} . \end{matrix}

(43)

Thus the Lagrange multipliers that appear in the MPD are the partial derivatives of the partition function. Differentiation of Equation (40) with respect to all variables that appear on the right-hand side gives

M d β + N d log q = 0 .

(44)

This is the Gibbs-Duhem equation associated with the Euler equation for

log Ω_{M, N}

in Equation (40). It may be written as

\bar{x} = - \frac{d log q}{d β} .

(45)

In this form its expresses the relationship between

β

, q and

\bar{x}

.

The MPD maximizes the log of the microcanonical weight,

H (n) + log W (n)

and its maximum is

log Ω_{M, N}

. Therefore we have the inequality:

log Ω_{M, N} \geq H (n) + log W (n) .

(46)

It is satisfied by all distributions

n

in the

(M, N)

ensemble with the equal sign only for

n = n^{*}

. This is the fundamental variational principle of the ensemble: it defines the MPD and generates all relationships of this section.

4.2. Thermodynamics

We recognize the equations of the previous section as those of familiar statistical thermodynamics. Equation (39) is the generalized canonical distribution, a member of the exponential family, whose parameters

β

and q are related to the microcanonical partition function via Equations (40), (42) and (43). We define the extensive form of the canonical partition function

Q (β, N)

via the Legendre transformation of

log Ω

:

log Q = log Ω - M {(\frac{\partial log Ω}{\partial M})}_{N} = N log q,

(47)

and thus we recognize

q = Q^{1 / N}

as the intensive form of the canonical partition function.

The variational condition that produces the set of thermodynamic relationships is the inequality in Equation (46), which defines the MPD as the distribution that maximizes the microcanonical weight. Expressing

H (n^{*})

and

log W (n^{*})

in terms of the Euler relationships (21) and (37), respectively, this inequality takes the form

\frac{log Ω_{M, N}}{N} \geq - \sum_{i} p_{i} log \frac{p_{i}}{w_{i}^{*}},

(48)

where

p_{i} = n_{i} / N

. The inequality is satisfied by all distributions

p_{i}

with mean

\bar{x} = M / N

and the equality applies only to

p_{i} = p_{i}^{*}

. With

w_{i}^{*} = 1

it reduces the second law: the log of the microcanonical partition function is equal to the Shannon entropy of the most probable distribution, and this is larger than the entropy of any other distribution with the same mean.

Table 2 summarizes these relationships. They are consequences of the maximization of the probability in Equation (18) and are independent of the details of aggregation. These details enter only through Equations (32) and (33), which express the partition function and the selection functional in terms of the aggregation kernel.

5. Gibbs Distributions

A special type of functional is of the form

W (n) = \prod_{i} w_{i}^{n_{i}},

(49)

whose log is linear in

n

log W (n) = \sum_{i} n_{i} log w_{i}

(50)

with functional derivative

log w_{i}

. Here

w_{i}

is a function of i alone and does not depend on

n

. If the selection functional is given by Equation (49) the probability of distribution in Equation (18) takes the form

P (n) = \frac{N!}{Ω_{M, N}} \prod_{i} \frac{w_{i}^{n_{i}}}{n_{i}!} .

(51)

Probability distributions of this type are called Gibbs distributions [23] and are frequently encountered in stochastic processes [24]. Several important results can be obtained in analytic form. In particular, the mean distribution is [22]:

\frac{〈n_{k}〉}{N} = w_{k} \frac{Ω_{M - k, N - 1}}{Ω_{M, N}} .

(52)

The result is exact for all

1 \leq N \leq M

,

1 \leq k \leq M - N + 1

.

We apply this selection functional of Equation (49) to the transition

(i - j) + (j) \to (i)

that converts parent distribution

n^{'}

to offspring

n

. By the stoichiometry of the transition we have

\frac{W (n^{'})}{W (n)} = \frac{w_{i - j} w_{j}}{w_{i}} .

(53)

Inserting into Equation (31) we obtain

\sum_{i = 2}^{\infty} \frac{n_{i}}{M - N} \sum_{j = 1}^{i - 1} \frac{w_{i - j} w_{j}}{w_{i}} K_{i - j, j} = 1 .

(54)

One possible solution that satisfies this equation for all distributions

n

is

w_{i} = \frac{1}{i - 1} \sum_{j = 1}^{i - 1} w_{i - j} w_{j} K_{i - j, j}; w_{1} = 1 .

(55)

This is not the only possible solution for W in Equation (31) and may or may not be acceptable; if it is, we have obtained a Gibbs distribution and the kernel is a Gibbs kernel.

We have identified three kernels for which Equation (55) is the correct solution. These are the constant kernel,

K_{i, j} = 1,

(56)

the sum kernel

K_{i, j} = \frac{i + j}{2}

(57)

and their linear combinations. The product kernel is a quasi-Gibbs kernel and is discussed in Section 5.3.

Here we provide detailed solutions for the constant and sum kernels. We will not discuss the linear combination in part because the results are more involved but mainly because this kernel reverts to the sum kernel when cluster masses are large thus it does not contribute to our understanding of aggregation beyond what we learn by studying the constant and sum kernels separately.

5.1. Constant Kernel

With

K_{i, j} = 1

Equation (55) gives

w_{i} = 1

for all i. Accordingly, the ensemble is unbiased and its partition function is given by Equation (27):

Ω_{M, N} = Ω_{M, N}^{\circ} = (\binom{M - 1}{N - 1}) .

(58)

The mean distribution follows from Equation (52) and is given by

\frac{〈n_{k}〉}{N} = (\binom{M - k - 1}{N - 2})/ (\binom{M - 1}{N - 1}) .

(59)

To obtain the most probable distribution we calculate the parameters

β

and q from Equations (42) and (43) along with (58). The differentiations may be done by first replacing the factorials in the partition function with the Stirling expression. Alternatively we may obtain these parameters by the discrete difference form of these derivatives and apply the asymptotic conditions

M, N ≫ 1

. The latter method is simpler:

\begin{matrix} β = log \frac{Ω_{M + 1, N}}{Ω_{M, N}} = \frac{M}{M - N + 1} \to \frac{\bar{x}}{\bar{x} - 1}, \end{matrix}

(60)

\begin{matrix} q = \frac{Ω_{M, N + 1}}{Ω_{M, N}} = \frac{M - N}{N} = \bar{x} - 1 . \end{matrix}

(61)

We obtain the MPD from (39) with

w_{k}^{*} = 1

:

\frac{n_{k}^{*}}{k} = \frac{1}{\bar{x} - 1} {(\frac{\bar{x}}{\bar{x} - 1})}^{- k} .

(62)

For large

\bar{x}

this goes over to the exponential distribution

f (x) = \frac{e^{- x / \bar{x}}}{\bar{x}},

(63)

which is the well known result for the constant kernel. Here x stands for the continuous cluster mass.

5.2. Sum Kernel

The ensemble average of the sum kernel is

{〈K〉}_{M, N} = \frac{M}{N} .

(64)

We obtain the partition function from Equation (32). The result is

Ω_{M, N} = N! \frac{M^{M - N}}{M!} (\binom{M - 1}{N - 1}) .

(65)

The factors

w_{k}

that satisfy Equation (55) are

w_{k} = \frac{k^{k - 1}}{k!}

(66)

and the mean distribution follows from (52),

\frac{〈n_{k}〉}{N} = \frac{k^{k - 1}}{k!} \frac{{(M - k)}^{M - N - k}}{M^{M - N - 1}} \frac{(N - 1) (M - N)!}{N (M - N - k + 1)!} .

(67)

This is an exact result for all

1 \leq N \leq M

,

1 \leq k \leq M - N + 1

. The parameters

β

and q are obtained similarly to those for the constant kernel:

\begin{matrix} β = \frac{Ω_{M + 1, N}}{Ω_{M, N}} \to \frac{M - N}{M} - log \frac{M - N}{M}, \end{matrix}

(68)

\begin{matrix} q = \frac{Ω_{M, N + 1}}{Ω_{M, N}} \to \frac{M - N}{M} . \end{matrix}

(69)

Combining with Equation (39) we obtain the MPD in the form

\frac{n_{k}^{*}}{N} = \frac{k^{k - 1}}{k!} θ^{k - 1} e^{k θ},

(70)

with

θ = 1 - 1 / \bar{x}

. We use the Stirling formula for the factorial the MPD in the continuous limit takes the form

f (x) = \frac{θ^{x - 1}}{\sqrt{2 π}} \frac{e^{- x θ}}{x^{3 / 2}} .

(71)

Figure 2 shows the MPD for

\bar{x} = 10

and the mean distribution from Equation (67) at fixed

M / N = 10

for various values of M and N. In the scaling limit the mean distribution converges to the MPD.

5.3. Quasi-Gibbs Kernels—The Product Kernel

We are able to obtain closed-form expressions for the partition function of the constant and sum kernels and heir linear combinations because they all satisfy the condition

{〈K〉}_{M, N} = K (n)

(72)

for all

n

. This states that the mean kernel is the same in all distributions of the ensemble, therefore also equal to the ensemble average kernel. In this case the calculation of the ensemble average kernel is trivial, as it does not require knowledge of the probabilities

n

. The constant kernel, sum kernel and their linear combinations are the only kernels that satisfy (72) in the strictest sense, that is, for all

n

that satisfy the two constraints in (3). We refer to Equation (72) as the Gibbs condition because it generates Gibbs distributions. We may relax the requirement that all distributions obey the Gibbs condition with the milder requirement that it be obeyed by most distributions. This is the case of the produce kernel. The product kernel is defined

K_{i, j} = i j,

(73)

and its mean within distribution

n

is

K (n) = \frac{N}{N - 1} ({〈i〉}^{2} - \frac{〈i^{2}〉}{N}) .

(74)

Here

〈i〉 = M / N

and

〈i^{2}〉

are the normalized first moment and second moments of

n

, respectively. In the limit

N \to \infty

,

M / N = fixed

, this scales as

K (n) \sim {〈i〉}^{2} = {(\frac{M}{N})}^{2},

(75)

in most distributions except those that contain clusters of the order M. (The largest cluster size in the ensemble is

k_{\max} = M - N + 1

and for

M ≫ N

it is of the order M.) According to Equation (75) the product kernel is a quasi-Gibbs kernel: it satisfies the Gibbs condition in Equation (72) asymptotically in most but not all feasible distributions. We proceed to obtain the Gibbs distribution of the product kernel and test its validity.

Inserting (75) into (32) we obtain the partition function:

Ω_{M, N} = {(N! \frac{M^{M - N}}{M!})}^{2} (\binom{M - 1}{N - 1}) .

(76)

We complete the solution by evaluating

w_{k}

from Equation (55),

w_{k} = \frac{2^{k - 1} k^{k - 2}}{k!} .

(77)

The mean distribution is obtained by inserting these results into Equation (52):

\frac{〈n_{k}〉}{N} = \frac{2^{k - 1} k^{k - 3} (N - 1) M! M^{1 - 2 (M - N)} (M - N)! {(M - k)}^{2 (M - N - k)}}{N^{2} (k - 1)! (M - k - 1)! (M - N - k + 1)!} .

(78)

Unlike Equations (62) and (67) this result is not exact. This can be demonstrated numerically by the fact this distribution is not normalized to unity and its mean is not

M / N

for finite M, N; its approaches proper normalization in the asymptotic limit. This failure arises from the fact that Equation (52) requires a Gibbs probability distribution that strictly applies to all distributions of the

(M, N)

ensemble.

We obtain

β

and

log q

from Equations (42) and (43):

\begin{matrix} β = \frac{M - N}{M} - 2 log \frac{M - N}{M} \end{matrix}

(79)

\begin{matrix} q = \frac{N (M - N)}{M^{2}} . \end{matrix}

(80)

Using

θ = 1 - 1 / \bar{x}

the MPD is

\frac{n_{k}^{*}}{N} = \frac{{(2 θ k)}^{k - 2}}{k!} \frac{2 θ}{1 - θ} e^{- 2 θ k}

(81)

and in the continuous limit

f (x) = \frac{2^{x} e^{x (1 - 2 θ)} θ^{x - 1}}{\sqrt{8 π} (1 - θ) x^{5 / 2}} .

(82)

These results are summarized in Table 3 along with those for the constant and sum kernels.

The relationship between the mean and the most probable distribution of the product kernel is shown in Figure 3 for two values of the mean cluster,

\bar{x} = 1.75

and

\bar{x} = 4

. At

\bar{x} = 1.75

the mean distribution calculated from Equation (78) is not exact but its moments asymptotically approach the correct values as M and N are increased at fixed

M / N

. At

\bar{x} = 4

the behavior is different. A peak develops at the long tail of the distribution. It is pushed to ever larger sizes but never vanishes. In this region the mean distribution from Equation (78) is not correct: its mean does not converge to

\bar{x}

when M and N are increased, but to a value smaller than

\bar{x}

, which implies that mass conservation is not satisfied. This breakdown is manifestation of gelation, the emergence of an infinite cluster that is not captured by the mean field theory. The precise nature of the gel phase is discussed in the next section.

6. Phase Behavior

6.1. Stability

The fundamental inequality of the ensemble is Equation (46) that defines the most probable distribution. This condition implies that the microcanonical functional is concave and this in turn implies that

log Ω_{M, N}

is a concave function of M and N and requires (see Supplementary Material)

\frac{d β}{d \bar{x}} \leq 0 or \frac{d log q}{d \bar{x}} \geq 0 .

(83)

These equivalent conditions guarantee the existence of the MPD in the form of Equation (39). In thermodynamic language they ensure that the MPD represents a stable state. The parameters

β

and q of the constant, sum and product kernel are plotted in Figure 4a,b, respectively, as a function of the progress variable

θ = 1 - 1 / \bar{x}

. According to Equation (83) stability requires

β

to be decreasing function of

\bar{x}

and q increasing function of

\bar{x}

. The constant kernel is stable at all

θ

:

β_{const}

decreases and

q_{const}

increases monotonically over the entire range of

θ

. The sum kernel is also stable at all

θ

but reaches the limit of stability at

θ = 1

or

\bar{x} = \infty

. This kernel is borderline-stable: it is stable for all finite times and reaches instability at

t = \infty

. The product kernel is stable up to

\bar{x} = 0.5

beyond which point both

β_{prod}

and

q_{prod}

violate the stability criteria.

To survey the stability landscape of aggregation we employ the power-law kernel,

K_{i, j} = {(i j)}^{ν / 2},

(84)

with arbitrary exponent

ν \geq 0

. This is a homogeneous kernel with degree

ν

. It reverts to the product kernel with

ν = 2

and to the constant kernel with

ν = 0

. We treat this as a quasi-Gibbs kernel by analogy to the product kernel. We take the ensemble average power-law kernel to scale as

{〈K〉}_{M, N} \sim {(\frac{M}{N})}^{ν},

(85)

and obtain the parameters

β

and q as

β = ν θ - log θ, q = θ {(1 - θ)}^{ν - 1} .

(86)

With

ν = 0

and

ν = 2

these revert, as expected, to the results for the constant and product kernels, respectively. Interestingly, with

ν = 1

we obtain the

(β, q)

parameters for the sum kernel. This behavior turns the power-law kernel into a useful tool, a homogeneous kernel that reproduces the correct

(β, q)

values of the constant, sum and product kernels, and which may be used to interpolate (and cautiously extrapolate) to other homogeneous kernels by varying the exponent

ν

.

The stability limit in power-law aggregation is reached at

θ^{*} = 1 / ν .

(87)

Accordingly, the MPD is stable in

0 \leq θ \leq θ^{*}

and unstable in

θ^{*} < θ \leq 1

. The phase diagram is shown in Figure 4a,b with the stable region indicated by the shaded area. For

ν \leq 1

the system is stable at all

θ

from 0 to 1. For

ν = 1

the limit of stability appears at

θ = 1

, which is reached in infinite time. In practice the system is stable at all finite times. For

ν > 1

the stability limit is reached within finite time at the point where the mean size reaches the critical value

{\bar{x}}^{*} = \frac{1}{1 - θ^{*}} = \frac{ν}{ν - 1} .

(88)

For

ν = 2

(product kernel) the limit of stability is reached at

{\bar{x}}^{*} = 2

. We see from Figure 4 that both

β

and q reach the limit of stability simultaneously.

6.2. Phase Splitting—The Sol-Gel Transition

When the system crosses into the unstable region its state is no longer represented by the MPD but by a mixture of two phases, each with its own MPD. What are these phases? To answer this question we first observe that the elements of the ensemble are fundamentally discrete distributions; the apparent continuity in the scaling limit is a mathematical artifact, a great convenience, but not a fundamental quality of the ensemble. To understand therefore the nature of the gel phase we must consider the discrete finite system. Given a distribution of M particles partitioned into N clusters, the maximum cluster mass possible is

k_{\max} = M - N + 1

and is found in a single distribution of the ensemble, in which one cluster contains

M - N + 1

units and the remaining

N - 1

clusters contain one unit mass each. The region

(k_{\max} + 1) / 2 < k \leq k_{\max}

is special: it is either empty, or it contains a single cluster. It cannot contain more than one cluster because there is not enough mass to have two clusters that are both larger than

(k_{\max} + 1) / 2

. In the event that it does contain a cluster, its mass is of the order of

k_{\max} = M - N + 1

, and in the asymptotic limit, of the order M. This means that the mass in the region

k > k_{\max} + 1) / 2

is of the same order as that in

k < (k_{\max} + 1) / 2

. A cluster in

k > (k_{\max} + 1) / 2

represents a giant component, a single element of the population that carries a finite fraction of the total mass contained in the distribution.

The set of distributions that do not contain a giant cluster constitute the sol phase; sol distributions satisfy the scaling form of the mean kernel in Equation (85) and the Gibbs condition in Equation (72). Distributions that contain a cluster in the gel region violate the Gibbs condition and will be treated as a mixture of a sol phase

(k \leq (k_{\max} + 1) / 2)

and a gel phase (

k > (k_{\max} + 1) / 2

). Given an individual distribution

n

, a certain fraction of mass is contained in the sol region with the rest in the gel region. The ensemble averages of these fractions define, respectively, the sol fraction,

ϕ_{sol}

, and gel fraction,

ϕ_{gel}

, in the ensemble:

ϕ_{sol} = \frac{1}{M} \sum_{n} P (n) \sum_{k = 1}^{k^{'}} n_{k}; ϕ_{gel} = \frac{1}{M} \sum_{n} P (n) \sum_{k = k^{'} + 1}^{k_{\max}} n_{k}; ϕ_{sol} + ϕ_{gel} = 1,

(89)

with

k^{'} = (k_{\max} + 1) / 2

. If

P (n)

is such that in the scaling limit

ϕ_{gel} \to 0

, the ensemble consists of a single phase, the sol, and is represented by the MPD in Equation (39). If

ϕ_{gel} > 0

the ensemble is represented by a mixture of the two phases. We will determine their distributions and construct the tie line between the two phases.

We suppose that the state at

(M, N)

consists of a sol phase with

M_{sol}

,

N_{sol} = N - 1

, and a gel phase with

M_{gel} = M - M_{sol}

. The evolution of the sol phase is governed by Equation (30), which we now write as

{(\frac{Ω_{M + 1, N}}{Ω_{M, N}})}_{sol} = q (θ_{sol}) = θ_{sol} {(1 - θ_{sol})}^{ν - 1} .

(90)

This must be satisfied by the sol phase at all times. In the pre-gel region the state is a single phase, sol, with

θ_{sol} = θ = 1 - N / M

and

(β, q)

parameters from Equation (86). In the post-gel region it is a mixture of two phases: a sol phase with mass

M_{sol}

and number of clusters

N_{sol} = N - 1

; and gel phase with mass

M_{gel} = M - M_{sol}

found in a single cluster (

N_{gel} = 1

). The sol phase is determined from Equation (90) with

θ_{sol} = 1 - M_{sol} / (N - 1)

and its

β

-q parameters are given by Equation (86) with

θ = θ_{sol}

. The mass of the gel phase is then obtained from the conservation conditions

M_{gel} = M - M_{sol}

. These steps are summarized below.

Pre-Gel Region

0 \leq θ < θ^{*}

The system consists of a sol phase and its MPD is

\frac{n_{k}^{*}}{N} = w_{k}^{*} \frac{e^{- β (θ)}}{q (θ)}

(91)

with

β = ν θ - log θ, q = θ {(1 - θ)}^{ν - 1}, θ = 1 - N / M .

(92)

Post-Gel Region

θ^{*} \leq θ < 1

The system consists of a sol phase with mass fraction

ϕ_{sol}

and a gel phase with fraction

ϕ_{gel} = 1 - ϕ_{sol}

.

Obtain $θ_{sol}$ by solving

$q (θ_{sol}) = q (θ), θ_{sol} \leq θ^{*} .$

(93)

with $q (θ)$ from Equation (86) and $θ = 1 - N / M$ .
Obtain $ϕ_{sol}$ and ${\bar{x}}_{sol}$ from

$ϕ_{sol} = \frac{1 - θ}{1 - θ_{sol}}, {\bar{x}}_{sol} = \frac{1}{1 - θ_{sol}} .$

(94)
Obtain the gel fraction from mass balance:

$ϕ_{gel} = 1 - ϕ_{sol} = \frac{θ - θ_{sol}}{1 - θ_{sol}} .$

(95)

The mean size of the gel cluster is ${\bar{k}}_{gel} = ϕ_{gel} M$ , where M is the total mass in the system. In the scaling limit the gel fraction is 1 and the size of the gel cluster is ∞.

The gel fraction and the mean cluster size for the product kernel

(ν = 2)

are shown in Figure 5 as a function of

θ

. The gel fraction is zero up until the gel point (

θ^{*} = 0.5

) and increases according to Equation (95) once in the post-gel region. The mean cluster size increases in the pre-gel region but decreases in the post-gel region, as clusters in the sol are lost by reaction with the gel. At

θ \to 1

(

t \to \infty

) all mass is found in the gel phase except for a single sol particle with unit mass. This is the infinite dilution limit of the sol phase, to borrow the terminology of solution thermodynamics.

The evolution of

{\bar{x}}_{sol}

past the gel point retraces its pre-gel history. This is a consequence of Equation (93), which resolves the sol phase in the two-phase region. The symmetry of

q (θ)

about

θ^{*} = 0.5

in the case of the product kernel produces a correspondingly symmetric evolution of

{\bar{x}}_{sol}

, as shown in Figure 5b. The dashed lines are Monte Carlo simulations with

M = 200

particles and are shown for comparison (the simulations are discussed in the next section). The deviation from theory near the gel point is due to finite size effects. In these simulations a relatively small number of particles was used to permit the collection of a large number of realizations within reasonable computational time. These simulations are discussed next.

6.3. Monte Carlo Simulations

We demonstrate the theory with simulations performed by the constant-V Monte Carlo method [25]. The method tracks a sample of clusters that undergo binary aggregation with probability proportional to the transition rate

R_{i, j}

given in Equation (5). At each step the simulation box contains a sample of N clusters, with N decreasing from M to 1 as clusters merge. A pair of clusters are chosen at random and is combined into a single cluster according the following criterion: draw a random number rnd in the interval

(0, 1)

and accept the merging of the clusters if

rnd < \frac{K_{i, j}}{K_{\max}},

(96)

where

K_{i, j}

is the aggregation kernel between the chosen clusters and

K_{\max}

is the maximum aggregation kernel in the simulation box. If the criterion is satisfied the event is accepted and the reactant particles are deleted and replaced by a cluster with their combined mass. If the event is rejected, a new pair is chosen and the process is repeated. The simulation begins with M monomers and continues until a single cluster is formed. This amounts to a random walk along the edges of the graph in Figure 1 that spans its entire range from

θ = 0

to

θ = 1 - 1 / M

. A trajectory from the top to the bottom of the graph consists of a sequence of M sampled distributions, one from each generation. By averaging trajectories we obtain the mean distribution in each generation, which may then be compared to the mean distribution predicted by the theory.

Figure 6 shows the evolution of the mean distribution obtained by MC simulation with the product kernel using

M = 200

. Up until the gel point is reached the state is a single sol phase. It is characterized by a population of clusters whose tail decays fast enough that its moments are finite. Above the gel point a gel peak emerges. It becomes more pronounced and moves to larger sizes as aggregation progresses. Past the gel point the sol distribution contracts and retraces its steps back to the monomeric state as

θ

increases. For example, the sol distribution at

θ = 0.9

is identical to that at

θ = 0.1

except that it carries less mass. In the Smoluchowski literature this is known as the Flory solution to gelation [5]. A competing solution by Stockmayer [13] predicts that the intensive distribution of the sol phase remains constant past the gel point except for the fact that its mass gradually decreases as it is transferred to the gel. As it turns out, Stockmayer solution implicitly assumes that

P (n)

is strictly a Gibbs distribution. In this case the sol-gel tie line is obtained by equating the temperatures of the two phases and the sol distribution is indeed found to be constant throughout the post gel region [20]. An analysis of the Stockmayer solution is beyond the scope of this work but a commentary is given in Reference [22].

7. Continuous Limit

We define the continuous limit by the conditions

(M, N) \to \infty, M / N ≫ 1 .

Thus in addition to the scaling limit we require the mean cluster size to be much larger than the unit mass, such that the cluster mass may be treated as a continuous variable, which we denote as x. Equations (63), (71) and (82) refer to this limit. We present the corresponding expressions for the partition function and the selection functional.

In the continuous domain all intensive properties of the ensemble are functions of the mean cluster size

\bar{x}

. Thus we write

β = β (\bar{x})

,

q = q (\bar{x})

,

w_{i}^{*} = w (x; \bar{x})

, and express the partition function in intensive form

log ω (\bar{x}) = (log Ω_{M, N}) / N

. The MPD is

f (x) = w (x; \bar{x}) \frac{e^{- x β (\bar{x})}}{q (\bar{x})}

(97)

and satisfies the normalizations

\int_{0}^{\infty} f (x) d x = 1, \int_{0}^{\infty} x f (x) d x = \bar{x} .

(98)

The log of the cluster function

w (x; \bar{x})

is the functional derivative of the selection functional at the MPD:

log w (x; \bar{x}) = \frac{δ log W [f]}{δ f}

(99)

and the notation

w (x; \bar{x})

indicates this function of x will generally depend on

\bar{x}

as well since the functional derivative of non linear functionals depend on the function on which the derivative is evaluated. Since the microcanonical probability peaks sharply about the MPD (we are assuming a stable single-phase state) all ensemble averages revert to averages over the continuous MPD. The ensemble average kernel is then equal to the mean kernel within the MPD

{〈K〉}_{M, N} \to \bar{K} (\bar{x}) = \int_{0}^{\infty} d x \int_{0}^{\infty} d y K (x, y) f (x) f (y) .

(100)

The log of the intensive partition function,

log ω (\bar{x}) = log Ω_{M, N} / N

, satisfies

log ω = β \bar{x} + log q,

(101)

with

β = \frac{d log ω}{d \bar{x}} .

(102)

These are the intensive forms of Equations (40) and (42), respectively. The partition function of aggregation is obtained from Equation (32) by expressing the summation over

log {〈K〉}_{g}

and an integral over

\bar{K} (\bar{x})

:

log ω = 1 + log \bar{x} + \bar{x} \int_{0}^{\bar{x}} log \bar{K} (y) \frac{d y}{y^{2}} .

(103)

The parameter

β

is obtained from Equation (102) and

log q

from (101):

\begin{matrix} β = \frac{1}{\bar{x}} + \frac{log \bar{K} (\bar{x})}{\bar{x}} + \int_{0}^{\bar{x}} log \bar{K} (y) \frac{d y}{y^{2}}, \end{matrix}

(104)

\begin{matrix} q = \frac{\bar{x}}{\bar{K} (\bar{x})} . \end{matrix}

(105)

From Equation (54) we obtain

\frac{1}{\bar{x}} \int_{0}^{\infty} d x \int_{0}^{x} f (x) \frac{w (x - y; \bar{x}) w (y; \bar{x})}{w (x; \bar{x})} K (x - y, y) = 1,

(106)

which expresses a condition on

w (x; \bar{x})

.

Equations (97), (103)–(106) provide an equivalent mathematical description of Smoluchowski aggregation in the continuous limit. These are accompanied by the variational condition

log ω \geq - \int_{0}^{\infty} p (x) log \frac{p (x)}{w (x; \bar{x})} d x,

(107)

which is the continuous form of Equation (48) and is satisfied by all distributions

p (x)

with mean

\bar{x}

. The equality defines the solution to the Smoluchowski process, the MPD,

f (x)

.

As a demonstration we apply these results to the constant kernel. The right-hand side of Equation (106) is zero and we obtain

W [f] = 1

at all times. It follows that

w (x; \bar{x}) = 1

. The parameters

β

and q are

\begin{matrix} β = 1 / \bar{x}, q = \bar{x}, \end{matrix}

(108)

and the MPD becomes

f (x) = \frac{e^{- x / \bar{x}}}{\bar{x}} .

(109)

This is the well-known solution of the constant kernel in the continuous domain. The partition function of the constant kernel is

log ω = 1 + log \bar{x},

(110)

and the inequality in Equation (107) becomes

1 + log \bar{x} \geq - \int p (x) log p (x) d x = H [p],

(111)

whose right-hand side is the Shannon entropy of distribution

p (x)

. For fixed

\bar{x}

it is maximized by the exponential distribution, whose entropy is

1 + log \bar{x}

: the inequality is indeed satisfied.

8. Summary

With the results obtained here we have made contact with several previous works in the literature. The mean distribution for the constant kernel in Equation (59) was given by Hendriks [26], who also obtained a recursion for the partition function similar to that in Equation (30). The combinatorial treatment of Hendriks has in fact several common elements to ours but is limited to the constant and sum kernels and lacks the thermodynamic element of this work. The recursion for the cluster weights in Equation (55) has appeared in various treatments of aggregation, both deterministic [2,27] and stochastic [15,17,26]. The mean distributions in the continuous limit for the mean and sum kernels and for the product kernel in the pre-gel region are well known results in the literature [2]. The instability of power-law kernels has been discussed by Ziff et al. [3] in the context of the Smoluchowski equation. These connections to prior literature serve to validate the theory presented here and demonstrate that the thermodynamic treatment provides a unified theory of aggregation that brings previously disconnected results under a single formalism, the Smoluchowski ensemble.

The Smoluchowski ensemble is a probability space of distributions that are feasible under the rules of binary aggregation. The structure of this space, that is, the connectivity of the graph in Figure 1, is solely determined by the condition that aggregation is a binary event; the probability measure over this space is determined by the rate expression prescribed by the aggregation model. In Smoluchowski aggregation the rate is directly proportional to the number of clusters that appear on the reactant side of the aggregation reaction and on the aggregation kernel. In the scaling limit the probability of distribution is sharply peaked around a single distribution of the ensemble, its most probable distribution (MPD). In this limit all ensemble averages reduce to averages of the MPD, a distribution that alone suffices to generate all properties of the ensemble. The Smoluchowski coagulation equation is the time evolution of the most probable distribution in the asymptotic limit.

The step that turns the Smoluchowski ensemble into a thermodynamic ensemble is Equation (18). It expresses the probability of distribution in terms of two special functionals, the multinomial coefficient

n!

and the selection functional

W (n)

. This formulation introduces the partition function

Ω_{M, N}

as the central property of the ensemble to which al other properties are connected. The thermodynamic calculus, summarized by the equations in Table 2, is a mathematical consequence of the variational condition that defines the most probable distribution in Equation (39) as the solution to the constrained maximization of the probability

P (n)

in Equation (18). The constraints are given by Equations (3) that fix the zeroth and first order moments of the distribution. These constraints define a microcanonical ensemble of distributions with fixed mean

\bar{x} = M / N

.

The MPD obtained by the method of constrained maximization is stable, provided that the partition function is concave in its independent variables. In extensive terms,

log Ω_{M, N}

must be concave in

M, N

; in intensive terms,

log ω (\bar{x})

must be concave in

\bar{x}

. The two conditions are equivalent and define the stability criterion of the MPD. As in regular thermodynamics, when the stability criterion is violated the system experiences phase splitting and exists a mixture of two phases—mathematically, as a linear combination of two independent MPD’s. In aggregation these phases are the sol phase, which is represented by the MPD in Equation (39) and the gel phase (giant component), which in the scaling limit is represented by a delta function at ∞. The splitting into a sol and gel phase is treated by the theory in a natural and rigorous manner.

Notably, entropy in this treatment plays no special role. The Shannon entropy of distribution is the log of the multinomial coefficient. In the scaling limit, entropy is a component of the partition function through Equation (36),

log Ω_{M, N} = H (n^{*}) + log W (n^{*}),

(112)

where

H (n^{*})

is the Shannon functional evaluated at the MPD. In the special case of the constant kernel

W (n^{*}) = 1

. In this case the partition function reduces to the Shannon entropy of the MPD,

log Ω_{M, N} = - N \sum_{i} \frac{n_{i}}{N} log \frac{n_{i}}{N}; (constant kernel),

(113)

and the variational condition reads,

H (n) \leq H (n^{*}) = Ω_{M, N}; (constant kernel) .

(114)

In this form we have recovered the inequality of the second law as stated in statistical thermodynamics: the entropy of the equilibrium distribution (MPD) is at maximum with respect to all feasible distributions, namely, all distributions with the same mean. As is well known this distribution is exponential. The constant kernel is special. With

W (n) = 1

the probability of distribution is proportional to

n!

; accordingly, all ordered sequences of N clusters with total mass M are equally probable. The ordered sequence of cluster masses in this case is analogous to microstate in statistical mechanics and the condition

W = 1

analogous to the postulate if equal a priori probabilities. In the general case the Shannon entropy and the log of the microcanonical partition function are not the same. The fundamental functional that is maximized is the microcanonical weight

n! W (n)

, whose log is

H (n) + log W (n) .

(115)

The selection functional incorporates the effect of the aggregation kernel and in this sense it the point of contact between thermodynamics and the mathematical model of the stochastic process that gives rise to the probability space of interest. In Smoluchowski aggregation the model is defined by the transition rate in Equation (10) and the corresponding governing equation for W is Equation (33).

The thermodynamic formalism developed here is not limited to aggregation. Two alternative derivations that make no reference to stochastic process that gives rise to the probability space have been given in References [20,28]. As long as

log W

is a homogeneous functional with degree 1, the thermodynamic relationships follow as a direct consequence of the maximization of the microcanonical probability in Equation (18) under the constraints in Equation (3). The details of aggregation enter through Equations (32) and (33) that give the partition function and selection functional in terms of the aggregation kernel. The approach may be generalized to other processes including growth by monomer addition and breakup. These will be treated elsewhere.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/1099-4300/22/10/1181/s1.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

Smoluchowski, M. Versuch einer mathematischen Theorie der Koagulationkinetic kolloider Loesungen. Z. Phys. Chem. 1917, 92, 129–168. [Google Scholar]
Leyvraz, F. Scaling theory and exactly solved models in the kinetics of irreversible aggregation. Phys. Rep. 2003, 383, 95–212. [Google Scholar] [CrossRef] [Green Version]
Ziff, R.M.; Hendriks, E.M.; Ernst, M.H. Critical Properties for Gelation: A Kinetic Approach. Phys. Rev. Lett. 1982, 49, 593–595. [Google Scholar] [CrossRef]
Hendriks, E.M.; Ernst, M.H.; Ziff, R.M. Coagulation Equations with Gelation. J. Stat. Phys. 1983, 31, 519–563. [Google Scholar] [CrossRef] [Green Version]
Ziff, R.M.; Stell, G. Kinetics of polymer gelation. J. Chem. Phys. 1980, 73, 3492–3499. [Google Scholar] [CrossRef]
Marcus, A. Stochastic Coalescence. Technometrics 1968, 10, 133–143. [Google Scholar] [CrossRef]
Lushnikov, A.A. Coagulation in finite systems. J. Colloid Interface Sci. 1978, 65, 276–285. [Google Scholar] [CrossRef]
Lushnikov, A.A. Exact kinetics of a coagulating system with the kernel K = 1. J. Phys. A Math. Theor. 2011, 44, 335001. [Google Scholar] [CrossRef]
Lushnikov, A.A. Exact kinetics of the sol-gel transition. Phys. Rev. E 2005, 71, 046129. [Google Scholar] [CrossRef]
Lushnikov, A.A. Exact particle mass spectrum in a gelling system. J. Phys. A Math. Gen. 2005, 38, L35. [Google Scholar] [CrossRef]
Lushnikov, A.A. From Sol to Gel Exactly. Phys. Rev. Lett. 2004, 93, 198302. [Google Scholar] [CrossRef] [PubMed]
Aldous, D.J. Deterministic and stochastic models for coalescence (aggregation and coagulation): A review of the mean-field theory for probabilists. Bernoulli 1999, 5, 3–48. [Google Scholar] [CrossRef]
Stockmayer, W.H. Theory of Molecular Size Distribution and Gel Formation in Branched-Chain Polymers. J. Chem. Phys. 1943, 11, 45–55. [Google Scholar] [CrossRef]
Spouge, J.L. Analytic solutions to Smoluchowski’s coagulation equation: A combinatorial interpretation. J. Phys. Math. Gen. 1985, 18, 3063. [Google Scholar] [CrossRef]
Spouge, J.L. Equilibrium polymer size distributions. Macromolecules 1983, 16, 121–127. [Google Scholar] [CrossRef]
Hendriks, E.M.; Spouge, J.L.; Eibl, M.; Schreckenberg, M. Exact solutions for random coagulation processes. Z. Phys. B Condens. Matter 1985, 58, 219–227. [Google Scholar] [CrossRef]
Spouge, J.L. The size distribution for the A_gRB_f-g Model of polymerization. J. Stat. Phys. 1983, 31, 363–378. [Google Scholar] [CrossRef]
Matsoukas, T. Statistical Thermodynamics of Irreversible Aggregation: The Sol-Gel Transition. Sci. Rep. 2015, 5, 8855. [Google Scholar] [CrossRef]
Matsoukas, T. Abrupt percolation in small equilibrated networks. Phys. Rev. E 2015, 91, 052105. [Google Scholar] [CrossRef]
Matsoukas, T. Statistical thermodynamics of clustered populations. Phys. Rev. E 2014, 90, 022113. [Google Scholar] [CrossRef] [Green Version]
Flory, P.J. Molecular Size Distribution in Three Dimensional Polymers. I. Gelation. J. Am. Chem. Soc. 1941, 63, 3083–3090. [Google Scholar] [CrossRef]
Matsoukas, T. Generalized Statistical Thermodynamics: Thermodynamics of Probability Distributions and Stochastic Processes; Springer International Publishing: Berlin/Heidelberg, Germany, 2019. [Google Scholar] [CrossRef]
Berestycki, N.; Pitman, J. Gibbs Distributions for Random Partitions Generated by a Fragmentation Process. J. Stat. Phys. 2007, 127, 381–418. [Google Scholar] [CrossRef] [Green Version]
Kelly, F.P. Reversibility and Stochastic Networks; Cambridge University Press: Cambridge, UK, 2011; (Reprint of the 1979 edition by Wiley). [Google Scholar]
Smith, M.; Matsoukas, T. Constant-number Monte Carlo simulation of population balances. Chem. Eng. Sci. 1998, 53, 1777–1786. [Google Scholar] [CrossRef]
Hendriks, E.M. Cluster size distributions in equilibrium. Z. Phys. B Condens. Matter 1984, 57, 307–314. [Google Scholar] [CrossRef]
Lushnikov, A.A. Evolution of coagulating systems. J. Colloid Interface Sci. 1973, 45, 549–556. [Google Scholar] [CrossRef]
Matsoukas, T. Thermodynamics Beyond Molecules: Statistical Thermodynamics of Probability Distributions. Entropy 2019, 21, 890. [Google Scholar] [CrossRef] [Green Version]

Figure 1. The aggregation graph for

M = 7

. Each layer contains all feasible distributions in that generation.

Figure 1. The aggregation graph for

M = 7

. Each layer contains all feasible distributions in that generation.

Figure 2. Approach to scaling limit for the sum kernel at

\bar{x} = 10

(θ = 0.9)

. The most probable distribution (MPD) is calculated from Equation (70) and the mean distribution from Equation (67) with

M = \bar{x} N

,

N = 20, 40, 80, 160

.

Figure 2. Approach to scaling limit for the sum kernel at

\bar{x} = 10

(θ = 0.9)

. The most probable distribution (MPD) is calculated from Equation (70) and the mean distribution from Equation (67) with

M = \bar{x} N

,

N = 20, 40, 80, 160

.

Figure 3. Approach to the scaling limit for the product kernel with (a)

\bar{x} = 1.75

and (b)

\bar{x} = 4

. The MPD is calculated from Equation (71) and the mean distribution (dashed lines) from Equation (78). The distributions for

\bar{x} = 4

are not stable.

Figure 3. Approach to the scaling limit for the product kernel with (a)

\bar{x} = 1.75

and (b)

\bar{x} = 4

. The MPD is calculated from Equation (71) and the mean distribution (dashed lines) from Equation (78). The distributions for

\bar{x} = 4

are not stable.

Figure 4. Phase diagram of power-law kernels: In the shaded region the system is stable and is represented by its MPD. The unshaded region is unstable and the system is split into two phases, a sol phase and a gel phase, each represented by its own MPD. (a,b) provide equivalent criteria of stability.

Figure 5. (a) Gel fraction and (b) mean sol cluster size as a function of the progress variable

θ

. Past the gel point the mean size in the sol retraces its pre-gel history back to its initial size

{\bar{x}}_{sol} = 1

. The dashed lines are Monte Carlo (MC) simulations with

M = 200

particles.

Figure 5. (a) Gel fraction and (b) mean sol cluster size as a function of the progress variable

θ

. Past the gel point the mean size in the sol retraces its pre-gel history back to its initial size

{\bar{x}}_{sol} = 1

. The dashed lines are Monte Carlo (MC) simulations with

M = 200

particles.

Figure 6. Monte Carlo snapshots of the mean distribution of the product kernel with

M = 200

particles (open circles are MC results, solid lines are calculated from theory). The gel phase emerges at

θ^{*} = 0.5

and moves towards ever larger sizes (arrows mark the theoretical predictions). The distribution of the sol grows in the pre-gel region range

0 < θ < 0.5

but contracts once past the post-gel point (

θ > 0.5

).

Figure 6. Monte Carlo snapshots of the mean distribution of the product kernel with

M = 200

particles (open circles are MC results, solid lines are calculated from theory). The gel phase emerges at

θ^{*} = 0.5

and moves towards ever larger sizes (arrows mark the theoretical predictions). The distribution of the sol grows in the pre-gel region range

0 < θ < 0.5

but contracts once past the post-gel point (

θ > 0.5

).

Table 1. Selected aggregation kernels.

Brownian coagulation	$K_{i, j} = \frac{1}{4} (2 + {(\frac{i}{j})}^{1 / 3} + {(\frac{j}{i})}^{1 / 3})$
Constant kernel	$K_{i, j} = 1$
Flory/Stockmayer kernel	$K_{i, j} = \frac{(f i - 2 i + 2) (f j - 2 j + 2)}{f^{2}}$
Product kernel	$K_{i, j} = i j$
Sum kernel	$K_{i, j} = \frac{i + j}{2}$

Table 2. Summary of thermodynamic relationships.

Most Probable Distribution	$\frac{n_{k}^{}}{N} = w_{k}^{} \frac{e^{- β k}}{q}$	Equation (39)
Partition Function	$Ω_{M, N} = β M + (log q) N$	Equation (40)
	$β = {(\frac{\partial log Ω}{\partial M})}_{N}$	Equation (42)
	$log q = {(\frac{\partial log Ω}{\partial M})}_{M}$	Equation (43)
Gibbs-Duhem Equation	$M d β + N d log q = 0$	Equation (44)
Variational Condition(Second Law)	$\frac{log Ω_{M, N}}{N}$ $\geq - \sum_{i} p_{i} log \frac{p_{i}}{w_{i}^{*}}$	Equation (48)

Table 3. Summary of Constant, Sum and Product Kernel; in all cases

θ = 1 - 1 / \bar{x}

.

Table 3. Summary of Constant, Sum and Product Kernel; in all cases

θ = 1 - 1 / \bar{x}

.

	Constant Kernel	Sum Kernel	Product Kernel $^{†}$
$K_{i, j}$	1	$(i + j) / 2$	$i j$
$Ω$	$(\binom{M - 1}{N - 1})$	$N! \frac{M^{M - N}}{M!} (\binom{M - 1}{N - 1})$	${(N! \frac{M^{M - N}}{M!})}^{2} (\binom{M - 1}{N - 1})$
$β$	$- log θ$	$θ - log θ$	$2 θ - log θ$
q	$\frac{θ}{1 - θ}$	$θ$	$θ (1 - θ)$
$w_{k}$	1	$\frac{k^{k - 1}}{k!}$	$\frac{2^{k - 1} k^{k - 2}}{k!}$
MPD	$(1 - θ) θ^{k - 1}$	$\frac{k^{k - 1}}{k!} θ^{k - 1} e^{k θ}$	$\frac{{(2 θ k)}^{k - 2}}{k!} \frac{2 θ}{1 - θ} e^{- 2 θ k}$

^{†}

Valid only for

θ \leq 1 / 2

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Matsoukas, T. The Smoluchowski Ensemble—Statistical Mechanics of Aggregation. Entropy 2020, 22, 1181. https://0-doi-org.brum.beds.ac.uk/10.3390/e22101181

AMA Style

Matsoukas T. The Smoluchowski Ensemble—Statistical Mechanics of Aggregation. Entropy. 2020; 22(10):1181. https://0-doi-org.brum.beds.ac.uk/10.3390/e22101181

Chicago/Turabian Style

Matsoukas, Themis. 2020. "The Smoluchowski Ensemble—Statistical Mechanics of Aggregation" Entropy 22, no. 10: 1181. https://0-doi-org.brum.beds.ac.uk/10.3390/e22101181

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Smoluchowski Ensemble—Statistical Mechanics of Aggregation

Abstract

1. Introduction

2. The Smoluchowski Ensemble

2.1. Kinetics

2.2. Probabilities

2.3. Smoluchowski Equation

3. Thermodynamic Formalism

3.1. Partition Function and Selection Functional

3.2. Shannon Entropy

3.3. The Selection Functional

3.4. Propagation Equations

4. Scaling Limit

4.1. Most Probable Distribution

4.2. Thermodynamics

5. Gibbs Distributions

5.1. Constant Kernel

5.2. Sum Kernel

5.3. Quasi-Gibbs Kernels—The Product Kernel

6. Phase Behavior

6.1. Stability

6.2. Phase Splitting—The Sol-Gel Transition

6.3. Monte Carlo Simulations

7. Continuous Limit

8. Summary

Supplementary Materials

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI