A Novel Reconstruction Method to Increase Spatial Resolution in Electron Probe Microanalysis

Claus, Tamme; Bünger, Jonas; Torrilhon, Manuel

doi:10.3390/mca26030051

Open AccessFeature PaperArticle

A Novel Reconstruction Method to Increase Spatial Resolution in Electron Probe Microanalysis

by

Tamme Claus

^*

,

Jonas Bünger

and

Manuel Torrilhon

Research Lab for Applied and Computational Mathematics (ACoM), Department of Mathematics, RWTH Aachen University, Schinkelstr. 2, 52062 Aachen, Germany

^*

Author to whom correspondence should be addressed.

Math. Comput. Appl. 2021, 26(3), 51; https://0-doi-org.brum.beds.ac.uk/10.3390/mca26030051

Submission received: 17 May 2021 / Revised: 4 July 2021 / Accepted: 9 July 2021 / Published: 14 July 2021

Download

Browse Figures

Versions Notes

Abstract

:

The spatial resolution of electron probe microanalysis (EPMA), a non-destructive method to determine the chemical composition of materials, is currently restricted to a pixel size larger than the volume of interaction between beam electrons and the material, as a result of limitations on the underlying k-ratio model. Using more sophisticated models to predict k-ratios while solving the inverse problem of reconstruction offers a possibility to increase the spatial resolution. Here, a k-ratio model based on the deterministic M1-model in Boltzmann Continuous Slowing-Down approximation (BCSD) will be utilized to present a reconstruction method for EPMA which is implemented as a PDE-constrained optimization problem. Iterative gradient-based optimization techniques are used in combination with the adjoint state method to calculate the gradient in order to solve the optimization problem efficiently. The accuracy of the spatial resolution still depends on the number and quality of the measured data, but in contrast to conventional reconstruction methods, an overlapping of the interaction volumes of different measurements is permissible without ambiguous solutions. The combination of k-ratios measured with various electron beam configurations is necessary for a high resolution. Attempts to reconstruct materials with synthetic data show challenges that occur with small reconstruction pixels, but also indicate the potential to improve the spatial resolution in EPMA using the presented method.

Keywords:

electron probe microanalysis; spatial resolution; material reconstruction; material imaging; m1-model; adjoint state method; inverse problem; k-ratio

1. Increasing the Spatial Resolution in EPMA

Electron probe microanalysis (EPMA) [1,2] is a non-destructive method to determine the chemical composition of a material sample that is applied in various fields, for example geophysics, material science, or engineering. The composition is not measured directly, but reconstructed from the intensity of X-rays, which are produced by exciting a material sample with a focused beam of electrons. Atoms inside the sample are ionized by collision with the highly energetic beam electrons and subsequently relax from the excited state by emitting characteristic X-ray radiation. A part of the radiation is absorbed, the other part leaves the sample and is measured by a detector. The characteristic radiation is quantified in the form of k-ratios

k = I / I^{std}

, where the measured radiation intensity I is normalized by standard intensities

I^{std}

measured for a known reference sample. A sketch of the physical processes is shown in Figure 1.

We characterize the chemical composition of a material by its mass fraction field

c (x)

. Due to the statistical nature of the k-ratio measurements, the mass fractions

c (x)

, as the solution to the inverse problem of reconstruction, are also probabilistic and all information gained by the experiment is encoded in the statistical moments of

c (x)

[3]. However, the probabilistic inversion is challenging; thus, this article deals with the minimization of the squared error of k-ratios obtained from an experiment

k^{\exp}

and a forward model

k (c (x))

. The result can be interpreted as a maximum likelihood estimate under the assumption that the noise of the experimental data is independent and identically Gaussian distributed.

A parametrization

p

that restricts the unknown

c (x; p)

to finite dimensions states a regularization of the generally ill-posed inverse problem. The minimization can then be written as

p^{*} = \underset{p}{arg min} {∥k (c (x; p)) - k^{\exp}∥}_{2}^{2} .

(1)

Currently, forward models typically used in reconstruction (ZAF,

ϕ (ρ z)

) [2,4,5,6,7] assume either a fully homogeneous or a layered structure interaction volume (the part of the sample in which beam electrons excite atoms). The ZAF model uses correction factors for the atomic number

Z (c)

, the absorption

A (c)

, and the fluorescence

F (c)

and is implemented as a fixed-point iteration to reconstruct homogeneous mass fractions from k-ratios obtained from a single beam configuration. This idea is extended to layered samples using a correction model based on the depth distribution of X-ray generation

ϕ (ρ z)

. The material is scanned by the electron beam, ensuring that their interaction volumes do not overlap, and each interaction volume is reconstructed independently. In this procedure, the forward models limit the size of the reconstruction pixels to a size that corresponds approximately to the size of the interaction volume.

Previous analysis [8,9,10] of the analytical spatial resolution highlight the need to resolve small structures inside the sample, e.g., the inclusion of spherical droplets in lunar rock which forms due to exposure to energetic particles or small crystals of magnetite embedded in glass. In [8,9,10], the analytical spatial resolution of EPMA is defined as the size of the interaction volume and approaches which physically decrease this volume are described. In contrast, we propose to decouple the spatial resolution and the size of the interaction volume by employing a more sophisticated reconstruction method that harnesses k-ratio measurements from multiple experiments with overlapping interaction volumes.

To analyze the size of the interaction volume of inhomogeneous structures, Monte Carlo models [11,12,13,14,15] are used. Additionally, Monte Carlo models can be used to implement a k-ratio model that allows inhomogeneous material structures by first estimating the electron transport on the basis of Monte Carlo sampling and then approximating the measured X-ray intensity. However, Monte Carlo models suffer from statistical noise which restricts the use of gradient based optimization techniques to solve the inverse problem [16].

In this work, we use a deterministic k-ratio model that bases on the

M_{1} - model

[16,17,18,19,20] in Boltzmann Continuous Slowing-Down approximation (BCSD) and is able to deal with inhomogeneous interaction volumes. The model has previously been validated against Monte Carlo models [16]. Our model additionally allows a fast and precise gradient computation via the adjoint state method, hence efficient gradient based optimization techniques can be applied. The combination of a deterministic model and gradient based optimization methods allows the implementation of a reconstruction method with a higher spatial resolution. Instead of prohibiting overlapping interaction volumes, this method merges sequential k-ratio measurements with multiple beam configurations to accomplish a higher resolution with a pixel size smaller than the size of the interaction volume.

The source code accompanying this article is made available via GitHub: https://github.com/tam724/m1epma (accessed on 9 July 2021) [21].

2. Deterministic k-Ratio Model

2.1. Parameterized Material Description

A sample occupying a domain

Ω \subset R^{3}

, which consists of

n_{e}

chemical elements, is characterized by the mass fractions of all elements

c (x; p) : Ω \times P \to {[0, 1]}^{n_{e}}

at every point

x \in Ω

. The mass fraction

c_{i} = \frac{m_{i}}{m_{tot}}

(2)

describes the ratio of partial

m_{i}

to total mass

m_{tot}

in a (infinitesimal small) volume

d x

around

x

for one element

i = 1, \dots, n_{e}

and is therefore constrained by

0 \leq c (x; p) \leq 1

and

\sum_{i = 1}^{n_{e}} c_{i} (x; p) = 1

. The choice of a global set of parameters

p \in P \subseteq R^{n_{p}}

discretizes the mass fractions and limits the unknown to finite dimensions.

In this work, the mass density

ρ (x)

is modeled by a linear combination of the mass fractions and the density of the pure elements. Such a model neglects the influence of different molecular structures and assumes that only mass fractions sufficiently describe the material.

2.2. k-Ratios as a Function of the Electron Number Density

The measured quantities in EPMA, the radiation intensities of X-rays with wavelengths characteristic to elements occurring in the material, are considered in terms of k-ratios

k = I / I^{std}

. The measured intensity I gets normalized by the intensity

I^{std}

measured for a known reference sample, to account for experimental inaccuracies, e.g., the detector efficiency. Furthermore, an experiment is conducted with

n_{s}

setups (here, we consider different beam energies and beam positions) yielding different measurements. An enumeration of all

n_{k}

k-ratios allows for collecting all measured k-ratios into

k^{\exp} \in R^{n_{k}}

.

We model a k-ratio

k_{j}

by

k_{j} = \frac{1}{I_{j}^{std}} \int_{Ω} e^{- \int_{d (x)} μ_{j} (c (y; p)) d y} N_{i}^{V} (c (x; p)) \int_{ϵ_{0}}^{ϵ_{1}} σ_{j}^{emiss} (ϵ) ψ^{0} (x, ϵ) d ϵ d x,

(3)

representing the integral of the zeroth moment of the electron fluence

ψ^{0} (x, ϵ)

over the spatial domain

x \in Ω

and all considered energies

ϵ \in [ϵ_{0}, ϵ_{1}]

, weighted by attenuation and emission/fluorescence factors. The presence of element i at a position

x

is taken into account by the number density of atoms per unit volume

N_{i}^{V} = c_{i} (x) ρ (x) / A_{i}

, with the atomic mass

A_{i}

. The ionization cross-section

σ_{i}^{emiss}

, which combines fluorescence yield and ionization cross-section, describes the fraction of electron-atom collisions leading to X-ray generation. X-ray attenuation is approximated by the Beer–Lambert law with the linear attenuation coefficient being averaged over paths

d (x)

from each point

x

to the detector. The attenuation coefficient of a compound affecting an X-ray j is given by

μ_{j} (c (y; p)) = ρ (y) \sum_{m = 1}^{n_{e}} c_{m} (y; p) {(μ / ρ)}_{m, j}

. Emission cross-section and mass attenuation coefficients

(μ / ρ)

are interpolated from tabulated data; for details, see [22,23,24,25]. Note that the integrals in Equation (3) model the intensity

I_{j}

; hence, the standard intensities

I_{j}^{std}

can also be calculated this way by inserting the respective mass fractions.

2.3. M1-Model of Electron Transport

To model

ψ^{0} (x, ϵ)

, we use the

M_{1} - model

[16,17,18,19,20] in BCSD, which describes the first two moments of the electron fluence inside the material. They are defined as

ψ^{(0)} (x, ϵ) = \int_{S^{2}} ∥v (ϵ, Θ)∥ n (x, v) d Θ and ψ^{(1)} (x, ϵ) = \int_{S^{2}} Θ ∥v (ϵ, Θ)∥ n (x, v) d Θ,

(4)

using the velocity

v (ϵ, Θ)

of electrons (with magnitude defined by the electron energy

ϵ

and direction

Θ

) and the stationary number density of electrons

n (x, v)

. Based on the Boltzmann equation in Continuous Slowing-Down approximation, the

M_{1} - model

belongs to a class of models

M_{N}

which are derived from closing the system of moments by minimization of the Boltzmann entropy. With a closure after the first moment,

M_{1}

, the system takes the form of a nonlinear hyperbolic partial differential equation (PDE) in space and energy, given by

G (ψ, p) : = - \partial_{ϵ} (S (\begin{matrix} ψ^{(0)} \\ ψ^{(1)} \end{matrix})) + \nabla \cdot \underset{= F (ψ) = (F_{1} (ψ), F_{2} (ψ), F_{3} (ψ))}{\underset{︸}{(\begin{matrix} {(ψ^{(1)})}^{T} \\ ψ^{(0)} (\frac{1 - χ}{2} I_{3} + \frac{3 χ - 1}{2} \frac{α}{∥α∥} \otimes \frac{α}{∥α∥}) \end{matrix})}} - T (\begin{matrix} ψ^{(0)} \\ ψ^{(1)} \end{matrix}) = 0 .

(5)

Thereby,

α = ψ^{(1)} (x, ϵ) / ψ^{(0)} (x, ϵ)

is the anisotropy parameter and

χ (∥α∥)

is the Eddington factor, which implicitly depends on

α

and is approximated by a rational function. The identity matrix is denoted by

I_{3} \in R^{3 \times 3}

. Stopping power S and transport coefficient T for a compound are obtained from the mass fractions

c

, the density

ρ

, and the respective coefficients of pure elements

S_{i}

and

T_{i}

:

S (ϵ, x) = ρ (x) \sum_{i = 1}^{n_{e}} c_{i} (x; p) S_{i} (ϵ) T (ϵ, x) = (\begin{matrix} 0 & 0 \\ 0 & ρ (x) I_{3} \sum_{i = 1}^{n_{e}} c_{i} (x; p) T_{i} (ϵ) \end{matrix})

(6)

The analytic forms of

S_{i}

,

T_{i}

and the function that approximates

χ (| | α | |)

as well as the Jacobian of the flux function

F (ψ)

are given in Appendix A. For a detailed description of the

M_{1} - model

, we refer to [16,17,18,19,20]. A solution

ψ = {(ψ^{(0)}, ψ^{(1)})}^{T} : Ω \times [ϵ_{0}, ϵ_{1}] \to R^{1 + 3}

of the

M_{1} - model

depends on the material parameters

p

because stopping power and transport coefficient depend on the mass fractions which are parameterized by

p

.

Boundary Conditions

Initial

ϵ_{1}

and cutoff energy

ϵ_{0}

of the computational domain

Ω \times [ϵ_{0}, ϵ_{1}]

are chosen such that the beam is sufficiently captured

ϵ_{b e a m} < ϵ_{1}

and all minimal ionization energies are considered

ϵ_{0} < {min}_{i = 1 \dots n_{e}} ϵ_{e d g e, i}

. The edge energy

ϵ_{e d g e, i}

of an element i is the lowest energy at which ionization still occurs. Initially, no beam electrons are present inside the domain

Ω

, hence

ψ (x, ϵ = ϵ_{1}) = 0 .

(7)

The spatial boundary

\partial Ω

consists of one surface that corresponds to the samples surface

S_{impact}

where the electron beam impacts. The other boundaries lie inside the sample and are assumed to be sufficiently far from the interaction volume so as not to influence the solution. A schematic of the 2D setup is provided by Figure 2. The electron beam is modeled as Gaussian in space and energy and Dirac in angle, yielding the moments

ψ_{b e a m} (x, ϵ) = (\begin{matrix} 1 \\ D_{b e a m} \end{matrix}) φ (ϵ | ϵ_{b e a m}, σ_{b e a m}) φ (x | x_{b e a m}, Σ_{b e a m}),

(8)

where

D_{b e a m} \in R^{3}

is a unit vector describing the direction of the beam,

φ

the normal distribution, given the respective means,

ϵ_{b e a m}

and

x_{b e a m}

, and variances,

σ_{ϵ}

and

Σ_{b e a m}

. This leads to the following spatial boundary condition:

\begin{matrix} ψ (x \in \partial Ω, ϵ) = \{\begin{matrix} ψ_{b e a m} (x, ϵ) & x \in S_{impact} \\ 0 & otherwise \end{matrix} \end{matrix}

(9)

which will in the following be referred to as

B_{k} (ψ (x, ϵ)) = 0

for multiple beam configurations

k = 1, \dots n_{s}

.

For the implementation of the

M_{1} - model

with its boundary conditions using the finite volume library Clawpack [26,27,28], we refer to Section 4.

3. Inverse Problem of Material Reconstruction

3.1. PDE-Constrained Optimization Problem

Given measured k-ratios

k^{\exp}

, the reconstruction of material parameters can now be written as a PDE-constrained optimization problem. Incorporating the k-ratio model, consisting of the integral equation (Equation (3)) and the

M_{1} - model

G (Equation (5)) with boundary conditions

B_{k}

, the inverse problem of reconstruction is:

\begin{matrix} p^{*} = \underset{p}{arg min} \underset{: = H (p)}{\underset{︸}{{∥k (ψ_{1}, \dots, ψ_{n_{s}}, p) - k^{\exp}∥}_{2}^{2}}} \\ \begin{matrix} s . t . & G (ψ_{k}, p) = 0 & x \in Ω, ϵ \in [ϵ_{0}, ϵ_{1}] \\ ψ_{k} (x, ϵ) = 0 & ϵ = ϵ_{1} \\ B_{k} (ψ_{k}) = 0 & x \in \partial Ω \\ k = 1, \dots, n_{s} . \end{matrix} \end{matrix}

(10)

This reads as follows. Find the parameters

p^{*}

such that the objective function

H (p)

, the least-squares error of the residuals between modeled and measured k-ratios, is minimal. Later, we will refer to the residuals as

h : = k - k^{\exp}

. Here, the moments of the electron fluence

ψ_{k}

, which are used to determine the k-ratios

k

, are subject to the implicit constraint to fulfill the

M_{1} - model

G (ψ_{k}, p) = 0

with its boundary conditions

B_{k} (ψ_{k}) = 0

. To obtain a set of k-ratios

k

, a solution of the PDE-constaints

G_{k}

is calculated and integrated according to Equation (3).

Note the direct influence of the mass fraction parameters

p

on the modeled k-ratios

k (ψ_{1}, \dots, ψ_{n_{s}}, p)

via Equation (3) and the indirect influence via the moments of the electron fluence

ψ_{k}

. Therefore, a perturbation in

p

will not only change the objective function H due to their direct dependency, but also due to the implied perturbation in

ψ_{k}

.

3.2. Iterative Gradient Based Optimization

To find a solution to the inverse problem, an iterative optimization method of the form

p_{k + 1} = p_{k} + Δ p_{k}

starting with an initial guess

p_{0}

is used. Many optimization methods calculate the update

Δ p

based on the gradient of the objective function

H (p)

and possibly also higher order derivatives—for example, using the negative gradient of the objective in combination with a line search method to find the step length yields the method of steepest descent [29]. The Levenberg–Marquardt [29] algorithm additionally exploits the least-squares form of the objective function and can be interpreted as a combination of a gradient descent approach on the objective function H and a Gauss–Newton method acting on the residuals

h

. In contrast to the method of steepest descent, which only requires the evaluation of the objective function H and its gradient, the Levenberg–Marquardt method additionally needs the computation of the Jacobian of the residuals

J_{p} h (p)

with respect to

p

. In [29,30], these two and several other methods of a similar form are extensively described, with most methods sharing the need to calculate derivatives. Note that such methods only converge to a local optimum and do not guarantee a global minimization.

For the PDE-constrained problem (10), the cost of such methods is dominated by the cost of evaluating the PDE-constraint as part of the objective function and the cost of evaluating its derivatives. It is advantageous to choose an optimization method which requires a low number of evaluations of the objective function and its derivatives, but the efficiency of each evaluation still remains crucial. A naive approach to determine the derivative is a finite difference approximation, which, applied to our problem, requires at least

n_{p} + 1

evaluations of the objective function. Each evaluation of the objective function includes solving

n_{s}

computationally expensive PDEs. Given many parameters

n_{p}

or experimental setups

n_{s}

, which are required for high resolution, this approach becomes computationally too expensive. Furthermore, the choice of the spacing for the finite difference approximation is non-trivial. Another option to determine derivatives is the implementation of additional sensitivity equations [31] besides the PDE-constraint. However, the cost also scales with the number of parameters

n_{p}

in this case, which inhibits their application for our problem.

3.3. The Adjoint State Method

The adjoint state method provides an alternative to calculate the gradient. Details on the method can be found in [31,32,33], while here we give the general steps that will later be needed to apply it for problem (10).

The Levenberg–Marquardt method requires the Jacobian of the residuals

h

, while, for the steepest descent, the gradient of the objective H is needed. Covering both cases, we consider the gradient of a scalar valued functional

\nabla_{p} h (ψ, p)

with respect to

p

. The functional h could be one component of the residual

h

, to gradually calculate the Jacobian for the Levenberg–Marquardt method, or the objective H for the gradient used in steepest descent. Without loss of generality, but, for simpler notation, the discussion here is limited to only one PDE constraint where we consider the state variable

ψ

, which is implicitly related to the parameters

p

by

G (ψ, p) = 0

(omitting the index k for the beam configurations).

The introduction of a Lagrange multiplier, the adjoint state variable

λ

, allows for stating the Lagrange function of the optimization problem:

L (ψ, p, λ) = h (ψ, p) + {〈 λ, G (ψ, p) 〉}_{⋆} .

(11)

Here,

{〈 ψ, ϕ 〉}_{⋆} = \int_{ϵ_{0}}^{ϵ_{1}} \int_{Ω} ψ {(x, ϵ)}^{T} ϕ (x, ϵ) d x d ϵ

denotes the scalar product in function spaces,

〈 \cdot, \cdot 〉

is the scalar product in

R^{n}

. For

G (ψ, p) = 0

, one recognizes the equality of

L

and h, which implies the equality of their gradients

\nabla_{p} L = \nabla_{p} h

.

Consider the directional derivative

\frac{d L}{d q} = 〈 \nabla_{p} L, q 〉

of the Lagrangian

L

in a direction

q

. Using

ψ_{q}

, the direction of change of the state variable corresponding to a small perturbation of the parameters, such that

G (ψ + τ ψ_{q}, p + τ q) = O (τ^{2})

, the directional derivative can be written as

\frac{d}{d q} L = {〈 \frac{δ h (ψ, p)}{δ ψ}, ψ_{q} 〉}_{⋆} + 〈 \frac{δ h (ψ, p)}{δ p}, q 〉 + {〈 λ, [\frac{δ G (ψ, p)}{δ ψ}] (ψ_{q}) + [\frac{δ G (ψ, p)}{δ p}] (q) 〉}_{⋆} .

(12)

Here,

δ h / δ ψ

denotes the functional derivative,

δ h / δ p

the vector of partial derivatives with respect to

p

, not including the variation due to

ψ_{p}

, and

[δ G / δ \cdot]

the corresponding Fréchet derivative operators. Multiple calculations of Equation (12), while iterating over the unit vectors as direction

q

, accumulate to the gradient

\nabla_{p} h

, but would include multiple expensive calculations of the direction

ψ_{q}

. To avoid this calculation, Equation (12) is rearranged to

\frac{d}{d q} L = {〈 \frac{δ h (ψ, p)}{δ ψ} + {[\frac{δ G (ψ, p)}{δ ψ}]}^{*} (λ), ψ_{q} 〉}_{⋆} + 〈 \frac{δ h (ψ, p)}{δ p} + \frac{δ {〈 λ, G (ψ, p) 〉}_{⋆}}{δ p}, q 〉,

(13)

where

{[δ G / δ ψ]}^{*}

is the adjoint operator of the Fréchet derivative

[δ G / δ ψ]

. The crux of the adjoint state method is to force

λ

to satisfy the adjoint state equation

\frac{δ h (ψ, p)}{δ ψ} + {[\frac{δ G (ψ, p)}{δ ψ}]}^{*} (λ) \overset{!}{=} 0,

(14)

such that the first scalar product in Equation (13) including

ψ_{q}

vanishes and only the second scalar product remains as the directional derivative of

L

. Then, the gradient

\nabla_{p} h

is identified as

\nabla_{p} h = \frac{δ}{δ p} (h (ψ, p) + {〈 λ, G (ψ, p) 〉}_{⋆}) .

(15)

To calculate the full gradient, one evaluation of the adjoint state Equation (14) is sufficient because it does not depend on the direction

q

. Its structure is usually similar to the structure of the forward equation

G (ψ, p) = 0

but depends on the specific form of the adjoint operator. Being an essential component of the method, the adjoint operator for our problem (Equation (10)) will now be derived under consideration of the form of the PDE-constraint G (Equation (5)).

3.4. Adjoint State Method for the M1-Model Constraint

The adjoint state method bases on the identity

{〈 λ, [\frac{δ G (ψ, p)}{δ ψ}] (ψ_{q}) 〉}_{⋆} = {〈 {[\frac{δ G (ψ, p)}{δ ψ}]}^{*} (λ), ψ_{q} 〉}_{⋆} .

(16)

The adjoint operator

{[δ G / δ ψ]}^{*}

allows for isolating the direction

ψ_{q}

in the scalar product of Equation (13), which subsequently enables stating the adjoint state Equation (14). To derive the adjoint state equation for the

M_{1} - model

, this identity must be considered, taking G from Equation (5) into account. The Fréchet derivative of G can be found using the Jacobians of the flux functions

J_{ψ} F_{d} (ψ) : = A_{d} (ψ)

. Inserting into Equation (16) and proceeding using integration by parts yields

\begin{matrix} {〈 λ, [\frac{δ G (ψ, p)}{δ ψ}] (ψ_{q}) 〉}_{⋆} = {〈 λ, - \partial_{ϵ} (S ψ_{q}) + \sum_{d = 1}^{3} \partial_{x_{d}} (A_{d} (ψ) ψ_{q}) - T ψ_{q} 〉}_{⋆} \\ = \int_{ϵ_{0}}^{ϵ_{1}} \int_{Ω} λ^{T} (- \partial_{ϵ} (S ψ_{q}) + \sum_{d = 1}^{3} \partial_{x_{d}} (A_{d} (ψ) ψ_{q}) - T ψ_{q}) d x d ϵ \\ = \int_{ϵ_{0}}^{ϵ_{1}} \int_{Ω} {(S \partial_{ϵ} λ - \sum_{d = 1}^{3} A_{d}^{T} (ψ) \partial_{x_{d}} λ - T λ)}^{T} ψ_{q} d x d ϵ \\ + \underset{\overset{!}{=} 0}{\underset{︸}{\int_{Ω} {[- S λ^{T} ψ_{q}]}_{ϵ_{0}}^{ϵ_{1}} d x + \int_{ϵ_{0}}^{ϵ_{1}} \int_{\partial Ω} (\begin{matrix} λ^{T} A_{x_{0}} (ψ) ψ_{q} \\ ⋮ \\ λ^{T} A_{x_{D}} (ψ) ψ_{q} \end{matrix}) \cdot n d s d ϵ}} \\ = {〈 S \partial_{ϵ} λ - \sum_{d = 1}^{3} A_{d}^{T} (ψ) \partial_{x_{d}} λ - T λ, ψ_{q} 〉}_{⋆} \\ = {〈 {[\frac{δ G (ψ, p)}{δ ψ}]}^{*} (λ), ψ_{q} 〉}_{⋆} . \end{matrix}

(17)

The boundary integrals, which are enforced to be zero, yield boundary conditions for the adjoint state variable

λ

. At energy

ϵ_{1}

, the state variable

ψ

is specified by the boundary condition of the

M_{1} - model

which does not depend on the parameters

p

; therefore,

ψ_{q} (ϵ_{1}) = 0

. At energy

ϵ_{0}

, we have to prescribe the adjoint state variable

λ (ϵ_{0}) = 0

. In addition, the boundary term for the spatial boundary

\partial Ω

with outer normal vector

n

has to vanish, which yields boundary values for

λ

on

\partial Ω

. Their numerical implementation is discussed along with the boundary conditions for the forward model in Section 4.

Then, the adjoint state equation to be solved for the adjoint state variable

λ

is given by

\frac{δ h (ψ, p)}{δ ψ} + S \partial_{ϵ} λ - \sum_{d = 1}^{D} A_{d}^{T} (ψ) \partial_{x_{d}} λ - T λ = 0 .

(18)

To solve the adjoint equation, the solution

ψ

of the forward equation

G (ψ, p) = 0

has to be known, since

A_{d}^{T} (ψ)

and

δ h (ψ, p) / δ ψ

depend on it.

The term

δ h / δ ψ

acts as a source to the adjoint state variable and depends on the choice of the functional h to be differentiated. For the Levenberg–Marquardt algorithm, the calculation of the gradient of each residual is required, thus we choose

h = h_{j} = k_{j} (ψ, p) - k_{j}^{\exp}

. Since in Equation (3) the first component of the state variable

ψ^{(0)}

enters the integral linearly, the functional derivative of one residuum

h_{j}

is given by

\frac{δ h_{j}}{δ ψ} = (\begin{matrix} ξ_{j} (x, ϵ) \\ 0^{3 \times 1} \end{matrix}), where ξ_{j} (x, ϵ) = e^{- \int_{d} μ_{j} d y} N_{i}^{V} σ_{j}^{emiss} .

(19)

Here,

ξ_{j}

collects all weighting factors of the integral in Equation (3).

To implement the steepest descent method, the gradient of the objective H is required; therefore,

h = H = {∥h∥}_{2}^{2}

and, by a similar argument as above, its functional derivative is

\frac{δ H}{δ ψ} = (\begin{matrix} 2 \sum_{j = 1}^{n_{k}} h_{j} ξ_{j} (x, ϵ) \\ 0^{3 \times 1} \end{matrix}) .

(20)

Having the adjoint equation determined by the adjoint operator and the respective source for the adjoint state variable, the calculation of the gradient via the adjoint state method can be summarized by

the solution of the forward equation $G (ψ, p) = 0$ for $ψ$ ,
the solution of the adjoint Equation (18) for $λ$ and
the calculation of the gradient $\nabla_{p} h (ψ, p) = \frac{δ}{δ p} (h (ψ, p) + 〈 λ, G (ψ, p) 〉)$

In comparison to a finite difference approximation with

n_{p} + 1

solutions of a PDE, the adjoint state method only requires the solving of the forward (first step) and the adjoint (second step) equation to calculate the gradient. The calculation of the gradient (third step) can usually be implemented efficiently. Consequently, the adjoint state method offers an efficient way to determine the gradient to be used in an iterative optimization method for our problem (10).

4. Numerical Implementation

4.1. Material Parameterization

The subdivision of the material into reconstruction pixels (see Figure 1) motivates the discretization of the mass fractions as piecewise constant functions. Each parameter describes one of the constant mass fractions in one of the reconstruction pixels. Note that, due to the conditions on

c

(c.f. Section 2),

n_{e} - 1

parameters per pixel are sufficient.

4.2. Solving the M1-Model Using Clawpack

The form of Equation (5) motivates the application of finite volume methods to approximate its solution. Our solver of the

M_{1} - model

in two spatial dimensions is implemented using the finite volume library Clawpack/pyclaw [27]. A pseudo time variable

t (ϵ) = ϵ_{1} - ϵ

transforms the energy interval to be compatible with Clawpack’s formulation. The term

- \partial_{ϵ} (S ψ)

can then be split into

- ψ \partial_{ϵ} S

and

S \partial_{t} ψ

, where the first acts like a source term and the second fits to Clawpack’s formulation with S being the capacity function. The capacity coefficient and the source term are applied to the solution via the ‘before-step’ and the ‘source’ function defined by Clawpack. Furthermore, it is important to choose the grid of finite volume cells at least as fine as the reconstruction pixels and to assure that their cell/pixel boundaries coincide.

To implement a Riemann solver, we, as suggested in [26], approximate the Jacobian

A \approx J_{ψ} F_{d} (\frac{ψ_{L} + ψ_{R}}{2})

at each cell interface using the average of the state variable

ψ

. (See Appendix A.4 for the analytical form of the Jacobian of the flux function

J_{ψ} F_{1} (ψ)

.) Then, A is numerically decomposed in eigenvalues and eigenvectors

A = R Λ R^{- 1}

on the basis of which the wave speeds

Λ

, the wave strengths

R^{- 1} Δ ψ = R^{- 1} (ψ_{R} - ψ_{L})

and right and left going fluctuations

a p d q = R max {Λ, 0} R^{- 1} Δ ψ

and

a m d q = R min {Λ, 0} R^{- 1} Δ ψ

can be identified.

Spatial boundary conditions on

\partial Ω

are implemented in Clawpack using ghost cells surrounding the computational domain. At the boundary where the beam enters the domain, the ghost cells are updated using Equation (8). The remaining ghost cells are set to zero, justified by the assumption that the computational domain is large enough such that no electrons reach those boundaries.

4.3. Implementation of the k-Ratio Model

Clawpack discretizes the electron fluence into ‘snapshots’ of the solution at specific energies with piecewise constant cell values in space. The spatial integral for the k-ratios k in Equation (3) reduces to a weighted summation over all cells. The attenuation coefficients for each cell is approximated using the path from its midpoint to the detector, which is split in segments overlapping the reconstruction pixels. For the integral in energy, the trapezoidal rule is applied.

4.4. Adaptions to Solve the Adjoint State Equation and Compute the Gradient

Analogous to the

M_{1} - model

, the adjoint Equation (18) is also implemented using Clawpack/pyclaw taking into consideration the reversed direction of computation. Using another pseudo time variable for the adjoint

τ (ϵ) = ϵ - ϵ_{0}

, the term

S \partial_{ϵ} λ

can be written as

S \partial_{τ} λ

so that it matches Clawpack’s interface. For the wavespeeds, wavestrengths and fluctuations, the same Riemann solver as for the forward problem is employed. It is only modified to calculate the waves of the transposed Jacobians

A_{d}^{T} (ψ)

instead of the Jacobians

A_{d} (ψ)

. To evaluate the Jacobians, the whole forward solution is stored in memory. Via Clawpack’s ‘source’ function, the source term of the adjoint state variable, depending on the required gradient, is integrated. At energy

ϵ_{0}

, we choose

λ (ϵ_{0}) = 0

as the initial condition, while the ghost cells of the spatial boundary are set to Clawpack’s ‘extrapolate’ setting.

The partial derivatives in Equation (15) are computed using the algorithmic differentiation tool jax [34].

5. Numerical Experiments

The following numerical examples illustrate the application of the presented methods and demonstrate their capabilities and limitations. We use synthetic measurements in our experiments which are computed using the same forward model as used for the inversion. The utilized optimization method does not guarantee convergence to a global minimum, and no constraints are enforced on

c

. Overcoming those limitations and implementing a more robust reconstruction method is left for future work.

5.1. Reconstruction on a Small Grid

5.1.1. Zeroth Moment of the Electron Fluence: M1-Model

Under investigation is a domain

Ω = [0, 1000] n m \times [- 800, 0] n m

of a sample consisting of copper and manganese. The domain

Ω

is partitioned into

10 \times 10

reconstruction pixels, with a width of

100 n m \times 80 n m

, as indicated by the grid in Figure 3. The mass fraction of copper

c_{C u}

in each cell is randomly chosen from a uniform distribution over

[0, 1]

; consequently, the mass fractions of manganese are

c_{M n} = 1 - c_{C u}

. The solution of the

M_{1} - model

is computed on a smaller grid with

160 \times 120

grid cells, with 170 time/energy steps between the initial energy

ϵ_{1} = 13 k e V

and the cutoff energy

ϵ_{0} = 5 k e V

.

Bombarding this sample with an electron beam from above in a negative y-direction (beam energy

ϵ_{b e a m} = 12 k e V, σ_{ϵ} = 0.1 k e V

, beam position

x_{b e a m} = {(500, 0)}^{T} n m

,

σ_{x} = 30 n m

), produces the zeroth moment of the electron fluence

ψ^{0}

shown in Figure 3. It displays three snapshots of

ψ^{0}

in energy space as a heat map over the domain

Ω

. Figure 3 also illustrates the interaction volume, the volume where

ψ^{0} > 0

, which is larger than the reconstruction pixels

p_{1, \dots 4}

.

While the number of electrons with high energy is highest near the point of electron impact, electrons with lower energy are more widely distributed in the sample. The contribution of each pixel to the k-ratios scales linearly with the zeroth moment of the electron fluence (Equation (3)), hence a measurement carries most information from pixels close to the surface near the point of electron impact. It is impossible to gain information about the chemical composition in pixels where

ψ^{0}

is zero, so only pixels lying sufficiently close to the surface can be reconstructed.

5.1.2. Convergence to a Particular Solution

In order to show convergence of the

M_{1} - model

solver and the k-ratio model to a particular solution, we investigate the difference of a solution

ψ^{0}

calculated with different resolutions to a reference solution

ψ_{*}^{0}

calculated with high resolution (

320 \times 240

, cell size:

3.125 n m \times 3.3 n m

). Additionally, the k-ratios

k

based on the variable resolution

ψ^{0}

are compared to the k-ratios

k_{*}

computed with the high resolution

ψ_{*}^{0}

. Figure 4 shows the convergence to the reference solution with decreasing cell sizes. The different resolutions as well as other model parameters are shown in Table 1.

5.1.3. Synthetic Measurements

To mimic a reconstruction problem, we generate synthetic k-ratios (

C u K α

and

M n K α

) from the six different beam configurations given by the beam positions and parameters in Table 2. This yields a set of 12 k-ratios, which we consider as synthetic data

k^{\exp}

. All other model parameters are taken from the previous example (Table 1).

5.1.4. Objective Function

In order to give an insight into the behavior of the inverse problem of reconstruction, we will evaluate the objective function

{∥k (ψ, p) - k^{\exp}∥}_{2}^{2}

. All k-ratios, synthetic measurements and modeled k-ratios are computed using the same discretization (spatial grid:

80 \times 60

, energy steps: 100) of the

M_{1} - model

. Both plots in Figure 5 visualize the dependency of the objective function on two of the four parameters describing the mass fractions in the reconstruction pixels. The parameters

p_{1}

and

p_{2}

describe reconstruction pixels that are next to each other while the pixel belonging to

p_{3}

is above that of

p_{2}

, cf. the marked cells in Figure 3. The parameters in all other reconstruction pixels are kept to the values used to calculate the synthetic data. The crosses mark the hidden truth, i.e, the parameter values used to calculate the synthetic data. Both contour plots show a unique minimum with a convex objective function, which, however, cannot be expected for the case where all parameters are simultaneously variable. Nevertheless, Figure 5 gives an intuition into the nature of the problem. The shape of the objective function, which increases less rapidly in the direction in which the parameters cancel each other out, e.g., increasing

p_{a}

while decreasing

p_{b}

, shows that the pixels can partially compensate for each other. It is not detectable in which pixel the x-rays were generated, so an increase in the mass fraction of an element in one pixel may be counteracted to some extent by a decrease in its mass fraction in an adjacent pixel. Therefore, a high spatial resolution can only be achieved by combining measurements obtained from experiments with different electron beam positions and energies.

In addition, the greater elongation in the right plot indicates that it is more difficult to resolve the vertical direction than the lateral direction. The intensity of detected X-rays generated deeper inside the material is lower due to the decreased electron energy and the increased absorption, hence the sensitivity of the k-ratios to changes in the mass fractions of deeper pixels decreases. Using different electron beam energies, the depth of pixels which get excited can be varied and better discrimination of vertical pixels can be achieved. However, the reconstruction of vertical pixels is more difficult than the reconstruction of lateral pixels because, in lateral reconstruction, different electron beam positions can be used to excite only certain pixels. In vertical reconstruction, higher lying pixels are always excited as well.

5.1.5. Reconstruction of Four Parameters

Using the Levenberg–Marquardt optimization algorithm to minimize the objective function of the artificial reconstruction problem yields the optimum illustrated in the left-hand plot of Figure 6, together with the initial values and their evolution during the optimization. Synthetic measurements are computed using the following discretization of the

M_{1} - model

:

160 \times 120

spatial grid, 300 energy steps. For the reconstruction, multiple discretizations are used: a

40 \times 30

spatial grid with 70 energy steps, a

80 \times 60

spatial grid with 100 energy steps and a

160 \times 120

spatial grid with 300 energy steps.

The hidden truth values are marked with solid lines, which after around 10 steps are identified by the optimization algorithm. The dotted, dashed, and dashdotted lines show the evolution of the parameters during the optimization for the different discretizations. Using the same resolution as used for the synthetic measurements (

160 \times 80

, dashdotted line) yields a perfect reconstruction. For other resolutions, the reconstructed parameters differ from the hidden truth but converge to similar values. This is due to the fact that the k-ratios also differ when being calculated from the same parameters but with different discretizations. The right-hand plot shows the value of the objective function over the steps of the Levenberg–Marquardt iterations. Again, deviations are visible for different discretizations.

The size of the reconstruction pixel is smaller than the size of the interaction volume (c.f. Figure 3); nevertheless, the iterative reconstruction method is able to reveal the mass fractions. The combination of measurements with different electron beam configurations and overlapping interaction volumes allows the reconstruction with a high spatial resolution.

5.2. Reconstruction on Small Vertical Layers

We now consider the reconstruction of a material consisting of iron, nickel, and chromium arranged in vertical layers with a size of

50 n m

, which is smaller than the size of the interaction volume. The material is excited by beams with an energy of

17 k e V

and positions centered on each of the vertical layers. All model parameters and discretizations used in this example are given in Table 3. In Figure 7, the iterative reconstruction of the mass fractions is illustrated. The hidden truth configuration of all layers is shown as the background color, where blue is chromium, red is nickel, and the remaining orange part is iron. The Levenberg–Marquardt iterations for the mass fractions of chromium and nickel in each layer are shown by the blue solid and red dotted lines. The mass fraction of iron can be deduced (c.f. Section 2). From the initial configuration (the leftmost point in each layer), all parameters are reconstructed successfully. The rightmost point in each layer shows the mass fractions after six iterations. The algorithm was able to identify the mass fractions in all vertical layers.

6. Conclusions and Outlook

The results presented in this paper illustrate the potential of using high-resolution deterministic models (e.g., the

M_{1} - model

in BCSD) for the implementation of a reconstruction method that improves the spatial resolution of EPMA. Although a compromise must always be found between higher resolution and the number of measured values or the computing effort, the size of the reconstruction pixels is no longer limited by the size of the interaction volume and can be selected independently.

The reconstruction can be formalized as a deterministic PDE-constrained optimization problem and can be implemented using iterative gradient-based minimization techniques. Hereby, the computation of the derivative of the objective function forms the major part of the calculation of the next iterate. Hence, the adjoint state method, which efficiently calculates the gradient, yields significant reductions in computing time over e.g., the finite difference method.

Having the possibility to resolve fine structures in materials using EPMA raises the question about the confidence in a reconstruction. By means of the numerical examples, it was illustrated that reconstruction pixels close to the surface are easier to reconstruct than pixels deeper inside the material, and other pixels are not reconstructible. A high number of parameters, for example in a 3D reconstruction, increases the problem of underdetermination and raises the need to include prior knowledge about the material, e.g., in the form of additional regularization. Furthermore, an uncertainty quantification of the reconstruction result based on the uncertainty of the model and the measurements would provide deeper insight. In any case, the noise of real experimental data must be considered since the strength of the noise influences the accuracy of the reconstruction. Reconstruction experiments based on real measurements should be accompanied by a careful parameter study, since the uncertainty for many of the parameters used in this model is unclear. Nevertheless, the examples presented in this paper indicate the potential of using deterministic k-ratio models to improve the spatial resolution of EPMA compared to conventional methods by implementing a more sophisticated reconstruction method.

Author Contributions

Conceptualization and methodology, T.C., J.B., and M.T.; software, validation and visualization, T.C.; writing—original draft preparation, T.C.; writing—review and editing, J.B. and M.T.; supervision, J.B.; project administration, M.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the German Research Foundation (DFG) Grant No. TO414/4-1.

Data Availability Statement

The source code accompanying this article is made available via GitHub: https://github.com/tam724/m1epma (accessed on 9 July 2021) [21].

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Additional M1-Model Equations

Appendix A.1. Stopping Power

The stopping power follows from modeling the energy loss of electrons as a continuous process in BCSD. In this work, we evaluate the stopping power of an element i at electron energy

ϵ

by the analytical Bethe-loss formula [1,18]:

S_{i} (ϵ) = \frac{2 π e^{4} Z^{i}}{A_{i} {(4 π ϵ_{0})}^{2} ϵ} ln (b \frac{ϵ}{J_{i}}) .

(A1)

Except for the electron energy

ϵ

, all remaining quantities are constants: the vacuum permittivity

ϵ_{0}

, the elementary charge e, a relativistic constant

b = \sqrt{\frac{e}{2}}

as well as the atomic mass

A_{i}

and the atomic number

Z_{i}

. The mean ionization potential

J_{i}

of element i can be calculated from its atomic number

Z_{i}

J_{i} = e \{\begin{matrix} 9.76 Z_{i} + 58.8 {(Z_{i})}^{- 0.19} & , Z_{i} > 6 \\ 11.5 Z_{i} & , Z_{i} \leq 6 \end{matrix} .

(A2)

Appendix A.2. Transport Coefficient

The transport coefficient used in this work neglects inelastic collisions and is based on the screened Rutherford cross-section [1,18]

\begin{matrix} T_{i} (ϵ) = & \frac{2 π e^{4}}{16 {(4 π ϵ_{0})}^{2} ϵ^{2}} \frac{{(Z_{i})}^{2}}{A_{i}} \\ (\frac{8}{cos (θ_{0, i} (ϵ)) - 3} + 4 (ln (3 - cos (θ_{0, i} (ϵ))) - ln (1 - cos (θ_{0, i} (ϵ))))) . \end{matrix}

(A3)

Here,

θ_{0, i}

is the screening angle,

R_{i}

is the screening radius, and

λ (ϵ)

the De-Broglie wavelength

θ_{0, i} (ϵ) = \frac{λ (ϵ)}{2 π R_{i}}, R_{i} = a_{H} {(Z_{i})}^{- \frac{1}{3}}, λ (ϵ) = \frac{h}{\sqrt{2 m_{0} ϵ}} .

(A4)

In addition to the constants defined in Appendix A.1, the Bohr radius

a_{H}

, Planck’s constant h, and the electron rest mass

m_{0}

are used.

Appendix A.3. Eddington Factor

The Eddington factor

χ

implicitly depends on

| | α | |

and is approximated by a rational function in

| | α | |

(c.f. [18])

χ (| | α | |) \approx \frac{a_{6} {| | α | |}^{6} + a_{4} {| | α | |}^{4} + a_{2} {| | α | |}^{2} + a_{0}}{{| | α | |}^{4} + b_{2} {| | α | |}^{2} + b_{0}},

(A5)

with the coefficients given in the following table.

i	6	4	2	0
$a_{i}$	$0.720371$	$- 0.139318$	$0.348509$	$0.621529$
$b_{i}$	-	-	$- 1.32002$	$1.87095$

Appendix A.4. Jacobian of the Flux Function

In the following, we will derive an analytical expression of the Jacobian of the flux function with respect to

ψ = {(ψ^{(0)}, {(ψ^{(1)})}^{T})}^{T} = {(ψ^{(0)}, ψ_{x}^{(1)}, ψ_{y}^{(1)}, ψ_{z}^{(1)})}^{T}

. Exemplarily, we will only consider the flux function into direction of the first unit vector

F_{1} (ψ) = F (ψ) e_{1}

.

Using the definition

α = \frac{ψ^{(1)}}{ψ^{(0)}}

and the fact that

ψ^{(0)} \geq 0

[18], the outer product in the flux function can be simplified to

\frac{α}{| | α | |} \otimes \frac{α}{| | α | |} = \frac{1}{| | ψ^{(1)} {| |}^{2}} ψ^{(1)} \otimes ψ^{(1)} = \frac{1}{| | ψ^{(1)} {| |}^{2}} (\begin{matrix} {(ψ_{x}^{(1)})}^{2} & ψ_{x}^{(1)} ψ_{y}^{(1)} & ψ_{x}^{(1)} ψ_{z}^{(1)} \\ ψ_{y}^{(1)} ψ_{x}^{(1)} & {(ψ_{y}^{(1)})}^{2} & ψ_{y}^{(1)} ψ_{z}^{(1)} \\ ψ_{z}^{(1)} ψ_{x}^{(1)} & ψ_{z}^{(1)} ψ_{y}^{(1)} & {(ψ_{z}^{(1)})}^{2} \end{matrix}) .

(A6)

For a compact notation, we, if possible, replace occurring terms with

α

,

| | α | |

, and

| | ψ^{(1)} | |

and abbreviate

t_{1} : = \frac{1 - χ (| | α | |)}{2}, t_{2} : = \frac{3 χ (| | α | |) - 1}{2 | | ψ^{(1)} {| |}^{2}},

(A7)

t_{3} : = \frac{χ^{'} (| | α | |)}{2 {(ψ^{(0)})}^{2} | | α | |} and t_{4} : = \frac{3 t_{3} | | ψ^{(1)} {| |}^{2} - 3 χ (| | a | |) + 1}{| | ψ^{(1)} {| |}^{4}},

(A8)

which allow for writing the flux function

F_{1} (ψ)

(c.f. Equation (5)) as

F_{1} (ψ) = (\begin{matrix} ψ_{x}^{(1)} \\ ψ^{(0)} \underset{: = t_{5}}{\underset{︸}{((\begin{matrix} t_{1} \\ 0 \\ 0 \end{matrix}) + t_{2} (\begin{matrix} {(ψ_{x}^{(1)})}^{2} \\ ψ_{x}^{(1)} ψ_{y}^{(1)} \\ ψ_{x}^{(1)} ψ_{z}^{(1)} \end{matrix}))}} \end{matrix}) .

(A9)

We differentiate the individual parts separately for

ψ^{(0)}

and

ψ^{(1)}

. Starting with

{(t_{1}, 0, 0)}^{T}

, differentiation yields

J_{ψ^{(0)}} (\begin{matrix} t_{1} \\ 0 \\ 0 \end{matrix}) = t_{3} (\begin{matrix} {| | α | |}^{2} ψ^{(0)} \\ 0 \\ 0 \end{matrix}) and J_{ψ^{(1)}} (\begin{matrix} t_{1} \\ 0 \\ 0 \end{matrix}) = t_{3} (\begin{matrix} - {(ψ^{(1)})}^{T} \\ 0^{1 \times 3} \\ 0^{1 \times 3} \end{matrix}) : = T_{1}

(A10)

Similarly, for

t_{2}

, we get

J_{ψ^{(0)}} (t_{2}) = - \frac{3 t_{3}}{ψ^{(0)}} and J_{ψ^{(1)}} (t_{2}) = t_{4} (\begin{matrix} ψ_{x}^{(1)} & ψ_{y}^{(1)} & ψ_{z}^{(1)} \end{matrix}) .

(A11)

The combined derivative of

t_{2}

and the first column of the tensor product

ψ^{(1)} \otimes ψ^{(1)}

results in

J_{ψ^{(0)}} (t_{2} (\begin{matrix} {(ψ_{x}^{(1)})}^{2} \\ ψ_{x}^{(1)} ψ_{y}^{(1)} \\ ψ_{x}^{(1)} ψ_{z}^{(1)} \end{matrix})) = - 3 ψ_{x}^{(1)} t_{3} α and

(A12)

J_{ψ^{(1)}} (t_{2} (\begin{matrix} {(ψ_{x}^{(1)})}^{2} \\ ψ_{x}^{(1)} ψ_{y}^{(1)} \\ ψ_{x}^{(1)} ψ_{z}^{(1)} \end{matrix})) = ψ_{x}^{(1)} t_{4} (ψ^{(1)} \otimes ψ^{(1)}) + t_{2} (\begin{matrix} 2 ψ_{x}^{(1)} & 0 & 0 \\ ψ_{y}^{(1)} & ψ_{x}^{(1)} & 0 \\ ψ_{z}^{(1)} & 0 & ψ_{x}^{(1)} \end{matrix}) : = T_{2} .

(A13)

With the two auxiliary matrices

T_{1}, T_{2} \in R^{3 \times 3}

, the derivative of the flux function

J_{ψ} F_{1} (ψ)

is given by

J_{ψ} (F_{1} (ψ)) = (\begin{matrix} 0 & \begin{matrix} 1 & 0 & 0 \end{matrix} \\ t_{5} + ψ^{(0)} [(\begin{matrix} t_{3} {| | α | |}^{2} ψ^{(0)} \\ 0 \\ 0 \end{matrix}) - 3 ψ_{x}^{(1)} t_{3} α] & ψ^{(0)} (T_{1} + T_{2}) \end{matrix}) .

(A14)

While exploiting similarities in the other two flux function, their Jacobians

J_{ψ} F_{2} (ψ)

and

J_{ψ} F_{3} (ψ)

can be derived accordingly.

References

Reimer, L. Scanning Electron Microscopy: Physics of Image Formation and Microanalysis, 2nd ed.; Springer Series in Optical Sciences; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar] [CrossRef]
Bastin, G.; Heijligers, H. Quantitative Electron Probe Microanalysis of Ultra-Light Elements (Boron-Oxygen). In Electron Probe Quantitation; Springer: Boston, MA, USA, 1991; Chapter 8; pp. 145–161. [Google Scholar] [CrossRef] [Green Version]
Tarantola, A. Inverse Problem Theory and Methods for Model Parameter Estimation; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2005. [Google Scholar] [CrossRef] [Green Version]
Pouchou, J.L.; Pichoir, F. Quantitative Analysis of Homogeneous or Stratified Microvolumes Applying the Model “PAP”. In Electron Probe Quantitation; Springer: Boston, MA, USA, 1991; Chapter 4; pp. 31–75. [Google Scholar] [CrossRef]
Bastin, G.; Dijkstra, J.; Heijligers, H. PROZA96: An improved matrix correction program for electron probe microanalysis, based on a double Gaussian ϕ(ρz) approach. X-Ray Spectrom. 1998, 27, 3–10. [Google Scholar] [CrossRef]
Packwood, R.; Brown, J. A Gaussian expression to describe ϕ(ρz) curves for quantitative electron probe microanalysis. X-Ray Spectrom. 1981, 10, 138–146. [Google Scholar] [CrossRef]
Love, G.; Sewell, D.; Scott, V. An improved absorption correction for quantitative analysis. J. Phys. Colloq. 1984, 45, C2-21–C2-24. [Google Scholar] [CrossRef] [Green Version]
Buse, B.; Kearns, S.L. Spatial resolution limits of EPMA. IOP Conf. Ser. Mater. Sci. Eng. 2020, 891, 012005. [Google Scholar] [CrossRef]
Moy, A.; Fournelle, J. Analytical Spatial Resolution in EPMA: What is it and How can it be Estimated? Microsc. Microanal. 2017, 23, 1098–1099. [Google Scholar] [CrossRef] [Green Version]
Carpenter, P.K.; Jolliff, B.L. Improvements in EPMA: Spatial Resolution and Analytical Accuracy. Microsc. Microanal. 2015, 21, 1443–1444. [Google Scholar] [CrossRef] [Green Version]
Ammann, N.; Karduck, P. A further developed Monte Carlo model for the quantitative EPMA of complex samples. In Microbeam Analysis; Michael, J., Ingram, P., Eds.; San Francisco Press: San Francisco, CA, USA, 1990; pp. 150–154. [Google Scholar]
Joy, D.C. An introduction to Monte Carlo simulations. Scanning Microsc. 1991, 5, 329–337. [Google Scholar]
Ritchie, N.W. A new Monte Carlo application for complex sample geometries. Surf. Interface Anal. 2005, 37, 1006–1011. [Google Scholar] [CrossRef]
Gauvin, R.; Lifshin, E.; Demers, H.; Horny, P.; Campbell, H. Win X-ray: A new Monte Carlo program that computes X-ray spectra obtained with a scanning electron microscope. Microsc. Microanal. 2006, 12, 49–64. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Salvat, F. PENELOPE-2014: A Code System for Monte Carlo Simulation of Electron and Photon Transport; OECD/NEA Data Bank: Issy-les-Moulineaux, France, 2015. [Google Scholar]
Buenger, J.; Richter, S.; Torrilhon, M. A deterministic model of electron transport for electron probe microanalysis. IOP Conf. Ser. Mater. Sci. Eng. 2018, 304, 012004. [Google Scholar] [CrossRef] [Green Version]
Mevenkamp, N.; Pinard, P.; Richter, S.; Torrilhon, M. On a hyperbolic conservation law of electron transport in solid materials for electron probe microanalysis. Bull. Braz. Math. Soc. New Ser. 2016, 47, 575–588. [Google Scholar] [CrossRef]
Mevenkamp, N. Inverse Modeling in Electron Probe Microanalysis Based on Deterministic Transport Equations. Master’s Thesis, RWTH Aachen University, Aachen, Germany, 2013. [Google Scholar]
Duclous, R.; Dubroca, B.; Frank, M. A deterministic partial differential equation model for dose calculation in electron radiotherapy. Phys. Med. Biol. 2010, 55, 3843. [Google Scholar] [CrossRef] [PubMed]
Larsen, E.W.; Miften, M.M.; Fraass, B.A.; Bruinvis, I.A.D. Electron dose calculations using the Method of Moments. Med. Phys. 1997, 24, 111–125. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Claus, T.; Torrilhon, M.; Bünger, J. tam724/m1epma: Initial Release v0.1.0. 2021. Available online: https://0-doi-org.brum.beds.ac.uk/10.5281/zenodo.4742827 (accessed on 9 July 2021).
Claus, T. Application of the Adjoint Method in Gradient-Based Optimization to the M1-Model in Electron Beam Microanalysis. Bachelor’s Thesis, RWTH Aachen University, Aachen, Germany, 2018. Available online: https://publications.rwth-aachen.de/record/818520 (accessed on 9 July 2021).
Hubbell, J.; Seltzer, S. Tables of X-ray Mass Attenuation Coefficients and Mass Energy-Absorption Coefficients (Version 1.4); National Institute of Standards and Technology: Gaithersburg, MD, USA, 2004; Available online: http://physics.nist.gov/xaamdi (accessed on 9 July 2021).
Powell, C. Cross sections for inelastic electron scattering in solids. Ultramicroscopy 1989, 28, 24–31. [Google Scholar] [CrossRef]
Hubbell, J.H.; Trehan, P.N.; Singh, N.; Chand, B.; Mehta, D.; Garg, M.L.; Garg, R.R.; Singh, S.; Puri, S. A Review, Bibliography, and Tabulation of K, L, and Higher Atomic Shell X-Ray Fluorescence Yields. J. Phys. Chem. Ref. Data 1994, 23, 339–364, Erratum in 2004, 33, 621. [Google Scholar] [CrossRef]
LeVeque, R.J.L.; Crighton, D. Finite Volume Methods for Hyperbolic Problems; Cambridge Texts in Applied Mathematics; Cambridge University Press: Cambridge, UK, 2002. [Google Scholar] [CrossRef]
Clawpack Development Team. Clawpack Software, 2019. Version 5.6.0. Available online: https://0-doi-org.brum.beds.ac.uk/10.5281/zenodo.3237295 (accessed on 9 July 2021).
Mandli, K.T.; Ahmadia, A.J.; Berger, M.; Calhoun, D.; George, D.L.; Hadjimichael, Y.; Ketcheson, D.I.; Lemoine, G.I.; LeVeque, R.J. Clawpack: Building an open source ecosystem for solving hyperbolic PDEs. PeerJ Comput. Sci. 2016, 2, e68. [Google Scholar] [CrossRef]
Nocedal, J.; Wright, S. Numerical Optimization, 2nd ed.; Springer Series in Operations Research and Financial Engineering; Springer: New York, NY, USA, 2006. [Google Scholar] [CrossRef] [Green Version]
Madsen, K.; Nielsen, H.B.; Tingleff, O. Methods for Non-Linear Least Squares Problems, 2nd ed.; Technical University of Denmark: Lyngby, Denmark, 2004. [Google Scholar]
Rackauckas, C.; Ma, Y.; Dixit, V.; Guo, X.; Innes, M.; Revels, J.; Nyberg, J.; Ivaturi, V. A Comparison of Automatic Differentiation and Continuous Sensitivity Analysis for Derivatives of Differential Equation Solutions. arXiv 2018, arXiv:1812.01892. [Google Scholar]
Plessix, R.E. A review of the adjoint-state method for computing the gradient of a functional with geophysical applications. Geophys. J. Int. 2006, 167, 495–503. [Google Scholar] [CrossRef]
Métivier, L.; The SEISCOPE Group. Numerical Optimization and Adjoint State Methods for Large-Scale Nonlinear Least-Squares Problems; Joint Inversion Summer School: Barcelonnette, France, 2015. [Google Scholar]
Bradbury, J.; Frostig, R.; Hawkins, P.; Johnson, M.J.; Leary, C.; Maclaurin, D.; Necula, G.; Paszke, A.; VanderPlas, J.; Wanderman-Milne, S.; et al. JAX: Composable transformations of Python+NumPy Programs. 2018. Available online: http://github.com/google/jax (accessed on 9 July 2021).

Figure 1. Sketch of the physical processes in EPMA: electron path with multiple collisions (dotted line), vacant electron (hollow circle), relaxation (black arrow), X-ray radiation and attenuation/detection (wavy lines), and reconstruction pixel (dashed lines).

Figure 2. 2D schematic of the domain

Ω

, the exposed sample surface

S_{impact}

(solid line), the internal surfaces (dashed lines), the interaction volume (dotted line), and the beams’ spatial distribution

ψ_{b e a m}^{(0)}

.

Figure 2. 2D schematic of the domain

Ω

, the exposed sample surface

S_{impact}

(solid line), the internal surfaces (dashed lines), the interaction volume (dotted line), and the beams’ spatial distribution

ψ_{b e a m}^{(0)}

.

Figure 3. The electron fluence

ψ^{0} (x, ϵ)

induced by a centered electron beam (

ϵ_{b e a m} = 12 k e V

,

σ_{ϵ} = 0.1 k e V

and

x_{b e a m} = {(500, 0)}^{T} n m, σ_{x} = 30 n m

) for three different energies

ϵ \in {11, 9, 7} k e V

. Discretization of the

M_{1} - model

in

(n_{x}, n_{y}, n_{ϵ}) = (160, 120, 170)

. White lines indicate the size of the reconstruction pixel. The white dashdotted (

p_{1}

), solid (

p_{2}

), dashed (

p_{3}

) and dotted (

p_{4}

) pixel are reconstructed in Section 5.1.5.

Figure 3. The electron fluence

ψ^{0} (x, ϵ)

induced by a centered electron beam (

ϵ_{b e a m} = 12 k e V

,

σ_{ϵ} = 0.1 k e V

and

x_{b e a m} = {(500, 0)}^{T} n m, σ_{x} = 30 n m

) for three different energies

ϵ \in {11, 9, 7} k e V

. Discretization of the

M_{1} - model

in

(n_{x}, n_{y}, n_{ϵ}) = (160, 120, 170)

. White lines indicate the size of the reconstruction pixel. The white dashdotted (

p_{1}

), solid (

p_{2}

), dashed (

p_{3}

) and dotted (

p_{4}

) pixel are reconstructed in Section 5.1.5.

Figure 4. Error of the solution of the

M_{1} - model

ψ^{0}

and the k-ratios

k

compared to a reference solution calculated with high resolution

ψ_{*}^{0}

and

k_{*}

. The cell size of the reference solution is

3.125 n m

, and the energy interval is discretized in 350 steps. For reference, the gray dashed and dotted lines indicate the first and second order.

Figure 4. Error of the solution of the

M_{1} - model

ψ^{0}

and the k-ratios

k

compared to a reference solution calculated with high resolution

ψ_{*}^{0}

and

k_{*}

. The cell size of the reference solution is

3.125 n m

, and the energy interval is discretized in 350 steps. For reference, the gray dashed and dotted lines indicate the first and second order.

Figure 5. Contour lines of the objective function, the least squares error of

C u

and

M n

k-ratios from six electron beam positions and energies. In the left-hand plot, two parameters describing pixels lying next to each other are varied, and, in the right-hand plot, two parameters lying below each other are varied, see Figure 3 for the scenario, pixel size, and position. In both cases, all other parameters are fixed to their hidden truth values. The cross marks the hidden truth of the variable parameters.

Figure 5. Contour lines of the objective function, the least squares error of

C u

and

M n

k-ratios from six electron beam positions and energies. In the left-hand plot, two parameters describing pixels lying next to each other are varied, and, in the right-hand plot, two parameters lying below each other are varied, see Figure 3 for the scenario, pixel size, and position. In both cases, all other parameters are fixed to their hidden truth values. The cross marks the hidden truth of the variable parameters.

Figure 6. The iterative reconstruction of the mass fractions of copper in each of the four variable pixels and the value of the objective function using the Levenberg–Marquardt algorithm. The mass fraction of manganese can be deduced from

1 - c_{C u}

. Dotted, dashed, and dashdotted lines show the reconstructions where different discretizations of the

M_{1} - model

are used. Solid lines show the hidden truth of all four parameter values. The synthetic measurements are generated with a resolution of

160 \times 120

, hence the perfect reconstruction using this resolution. Using other resolutions, the reconstruction also converges to similar values.

Figure 6. The iterative reconstruction of the mass fractions of copper in each of the four variable pixels and the value of the objective function using the Levenberg–Marquardt algorithm. The mass fraction of manganese can be deduced from

1 - c_{C u}

. Dotted, dashed, and dashdotted lines show the reconstructions where different discretizations of the

M_{1} - model

are used. Solid lines show the hidden truth of all four parameter values. The synthetic measurements are generated with a resolution of

160 \times 120

, hence the perfect reconstruction using this resolution. Using other resolutions, the reconstruction also converges to similar values.

Figure 7. Reconstruction of a vertically layered material. Electron beams bombard the sample from above with

17 k e V

centered at each of the

50 n m

layers, whose interfaces are indicated by vertical black lines. The background color shows the hidden truth mass fraction of each layer (blue: chromium, red: nickel, orange: iron). The lines (blue solid: chromium, red dotted: nickel) in each layer show the Levenberg–Marquardt steps from initial (left) to sixth iteration (right). These two parameters are sufficient to describe all three mass fractions (see Section 2). No additional constraint was applied during the optimization, hence the nonphysical behavior

c_{C r} + c_{N i} > 1

during the optimization. Nevertheless, all parameters are reconstructed successfully.

Figure 7. Reconstruction of a vertically layered material. Electron beams bombard the sample from above with

17 k e V

centered at each of the

50 n m

layers, whose interfaces are indicated by vertical black lines. The background color shows the hidden truth mass fraction of each layer (blue: chromium, red: nickel, orange: iron). The lines (blue solid: chromium, red dotted: nickel) in each layer show the Levenberg–Marquardt steps from initial (left) to sixth iteration (right). These two parameters are sufficient to describe all three mass fractions (see Section 2). No additional constraint was applied during the optimization, hence the nonphysical behavior

c_{C r} + c_{N i} > 1

during the optimization. Nevertheless, all parameters are reconstructed successfully.

Table 1. Discretization and model parameters used for the convergence analysis.

spatial domain $Ω$	$[0, 1000] nm \times [- 800, 0] nm$
spatial grid	$40 \times 30$ $(25 n m \times 26.6 n m)$ , $80 \times 60$ $(12.5 n m \times 13.3 n m)$ , $160 \times 120$ $(6.25 n m \times 6.6 n m)$
spatial grid (reference)	$320 \times 240$ $(3.125 n m \times 3.3 n m)$
energy range	$[5, 13] k e V$
energy steps	350
beam configuration	$12 k e V$ $σ_{ϵ} = 0.1 k e V$ , $500 n m$ $σ_{x} = 30 n m$
k-ratios	$C u K α$ , $M n K α$
detector position	$(500, 50) n m$

Table 2. Beam configurations used to generate synthetic k-ratio measurements.

beam energies and positions	$(450 nm, 12 keV)$ , $(450 nm, 11 keV)$ , $(450 nm, 10 keV)$ , $(550 nm, 12 keV)$ , $(550 nm, 11 keV)$ , $(550 nm, 10 keV)$
beam widths	$σ_{ϵ} = 0.1 keV$ , $σ_{x} = 30 nm$

Table 3. Discretization and model parameters used for the reconstruction of a layered material.

spatial domain $Ω$	$[0, 1000] nm \times [- 800, 0] nm$
reconstruction pixel	$20 \times 1$ $(50 n m)$
spatial grid	$40 \times 40$ $(25 n m \times 20 n m)$
energy range	$[5, 18] k e V$
energy steps	200
beam positions	${25, 75, 125, 175, \dots 925, 975} n m$ $σ_{x} = 30 n m$
beam energy	$17 k e V$ $σ_{ϵ} = 0.125 k e V$
k-ratios	$C r K α$ , $N i K α$ , $F e K α$
detector position	$(500, 50) n m$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Claus, T.; Bünger, J.; Torrilhon, M. A Novel Reconstruction Method to Increase Spatial Resolution in Electron Probe Microanalysis. Math. Comput. Appl. 2021, 26, 51. https://0-doi-org.brum.beds.ac.uk/10.3390/mca26030051

AMA Style

Claus T, Bünger J, Torrilhon M. A Novel Reconstruction Method to Increase Spatial Resolution in Electron Probe Microanalysis. Mathematical and Computational Applications. 2021; 26(3):51. https://0-doi-org.brum.beds.ac.uk/10.3390/mca26030051

Chicago/Turabian Style

Claus, Tamme, Jonas Bünger, and Manuel Torrilhon. 2021. "A Novel Reconstruction Method to Increase Spatial Resolution in Electron Probe Microanalysis" Mathematical and Computational Applications 26, no. 3: 51. https://0-doi-org.brum.beds.ac.uk/10.3390/mca26030051

Article Menu

A Novel Reconstruction Method to Increase Spatial Resolution in Electron Probe Microanalysis

Abstract

1. Increasing the Spatial Resolution in EPMA

2. Deterministic k-Ratio Model

2.1. Parameterized Material Description

2.2. k-Ratios as a Function of the Electron Number Density

2.3. M1-Model of Electron Transport

Boundary Conditions

3. Inverse Problem of Material Reconstruction

3.1. PDE-Constrained Optimization Problem

3.2. Iterative Gradient Based Optimization

3.3. The Adjoint State Method

3.4. Adjoint State Method for the M1-Model Constraint

4. Numerical Implementation

4.1. Material Parameterization

4.2. Solving the M1-Model Using Clawpack

4.3. Implementation of the k-Ratio Model

4.4. Adaptions to Solve the Adjoint State Equation and Compute the Gradient

5. Numerical Experiments

5.1. Reconstruction on a Small Grid

5.1.1. Zeroth Moment of the Electron Fluence: M1-Model

5.1.2. Convergence to a Particular Solution

5.1.3. Synthetic Measurements

5.1.4. Objective Function

5.1.5. Reconstruction of Four Parameters

5.2. Reconstruction on Small Vertical Layers

6. Conclusions and Outlook

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Additional M1-Model Equations

Appendix A.1. Stopping Power

Appendix A.2. Transport Coefficient

Appendix A.3. Eddington Factor

Appendix A.4. Jacobian of the Flux Function

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI