Affine Differential Geometric Control Tools for Statistical Manifolds

Hirica, Iulia-Elena; Pripoae, Cristina-Liliana; Pripoae, Gabriel-Teodor; Preda, Vasile

doi:10.3390/math9141654

Open AccessArticle

Affine Differential Geometric Control Tools for Statistical Manifolds

¹

Faculty of Mathematics and Computer Science, University of Bucharest, Academiei 14, RO-010014 Bucharest, Romania

²

Department of Applied Mathematics, The Bucharest University of Economic Studies, Piata Romana 6, RO-010374 Bucharest, Romania

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2021, 9(14), 1654; https://0-doi-org.brum.beds.ac.uk/10.3390/math9141654

Submission received: 27 June 2021 / Revised: 10 July 2021 / Accepted: 12 July 2021 / Published: 14 July 2021

(This article belongs to the Special Issue Stochastic Models and Methods with Applications)

Download Versions Notes

Abstract

:

The paper generalizes and extends the notions of dual connections and of statistical manifold, with and without torsion. Links with the deformation algebras and with the Riemannian Rinehart algebras are established. The semi-Riemannian manifolds admitting flat dual connections with torsion are characterized, thus solving a problem suggested in 2000 by S. Amari and H. Nagaoka. New examples of statistical manifolds are constructed, within and beyond the classical setting. The invariant statistical structures on Lie groups are characterized and the dimension of their set is determined. Examples for the new defined geometrical objects are found in the theory of Information Geometry.

Keywords:

statistical manifold; dual connections; Fisher metric; Gibbs entropy; invariant connections on Lie groups; Information Geometry; dually flat connections; deformation algebras; Riemannian Rinehart algebras; bi-algebras; control tools

1. Introduction

A triple (

M, g, \nabla

) is called statistical manifold if (

M, g

) is a semi-Riemannian manifold and ∇ is a torsion-free affine connection on M such that

(\nabla_{Z} g) (X, Y) = (\nabla_{Y} g) (X, Z) .

(1)

This notion was defined by S. Amari in [1], as a geometrical model for some facts in Statistics: M is a parameters space of distributions of probability, g is the Rao–Fisher metric deduced from the Bolzmann–Gibbs–Shannon entropy function, and ∇ is a tool for asymptotic estimations.

The geometrization based on both a semi-Riemannian metric and an affine connection was already used (until now, without great succes) in different attempts to unify relativistic gravity models with electromagnetism ones (Weyl, Eddington, Einstein, Kaluza, etc., in the first half of the 20th century; see in [2] for a recent review). Instead, in Statistics, the model was considered important and fruitful. Today it constitutes a modern and promising area of active research (see, for example, in [3,4,5,6]).

Two connections

\nabla^{1}

and

\nabla^{2}

are dual in

(M, g)

if

X g (Y, Z) - g (\nabla_{X}^{1} Y, Z) - g (Y, \nabla_{X}^{2} Z) = 0 .

From a differential geometric point of view, the dualistic structure first generalizes somehow the invariance of the inner product under parallel translation through metric connections. Moreover, the existence of a dually flat structure on a manifold points out some topological and geometrical properties of the manifold. The notion traces back to Norden and was adapted on statistical manifolds, where remarkable families of dual connections contain information about the dualistic properties of exponential families of probability distributions [7].

This initial (and already classical) setting was generalized in several ways. We point out here only a direction opened by Kurose and Matsuzoe, who considered statistical manifolds with non-symmetric connections ∇ [8,9,10], intended for quantum field theories. As statistical manifolds have close relations to the geometry of affine immersions, statistical manifolds admitting torsion have relations to the geometry of affine distributions.

In this paper, we review in a creative manner the fundamentals of dual connections and of statistical manifolds and we give new examples (Section 2 and Section 4). The families of these new examples depend on many “parameters” and thus are susceptible to fit in various applications. We establish “controls” over the parameters manifold M and its various affine modules of connections. These “controls” are provided by deformation algebras defined by the difference of two connections, or by some Riemannian Rinehart bi-algebras associated to the Riemannian metric and to the canonical Lie–Rinehart algebra of the manifold. The deformation algebras were intensively studied during 1970–1990, as a natural translation between algebraic and differential geometric properties of differentiable manifolds. The Riemannian Rinehart structures are more recent and constitute a promising area of research. In Section 2, we give some hints for the respective literature related to both these algebraic objects. We show that arbitrary pairs of dual connections are determined by pairs of a metric connection and an arbitrary connection, or by triples formed by a metric connection, an arbitrary connection and a function (thus generalizing the pairs of the so called

α

-connections).

In Section 3, we prove formulas for the main invariants associated to the dual connections. We determine links between their Bianchi identities and express the Jacobi equation for geodesics in terms of dual connections.

In Section 4, some very general families of statistical manifolds are defined, depending on many parameters. Here, we characterize the semi-Riemannian manifolds admitting flat dual connections with torsion, thus solving a problem suggested in [7].

In Section 5, we define nine new families of statistical manifolds, denoted SMAT

_{1}, \dots,

SMAT

_{9}

, which generalize the known ones, by using some special hypothesis on the curvature and on the tensor vector fields.

In Section 6, we determine how many independent bi-invariant statistical structures may exist on a compact Lie group and how many independent left invariant statistical structures may exist on an arbitrary Lie group.

Section 7 is devoted to some examples of statistical manifolds, in particular frameworks from Information Geometry.

2. Dual Connections and Controls over Some Affine Modules of Connections

Let (

M, g

) be a semi-Riemannian manifold with the Levi–Civita connection

\nabla^{0}

. We denote by

C (M)

and

C_{s} (M)

the sets of affine connections and of symmetric (i.e., torsion-free) connections on M, respectively, endowed with the canonical structure of affine

F (M)

-module. We define

C_{m} (M, g) : = {\nabla \in C (M) ∣ (\nabla_{Z} g) (X, Y) = 0 for all X, Y, Z \in X (M)},

C_{c} (M, g) : = {\nabla \in C (M) ∣ (\nabla_{Z} g) (X, Y) = (\nabla_{Y} g) (X, Z) for all X, Y, Z \in X (M)}

and

C_{s c} (M, g) : = C_{s} (M) \cap C_{c} (M, g)

. As

{\nabla^{0}} = C_{s} (M) \cap C_{m} (M, g)

, we have the following inclusions of (non-void) affine submodules

C_{s c} (M, g) \subset C_{s} (M) \subset C (M), C_{s c} (M, g) \subset C_{c} (M, g) \subset C (M), C_{m} (M, g) \subset C_{c} (M, g) .

Remark 1.

(i) Each connection

\nabla \in C (M)

may be uniquely written as

\nabla = \nabla^{0} + A

, where

A \in T_{2}^{1} (M)

relates to the torsion tensor field

T^{\nabla}

by the relation

T^{\nabla} (X, Y) = A (X, Y) - A (Y, X) .

We have the complete determination of the connection ∇ by the (1,2)-tensor field A, and, mutatis mutandis, the determination of the affine module

C (M)

by its direction, the real vector space

T_{2}^{1} (M)

. We may interpret the semi-Riemannian geometry of

(M, g, \nabla^{0})

as a reference point and “vary” it by affine geometries

(M, \nabla)

, acting through the “control” A. Moreover, once A is fixed,

X (M)

gets a structure of

F (M)

-algebra, called the deformation algebra of the pair (

\nabla, \nabla^{0}

), by the multiplication

X \cdot Y : = A (X, Y)

. Translations between the algebraic properties of these deformation algebras and the geometric properties of the ambient manifold were extensively studied (see, for example, in [11,12,13] and the references therein).

We must point out here alternative invariants, also studied in the literature: the “cubic forms”

C_{1} (X, Z, Y) : = (\nabla_{X} g) (Y, Z)

and

C_{2} (X, Z, Y) : = g (X, A (Y, Z))

; we shall not use them in our paper.

(ii) We have the following characterizations:

C_{m} (M, g) = {\nabla^{0} + A ∣ g (A (X, Y), Z) + g (A (X, Z), Y) = 0},

C_{c} (M, g) = {\nabla^{0} + A ∣ g (A (X, Y), Z) + g (A (X, Z), Y) =

= g (A (Z, Y), X) + g (A (Z, X), Y)},

C_{s c} (M, g) = {\nabla^{0} + A ∣ g (A (X, Z), Y) = g (A (Y, Z), X), A (X, Y) = A (Y, X)} .

(iii) A Riemannian Rinehart space is a Lie–Rinehart algebra endowed with a “Riemannian metric”, i.e., a musical (generalized) scalar product (see in [14] for details). This construction establishes a purely algebraic framework for many properties which are commonly studied in Riemannian geometry, by analytic and geometric tools.

In particular, on the Lie–Rinehart algebra

(F (M), X (M))

[14], we get a canonical structure of Riemannian Rinehart space, induced by the (generalized) scalar product

<, >

, canonically associated to the Riemannian metric g.

Fix a connection

\nabla = \nabla^{0} + A \in C (M)

, with the (fixed) control A. Define, as above, the multiplication

X \cdot Y : = A (X, Y)

. It follows that on

(F (M), X (M), <, >)

we get an additional algebra structure, which combines the properties of the deformation algebra

(X (M), \cdot)

with those from the Lie–Rinehart algebra

(F (M), X (M))

. We believe that these bi-algebras

(F (M), X (M), <, >, \cdot)

deserve a closer attention of their own.

In what we are concerned here, we restrain to the following.

Theorem 1.

Let

(M, g)

be a Riemannian manifold and

\nabla = \nabla^{0} + A

a connection on M. Then,

\nabla \in C_{c} (M, g)

if and only if the Riemannian Rinehart bi-algebra

(F (M), X (M), <, >, \cdot)

satisfies the compatibility condition

< X \cdot Y, Z > - < Z \cdot Y, X > = < Z \cdot X - X \cdot Z, Y > .

We read this formula as a mutual determinancy, which express the obstruction to “self-adjointness”, in the left side, through the obstruction to commutativity, in the right side.

Remark 2.

(i) Consider the affine transformation

Φ : C (M) \to C (M)

, which associates to each

\nabla \in C (M)

the affine connection

\nabla^{*} : = Φ (\nabla)

, given by

g (X, \nabla_{Z}^{*} Y) = Z g (X, Y) - g (\nabla_{Z} X, Y) .

(2)

The connection

\nabla^{*}

is called the dual of ∇ [1,7]; the relation of duality is an equivalence one, as Φ is an involution. Obviously,

\nabla^{0}

is (the only) self-dual connection, as the unique fixed point of Φ. The transformation Φ depends on the Riemannian metric only.

(ii) Suppose

\nabla = \nabla^{0} + A

and denote

A^{'},^{'} A \in T_{2}^{1} (M)

the adjoint operators (at the right and at the left, respectively) given by

g (X, A^{'} (Z, Y)) = g (Y, A (Z, X)), g (X,^{'} A (Y, Z)) = g (Y, A (X, Z)) .

We have also a direct relation between

A^{'}

and

^{'} A

, which allows us to study only one of these operators, namely,

g (X, A^{'} (Z, Y)) = g (Z,^{'} A (Y, X)) .

Obviously,

^{'} A (X, Y) = A^{'} (Y, X)

if and only if A is commutative.

We have

\nabla^{*} : = \nabla^{0} + A^{*}

, where

A^{*} : = - A^{'}

. The deformation algebras

(X (M), A)

and

(X (M), A^{*})

are equivalent, i.e., contain the same (algebraic and) geometric information. Obviously,

A^{'} = A

if and only if

\nabla^{0} = \frac{1}{2} (\nabla + \nabla^{*})

.

(iii) Another interesting deformation algebra is

(X (M), A + A^{'})

, which measures another obstruction for ∇ to coincide with

\nabla^{*}

(i.e.,

\nabla = \nabla^{*}

if and only if

A + A^{'} = 0

if and only if

\nabla_{X} g = 0

).

(iv) The tensor fields

A^{*}

,

A^{'}

, and

^{'} A

(and their associated deformation algebras, or their associated Riemannian Rinehart bi-algebras) act as controls over the affine modules of special connections previously studied. The information they carry with is, of course, redundant and may be translated and simplified, following the context.

(v) Let (

\nabla, \nabla^{*}

) be dual connections and

\tilde{\nabla} : = \frac{1}{2} \nabla + \frac{1}{2} \nabla^{*}

their mean connection. It is well known [7,10] that

\tilde{\nabla} \in C_{m} (M)

. We remark also that

\tilde{T} = \frac{1}{2} T - \frac{1}{2} T^{'}

.

It is interesting that we have also a strong converse statement (inspired by a suggestion in [7] (p. 51)): consider

\nabla^{1} \in C_{m} (M)

, with the torsion tensor field of the form

T^{1} = \frac{1}{2} B - \frac{1}{2} B^{'}

, for some skew-symmetric tensor field

B \in T_{2}^{1} (M)

. Let ∇ be a connection on M, with torsion B. Define

\nabla^{2} : = 2 \nabla^{1} - \nabla

. It follows that

T^{2} = - B^{'}

. Then,

\nabla^{2} = \nabla^{*}

and

\nabla^{1} = \frac{1}{2} \nabla + \frac{1}{2} \nabla^{*}

. In conclusion, to any connection

\hat{\nabla} \in C_{m} (M)

with the respective special torsion, we can associate an infinite family of dual connections, such that

\hat{\nabla}

is the mean connection for every element of this family. In particular, this works for the Levi–Civita connection

\nabla^{0}

, because

B^{0} = 0

in this case. (An elementary comparation: in order to identify a closed interval on the real line, we may specify both its ends, or we may specify one of its ends and its middle point.)

The next example shows that we may generalize the previous “arithmetic” mean.

Example 1.

Consider an arbitrary couple

(\nabla, \nabla^{*})

of conjugate connections. We define a 1-parameter family of connections

{\nabla^{(f)}}_{f \in F (M)}

, called the f-connections, such that

(\nabla^{(- f)}, \nabla^{(f)})

are dually coupled to the metric, where

\nabla^{(f)} : = \frac{1 + f}{2} \nabla + \frac{1 - f}{2} \nabla^{*} .

We get

\nabla^{(f)} = \nabla^{0} + A^{f}

, with

A^{f} : = \frac{1 + f}{2} A - \frac{1 - f}{2} A^{'} \in T_{2}^{1} (M)

and we obtain a family of deformation algebras

(X (M), A^{f})

.

In particular, we have the

α

-connections

{\nabla^{α}}_{α \in R}

. These connections generalize the classical ones (see for example [7,15]), which are symmetric.

In the general case, a short calculation shows that

(\nabla_{X}^{(f)} g) (Y, Z) = - f g ((A + A^{'}) (X, Y), Z)

(3)

and

2 T^{f} (X, Y) = (1 + f) [A (X, Y) - A (Y, X)] - (1 - f) [A^{'} (X, Y) - A^{'} (Y, X)] .

(4)

Conversely, start with a connection

\nabla^{1} \in C (M)

, satisfying (3) and (4) for some

A \in T_{2}^{1} (M)

. Consider a function f and the connection

\nabla : = \nabla^{0} + A

. Then, there exists a unique connection

\nabla^{2}

such that

\nabla^{*} = \nabla^{2}

,

\nabla^{(f)} = \nabla^{1}

and (

\nabla, \nabla^{*}

) are conjugate. For

f = 0

, we recover the construction in the Remark 2, (v). (An elementary comparation: in order to identify a closed interval on the real line, we may specify both its ends, or we may specify one of its ends and the point which divides the interval in some given “ratio” f).

Finally, we remark that Formula (3) shows the direct proportionality between the obstruction to the

\nabla^{(f)}

-parallelism of the metric g (on the left side) and the extent to which ∇ differs from

\nabla^{*}

(i.e.,

A + A^{'}

differs from 0), weighted through the “conformal factor”

(- f)

.

3. The Main Geometric Invariants Associated to Dual Connections

Let ∇ and

\nabla^{*}

be dual connections on a semi-Riemannian manifold (

M, g

), with the Levi–Civita connection

\nabla^{0}

. We denote by T, t, R,

R i c

,

F a r

, and

ρ

the torsion tensor, the “mean torsion” one form, the curvature tensor, the Ricci tensor, the Faraday tensor, and the “pseudo-scalar” curvature of ∇, respectively, defined by

T (X, Y) = \nabla_{X} Y - \nabla_{Y} X - [X, Y],

t (X) : = t r a c e (Y \to T (Y, X)),

R (X, Y) Z = \nabla_{X} \nabla_{Y} Z - \nabla_{Y} \nabla_{X} Z - \nabla_{[X, Y]} Z,

R i c (X, Y) : = t r a c e (Z \to R (Z, X) Y),

F a r (X, Y) : = t r a c e (Z \to R (X, Y) Z)

and

ρ = t r a c e R i c

. Similar geometric objects associated to

\nabla^{*}

and

\nabla^{0}

will be denoted with an upper ∗ or 0, respectively. Denote

E_{1}, \dots, E_{n}

a local orthonormal basis of vector fields on

(M, g)

.

Remark 3.

Using (2), we can determine the previous invariants of

\nabla^{*}

in terms of ∇ and g:

\begin{matrix} g (X, T^{*} (Z, Y)) = g (X, T (Z, Y)) + (\nabla_{Z} g) (X, Y) - (\nabla_{Y} g) (X, Z), \\ t^{*} (Y) = t (Y) + (d i v^{\nabla} g) (Y) - τ (Y), \end{matrix}

(5)

where we denoted

τ (Y) : = \sum_{i} (\nabla_{Y} g) (E_{i}, E_{i})

,

\begin{matrix} g (R^{*} (Y, Z) W, X) = - g (R (Y, Z) X, W), \\ R i c^{*} (Z, W) = - \sum_{i} g (R (E_{i}, Z) E_{i}, W), \\ ρ^{*} = - \sum_{i, j} g (R (E_{i}, E_{j}) E_{i}, E_{j}) . \end{matrix}

(6)

Remark 4.

We denote

A (X, Y) : = \nabla_{X} Y - \nabla_{X}^{0} Y

. Then, we can express the invariants of ∇ in terms of g and A:

T (X, Y) = A (X, Y) - A (Y, X),

R (X, Y) Z = R^{0} (X, Y) Z + (\nabla_{X}^{0} A) (Y, Z) - (\nabla_{Y}^{0} A) (X, Z) +

+ A (X, A (Y, Z)) - A (Y, A (X, Z)),

R i c (Y, Z) = R i c^{0} (Y, Z) + \sum_{i} [g (E_{i}, (\nabla_{E_{i}}^{0} A) (Y, Z)) + g (E_{i}, A (E_{i}, A (Y, Z))) -

- g (E_{i}, (\nabla_{Y}^{0} A) (E_{i}, Z)) - g (E_{i}, A (Y, A (E_{i}, Z)))] .

If we denote

A^{t} (X, Y) = A (Y, X)

, then we have the equivalent formula

R i c (Y, Z) = R i c^{0} (Y, Z) + d i v A (Y, Z) + t r a c e A_{A (Y, Z)}^{t} - t r a c e (\nabla_{Y}^{0} A) (Z, \cdot) - t r a c e A_{Y} \circ A_{Z}^{t} .

ρ = ρ^{0} + \sum_{i, j} [g (E_{i}, (\nabla_{E_{i}}^{0} A) (E_{j}, E_{j})) + g (E_{i}, A (E_{i}, A (E_{j}, E_{j}))) -

- g (E_{i}, (\nabla_{E_{j}}^{0} A) (E_{i}, E_{j})) - g (E_{i}, A (E_{j}, A (E_{i}, E_{j})))] .

Using coordinates associated to the given orthonormal basis on M, one has

R i c_{i k} = R i c_{i k}^{0} + \nabla_{l}^{0} A_{i k}^{l} - \nabla_{i}^{0} A_{l k}^{l} + A_{i k}^{l} A_{m l}^{m} - A_{p k}^{s} A_{i s}^{p} .

The Ricci tensor

R i c

is symmetric iff

\nabla_{l}^{0} A_{i k}^{l} - \nabla_{i}^{0} A_{l k}^{l} + A_{i k}^{l} A_{m l}^{m} - A_{p k}^{s} A_{i s}^{p} = \nabla_{l}^{0} A_{k i}^{l} - \nabla_{k}^{0} A_{l i}^{l} + A_{k i}^{l} A_{m l}^{m} - A_{p i}^{s} A_{i k}^{p} .

The following result is a simple consequence of the previous formulas and may be considered “folklore”.

Proposition 1.

Let

\nabla \in C (M)

. Then, (i)

\nabla \in C_{c} (M)

if and only if

T = T^{*}

; (ii)

\nabla \in C_{c} (M)

if and only if

\nabla^{*} \in C_{c} (M)

; (iii)

\nabla \in C_{s c} (M)

if and only if

\nabla^{*} \in C_{s c} (M)

; (iv) if

\bar{\nabla} = \frac{1}{2} (\nabla + \nabla^{*})

, then

\bar{\nabla} \in C_{c} (M)

; (v) ∇ and

\nabla^{*}

have the same (parameterized) geodesics

\Leftrightarrow A (X, X) + A^{'} (X, X) = 0, \forall X \in X (M) \Leftrightarrow

the tensor field

A + A^{'}

is skew-symmetric.(vi) Let

\nabla \in C_{s c} (M)

. If ∇ and

\nabla^{*}

have the same (parameterized) geodesics, then

A = A^{'} = 0

, i.e.,

\nabla = \nabla^{*} = \nabla^{0} .

Proposition 2.

Let

\nabla = \nabla^{0} + A

and

\nabla^{*} = \nabla^{0} - A^{'}

be dual connections on

(M, g)

. Then, the Bianchi identities for ∇ impose the following conditions uppon the control A:

Bianchi 1 \sum_{X, Y, Z}^{c} [(\nabla_{Y}^{0} A) (X, Y) - (\nabla_{X}^{0} A) (Z, Y) -

- A (X, A (Z, Y)) + A (A (X, Z), Y) - A (A (Y, X), Z) + A (Z, A (Y, X))] = 0,

Bianchi 2 \sum_{X, Y, Z}^{c} [A (X, R^{0} (Y, Z) W) + \nabla_{X}^{0} (\nabla_{Y}^{0} A) (Z, W) + A (X, (\nabla_{Y}^{0} A) (Z, W)) -

- \nabla_{X}^{0} (\nabla_{Z}^{0} A) (Y, W) - A (X, (\nabla_{Z}^{0} A) (Y, W)) + \nabla_{X}^{0} A (Y, A (Z, W)) +

+ A (X, A (Y, A (Z, W))) - \nabla_{X}^{0} A (Z, A (Y, W)) - A (X, A (Z, A (Y, W))) +

+ R^{0} (A (X, Y) - A (Y, X), Z) W + (\nabla_{A (X, Y) - A (Y, X)}^{0} A) (Z, W) - (\nabla_{Z}^{0} A) (A (X, Y) -

- A (Y, X), W) + A (A (X, Y) - A (Y, X), A (Z, W)) -

- A (Z, A (A (X, Y) - A (Y, X), W))] = 0 .

Similar conditions arise for

\nabla^{*}

, replacing A by

- A^{'}

.

Remark 5.

With the notations in the previous proposition, we deduce a consequence of combining the first Bianchi identity for ∇ and

\nabla^{*}

\sum_{X, Y, Z}^{c} [- g (R (X, Y) W, Z) - g (T (T^{*} (X, Y), Z), W) -

- (\nabla_{T^{*} (X, Y)} g) (Z, W) + (\nabla_{Z} g) (T^{*} (X, Y), W) -

- X (g (T (Y, Z), W) - X ((\nabla_{Y} g) (Z, W)) + X ((\nabla_{Z} g) (Y, W)) +

+ g (T (Y, Z), \nabla_{X} W) + (\nabla_{Y} g) (\nabla_{X} W, Z) - (\nabla_{Z} g) (\nabla_{X} W, Y) +

+ g (T (\nabla_{X}^{*} Y, Z), W) + (\nabla_{\nabla_{X}^{*} Y} g) (Z, W) - (\nabla_{X} g) (\nabla_{X}^{*}, Y, W) +

+ g (T (Y, \nabla_{X}^{*} Z), W) + (\nabla_{Y} g) (\nabla_{X}^{*} Z, W) - (\nabla_{\nabla_{X}^{*} Z} g) (Y, W)] = 0 .

The second Bianchi identity leads to a much more complicated relation of compatibility and we omit it.

Theorem 2.

Let

(\nabla, \nabla^{*})

be dual connections on the semi-Riemannian manifold

(M, g)

, with

\nabla = \nabla^{0} + A

and

\nabla^{*} = \nabla^{0} - A^{'}

and let

γ = γ (t)

be a geodesic of g. Then, the Jacobi fields J along γ are solutions of

\nabla_{\dot{γ}} \nabla_{\dot{γ}} J + R (J, \dot{γ}) \dot{γ} - A (T (J, \dot{γ}), \dot{γ}) = (\nabla_{J} A) (\dot{γ}, \dot{γ}) + A (\nabla_{\dot{γ}} J, \dot{γ}) - A (J, A (\dot{γ}, \dot{γ})) +

+ A (\dot{γ}, A (J, \dot{γ})) - A (\dot{γ}, A (\dot{γ}, J)) + \nabla_{\dot{γ}} A (\dot{γ}, J) - \nabla_{\dot{γ}} A (J, \dot{γ}) + A (J, \nabla_{\dot{γ}} \dot{γ}) + A (\dot{γ}, \nabla_{\dot{γ}} J) .

If, moreover, ∇ is symmetric, then

\nabla_{\dot{γ}} \nabla_{\dot{γ}} J + R (J, \dot{γ}) \dot{γ} = (\nabla_{J} A) (\dot{γ}, \dot{γ}) + 2 A (\nabla_{\dot{γ}} J, \dot{γ}) - A (J, A (\dot{γ}, \dot{γ})) + A (J, \nabla_{\dot{γ}} \dot{γ}) .

The assertion still holds if we replace ∇ by

\nabla^{*}

and A by

- A^{'}

.

Proof.

The Jacobi equation for

J \in X_{γ}

writes

\nabla_{\dot{γ}}^{0} \nabla_{\dot{γ}}^{0} J + R^{0} (J, \dot{γ}) \dot{γ} = 0 .

We use the identity

R^{0} (X, Y) Z = R (X, Y) Z - (\nabla_{X} A) (Y, Z) + (\nabla_{Y} A) (X, Z) +

+ A (X, A (Y, Z)) - A (Y, A (X, Z)) - A (T (X, Y), Z) .

Replacing

\nabla^{0}

and

R^{0}

as functions of ∇, R and A, we get the identity we were looking for. □

The previous theorem provides formulas for the transversal control of the geodesics behaviour, expressed in terms of the dual connections instead of the metric. Conversely, we may obtain formulas which express the Jacobi equation along the auto-parallel curves of ∇ or

\nabla^{*}

(i.e., ∇-“geodesics” or

\nabla^{*}

-“geodesics”), in terms of g,

\nabla^{0}

, and A or

- A^{'}

, respectively.

4. Existence and Characterizations of Statistical Structures

A triple

(M, g, \nabla)

is a statistical manifold if

\nabla \in C_{s c} (M, g)

. In this case, (

M, g, \nabla^{*}

) is a statistical manifold too. Alternatively, we denote

(M, g, \nabla, \nabla^{*})

instead of

(M, g, \nabla)

, in order to point out the implicit duality inside.

The centro-affine properties of

C_{s c} (M)

w.r.t.

\nabla^{0}

(together with the metric properties) constitute the geometrical core of the theory of statistical manifolds.

Remark 6.

Given the semi-Riemannian manifold (

M, g

), we point out these five (equivalent) characterizations of a statistical manifold (

M, g, \nabla

):

(I)—by relation (1) and

T^{\nabla} = 0

.

(II)—through

A \in T_{2}^{1} (M)

such that

\nabla = \nabla^{0} + A

, with

g (A (X, Z), Y) = g (A (Y, Z), X), A (X, Y) = A (Y, X) .

(III)—through

T^{\nabla} = 0

and

B \in T_{3}^{0} (M)

such that

(\nabla_{X} g) (Y, Z) = B (X, Y, Z), B (X, Y, Z) = B (Y, X, Z) .

(IV)—through the dual connection

\nabla^{*}

in (2), such that

T^{\nabla} = T^{\nabla^{*}} = 0

.

(V)—through the dual connection

\nabla^{*}

in (2), such that both A,

A^{'}

are symmetric.

The set of all the invariant statistical structures on a Lie group G will be characterized in Section 6. The Lie algebra

L (G)

will allow us to “count” easier “how many” statistical structures exist on G.

Example 2.

(classical statistical manifolds, i.e., for dual connections without torsion) Consider a fixed n-dimensional semi-Riemannian manifold (

M, g

). We have the canonical (and trivial) structure of statistical manifold (

M, g, \nabla^{0}

), with

{(\nabla^{0})}^{*} = \nabla^{0}

. The set of all the statistical structures on (

M, g

) is parameterized by

C_{s c} (M, g)

. This is a large set (see Section 6) and the (1,2)-type deformation tensor fields measure “how far” a statistical structure is from the canonical one. In what follows, we construct particular new statistical structures on (

M, g

), in a down-to-up hierarchical way.

(i) Let fix

ξ \in X (M)

and denote

ξ^{b} : = g (ξ,^{.})

its dual 1-form w.r.t. g. Define

A_{ξ} (X, Y) = ξ^{b} (X) Y + ξ^{b} (Y) X + g (X, Y) ξ

and

\nabla : = \nabla^{0} + A_{ξ}

. Then,

\nabla \in C_{s c} (M, g)

and

\nabla^{*} = \nabla^{0} - A_{ξ}

. Thus, on each semi-Riemannian manifold, there always exists an infinite family of (distinct) dual connections, each of them corresponding to a different statistical structure associated to (

M, g

).

(ii) Suppose M is parallelizable and consider a fixed basis

{E_{1}, \dots, E_{n}}

in

X (M)

. We denote

\overset{i}{\nabla} : = \nabla^{0} + A_{E_{i}}

, as defined in (i). We have n “independent” statistical structures (

M, g, \overset{i}{\nabla}

) with

i = \bar{1, n}

. Moreover, each affine combination of these connections provides a new statistical structure, a “mean” with specified weights, which may control the global measuring in a specific way (w.r.t. the fixed basis, of course).

(iii) If M is not parallelizable, we may consider (if any) a linearly independent set

{E_{1}, \dots, E_{k}}

in

X (M)

,

k < n

, and make a similar construction as in (ii).

(iv) Let fix

F \in T_{1}^{1} (M)

and denote

F^{'}

its adjoint w.r.t. g, i.e.,

g (F X, Y) = g (X, F^{'} Y)

. Fix

α

,

β \in T_{1}^{0} (M)

and denote

α^{#}

,

β^{#}

their dual vector fields w.r.t. g, i.e.,

g (α^{#}, X) = α (X)

and

g (β^{#}, X) = β (X)

. Fix two symmetric

ω

,

η \in T_{2}^{0} (M)

,

ξ \in X (M)

, define

A (X, Y) = α (X) Y + α (Y) X + β (X) F Y + β (Y) F X + ω (X, Y) ξ + η (X, Y) F ξ,

(7)

and

\nabla : = \nabla^{0} + A

. Then ∇ is a symmetric connection and

\nabla^{*} = \nabla^{0} - A^{'}

, where

A^{'} (X, Y) = α (X) Y + g (X, Y) α^{#} + β (X) F^{'} Y + g (F X, Y) β^{#} +

+ g (ξ, Y) \tilde{ω} (X) + g (ξ, F^{'} Y) \tilde{η} (X),

and

\tilde{ω}

,

\tilde{η} \in T_{1}^{1} (M)

are given by

g (\tilde{ω} (X), Y) = ω (X, Y)

and

g (\tilde{η} (X), Y) = η (X, Y)

.

We have

\nabla \in C_{s c} (M, g)

if and only if

α (Z) Y + β (Z) F^{'} Y + g (F Z, Y) β^{#} + ξ^{b} (Y) \tilde{ω} (Z) + ξ^{b} (F^{'} Y) \tilde{η} (Z) =

= α (Y) Z + β (Y) F^{'} Z + g (F Y, Z) β^{#} + ξ^{b} (Z) \tilde{ω} (Y) + ξ^{b} (F^{'} Z) \tilde{η} (Y) .

In particular, it follows that

(n - 1) α (Y) = 2 β (F^{'} Y) - β (F Y) - β (Y) t r a c e F^{'} +

+ ξ^{b} (Y) t r a c e \tilde{ω} + ξ^{b} (F Y) t r a c e \tilde{η} - ξ^{b} (\tilde{ω} (Y)) - ξ^{b} (F \tilde{η} (Y))

and

(n - 1) α^{#} = 2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} + t r a c e \tilde{ω} ξ + t r a c e \tilde{η} F^{'} ξ - {\tilde{ω}}^{'} (ξ) - {\tilde{η}}^{'} (F^{'} ξ) .

(8)

Relation (8) may be viewed as an equation in the unknowns

α, β, ξ, F, ω, η

, which always has solutions, as

α

depends explicitly on the variables from the right side.

We distinguish the following special cases:

(iv)

_{1}

Suppose

F = 0

. Then,

A (X, Y) = α (X) Y + α (Y) X + ω (X, Y) ξ

and (8) writes

(n - 1) α^{#} = t r a c e (\tilde{ω}) ξ - {\tilde{ω}}^{'} (ξ)

. In particular, for

ξ = α^{#}

and

ω = g

we get the examples from the family (iii).

(iv)

_{2}

Suppose

F = I d

. Then,

A (X, Y) = (α (X) + β (X)) Y + (α (Y) + β (Y)) X + (ω (X, Y) + η (X, Y)) ξ

and (8) writes

(n - 1) α^{#} = (1 - n) β^{#} + t r a c e (\tilde{ω}) ξ - {\tilde{ω}}^{'} (ξ) + t r a c e (\tilde{η}) ξ - {\tilde{η}}^{'} (ξ) .

(iv)

_{3}

If

ω = η = 0

, then

A (X, Y) = α (X) Y + α (Y) X + β (X) F Y + β (Y) F X .

We get

(n - 1) α^{#} = 2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} .

(iv)

_{4}

If

ξ = α^{#}, ω = g

, then

A (X, Y) = α (X) Y + α (Y) X + β (X) F Y + β (Y) F X + g (X, Y) α^{#} + η (X, Y) F α^{#} .

We get

2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} + t r a c e \tilde{η} F^{'} α^{#} - {\tilde{η}}^{'} (F^{'} α^{#}) = 0 .

(a) If, moreover,

η = - g

, then

2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} - (n - 1) F^{'} α^{#} = 0 .

In particular, if

α = β

, then

2 F α^{#} - t r a c e F^{'} α^{#} - n F^{'} α^{#} = 0 .

(b) If, moreover,

η = g

, then

2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} + (n - 1) F^{'} α^{#} = 0 .

In particular, if

α = β

, then

2 F α^{#} - t r a c e F^{'} α^{#} + (n - 2) F^{'} α^{#} = 0 .

(iv)

_{5}

If

ξ = α^{#}, ω = - g

, then

A (X, Y) = α (X) Y + α (Y) X + β (X) F Y + β (Y) F X - g (X, Y) α^{#} + η (X, Y) F α^{#} .

We get

2 (n - 1) α^{#} = 2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} + t r a c e \tilde{η} F^{'} α^{#} - {\tilde{η}}^{'} (F^{'} α^{#}) .

(a) If, moreover,

η = g

, then

2 (n - 1) α^{#} = 2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} + (n - 1) F^{'} α^{#} .

In particular, if

α = β

, then

2 (n - 1) α^{#} = 2 F α^{#} - t r a c e F^{'} α^{#} + (n - 2) F^{'} α^{#} .

(b) If, moreover

η = - g

, then

2 (n - 1) α^{#} = 2 F β^{#} - F^{'} β^{#} - t r a c e F^{'} β^{#} - (n - 1) F^{'} α^{#} .

In particular, if

α = β

, then

2 (n - 1) α^{#} = 2 F α^{#} - t r a c e F^{'} α^{#} - n F^{'} α^{#} .

All these examples show that, on every semi-Riemannian manifold, there always exist many families of (distinct) dual connections; the choice of the parameters

α, β, F, ω, η

allows a large flexibility and variability of the possible associated statistical models.

(v) Let

f \in F (M)

and

\nabla \in C_{s} (M)

such that

d f (R^{\nabla} (X, Y) Z) = 0

and

g : = H e s s_{f}^{\nabla}

is non-degenerated. Then (

M, g, \nabla

) is a statistical manifold. Here, we used (1) and

(\nabla_{X} H e s s_{f}^{\nabla}) (Y, Z) - (\nabla_{Y} H e s s_{f}^{\nabla}) (X, Z) = - R^{\nabla} (X, Y) Z (f) .

(We remark that the hypothesis is much weaker than imposing the curvature flatness of ∇.) The dual connection of ∇ is uniquely determined by

(\nabla_{X} d f) (\nabla_{Z}^{*} Y) = Z {(\nabla_{X} d f) Y} - (\nabla_{Y} d f) (\nabla_{Z} X) .

In particular, if f is a divergence function associated to a parameterized family of distributions of probability, then g is the Fisher metric associated to it.

Remark 7.

In [7], p. 180, the authors suggest some open problems of interest. The third one is the following:

“Let $(M, g)$ be a Riemannian space. We say that $(M, g)$ may be flattened if there exists a pair of affine connections ∇ and $\nabla^{*}$ such that $(M, g, \nabla, \nabla^{*})$ is dually flat. Show whether this is always possible. If not, find the invariant which characterizes those spaces which may be flattened.”

From (6) we see that

R^{\nabla} = 0

if and only if

R^{\nabla^{*}} = 0

. Suppose that

T^{\nabla} = T^{\nabla^{*}} = 0

. The existence of a dually flat structure

(M, g, \nabla, \nabla^{*})

is then equivalent, in this case, with the existence of an affine structure on M, which is a longstanding open problem in Differential Geometry. For example, there exist Lie groups which do not admit such a left invariant structure.

If we relax the hypothesis, and accept ∇ and

\nabla^{*}

have torsion, then we obtain the following characterization of the spaces which may be flattened.

Theorem 3.

(i) Let

(M, g, \nabla, \nabla^{*})

be a semi-Riemannian space with a pair of flat dual connections, not necessarily symmetric. Then M is parallelizable.

(ii) Conversely, suppose

(M, g)

is a parallelizable semi-Riemannian space. Then, there exists a pair of flat dual connections

(\nabla, \nabla^{*})

, compatible with g.

Proof.

(i) Because ∇ is flat, it follows that M is a parallelizable manifold. This is a purely affine differential result, which has nothing to do with the semi-Riemannian structure of the space. Furthermore, it does not involve some properties related to the torsion, for example the eventual symmetry of the connection.

(ii) Consider

E_{1}, \dots, E_{n}

a global basis of vector fields on

(M, g)

. Let

\nabla^{-}

and

\nabla^{+}

be the Cartan–Schouten connections on M, defined by

\nabla_{E_{i}}^{-} E_{j} = 0

and

\nabla_{E_{i}}^{+} E_{j} = [E_{i}, E_{j}]

, respectively, for every indices

i, j = \bar{1, n}

. Then,

\nabla_{X}^{-} Y = X (Y^{j}) E_{j}, \nabla_{X}^{+} Y = X (Y^{j}) E_{j} + X^{i} Y^{j} [E_{i}, E_{j}],

where

X = X^{i} E_{i}

,

Y = Y^{j} E_{j}

are arbitrary vector fields on M. One knows that

R^{\nabla^{-}} = R^{\nabla^{+}} = 0

; in general, both connections have non-null torsion. We define

{(\nabla^{-})}^{*}

and

{(\nabla^{+})}^{*}

, using (2). In the chosen frame, their coefficients are given, respectively, by

{(Γ^{-})}^{*}_{i k}^{s} = E_{i} (g_{j k}) g^{j s}, {(Γ^{+})}^{*}_{i k}^{s} = E_{i} (g_{j k}) g^{j s} - {[E_{i}, E_{j}]}^{l} g_{l k} g^{j s} .

Then,

(M, g, \nabla^{-}, {(\nabla^{-})}^{*})

and

(M, g, \nabla^{+}, {(\nabla^{+})}^{*})

are dually flat. It is possible that, in some cases, these two structures coincide. □

Remark 8.

The proof of the second part of the previous theorem suggests the following question: on which parallelizable semi-Riemannian manifold does there exist a dually flat structure which, moreover, has both connections with parallel torsion? The Lie groups endowed with left-invariant semi-Riemannian metrics are the first candidates, as then

\nabla^{-}

has

\nabla^{-}

-parallel torsion (see also Section 6).

5. Beyond the Beaten Path: Exotic Statistical-Like Manifolds

The classical statistical manifolds

(M, g, \nabla, \nabla^{*})

with symmetric dual connections were generalized to statistical manifolds with torsion, in the works of Kurose and Matsuzoe [9,16] and denoted under the acronym SMAT. They satisfy

(\nabla_{Z} g) (X, Y) - (\nabla_{Y} g) (X, Z) = - g (T (Z, Y), X) .

In the following, we define nine new similar families of generalized statistical manifolds (with torsion), denoted SMAT

_{i}

, for

i = \bar{1, 9}

.

Definition 1.

Let

(M, g)

be a semi-Riemannian manifold and

(\nabla, \nabla^{*})

dual connections. Let ξ be an arbitrary fixed vector field and

f_{1}

,

f_{2}

fixed functions on M. The structure

(M, g, \nabla, \nabla^{*})

is called statistical manifold with torsion of type i, for

i = \bar{1, 9}

, if

({SMAT}_{1}) (\nabla_{Z} g) (X, Y) - (\nabla_{Y} g) (X, Z) = g (R (Z, Y) X, ξ),

({SMAT}_{2}) (\nabla_{Z} g) (X, Y) - (\nabla_{Y} g) (X, Z) = {R i c (Z, Y) - R i c (Y, Z)} ξ^{b} (X),

({SMAT}_{3}) (\nabla_{Z} g) (X, Y) - (\nabla_{Y} g) (X, Z) = g (A (Z, Y) - A (Y, Z), X),

({SMAT}_{4}) (\nabla_{Z} g) (X, Y) - (\nabla_{Y} g) (X, Z) = f_{1} g (T (Z, Y), X) + f_{2} g (R (Z, Y) X, ξ),

({SMAT}_{5}) (d i v^{\nabla} g) (Y) = t r a c e \nabla_{Y} g,

({SMAT}_{6}) (d i v^{\nabla} g) (Y) = t r a c e \nabla_{Y} g - F a r (Y, ξ),

({SMAT}_{7}) (d i v^{\nabla} g) (Y) = t r a c e \nabla_{Y} g - R i c^{*} (Y, ξ),

({SMAT}_{8}) (d i v^{\nabla} g) (Y) = t r a c e \nabla_{Y} g + R i c (ξ, Y) - R i c (Y, ξ),

({SMAT}_{9}) (d i v^{\nabla} g) (Y) = t r a c e \nabla_{Y} g + f_{1} t (Y) - f_{2} R i c^{*} (Y, ξ) .

Remark 9.

(i) A necessary and sufficient condition for (SMAT

_{1}

) is

g (X, T^{*} (Z, Y) - T (Z, Y)) = - g (X, R^{*} (Z, Y) ξ) .

We rewrite it as

g (A (Z, X), Y) + g (X, A (Z, Y)) - g (A (Y, X), Z) -

- g (X, A (Y, Z)) = - g (R^{0} (Z, Y) X, ξ) - g ((\nabla_{Z}^{0} A) (Y, X), ξ) +

+ g ((\nabla_{Y}^{0} A) (Z, X), ξ) - g (A (Z, A (Y, X)), ξ) + g (A (Y, A (Z, X)) ξ) .

These manifolds generalize the statistical manifolds with symmetric and flat dual connections. A non-trivial example is for flat dual connections with the same non-null torsion tensor field.

(ii) A necessary and sufficient condition for (SMAT

_{2}

) is

g (X, T^{*} (Z, Y) - T (Z, Y)) = g (X, ξ) (R i c (Z, Y) - R i c (Y, Z)) .

We rewrite it as

- g (A (Z, X), Y) - g (X, A (Z, Y)) + g (A (Y, X), Z) + g (X, A (Y, Z)) =

= \sum_{i} (g (E_{i}, (\nabla_{E_{i}}^{0} A) (Z, Y) - (\nabla_{E_{i}}^{0} A) (Y, Z)) - g (E_{i}, (\nabla_{Z}^{0} A) (E_{i}, Y) - (\nabla_{Y}^{0} A) (E_{i}, Z) +

+ g (E_{i}, A (E_{i}, A (Z, Y) - A (Y, Z))) - g (E_{i}, A (Z, A (E_{i}, Y)) - A (Y, A (E_{i}, Z)))) ξ^{b} (X) .

(iii) A necessary and sufficient condition for (SMAT

_{3}

) is

g (X, T^{*} (Z, Y) - T (Z, Y)) = g (X, A (Z, Y) - A (Y, Z)) .

We rewrite it as

- g (A (Z, X), Y) + g (A (Y, X), Z) - 2 g (X, A (Z, Y)) + 2 g (X, A (Y, Z)) = 0 .

(iv) A necessary and sufficient condition for (SMAT

_{4}

) is

g (X, T^{*} (Z, Y) - T (Z, Y)) = f_{1} g (X, T (Z, Y)) + f_{2} g (X, - R^{*} (Z, Y) ξ) .

We rewrite it as

- g (A (Z, X), Y) - g (X, A (Z, Y)) + g (A (Y, X), Z) + g (X, A (Y, Z)) =

= f_{1} [g (A (Z, Y), X) - g (A (Y, Z), X)] + f_{2} [g (R^{0} (Z, Y) X, ξ) + g ((\nabla_{Z}^{0} A) (Y, X), ξ) -

- g ((\nabla_{Y}^{0} A) (Z, X), ξ) + g (A (Z, A (Y, X)), ξ) - g (A (Y, A (Z, X)), ξ)] .

(v) A necessary and sufficient condition for (SMAT

_{5}

) is

t = t^{*}

. We rewrite it as

\sum_{i} [g (A (E_{i}, E_{i}), Y) + g (E_{i}, A (E_{i}, Y) - 2 A (Y, E_{i}))] = 0 .

Moreover,

(v)

_{1}

If

\nabla \in C_{c} (M, g)

, then ∇ is SMAT

_{5}

.

(v)

_{2}

Let

\nabla = \nabla^{0} + A, A (X, Y) = α (X) Y - α (Y) X,

where α is 1-form. If ∇ is SMAT

_{5}

, then

A = 0 .

(vi) A necessary and sufficient condition for (SMAT

_{6}

) is

t^{*} (Y) = t (Y) - \sum_{i} g (E_{i}, R (Y, ξ) E_{i}) .

We rewrite

\sum_{i} [- g (A (E_{i}, E_{i}), Y) - g (E_{i}, A (E_{i}, Y))] = \sum_{i} [- 2 g (A (Y, E_{i}), E_{i})) -

- g (E_{i}, R^{0} (Y, ξ) E_{i}) - g (E_{i}, (\nabla_{Y}^{0} A) (ξ, E_{i})) + g (E_{i}, (\nabla_{ξ}^{0} A) (Y, E_{i})) -

- g (E_{i}, A (Y, A (ξ, E_{i}))) + g (E_{i}, A (ξ, A (Y, E_{i})))] .

(vii) A necessary and sufficient condition for (SMAT

_{7}

) is

t^{*} (Y) = t (Y) - \sum_{i} g (E_{i}, R^{*} (E_{i}, Y) ξ) .

We write it as

\sum_{i} [- g (A (E_{i}, E_{i}), Y) - g (E_{i}, A (E_{i}, Y))] =

= \sum_{i} [- 2 g (A (Y, E_{i}), E_{i})) + g (R^{0} (E_{i}, Y) E_{i} +

+ (\nabla_{E_{i}}^{0} A) (Y, E_{i}) - (\nabla_{Y}^{0} A) (E_{i}, E_{i}) + A (E_{i}, A (Y, E_{i})) - A (Y, A (E_{i}, E_{i})), ξ)] .

(viii) A necessary and sufficient condition for (SMAT

_{8}

) is

t^{*} (Y) = t (Y) + R i c (ξ, Y) - R i c (Y, ξ),

i.e.,

t^{*} (Y) = t (Y) + \sum_{i} g (E_{i}, R (E_{i}, ξ) Y - R (E_{i}, Y) ξ) .

We rewrite it as

\sum_{i} [- g (A (E_{i}, E_{i}), Y) - g (E_{i}, A (E_{i}, Y))] = \sum_{i} [- 2 g (A (Y, E_{i}), E_{i})) +

+ R i c^{0} (ξ, Y) + g (E_{i}, (\nabla_{E_{i}}^{0} A) (ξ, Y)) - g (E_{i}, (\nabla_{ξ}^{0} A) (E_{i}, Y)) +

+ g (A (E_{i}, A (ξ, Y)), E_{i}) - g (A (ξ, A (E_{i}, Y)), E_{i}) -

- R i c^{0} (Y, ξ) - g (E_{i}, (\nabla_{E_{i}}^{0} A) (Y, ξ)) + g (E_{i}, (\nabla_{Y}^{0} A) (E_{i}, ξ)) -

- g (A (E_{i}, A (Y, ξ)), E_{i}) + g (A (Y, A (E_{i}, ξ)), E_{i})] .

(ix) A necessary and sufficient condition for (SMAT

_{9}

) is

t^{*} (Y) = (1 + f_{1}) t (Y) - f_{2} \sum_{i} g (E_{i}, R^{*} (E_{i}, Y) ξ) .

We rewrite it as

\sum_{i} [- g (A (E_{i}, E_{i}), Y) - g (E_{i}, A (E_{i}, Y))] =

= \sum_{i} [- 2 (1 + f_{1}) g (A (Y, E_{i}), E_{i}) + g (R^{0} (E_{i}, Y) E_{i} +

+ (\nabla_{E_{i}}^{0} A) (Y, E_{i}) - (\nabla_{Y}^{0} A) (E_{i}, E_{i}) + A (E_{i}, A (Y, E_{i})) - A (Y, A (E_{i}, E_{i})), ξ)] .

Remark 10.

We have the following inclusions: (i) SMAT

_{3}

⊂ SMAT

_{4} (f_{1} = 1, f_{2} = 0)

; (ii) SMAT

_{1} \subset

SMAT

_{4} (f_{1} = 0, f_{2} = 1)

; (iii) SMAT

_{7} \subset

SMAT

_{9} (f_{1} = 0, f_{2} = 1)

; (iv)

C_{c} (M, g) \subset

SMAT

_{5}

; (v) SMAT

_{i} \cap C_{c} (M, g) \neq \emptyset

.

Examples of some SMAT

_{i}

’s will be given in Section 7.

6. Invariant Statistical Structures on Lie Groups

Let G be a n-dimensional Lie group and

L (G)

its Lie algebra. A left invariant statistical structure on G is defined by a left invariant semi-Riemannian metric g and a left invariant connection ∇ satisfying (1). A similar definition works for right invariant statistical structures. A statistical structure is called bi-invariant if it is simultaneously left and right invariant. Linearity of the tensorial relations allows simpler expressions of the characteristic properties, as acting on invariant vector fields. For example, for a left invariant statistical structure, relation (1) is equivalent to

g (\nabla_{Z} X, Y) + g (X, \nabla_{Z} Y) = g (\nabla_{Y} X, Z) + g (X, \nabla_{Y} Z),

(9)

for all

X, Y, Z \in L (G)

and (2) is equivalent to

g (\nabla_{Z}^{*} X, Y) + g (X, \nabla_{Z} Y) = 0,

(10)

for all

X, Y, Z \in L (G)

.

The simplest (and trivial) example of bi-invariant statistical structure is given by a bi-invariant semi-Riemannian metric together with its Levi-Civita connection

\nabla_{X}^{0} Y = \frac{1}{2} [X, Y]

, for all

X, Y \in L (G)

. The real “line”

{λ \nabla^{0} ∣ λ \in R}

contains only bi-invariant connections, so the dimension of the space of bi-invariant connections is at least 1.

On a n-dimensional abelian Lie group G, any left invariant geometrical object is also bi-invariant. As the set of symmetric left invariant connections may be identified with the set of symmetric type (1,2) tensors on

L (G)

, it follows that, in this case, there exist plenty of bi-invariant statistical structures on G, different from the (previous) trivial ones.

The situation changes drastically as soon as we quit the abelian realm.

Proposition 3.

Let g be a bi-invariant semi-Riemannian metric on a compact simple Lie group G. Any bi-invariant statistical structure (

G, g, \nabla

) is trivial, with the exception of

S U (n)

, for

n \geq 3

, which admits an infinite family corresponding to

\nabla_{X}^{α} Y = \frac{1}{2} [X, Y] + α {X Y + Y X - \frac{2}{n} t r (X Y) I} i,

(11)

for any real number α. (By i we denote the imaginary constant.)

Proof.

The Levi–Civita connection of g is

\nabla_{X}^{0} Y = \frac{1}{2} [X, Y] = \frac{1}{2} (X Y - Y X) .

Consider a symmetric bi-invariant connection ∇ on G such that the triple (

G, g, \nabla

) be a bi-invariant statistical structure. Then,

\nabla_{X} Y = \frac{1}{2} [X, Y] + A (X, Y)

, for all

X, Y \in L (G)

, where A is a symmetric bi-invariant type (1,2) tensor on

L (G)

.

In [17], it was proven that all the bi-invariant connections on G are trivial, except

S U (n)

(for

n \geq 3

), where there exists a family of connections, depending on two real parameters

ν

and

μ

, of the form

\nabla_{X}^{μ, ν} Y = μ [X, Y] + ν {X Y + Y X - \frac{2}{n} t r (X Y) I} i .

It follows that ∇ must satisfy (11) for any real number

α

. □

Corollary 1.

Let g be a bi-invariant semi-Riemannian metric on a compact Lie group G. Suppose

L (G) = C (G) \oplus L_{1} \oplus \dots \oplus L_{q}

, where the center

C (G)

has dimension p and the

L_{i}

’s are the simple ideals in

L (G)

. Let r be the number of

su (n)

’s (

n \geq 3

) in

L (G)

. Then, the dimension of the space of bi-invariant statistical structures (

G, g, \nabla

) is given by

\frac{1}{2} p^{2} (p + 1) + q (p + 1) + r .

Proof.

The dimension of the space of all the bi-invariant connections on G was determined in [17] to be

p^{3} + 3 p q + q + r .

For statistical structures one must restrain to symmetric bi-invariant connections only, which leads to the required number.

In particular, when G is simple, we have

p = 0

,

q = r = 1

and the dimension of the space of bi-invariant symmetric connections is 1 (as stated in Proposition 3). □

Corollary 2.

Let g be a bi-invariant semi-Riemannian metric on

U (n)

(

n \geq 3

). Then, any bi-invariant statistical structure (

U (n), g, \nabla

) is given by

\nabla_{X} Y = \frac{1}{2} [X, Y] + α (X Y + Y X) i + β {t r (X) Y + t r (Y) X} i +

+ γ t r (X Y) i I + ϵ t r (X) t r (Y) i I,

for arbitrary real numbers

α, β, γ, ϵ

.

Proof.

For

U (n)

, we have

p = 1

,

q = r = 1

and the dimension of the space of bi-invariant symmetric connections is 4 (as follows from Corollary 1).

A basis for the bi-invariant connections on

U (n)

is given [17] by

η_{1} (X, Y) = X Y - Y X, η_{2} (X, Y) = (X Y + Y X) i

η_{3} (X, Y) = i t r (X) Y, η_{4} (X, Y) = i t r (Y) X

η_{5} (X, Y) = i t r (X Y) I, η_{6} (X, Y) = i t r (X) t r (Y) I,

where I is the identity

n \times n

matrix. As statistical structures involve only symmetric connections, we see that from the “affine connections frame”

{η_{1}; η_{2}, η_{3} + η_{4}, η_{5}, η_{6}}

we get that

\nabla - η_{1}

may be uniquely expressed as a combination of

η_{2}, η_{3} + η_{4}, η_{5}, η_{6}

. (Remember that all the geometric objects here act on the Lie algebra.) We found the required general form of a symmetric bi-invariant connection ∇. □

Remark 11.(i) The group

U (1)

is isomorphic with

S^{1}

, so it admits a unique bi-invariant statistical structure (and that is the trivial one).

(ii) On

U (2)

, the space of bi-invariant statistical structures (

U (2), g, \nabla

) has only three dimensions, as we have [17] the following relation of linear dependence on

L (G L (2, C))

η_{2} = η_{3} + η_{4} + η_{5} - η_{6}

and thus

\nabla_{X} Y = \frac{1}{2} [X, Y] + β {t r (X) Y + t r (Y) X} i + γ t r (X Y) i I + ϵ t r (X) t r (Y) i I,

for arbitrary real numbers

β, γ, ϵ

.

(iii) All the symmetric bi-invariant connections on non-abelian compact Lie groups are non-flat, due to a result of Milnor [18].

Proposition 4.

Let G be a n-dimensional Lie group, g a left invariant semi-Riemannian metric on G. Then, the space of left invariant statistical structures (

G, g, \nabla

) has the dimension

\frac{1}{6} n (n + 1) (n + 2)

.

Proof.

Any left invariant connection ∇ may be written

\nabla_{X} Y = \frac{1}{2} [X, Y] + A (X, Y),

where A is a left invariant type (1,2) tensor on

L (G)

. As ∇ must be symmetric and subject to (7), it follows that

A (X, Y) = A (Y, X), g (A (X, Y), Z) = g (A (X, Z), Y),

for all

X, Y, Z \in L (G)

. A simple combinatorics counts the number of independent tensors A to be

n + 2 C_{n}^{2} + C_{n}^{3} = \frac{1}{6} n (n + 1) (n + 2),

which finishes the proof. □

Remark 12.

(i) Let G be a n-dimensional Lie group and

m : = \frac{1}{6} n (n + 1) (n + 2)

. As a consequence of Proposition 4, we deduce that the set of all the (semi-Riemannian!) left invariant statistical structures (

G, g, \nabla

) can be parameterized by the direct product of

R^{m}

with an open subset of

R^{\frac{1}{2} n (n + 1)}

(corresponding to the symmetric

n \times n

non-singular matrices).

(ii) The left invariant connection involved in the previously considered left invariant statistical structures is not supposed to be flat. In this context, flatness would be a very strong restriction, which might forbid the existence of such structures. Moreover, up to now, the existence of flat symmetric left invariant connections on Lie groups is an open problem.

7. Examples

In this section, we shall use the framework and notations adapted from [7] (Chapters 2 and 3) and [15], where more details may be found.

Consider

n, m

positive integers and M a connected m-dimensional differentiable manifold. The set

R^{n} \times M

is a parametric model for the domain of a family of probability distributions

p : R^{n} \times M \to R

,

p = p (x, ξ)

,

p (x, ξ) > 0

,

\int p (x, ξ) d x = 1

. All integrals have the domain

R^{n}

. For an arbitrary function

f : R^{n} \times M \to R

, we denote

E_{ξ} [f] : = \int f (x, ξ) p (x, ξ) d x,

with

ξ \to E_{ξ} [f]

a function from M to

R

. We write

ξ = (ξ^{1}, \dots ξ^{m})

when local coordinates on M are involved. In many applications, f depends on p and the values of the operator E measure some kind of entropy, thus the notation.

Let

l = l (x, ξ) : R^{n} \times M \to R

,

l (x, ξ) : = l n p (x, ξ)

be the log-likehood function,

S = S_{ξ} : M \to R

the Gibbs entropy function, given by

E_{ξ} [- l]

, i.e.,

S_{ξ} : = - \int l (x, ξ) p (x, ξ) d x .

and

G (ξ) : = {(g_{i j} (ξ))}_{i, j = \bar{1, m}}

the

m \times m

-matrix of the Fischer Riemannian metric, defined by

g_{i j} (ξ) : = \int \partial_{i} l (x, ξ) \partial_{j} l (x, ξ) p (x, ξ) d x,

where we denoted

\partial_{i} l : = \frac{\partial l}{\partial_{ξ^{i}}}

. We have [7]

g_{i j} = E [\partial_{i} l \partial_{j} l] = - E [\partial_{i j} l] = \int \frac{1}{p (x, ξ)} \partial_{i} p (x, ξ) \partial_{j} p (x, ξ) d x .

Let

α

be a fixed real number. Then, the connection

\nabla^{(α)}

from Example 1 has the following coefficients, calculated in a point

ξ \in M

:

{(Γ_{i j, k}^{(α)})}_{ξ} = E_{ξ} [(\partial_{i} \partial_{j} l + \frac{1 - α}{2} \partial_{i} l \partial_{j} l) \partial_{k} l] .

Here, the coefficients of

\nabla^{(α)}

, with three down indices, are defined by

Γ_{i j, k}^{(α)} : = g (\nabla_{\partial_{i}}^{(α)} \partial_{j}, \partial_{k}) .

The coefficients of the Levi–Civita connection of the metric g (also known as the Christoffel coefficients of the first kind) are

{(Γ_{i j, k}^{(0)})}_{ξ} = E_{ξ} [(\partial_{i} \partial_{j} l + \frac{1}{2} \partial_{i} l \partial_{j} l) \partial_{k} l] .

Whenever it is possible, we shall avoid writing the point

ξ

in formulas. For example,

Γ_{i j, k}^{(α)} = Γ_{i j, k}^{(0)} - \frac{α}{2} E [\partial_{i} l \partial_{j} l \partial_{k} l] .

Example 3.

Let ∇ be an arbitrary connection on M, given by

\nabla = \nabla^{0} + A

, with

A \in T_{2}^{1} (M)

. Denote

A_{i j, k} : = g (A (\partial_{i}, \partial_{j}), \partial_{k})

and

Γ_{i j, k} : = Γ_{i j, k}^{(0)} + A_{i j, k}

the coefficients (with three down indices) of a connection ∇.

We shall choose A in order to provide examples for SMAT

_{i}

’s.

(i) If

A_{i j, k} = E [- \frac{1}{2} \partial_{i} l \partial_{j} l \partial_{k} l]

, then

\nabla \in C_{s c} (M, g)

.

(ii) Let

f = f (x, ξ) : R^{n} \times M \to R

be another function providing an entropy function

F = F_{ξ} : M \to R

, given by

E_{ξ} [f]

, i.e.

F_{ξ} : = \int f (x, ξ) p (x, ξ) d x .

Many such choices are possible, as many entropy functions were suggested in the last decades (the Tsallis entropy, the von Neumann entropy, the Renyi entropy, etc.) and their various generalizations.

We shall combine the partial derivatives of l and f in

E []

, in order to get more examples. We consider a generic

A_{i j, k} = E [\partial_{i} \partial_{j} (a_{1} f + b_{1} l) \partial_{i} (a_{2} f + b_{2} l) + \partial_{i} \partial_{k} (a_{3} f + b_{3} l) \partial_{j} (a_{4} f + b_{4} l) +

+ \partial_{j} \partial_{k} (a_{5} f + b_{5} l) \partial_{i} (a_{6} f + b_{6} l) + c_{1} \partial_{i} f \partial_{j} f \partial_{k} f + c_{2} \partial_{i} l \partial_{j} l \partial_{k} l +

+ c_{3} \partial_{i} f \partial_{j} f \partial_{k} l + c_{4} \partial_{i} f \partial_{j} l \partial_{k} l + d_{1} \partial_{i} \partial_{j} \partial_{k} f + d_{2} \partial_{i} \partial_{j} \partial_{k} l],

where

a_{1}, \dots, a_{6}, b_{1}, \dots, b_{6}, c_{1}, \dots, c_{4}, d_{1}, d_{2}

are constants to be determined.

(1) If

\begin{matrix} a_{1} a_{2} + 2 a_{3} a_{4} - 3 a_{5} a_{6} = 0, b_{1} b_{2} + 2 b_{3} b_{4} - 3 b_{5} b_{6} = 0, \\ a_{1} b_{2} + 2 a_{3} b_{4} - 3 a_{5} b_{6} = 0, b_{1} a_{2} + 2 b_{3} a_{4} - 3 b_{5} a_{6} = 0, \end{matrix}

where

c_{i} = 0

,

i = \bar{1, 4}

,

d_{j} = 0

,

j = \bar{1, 2}

, then

(M, g, \nabla)

is SMAT

_{3}

.

(2) If

\begin{matrix} a_{1} a_{2} - a_{3} a_{4} + 2 a_{5} a_{6} = 0, 3 a_{3} a_{4} - a_{5} a_{6} = 0, a_{1} a_{2} + a_{5} a_{6} = 0, \\ b_{1} b_{2} - b_{3} b_{4} + 2 b_{5} b_{6} = 0, 3 b_{3} b_{4} - b_{5} b_{6} = 0, b_{1} b_{2} + b_{5} b_{6} = 0, \\ a_{1} b_{2} - a_{3} b_{4} + 2 a_{5} b_{6} = 0, 3 a_{3} b_{4} - a_{5} a_{6} = 0, a_{1} b_{2} + a_{5} b_{6} = 0, d_{i} = 0, 1 = \bar{1, 2}, \\ b_{1} a_{2} - b_{3} a_{4} + 2 b_{5} a_{6} = 0, 3 b_{3} a_{4} - b_{5} a_{6} = 0, b_{1} a_{2} + b_{5} a_{6} = 0, c_{i} = 0, i = \bar{1, 4} . \end{matrix}

then

(M, g, \nabla)

is SMAT

_{5}

.

For other families of SMAT

_{i}

’s, the calculations are similar but more tedious. We shall follow now another path, under some more restrictive assumptions.

Example 4.

With the previous notations, let

(M, g)

have

g_{i j} = δ_{i j}

, so

{(Γ^{0})}_{i j}^{k} = 0

,

Γ_{i j}^{k} = Γ_{i j, k} = A_{i j, k} .

This may occur, for example, for exponential families of distributions probabilities, i.e., [5,7]

p (x, ξ) = e x p {C (x) + ξ^{i} F_{i} (x) - ψ (x)},

with

ψ (x) = {(x^{1})}^{2} + \dots + {(x^{m})}^{2}

. Then, we have the following characterizations:

{SMAT}_{1} - Γ_{k i}^{j} - Γ_{k j}^{i} + Γ_{j i}^{k} + Γ_{j k}^{i} = (Γ_{j i}^{r} Γ_{k r}^{s} - Γ_{k i}^{r} Γ_{j r}^{s}) ξ_{s} .

Let us take

n = 2

and

ξ = \partial_{1} .

One has

- 2 Γ_{21}^{1} + Γ_{11}^{2} + Γ_{12}^{1} = Γ_{11}^{r} Γ_{2 r}^{1} - Γ_{21}^{r} Γ_{1 r}^{1}, - Γ_{11}^{2} + Γ_{21}^{1} = - Γ_{11}^{r} Γ_{2 r}^{1} + Γ_{21}^{r} Γ_{1 r}^{1},

- Γ_{12}^{2} + Γ_{22}^{1} = Γ_{22}^{r} Γ_{1 r}^{1} - Γ_{12}^{r} Γ_{2 r}^{1}, - Γ_{22}^{1} - Γ_{21}^{2} + Γ_{12}^{2} + Γ_{12}^{1} = Γ_{12}^{r} Γ_{2 r}^{1} - Γ_{22}^{r} Γ_{1 r}^{1} .

We obtain

Γ_{12}^{1} = Γ_{21}^{1} = Γ_{21}^{2} = a .

Then

T_{12}^{1} = 0 .

Let us take

T_{12}^{2} = 1 .

Therefore,

Γ_{12}^{2} = 1 + a .

If one considers

Γ_{11}^{1} = 0, Γ_{11}^{2} = a - a^{2}, Γ_{22}^{2} = \frac{a^{2} - a - 1}{a}, a \neq 0, Γ_{22}^{1} = 0,

then we have one example of connection with torsion for SMAT

_{1}

and SMAT

_{4}

.

{SMAT}_{2} - Γ_{k i}^{j} - Γ_{k j}^{i} + Γ_{j i}^{k} + Γ_{j k}^{i} = \sum_{l} E_{l}^{r} E_{l}^{s} ξ_{i} [(Γ_{k j}^{p} - Γ_{j k}^{p}) Γ_{s p}^{r} - Γ_{s j}^{p} Γ_{k p}^{r} + Γ_{s k}^{p} Γ_{j p}^{r}] .

Let

n = 2

and

ξ = \partial_{1} .

If

- Γ_{k i}^{j} - Γ_{k j}^{i} + Γ_{j i}^{k} + Γ_{j k}^{i} = 0,

we may consider

(Γ_{12}^{p} - Γ_{21}^{p}) Γ_{s p}^{2} - Γ_{s 1}^{p} - Γ_{2 p}^{r} + Γ_{s 2}^{p} Γ_{1 p}^{r} = 0, - Γ_{s 2}^{p} Γ_{2 p}^{r} + Γ_{s 2}^{p} Γ_{2 p}^{r} = 0 .

We get

- 2 Γ_{21}^{1} + Γ_{11}^{2} - Γ_{12}^{1} = 0, - Γ_{11}^{2} + Γ_{21}^{1} = 0,

2 Γ_{21}^{2} - Γ_{12}^{2} + Γ_{22}^{1} = 0, - Γ_{22}^{1} - Γ_{21}^{2} + Γ_{12}^{2} + Γ_{12}^{1} = 0,

Γ_{22}^{1} - Γ_{11}^{1} Γ_{21}^{2} - Γ_{11}^{2} Γ_{22}^{2} + Γ_{12}^{1} Γ_{11}^{2} + Γ_{12}^{2} Γ_{21}^{2} = 0,

Γ_{22}^{2} - Γ_{21}^{1} Γ_{21}^{1} - Γ_{21}^{2} Γ_{22}^{2} + Γ_{22}^{1} Γ_{11}^{2} + Γ_{22}^{2} Γ_{12}^{2} = 0 .

{SMAT}_{3} - Γ_{k i}^{j} - 2 Γ_{k j}^{i} + Γ_{j i}^{k} + 2 Γ_{j k}^{i} = 0 .

In dimension 2 we obtain

- 3 Γ_{21}^{1} + Γ_{11}^{2} + Γ_{12}^{1} = 0, - Γ_{11}^{2} + Γ_{21}^{1} = 0,

3 Γ_{21}^{2} - 2 Γ_{12}^{2} + Γ_{22}^{1} = 0, - Γ_{22}^{1} - 2 Γ_{21}^{2} + 2 Γ_{12}^{1} = 0 .

Let us take the torsion

T_{12}^{1} = 1, T_{12}^{2} = 0 .

We can consider

Γ_{21}^{2} = Γ_{12}^{2} = a, Γ_{12}^{1} = 1 + Γ_{21}^{1} = 1 + b, Γ_{11}^{2} = b, Γ_{21}^{1} = \frac{2 b + 1}{3}, Γ_{22}^{1} = - a .

If

a = 2 b + 2

we have one example of affine connection with torsion for SMAT

_{3}

and SMAT

_{4}

.

{SMAT}_{4} - Γ_{k i}^{j} - Γ_{k j}^{i} + Γ_{j i}^{k} + Γ_{j k}^{i} = f_{1} (Γ_{k j}^{i} - Γ_{j k}^{i}) + f_{2} ξ_{p} (Γ_{j i}^{r} Γ_{k r}^{p} - Γ_{k i}^{r} Γ_{j r}^{p}) .

{SMAT}_{5} \sum_{l} E_{l}^{r} E_{l}^{s} (Γ_{r s}^{j} + Γ_{s j}^{r} - 2 Γ_{j s}^{r}) = 0,

where

E_{i} = E_{i}^{j} \partial_{j} .

In dimension 2 one can consider

Γ_{11}^{2} = Γ_{12}^{1} = Γ_{21}^{1} = b, Γ_{21}^{2} = Γ_{12}^{2} = Γ_{22}^{1} = a, Γ_{11}^{1} = 0, Γ_{22}^{2} = 0

a solution for SMAT

_{5}

.

{SMAT}_{6} \sum_{l} E_{l}^{r} E_{l}^{s} [Γ_{r s}^{j} + Γ_{s j}^{r} - 2 Γ_{j s}^{r} + ξ^{p} (- Γ_{p s}^{q} Γ_{j q}^{r} + Γ_{j s}^{q} Γ_{p q}^{r})] = 0 .

In dimension 2, if

ξ = \partial_{1},

we may consider

Γ_{11}^{2} + Γ_{12}^{1} - 2 Γ_{21}^{1} = 0, Γ_{22}^{1} + Γ_{21}^{2} - 2 Γ_{12}^{2} = 0, Γ_{12}^{1} Γ_{21}^{2} + Γ_{22}^{1} Γ_{11}^{1} = 0,

T_{12}^{1} = 0, Γ_{12}^{2} - Γ_{22}^{1} - Γ_{12}^{1} Γ_{21}^{1} + Γ_{22}^{1} Γ_{11}^{1} - Γ_{12}^{2} Γ_{22}^{1} + Γ_{22}^{2} Γ_{12}^{1} = 0,

Γ_{21}^{1} = Γ_{11}^{2}, - Γ_{12}^{2} + Γ_{21}^{2} - Γ_{11}^{1} Γ_{21}^{2} + Γ_{21}^{1} Γ_{11}^{2} - Γ_{11}^{2} Γ_{22}^{2} + Γ_{21}^{2} Γ_{12}^{2} = 0 .

{SMAT}_{7} \sum_{l} E_{l}^{r} E_{l}^{s} [Γ_{r s}^{j} + Γ_{s j}^{r} - 2 Γ_{j s}^{r} + ξ_{p} (Γ_{j s}^{q} Γ_{r q}^{p} - Γ_{r s}^{q} Γ_{j q}^{p})] = 0 .

In dimension 2, if

ξ = \partial_{1},

one can consider

Γ_{11}^{2} + Γ_{12}^{1} - 2 Γ_{21}^{1} + Γ_{21}^{2} Γ_{12}^{1} - Γ_{11}^{2} Γ_{22}^{1} = 0, Γ_{21}^{1} - Γ_{11}^{2} + Γ_{11}^{2} Γ_{22}^{1} - Γ_{21}^{2} Γ_{12}^{1} = 0,

Γ_{21}^{1} + Γ_{21}^{2} - 2 Γ_{12}^{2} + Γ_{12}^{1} Γ_{21}^{1} - Γ_{22}^{1} Γ_{11}^{1} + Γ_{12}^{2} Γ_{22}^{1} - Γ_{22}^{2} Γ_{12}^{1} = 0,

Γ_{12}^{1} - Γ_{22}^{1} - Γ_{12}^{1} Γ_{21}^{1} + Γ_{22}^{1} Γ_{11}^{1} - Γ_{12}^{2} Γ_{22}^{1} + Γ_{22}^{2} Γ_{12}^{1} = 0, T_{12}^{1} = T_{12}^{2} = 0 .

{SMAT}_{8} \sum_{l} E_{l}^{r} E_{l}^{s} [Γ_{r s}^{j} + Γ_{s j}^{r} - 2 Γ_{j s}^{r} + ξ^{p} (Γ_{p j}^{q} Γ_{r q}^{s} - Γ_{r j}^{q} Γ_{p q}^{s} - Γ_{j p}^{q} Γ_{r q}^{s} + Γ_{r p}^{q} Γ_{j q}^{s})] = 0 .

{SMAT}_{9} \sum_{l} E_{l}^{r} E_{l}^{s} [Γ_{r s}^{j} + Γ_{s j}^{r} - 2 Γ_{j s}^{r} - 2 f_{1} Γ_{j s}^{r} + f_{2} ξ_{p} (Γ_{j s}^{q} Γ_{r q}^{p} - Γ_{r s}^{q} Γ_{j q}^{p}) = 0 .

8. Discussions

The paper tries to clarify some notions and results from Differential Geometry, which are motivated by models arising from Statistics, related to statistical manifolds and to dual connections. The main idea is to distinguish, at each level of understanding, which are the appropriate algebraic and/or geometric “controls” for the variability of the models. Thus, we pointed out the deformation algebras

(X (M), A)

and the Riemannian Rinehart bi-algebras

(F (M), X (M), <, >, \cdot)

, as algebraic invariants underlying behind the dual connections and statistical manifolds.

Second, we characterized the differentiable manifolds admitting dually flat statistical structures with torsion (Theorem 4.4) and proved several results which count the number of statistical manifold structures on compact Lie groups (Section 6).

Third, we define new families of dual connections and of statistical manifolds with and without torsion (including the families SMAT

_{i}

,

1 = \bar{1, 9}

), which impose new assumptions on the curvature and torsion tensor fields. In Section 7 we exemplify them, on particular manifolds of probability distributions.

Several research directions open: (i) the purely algebraic study of the Riemannian Rinehart bi-algebras and of the deformations algebras, associated to specific control tensor fields on statistical manifolds; (ii) the relevance of the

\nabla^{(f)}

—connections for statistics, with arbitrary (or specific) functions f, extending the studies when the function is constant; (iii) specific statistical applications for the SMAT

_{i}

’s structures; and (iv) optimization results on the space of the control tensors A.

Author Contributions

The authors contributed equally to the writing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We are grateful to the reviewers for their valuable remarks, which helped us in improving the clarity of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Amari, S. Differential-Geometrical Methods in Statistics; Lecture Notes in Statistics; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 1985. [Google Scholar]
Belarbi, O.A.; Meziane, A. Overview and perspectives on metric-affine gravity. J. Phys. Conf. Ser. 2021, 1766, 012007. [Google Scholar] [CrossRef]
Amari, S. Information Geometry and Its Applications; Springer: Tokyo, Japan, 2016; Volume 194. [Google Scholar]
Ay, N.; Jurgen, J.; Le, H.V.; Schwachhöfer, L. Information Geometry; Ergebnisse der Mathematik und ihrer Grenzgebiete, 64; Springer: Heidelberg, Germany; New York, NY, USA, 2017. [Google Scholar]
Calin, O.; Udriste, C. Geometric Modeling in Probability and Statistics; Springer: Heidelberg, Germany; New York, NY, USA, 2014. [Google Scholar]
Nielsen, F. An Elementary Introduction to Information Geometry. Entropy 2020, 22, 1100. [Google Scholar] [CrossRef] [PubMed]
Amari, S.; Nagaoka, H. Methods of Information Geometry; Transl. of Math. Monographs, Amer. Math. Society: Providence, RI, USA, 2000; Volume 191. [Google Scholar]
Kurose, T. Dual connections and affine geometry. Math. Z. 1990, 203, 115–121. [Google Scholar] [CrossRef]
Kurose, T. Statistical Manifolds Admitting Torsion. Geometry and Something; Fukuoka Univ.: Fukuoka-shi, Japan, 2007. (In Japanese) [Google Scholar]
Matsuzoe, H. Statistical manifolds and affine differential geometry. In Probabilistic Approach to Geometry; Advanced Studies in Pure Mathematics; Math. Soc. Japan: Tokyo, Japan, 2010; Volume 57, pp. 303–321. [Google Scholar]
Hirică, I.E.; Nicolescu, L. Associative deformation algebras on Weyl manifolds. Diff. Geom.-Dyn. Syst. 2006, 8, 112–119. [Google Scholar]
Nicolescu, L. Champs des vecteurs remarcables dans l’algebre de deformation; retrospective et perspective. Balk. J. Geom. Appl. 1996, 1, 47–60. [Google Scholar]
Nicolescu, L.; Pripoae, G.T. Structures algebriques engendres par certaines proprietes geometriques. An. Univ. Buc. 1986, 35, 58–64. [Google Scholar]
Pessers, V.; Van der Veken, J. Riemannian manifolds as Lie-Rinehart algebras. Int. J. Geom. Meth. Mod. Phys. 2016, 13, 1641003. [Google Scholar] [CrossRef]
Matsuzoe, H.; Takeuchi, J.; Amari, S. Equiaffine structures on statistical manifolds and Bayesian statistics. Diff. Geom. Appl. 2006, 24, 567–578. [Google Scholar] [CrossRef]
Matsuzoe, H. Information geometry of Bayesian statistics. AIP Conf. Proc. 2015, 279, 1641. [Google Scholar]
Laquer, H.T. Invariant affine connections on Lie groups. Trans. Am. Math. Soc. 1992, 331, 541–551. [Google Scholar] [CrossRef]
Milnor, J. Curvatures of left invariant metrics on Lie groups. Adv. Math. 1976, 21, 293–329. [Google Scholar] [CrossRef] [Green Version]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hirica, I.-E.; Pripoae, C.-L.; Pripoae, G.-T.; Preda, V. Affine Differential Geometric Control Tools for Statistical Manifolds. Mathematics 2021, 9, 1654. https://0-doi-org.brum.beds.ac.uk/10.3390/math9141654

AMA Style

Hirica I-E, Pripoae C-L, Pripoae G-T, Preda V. Affine Differential Geometric Control Tools for Statistical Manifolds. Mathematics. 2021; 9(14):1654. https://0-doi-org.brum.beds.ac.uk/10.3390/math9141654

Chicago/Turabian Style

Hirica, Iulia-Elena, Cristina-Liliana Pripoae, Gabriel-Teodor Pripoae, and Vasile Preda. 2021. "Affine Differential Geometric Control Tools for Statistical Manifolds" Mathematics 9, no. 14: 1654. https://0-doi-org.brum.beds.ac.uk/10.3390/math9141654

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Affine Differential Geometric Control Tools for Statistical Manifolds

Abstract

1. Introduction

2. Dual Connections and Controls over Some Affine Modules of Connections

3. The Main Geometric Invariants Associated to Dual Connections

4. Existence and Characterizations of Statistical Structures

5. Beyond the Beaten Path: Exotic Statistical-Like Manifolds

6. Invariant Statistical Structures on Lie Groups

7. Examples

8. Discussions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI