Next Article in Journal
The Cross-Sectional Intrinsic Entropy—A Comprehensive Stock Market Volatility Estimator
Next Article in Special Issue
Probabilistic Inference with Polymerizing Biochemical Circuits
Previous Article in Journal
Application of Coarse-Grained (CG) Models to Explore Conformational Pathway of Large-Scale Protein Machines
Previous Article in Special Issue
Minimal Developmental Computation: A Causal Network Approach to Understand Morphogenetic Pattern Formation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Family of Fitness Landscapes Modeled through Gene Regulatory Networks

by
Chia-Hung Yang
1,* and
Samuel V. Scarpino
1,2,3,4,5,6,*
1
Network Science Institute, Northeastern University, Boston, MA 02115, USA
2
Physics Department, Northeastern University, Boston, MA 02115, USA
3
Roux Institute, Northeastern University, Boston, MA 02115, USA
4
Institute for Experiential AI, Northeastern University, Boston, MA 02115, USA
5
Santa Fe Institute, Santa Fe, NM 87501, USA
6
Vermont Complex Systems Center, University of Vermont, Burlington, VT 05405, USA
*
Authors to whom correspondence should be addressed.
Submission received: 2 December 2021 / Revised: 11 April 2022 / Accepted: 26 April 2022 / Published: 29 April 2022
(This article belongs to the Special Issue Foundations of Biological Computation)

Abstract

:
Fitness landscapes are a powerful metaphor for understanding the evolution of biological systems. These landscapes describe how genotypes are connected to each other through mutation and related through fitness. Empirical studies of fitness landscapes have increasingly revealed conserved topographical features across diverse taxa, e.g., the accessibility of genotypes and “ruggedness”. As a result, theoretical studies are needed to investigate how evolution proceeds on fitness landscapes with such conserved features. Here, we develop and study a model of evolution on fitness landscapes using the lens of Gene Regulatory Networks (GRNs), where the regulatory products are computed from multiple genes and collectively treated as phenotypes. With the assumption that regulation is a binary process, we prove the existence of empirically observed, topographical features such as accessibility and connectivity. We further show that these results hold across arbitrary fitness functions and that a trade-off between accessibility and ruggedness need not exist. Then, using graph theory and a coarse-graining approach, we deduce a mesoscopic structure underlying GRN fitness landscapes where the information necessary to predict a population’s evolutionary trajectory is retained with minimal complexity. Using this coarse-graining, we develop a bottom-up algorithm to construct such mesoscopic backbones, which does not require computing the genotype network and is therefore far more efficient than brute-force approaches. Altogether, this work provides mathematical results of high-dimensional fitness landscapes and a path toward connecting theory to empirical studies.

1. Introduction

Since its introduction by Wright [1], the concept of fitness landscapes has grown and matured into a cornerstone of biology [2,3,4]. A fitness landscape consists of a space of genotypes that are mutually accessible through mutations and a fitness value associated with the phenotype each genotype encodes. In this context, fitness describes the evolutionary potential of each genotype, and the set of navigable genotypes on these landscapes is termed the genotype network [5]. Continuing with this metaphor, the evolution of a population can be depicted as a trajectory wandering on the fitness landscape. As a consequence, the topography of a fitness landscape sheds light on various evolutionary processes, including constraints on adaptation [6,7,8,9], speciation via genetic incompatibilities [10,11], (dis)advantages of sexual reproduction and recombination [12,13,14], the repeatability/reversibility (or not) of evolutionary trajectories [15,16,17,18], and the role of neutral networks—components of the genotype network with the same fitness—in epochal evolution [19,20,21,22,23,24].
Despite being introduced by Wright [1], Fisher’s 1930 geometric model of adaptation is the first mathematical model of evolution on what we now call fitness landscapes [15,25,26]. Later work by Kingman [27] and Kauffman and Levin [28] constructed what they termed a “house of cards” (HoC) model where fitness values for each genotype are drawn independently from a specified probability distribution. Building on the HoC model, Kauffman and Weinberger [29] introduced the NK model, which forces each locus to interact with a fixed number of other loci and where a genotype’s fitness becomes the sum of the fitness contributions of every interaction group. More recently, the “rough Mount Fuji model” [30,31] combines the HoC landscape with an additional field penalizing a genotype’s Hamming distance away from a referenced genotype with the optimal fitness. The dependence of a genotype’s fitness on that of neighboring genotypes is thought to be a key feature of empirical fitness landscapes.
Over the past three decades, the fitness landscapes for various organisms, including bacteria [32,33,34], fungi [35,36], and fruit flies [37], have been empirically reconstructed. While the number of genotypes included in these early landscapes was limited, modern sequencing techniques and high-throughput analyses have enabled the construction of many large landscapes. Notable studies have been conducted in HIV [38,39], yeast [40], E. coli [41], jellyfish [42], human cancers [43], human stem cells [44], and DNA/RNA networks [45,46,47,48,49]. Comprehensive landscapes for multiple eukaryotic species have also been analyzed based on the binding affinity of transcription factors [50] and after accounting for the ecological context the species experiences [51]. What emerged from these studies is a set of prominent topographical features conserved across diverged taxa [52].
Together, empirical and modeled fitness landscapes exhibit three key topographical features. First, fitness landscapes are more often “rugged” than smooth [52]. The degree of ruggedness can be assessed via a variety of measures, such as the roughness of the slope ratio [53,54] and the number of local fitness maxima [55], which are often strongly correlated with each other [4]. Empirical studies typically show moderate ruggedness in the observed fitness landscapes [32,33,34,36,50,56]. The degree of ruggedness in these empirical landscapes is less than the HoC model assumes and comparable to a fine-tuned NK model or rough Mount Fuji model [4]. Second, fitness landscapes reveal mutational trajectories from one genotype to another where the fitness is non-decreasing, which implies accessibility (typically to a fitness optimum) across the landscape [32,57,58,59,60]. Lastly, whereas the inaccessible region in the HoC model expands when distant from the fitness optimum [61], other models find accessible trajectories despite high genotypic dimensionality [62,63].
Due to the often pervasive interaction between loci, determining phenotype from genotype can have a high degree of computational complexity [64,65]. Many existing fitness landscape models have dealt with this complexity by strongly constraining the state-space of possible genotypic interactions and/or reducing the complexity of how information is processed when mapping genotype to phenotype. For example, studies have focused on the folded structure of short RNA sequences, where the resulting stability or affinity is a fitness proxy [66,67,68,69,70], networks of molecular/genetic pathways whose expression pattern or homeostasis determines fitness [71,72,73,74,75], and modular mutational effects at different loci in Fisher’s geometric model [76].
Here, we model the genotype–phenotype map using the pathway framework of gene regulatory networks (GRNs), where mechanistic knowledge of how phenotypes are computed from genotypes is encoded in the GRN (see [77,78] for a more formal introduction). To study the fitness landscapes induced by GRN evolution, we integrate the pathway framework into a family of fitness landscape models where the fitness value is uniquely determined by the phenotype corresponding to the regulatory outcome of a genotype. For a fitness landscape of GRNs, we first prove the existence of two key topographical features: (a) GRNs with the same phenotype are themselves connected in the underlying genotypic network, and (b) there exists accessible trajectories between all pairs of GRNs with similar phenotypes. Second, utilizing the idea of symmetries and automorphisms in the genotype network, we coarse-grain GRNs into groups with equivalent roles in the fitness landscapes and deduce an underlying mesoscopic structure with which we can predict the trajectory of evolution with minimal complexity. Lastly, using this coarse-graining, we develop a bottom-up algorithm for constructing the underlying fitness landscape of GRNs, which does not require computing the genotype network and is thus more efficient than the conventional brute-force approach.

2. Methods

Here, we introduce a family of fitness landscape models where the genotype-phenotype mapping is constructed from regulatory interactions. We first summarize a modeling framework of GRNs proposed in our previous work [77,78], termed the pathway framework, and then fitness landscape models of GRNs built upon the pathway framework.

2.1. Pathway Framework of GRNs

Genotypes in the pathway framework of GRNs contain all necessary information to construct a regulatory network [77,78]. More specifically, alleles at each locus include both a transcription activator and a protein product, which means the regulatory interactions among the loci can be deduced by connecting genes whose expression product corresponds to the activator of another. Compared to existing work on regulatory circuits—where mutations are modeled as rewiring a single interaction between genes [79,80]—the pathway framework considers a mutation as changing the activator/product of a gene. Lastly, the phenotype is determined by the set of loci reached in a regulatory cascade induced by external stimuli. These stimuli could be completely external to the individual or simply come from another regulatory network in the organism. For additional details on the pathway framework, see [77,78] and Figure 1 for illustration.
In this work, when building the family of fitness landscape models, we restrict the pathway framework with four assumptions. First, we consider a fixed set of genes underlying the genotypes, i.e., gene duplication and deletion events are excluded. Second, we assume a fixed underlying collection of proteins that can possibly exist in the organism. Third, we consider the case where a gene’s expression is activated by a specific protein, and it generates only one protein product. Fourth, we assume that the associated chemical state of each protein is modeled as a Boolean/binary variable (present or absent), and external environmental signals stimulate the existence of specific proteins in the organism. As a consequence, the Boolean state of a phenotype-related protein is determined by whether it is reached by a regulatory cascade starting from an initial stimulus.
While the above assumptions seem naive, as we will show in Section 3, this simplified model still predicts the topographical features observed in empirical landscapes (see Section 1). As a result of these assumptions, we are able to derive rigorous theoretical insights into GRN evolution and obtain fitness landscapes consistent with far more complicated models. We believe these assumptions are conservative with respect to the biology and a justified starting point for modeling fitness landscapes, and we discuss the implications of these assumptions and possible extensions to the model in Section 4.

2.2. Fitness Landscape of GRNs under the Pathway Framework

Let Γ and Ω be the fixed, underlying collection of loci and proteins, respectively. A genotype is represented by its GRN g such that every locus γ Γ is associated with a protein activator/product pair e g ( γ ) = ( u , v ) , u , v Ω . Equivalently, any GRN is a directed graph with | Ω | nodes labeled by the proteins Ω and | Γ | edges labeled by the loci Γ . In the rest of this paper, we will use the terminology “source/target” node of edge γ interchangeably to refer to the protein activator/product of locus γ . We also write G to be the set of all GRNs with the underlying loci Γ and proteins Ω .
The backbone of a fitness landscape of GRNs, i.e., the genotype network, is an undirected network of networks encoding the mutational relationship between the GRNs. Let G be the genotype network, and we denote its mega-nodes by V ( G ) = G and its edges by E ( G ) . There is an edge ( g 1 , g 2 ) E ( G ) between any GRNs g 1 , g 2 G when they only differ by the allele of a single locus γ , e g 1 ( γ ) e g 2 ( γ ) . In other words, g 1 and g 2 are connected in G when they can be transformed into each other through one edge rewiring.
Furthermore, we write x ω to be the binary state of protein ω Ω , where x ω = 1 indicates the presence of ω , and x ω = 0 designates its absence. We also partition Ω into three disjoint groups: (a) proteins Ω 0 whose presence is externally stimulated by the given environment, (b) proteins Ω ^ whose states influence the fitness value, and (c) the remaining ones, which we call the dummy proteins Ω since their specific identities are irrelevant to the external environment and the resultant phenotypes/fitness. (In this paper, we assume that the stimuli Ω 0 must be proteins that cannot be produced by expression, and we leave no constraint to the fitness-relevant and dummy proteins Ω ^ and Ω ).
A phenotype is then treated as a vector of zeros and ones, where each entry corresponds to the binary state of a protein in Ω ^ . The resultant phenotype x Ω ^ ( g ) of a GRN g is determined by the reachability in g: For any ω Ω ^ , x ω = 1 if and only if there is a stimulus ω 0 Ω 0 and a path from ω 0 to ω in g, which represents a chain of sequentially expressed genes that generates protein ω . Finally, the fitness f is simply a function of the phenotype x Ω ^ ( g ) .
Combined, a fitness landscape of GRNs is characterized by three key elements: the genotype network G, the external stimuli Ω 0 , and the fitness function of phenotype f (which implicitly identifies the fitness-relevant proteins Ω ^ ). The genotype network G serves as the skeleton of the fitness landscape, whereas the environment-dependent stimuli Ω 0 and fitness function f determine the phenotypes of GRNs and their selective advantages.

3. Results

In this work, we derive three theoretical insights into fitness landscape models using GRNs as the embedded genotype–phenotype mapping. First, we show that the resulting family of fitness landscapes must always contain two topographical properties: connectivity, i.e., GRNs with the same phenotype can be mutually reached via mutations, and accessibility, i.e., that any GRN can be reached from an arbitrary less-fit GRN (once certain similarity criterion is met). Second, we propose a mesoscopic coarse-graining for fitness landscapes, which is a more compact alternative to analyzing evolutionary processes than the original landscape. This mesoscopic backbone recognizes “symmetries” in the genotype network, and it aggregates GRNs with the same role in the fitness landscape into a single representative genotype. Third, we provide a bottom-up approach to algorithmically construct this mesoscopic backbone and demonstrate its efficiency over coarse-graining the genotype network using brute force.

3.1. Connectivity and Accessibility in a Fitness Landscape of GRNs

A fitness landscape model of GRNs features a handful of properties that have either been discovered in empirical fitness landscapes or investigated mathematically. First, its underlying space, i.e., the genotype network G, presents immense dimensionality. Second, the fitness function f is flexible and can effectively tune the ruggedness of the fitness landscape. For example, a highly rugged “holely” landscape can be modeled by a binary f such that any GRN g G has high fitness once some single protein ω Ω ^ is present, x ω = 1 , and otherwise, g has low/zero fitness. Because one can always find several mutational neighbors of g whose phenotype shows an opposite state x ω , the resultant fitness landscape is inevitably rugged. In what follows, we further show that fitness landscape models of GRNs must hold the characteristics of connectivity and accessibility.
Let y be a phenotype and denote by G y the set of all GRNs with phenotype y , i.e., x Ω ^ ( g ) = y for g G y , under the given external stimuli Ω 0 . We also write Ω y + to be the required-present proteins in the phenotype y , so x ω = 1 for ω Ω y + and x ω = 0 for any other ω Ω ^ Ω y + . Note that the number of required-present proteins | Ω y + | is bounded from above by the number of loci | Γ | since any present protein that is not a stimulus must be triggered by the expression of some locus.
We observe that some GRNs G ˜ y G y play a “central” role among GRNs with the same phenotype y . Specifically, for any g ˜ G ˜ y , all the edges in g ˜ point from the stimuli Ω 0 to the required-present proteins Ω y + , and each ω Ω y + is targeted by at least one edge in g ˜ . We demonstrate an example of such g ˜ in Figure 2a. These G ˜ y are deemed central because they can be reached by any GRN g G y through mutations among G y themselves: First, for every edge in g that points to an ω Ω y + , we rewire the edge such that it still points to ω but now from an ω 0 Ω 0 . Arbitrarily rewiring the remaining edges between Ω 0 and Ω y + then leads to some central GRN in G ˜ y (see Figure 2a).
In addition, if the phenotype y has strictly less required proteins than the number of loci, the central GRNs G y are mutually reachable by edge rewiring among G ˜ y . There is always a redundant edge whose rewiring makes no change to the phenotype, and it helps us rewire each edge to any desired source/target pair between Ω 0 and Ω y + (see Figure 2b), which subsequently creates a chain of mutations between any g ˜ , g ˜ G ˜ y . These results implicate that, for any phenotype y with | Ω y + | < | Γ | and any g 1 , g 2 G y , there is always a mutational trajectory between g 1 and g 2 that only traverses over GRNs in G y , especially through the central ones G ˜ y (see Figure 2c). In the extreme case where | Ω y + | = | Γ | , however, G y fragments into multiple connected components (detailed in Appendix A).
Next, we turn to accessibility between GRNs of different phenotypes y and y , where without loss of generality f ( y ) f ( y ) . We observe that, if | Ω y + Ω y + |     | Γ | + 1 , there are always two “peripheral” GRNs g ^ G y and g ^ G y , which only differ by one edge rewiring. To be more specific, there are two independent chains in g ^ , one of which begins with a stimulus ω 0 Ω 0 and sequentially connects the proteins required to be present in y but not in y , i.e., Ω y + Ω y + , while the other consecutively joins Ω y + Ω y + . The rest of the edges in g ^ merely point from Ω 0 to Ω y + Ω y + , and each ω Ω y + Ω y + is targeted by at least one edge (see example in Figure 3, left). The other GRN g ^ only differ from g by the first edge in the chain of Ω y + Ω y + , which is rewired such that it points from the stimulus ω 0 to the first node in the chain of Ω y + Ω y + (Figure 3, right).
Our observation suggests that there is a sequence of mutations with non-decreasing fitness from any GRN g G y to any GRN g G y , as long as | Ω y + Ω y + | | Γ | + 1 . In particular, when | Ω y + | < | Γ | , the mutational trajectory starting at g first traverses within G y to a peripheral GRN and then transitions into G y to reach g . An analogous trajectory exists even under the extreme scenario | Ω y + | = | Γ | (see Appendix A). We also note that if the number of fitness-relevant proteins is | Ω ^ | | Γ | + 1 , then the condition | Ω y + Ω y + | | Γ | + 1 is assuredly satisfied for any two phenotypes y and y . As a corollary, if | Ω ^ | | Γ | + 1 , the fitness optimum will always be accessible.

3.2. Mesoscopic Skeleton Derived from “Symmetries” in the Genotype Network of GRNs

Because the number of possible GRNs grows super-exponentially as the underlying loci and proteins expand, constructing the genotype network becomes extremely challenging beyond a small Γ and Ω . Here, we present a more compact skeleton of the fitness landscape of GRNs based on “symmetries” in the genotype network.
As the underlying space of a fitness landscape of GRNs, the genotype network G appears to contain redundant information. On the one hand, GRNs leading to the identical phenotype are deemed to have equal fitness. On the other hand, given any GRN, for example, the mega-node rounded by orange in Figure 4a, one can always find some other GRN such that their neighborhoods in G are locally similar, e.g., the mega-node rounded by blue. This simple demonstration suggests that the structure of the genotype network G is not arbitrary; instead, some structural symmetries exist.
In graph theory, symmetries in a network are formally described through the network’s automorphisms. An automorphism of a graph is a way to shuffle the labels of its nodes such that the graph remains identical before and after shuffling. For instance, in Figure 5b, exchanging nodes 2 and 3 generates the same network and is thus an automorphism, whereas exchanging nodes 2 and 4 is not because there is an edge from 2 to 3 after shuffling. Formally, an automorphism of the genotype network G is a permutation σ of all plausible GRNs G = V ( G ) such that, for any g 1 , g 2 G , ( σ ( g 1 ) , σ ( g 2 ) ) E ( G ) if and only if we also have ( g 1 , g 2 ) E ( G ) . (A permutation of G is a mapping σ : G G where no two GRNs are mapped to the same GRN, i.e., σ ( g 1 ) σ ( g 2 ) if g 1 g 2 for any g 1 , g 2 G .) Once two GRNs g and g are related through an automorphism σ of G, e.g., g = σ ( g ) , they share the same mega-node properties that are fully determined by the connections in the genotype network (see Proposition A1).
Furthermore, automorphisms partition the GRNs by their roles in the genotype network through the mathematical concept of equivalence classes. For a high-level and general description, imagine a set of elements and a group of operations acting on them. Each operation turns one element into another, and these two elements are related by the operation, which describes the similarity between them. An equivalence class consists of elements that are mutually related by any operation, and the set of elements is said to be partitioned into equivalence classes under the action of the operations (see Figure 5a for an illustrative example). For automorphisms Σ ( G ) of a graph G , the equivalence classes of nodes V ( G ) under the action of Σ ( G ) then gather nodes with a similar “structural position” in G (Figure 5c).
However, to reveal GRNs with identical roles in a fitness landscape, these automorphisms also need to preserve the phenotype. Denote by Σ x ( G ) the set of such automorphisms of G, i.e., for any σ Σ x ( G ) , and GRN g G , σ ( g ) and g have the same phenotype. The equivalence classes of mega-nodes V ( G ) under the action of phenotype-preserving automorphisms Σ x ( G ) then unite GRNs that (a) show similar mutational relationships with others and (b) lead to the same fitness due to their identical phenotype. We will mildly abuse the terminology to call them the equivalence classes of GRNs, which we denote by Θ , and each θ Θ is a set of GRNs related through Σ x ( G ) . Crucially, since the mutational relationship and the resultant phenotype are the two components that characterize a GRN in the fitness landscape, GRNs in a θ Θ are deemed equivalent semantically, and they can be reduced to an arbitrary representative among them. Therefore, the equivalence classes of GRNs provide an efficient way to depict the underlying space of the fitness landscape.
However, what exactly composes the phenotype-preserving automorphisms Σ x ( G ) of the genotype network? From a sufficiency direction, we show that there exist a few graphical operations on the GRNs that produce phenotype-preserving automorphisms. These graphical operations involve permuting/shuffling different sorts of elements in a GRN:
(i)
The identities of loci Γ , e.g., exchanging edge labels of loci A and B in Figure 4b;
(ii)
The identities of dummy proteins Ω , e.g., exchanging node labels of proteins 3 and 4 in Figure 4c.
Then, potentially rewiring a given edge (see details in Definitions A1 and A2):
(iii)
Change the source node of an edge from one stimulus to another stimulus and vice versa, e.g., in Figure 4d, moving an edge pointing from node 1 to node 3 to pointing from node 2. (Note that this operation is not necessarily equivalent to permuting the identities of stimuli since at most only the single focal edge will be affected.)
(iv)
Move a self-loop at one node to another node and vice versa, for example, re-allocating a self-loop at node 3 to node 4 in Figure 4e.
For the formal proofs, we point the reader to Theorem A1. Additionally, from a necessity direction, one can computationally obtain a partition Θ ^ of the GRNs G that is coarser than the equivalence classes Θ . (A partition P is coarser than another partition P if any group in P is included in some group in P .) Specifically, start with a partition Θ ^ 0 where GRNs with the same resultant phenotype are grouped together. We create a sequence of partitions of G through the following iterative procedure: Given the partition Θ ^ i , the next partition Θ ^ i + 1 is obtained by further dividing groups into Θ ^ i (if needed) such that for each group θ Θ ^ i and θ Θ ^ i + 1 , any two GRNs in θ have the same number of neighbors among θ . This iterative procedure is terminated when no further division is required, i.e., Θ ^ k + 1 = Θ ^ k for some integer k (see Figure 6a for an illustrative cartoon of the iterative procedure). We then have Θ ^ = Θ ^ k to be our desired partition of GRNs.
To see why the proposed iterative procedure generates a coarser partition Θ ^ than the equivalence classes Θ of GRNs, we stress that the equivalence classes under automorphisms always form an equitable partition. A partition P = { P i } i = 1 m of nodes of a graph is equitable [81] if for every P i , P j P , any two nodes u , v in group P i have the same number of neighbors in P j (Figure 6b). Since GRNs in an equivalence class θ Θ must have the same amount of neighbors for each different phenotype, we inductively show that any two GRNs g 1 , g 2 θ are never separated during the iterative procedure that generates Θ ^ (see Theorem A2). Therefore, any equivalence class θ Θ must be included in a computationally acquired group θ ^ Θ ^ .
Figure 7 demonstrates the coarser partition Θ ^ generated by the iterative procedure for an arbitrary toy example. The obtained Θ ^ contains 154 groups of GRNs, and the size of groups ranges from 2 to 96. We also count the number of different kinds of GRNs that can not be transformed through graphical operations (i) and (ii), and this number varies from 1 to 4 in our example Θ ^ . Moreover, for every group in Θ ^ , we observe that those different kinds of GRNs can be related by changing the stimulus that an edge is pointing from and re-allocating self-loops (e.g., see Figure 7b). Θ ^ is thus not simply a coarser partition than the equivalence classes; according to (i)–(iv), we know that groups in Θ ^ are exactly the equivalence classes Θ . This arguably general toy example implicates that there is no need for other graphical operations to determine the equivalence classes of GRNs.
As a result, we conjecture that all the phenotype-preserving automorphisms Σ x ( G ) of the genotype network can be generated by combining graphical operations (i) to (iv) on the GRNs. In other words, two GRNs g 1 and g 2 belong to the same equivalence class if and only if, after removing all the self-loops and merging stimuli Ω 0 into a single node, there exist permutations of loci Γ and dummy proteins Ω that jointly transform g 1 into g 2 . This condition reconciles with the concept of isomorphisms between graphs. Whereas an automorphism is a mapping of nodes such that a graph preserves itself, an isomorphism is a mapping of nodes that transform one graph into another. We will borrow the terminology and call the two permutations of Γ and Ω together a phenotype-preserving isomorphism from g 1 to g 2 .

3.3. Algorithmic Construction of the Mesoscopic Backbone of GRN Fitness Landscape

Next, we investigate algorithmic approaches to construct the mesoscopic backbone of a fitness landscape based on equivalence classes, where a representative GRN replaces all other GRNs in an equivalence class due to their identical role. In particular, the desired algorithm must (a) acquire the equivalence classes Θ from scratch and (b), for a representative GRN in any equivalence class, count the number of its mutational neighbors in other equivalence classes and also within the class it belongs to.
To avoid any confusion, we emphasize that, although drawing mutational connections between equivalence classes Θ can be achieved by grouping mega-nodes in the genotype network G, this naive exercise is unsuitable. First and foremost, grouping mega-nodes demands prior knowledge of the genotype network itself, but its construction is computationally heavy. Second, in contrast to coarse-graining nodes in a graph where the groups of nodes are pre-specified, listing all GRNs in an equivalence class requires examining pairs of GRNs and assuring a phenotype-preserving isomorphism between them after removing self-loops and merging stimuli. Determining the equivalence classes Θ from all the GRNs G = V ( G ) can thus be costly as well. These reasons again show the value of the equivalence classes Θ , which consolidate GRNs into their equivalent representatives.
Here, we present a bottom-up approach that enumerates each equivalence class of GRNs and simultaneously computes the number of mutational connections among them. To begin, recall from Section 2.2 that a mutation from a GRN g 1 G to another g 2 G corresponds to rewiring a single edge in g 1 , where g 1 may rewire a self-loop/non-self-loop edge to a self-loop/non-self-loop edge in g 2 . We observe that the number of non-self-loop edges in mutational neighbors g 1 and g 2 differ at most by one. We denote by Γ ( g ) the loci representing the non-self-loop edges in the GRN g, and | Γ ( g ) | the number of those non-self-loop edges. In other words, given equivalence classes θ , θ Θ and representative GRNs g θ and g θ , g has no mutational neighbors in θ if | | Γ ( g ) | | Γ ( g ) | | > 1 .
We can therefore build the mesoscopic backbone by incrementally examining each equivalence class with an increasing number of non-self-loop edges in the representative GRN. This strategy is envisioned in Figure 8, where the backbone can be viewed as “layers” of equivalence classes of GRNs. Let Θ k be the set of equivalence classes where for every θ Θ k , the representative GRN g θ has exactly k non-self-loop edges, | Γ ( g ) | = k . We start with layer Θ 0 , which consists of the only equivalence class with no non-self-loop edges. Then, with layers Θ 0 , Θ 1 , , Θ k and all the mutational connections among them, we will find the equivalence classes in the next layer Θ k + 1 and their mutational connections with layer Θ k and within themselves up until k = | Γ | , where all the edges are non-self-loops.
To be more precise, we introduce the concept of M + neighborhood: For any GRN g G , denote by M + ( g ) the mutational neighbors of g that have one more non-self-loop edge than g. M + neighborhoods are sufficient to capture the relationship between two mutational neighbors g and g :
  • If g has one more non-self-loop edge than g, then g M + ( g ) ;
  • If g has one less non-self-loop edge than g, then we have g M + ( g ) ;
  • If g has the same number of non-self-loop edges as g, and then they share a common mutational neighbor g , where the only different edge between g and g is rewired to a self-loop and thus g , g M + ( g ) .
The mutational connections between equivalence classes can hence be uncovered by examining the M + neighborhood of the representative GRNs. Moreover, the M + neighborhood of representative GRNs in layer Θ k reveals the equivalence classes in layer Θ k + 1 because any GRN must have a mutational neighbor with one less non-self-loop edge. All that remains is to join different M + neighbors into equivalence classes. In particular:
(A)
For an equivalence class θ Θ k and its representative GRN g θ , under what condition will g 1 , g 2 M + ( g ) belong to the same equivalence class in layer Θ k + 1 ?
(B)
For two distinct equivalence classes θ 1 , θ 2 Θ k and their representative GRNs g 1 θ 1 and g 2 θ 2 , under what condition will g 1 M + ( g 1 ) and g 2 M + ( g 2 ) belong to the same equivalence class in layer Θ k + 1 ?
For our ease of illustration, we hereafter choose the GRNs g, g 1 , g 2 , g 1 and g 2 such that only one stimulus node is incident to out-going edges.
To address (A), let g 1 , g 2 M + ( g ) belong to the same equivalence class, so there is a phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal. Recalling from Section 2.2, e g ( γ ) = ( u , v ) denotes that “the source–target pair of edge γ is ( u , v ) in GRN g.” Furthermore, we write e g 1 ( γ 1 ) = ( u 1 , v 1 ) and e g 2 ( γ 2 ) = ( u 2 , v 2 ) , where γ 1 and γ 2 are the non-self-loop edges “added” to g that forms g 1 and g 2 , respectively. A few observations follow:
  • There is an integer p such that π p ( γ 1 ) = γ 1 and ( π p ( u 1 ) , π p ( v 1 ) ) = ( u 1 , v 1 ) ;
  • There is another integer q < p such that π q ( γ 1 ) = γ 2 and ( π q ( u 1 ) , π q ( v 1 ) ) = ( u 2 , v 2 ) ;
  • e g 2 ( π k ( γ 1 ) ) = ( π k ( u 1 ) , π k ( v 1 ) ) for k = 1 , 2 , , q ;
  • e g 2 ( π k ( γ 1 ) ) ( π k ( u 1 ) , π k ( v 1 ) ) for k = q + 1 , q + 2 , , p ;
  • For any locus γ and non-self-loop source–target pair ( u , v ) such that ( γ , u , v ) ( π k ( γ ) , π k ( u 1 ) , π k ( v 1 ) ) for 0 k q 1 , we have e g ( π ( γ ) ) = ( π ( u ) , π ( v ) ) if and only if e g ( γ ) = ( u , v ) .
We detail the reasoning behind these observations in Lemma A1–A3. Critically, our fifth observation implies that, after self-loop removal, the isomorphism π between g 1 and g 2 is in fact a phenotype-preserving automorphism of a subgraph g ¯ of the GRN g. In addition, observations 3. and 4. show that those edges in g—but not in g ¯ —are sequentially mapped from one to another via this automorphism π , i.e., Γ ( g ) Γ ( g ¯ ) = π k ( γ 1 ) k = 1 q 1 , and they bridge the newly added edges γ 1 and γ 2 = π q ( γ 1 ) . We show that the converse is also true (see Theorem A3): After self-loop removal, if we find a phenotype-preserving automorphism π of a subgraph g ¯ of g where γ 1 is consecutively mapped to γ 2 through the edge differences Γ ( g ) Γ ( g ¯ ) , π is guaranteed a phenotype-preserving isomorphism from g 1 to g 2 .
The sufficient and necessary condition for two M + neighbors of g to be in the same equivalence class, intriguingly, lies in the phenotype-preserving automorphisms of subgraphs of the representative GRN g. Here, we demonstrate a few simple examples in Figure 9a. In the top row, an automorphism of g directly maps between the two additional edges ( u 1 , v 1 ) = ( 3 , 5 ) and ( u 2 , v 2 ) = ( 1 , 4 ) . In the middle row, the two edges ( u 1 , v 1 ) = ( 1 , 2 ) are consecutively mapped to ( u 2 , v 2 ) = ( 3 , 4 ) through edge ( 2 , 3 ) , and ( u 2 , v 2 ) is consecutively mapped back to ( u 1 , v 1 ) through the non-edge ( 4 , 1 ) , so we have q = 2 and p = 4 . As a mixture of both, in the bottom row, ( u 1 , v 1 ) = ( 2 , 5 ) is consecutively mapped to ( u 2 , v 2 ) = ( 3 , 6 ) through edge ( 1 , 4 ) , and this isomorphism is exactly an automorphism of a subgraph g ¯ of g where edge ( 1 , 4 ) is removed.
Switching gears to the remaining question (B), suppose that g 1 and g 2 are the representative GRN in two different equivalence classes where | Γ ( g 1 ) | = | Γ ( g 2 ) | and that g 1 M + ( g 1 ) and g 2 M + ( g 2 ) belong to the same equivalence class. Let γ 1 and γ 2 be the newly added edges to g 1 and g 2 that generate g 1 and g 2 , respectively, where e g 1 ( γ 1 ) = ( u 1 , v 1 ) and e g 2 ( γ 2 ) = ( u 2 , v 2 ) , and let π be a phenotype-preserving isomorphism from g 1 to g 2 after self-loop removal. We observe that applying the permutation π on g 1 transforms it into another GRN g ˜ 1 in the same equivalence class. Since g 1 simply has one less edge γ 1 than g 1 , and g ˜ 1 and g 2 only differ by a missing edge π ( γ 1 ) . Namely, we have g 2 M + ( g ˜ 1 ) with the additional edge e g 2 ( π ( γ 1 ) ) = ( π ( u 1 ) , π ( v 1 ) ) . Moreover, since g 2 also belongs to the M + neighborhood of g 2 with the additional edge e g 2 ( γ 2 ) = ( u 2 , v 2 ) , by removing both the extra edges from g 2 , we find a GRN g such that g ˜ 1 , g 2 M + ( g ) .
We again present an illustrative example in Figure 9b. Here, a GRN g ˜ 1 in the equivalence class of g 1 can be found via the isomorphism π between g 1 and g 2 . We note that the newly added edge ( u 1 , v 1 ) = ( 4 , 1 ) is transformed into ( π ( u 1 ) , π ( v 1 ) ) = ( 3 , 4 ) in g 2 , which is missing in g ˜ 1 . Removing both ( π ( u 1 ) , π ( v 1 ) ) = ( 3 , 4 ) and ( u 2 , v 2 ) = ( 3 , 1 ) from g 2 produces a GRN g , which is a common neighbor of g 2 and g ˜ 1 with one less non-self-loop edge.
Our observation resolves the necessary condition of (B): For the representative GRNs of two different equivalence classes g 1 θ 1 and g 2 θ 2 , if their M + neighbor g 1 M + ( g 1 ) and g 2 M + ( g 2 ) belong to the same equivalence class, then we can always find two GRNs g ˜ 1 and g such that (a) g ˜ 1 falls into the equivalence class of g 1 , and (b) g ˜ 1 and g 2 are M + neighbors of g . Moreover, the converse is true as well (Theorem A4). Therefore, whether the M + neighborhood of g 1 and g 2 reveal a common equivalence class depends on the existence of a GRN g that both the equivalence classes θ 1 and θ 2 are rooted from.
Our strategy to build the mesoscopic backbone is now complete, and here, we detail our algorithm that incrementally generates the equivalence classes Θ of GRNs and establishes the mutational connections among them. Suppose that we have already built layers of equivalence classes Θ 0 , Θ 1 , , Θ k and determined the mutational connections among them. For each representative GRN g in layer Θ k and every g M + ( g ) , we will view g as the combination of g and an additional, non-self-loop edge e g ( γ ) = ( u , v ) , for which we write g = g ( γ , u , v ) . All such combinations form a collection of M + neighbors of the representative GRNs in layer Θ k , for which we abuse the notation M + ( Θ k ) .
We initially put each g M + ( Θ k ) into an individual group, and we define a collection of operations Φ that join groups of M + neighbors:
(I)
For every representative GRN g in Θ k and every phenotype-preserving automorphism σ of g, there is an operation ψ g , σ that joins together the groups of g 1 = g ( γ , u 1 , v 1 ) and g 2 = g ( γ , u 2 , v 2 ) , where u 1 , u 2 Ω 0 and v 2 = σ ( v 1 ) ;
(II)
For every representative GRN g in Θ k and every phenotype-preserving automorphism σ ¯ of each subgraph g ¯ of g such that the edge differences Γ ( g ) Γ ( g ¯ ) are sequentially connected via σ ¯ , there is an operation ϕ g , g ¯ , σ ¯ that joins together the groups of g 1 = g ( γ 1 , u 1 , v 1 ) and g 2 = g ( γ 2 , u 2 , v 2 ) , where automorphism σ ¯ consecutively transforms edge γ 1 into γ 2 through Γ ( g ) Γ ( g ¯ ) ;
(III)
For every representative GRN g in Θ k 1 and each g ˜ 1 = g ( γ 1 , u 1 , v 1 ) and g ˜ 2 = g ( γ 2 , u 2 , v 2 ) in two different equivalence classes θ 1 and θ 2 , such that we have phenotype-preserving isomorphisms π 1 / π 2 from g ˜ 1 / g ˜ 2 to the representative GRN g 1 / g 2 after self-loop removal, there is an operation φ g , g ˜ 1 , g ˜ 2 that joins together the groups of g 1 = g 1 ( π 2 ( γ 2 ) , π 2 ( u 2 ) , π 2 ( v 2 ) ) , and g 2 = g 2 ( π 1 ( γ 1 ) , π 1 ( u 1 ) , π 1 ( v 1 ) ) .
The resulting groups of M + neighbors, after applying the joining operations Φ , constitute the equivalence classes in the next layer Θ k + 1 . We hereafter denote by M Φ + ( θ ) the corresponding consequent group of an equivalence class θ Θ k + 1 . We then choose an arbitrary M + neighbor in M Φ + ( θ ) as the representative GRN of the equivalence class θ , such that only one stimulus node is incident to out-going edges in the chosen representative GRN.
The joining operations Φ further provide useful information to count the number of mutation neighbors that a representative GRN g θ in layer Θ k has among any equivalence class θ , which we will denote by A g ( θ ) . Let us first consider θ Θ k + 1 . For any g ˜ M Φ + ( θ ) , g ˜ is a mutational neighbor of g if it can be viewed as a combination of g and an arbitrary extra non-self-loop edge, and hence
A g ( θ ) = | M Φ + ( θ ) M + ( g ) | , for θ Θ k + 1 .
Note that, in this case, A g ( θ ) is easily acquired when building up the layer Θ k + 1 through Φ .
Second, for θ Θ k 1 , A g ( θ ) can be computed given A g ( θ ) , where g is the representative GRN of θ . Since the equivalence classes Θ generate an equitable partition of the genotype network G (see Section 3.2), we have A g ( θ ) × | θ | = A g ( θ ) × | θ | equal to the total number of mutational connections between θ and θ . Moreover, the size of the equivalence class θ is (see Appendix D)
| θ | = | Π | | Σ ( g ) | × n l ( | Γ | k ) × m s ( g ) × r ( g ) ,
where (a) we denote by Π the set of all permutations of dummy proteins Ω and denote by Σ ( g ) the set of automorphisms of the representative GRN g after self-loop removal that only permutes Ω ; (b) n l ( | Γ | k ) is the number of ways to allocate | Γ | k labeled self-loops among the proteins Ω ; (c) m s ( g ) is the number of ways to re-distribute the edges pointing from stimuli Ω 0 in g; and (d) r ( g ) is the number of ways to divide loci Γ into self-loops, non-self-loop edges pointing from stimuli, and others. As a result,
A g ( θ ) = A g ( θ ) × | Σ ( g ) | | Σ ( g ) | × n l ( | Γ | k + 1 ) m s ( g ) r ( g ) n l ( | Γ | k ) m s ( g ) r ( g ) , for θ Θ k 1 .
Third, we turn to the case where θ Θ k but θ θ . Recall that, if any g ˜ θ is a mutational neighbor of g, then there is a GRN g ˜ in layer Θ k 1 , where g , g ˜ M + ( g ˜ ) , and such g ˜ is unique up to arbitrary self-loop re-allocation. Additionally, the extra edge in g and g ˜ must correspond to the same locus, so
A g ( θ ) = θ Θ k 1 A g ( θ ) n l ( 1 ) × A g ( θ ) , for θ Θ k 1 , θ θ ,
in which we use g to be the representative GRN of equivalence class θ . Lastly, if θ = θ , we also need to include the scenario that the mutational neighbor g ˜ of g is generated by rewiring a self-loop to another self-loop. Therefore,
A g ( θ ) = ( | Γ | k ) × ( n l ( 1 ) 1 ) + θ Θ k 1 A g ( θ ) n l ( 1 ) × A g ( θ ) 1 .
In Algorithm 1, we summarize our proposed approach that constructs the mesoscopic backbone. It is apparent that the core of our algorithm is determining the joining operations Φ for a given layer Θ k . This task can be achieved by pre-computing the phenotype-preserving automorphisms of every representative GRN once it is chosen. In addition, since these joining operations reflect the mutational neighbors and the phenotype-preserving isomorphisms in previous layers, the type-(III) Φ for layer Θ k is generated as a composition of the already uncovered operations. Furthermore, the remaining Φ of type (II) consists of combinations of the uncovered joining operations and the newly computed automorphisms of representative GRNs in layer Θ k . As a result, the only prerequisite in our proposed algorithm is producing the phenotype-preserving automorphisms of a GRN.
Algorithm 1 Constructing the underlying space of a fitness landscape of GRNs
Require: The fixed underlying collections of loci Γ and proteins Ω of GRNs
Ensure: The representative GRN g θ of each equivalence class θ Θ , and its number of
mutational neighbors A g θ ( θ ) in any equivalence class θ Θ
1:
k 0                             ▹ initialization
2:
g θ 0 a GRN with no self-loop, where θ 0 is the only equivalence class in layer Θ 0
3:
Store the phenotype-preserving automorphisms Σ x ( g θ 0 ) .
4:
Compute A g θ 0 ( θ 0 ) via Equation (5).
5:
while k < | Γ | do                    ▹ incrementally find Θ
6:
    Construct and store the joining operations Φ for layer Θ k .
7:
     M Φ + grouping of M + ( Θ k ) acted by Φ
8:
     Θ k + 1 corresponds to the groups in M Φ + .
9:
    for all  θ Θ k + 1  do
10:
         g θ a GRN in M Φ + ( θ )           ▹ choose the representative GRN
11:
        Store the phenotype-preserving automorphisms Σ x ( g θ ) .
12:
    end for
13:
    for all  θ Θ k , θ Θ k + 1 do     ▹ count the number of mutational neighbors
14:
        Compute A g θ ( θ ) and A g θ ( θ ) via Equations (1) and (3).
15:
    end for
16:
    for all  θ 1 , θ 2 Θ k + 1  do
17:
        Compute A g θ 1 ( θ 2 ) via Equations (4) and (5).
18:
    end for
19:
     k k + 1
20:
end while
21:
Set any remaining, not computed A g θ ( θ ) to zero.

4. Conclusions

In this work, we integrate mechanistic knowledge of how phenotypes are computed from genotypes via regulatory interactions into fitness landscape models. The resulting family of fitness landscape models features flexibility for tunable ruggedness and accessibility among phenotypes. Furthermore, we introduce the concept of equivalence classes of GRNs, where GRNs of the same phenotype and with similar structural positions in the genotype network are coarse-grained into a group. These equivalence classes of GRNs lead to a compact and informative description of the fundamental space of a fitness landscape. Using this coarse-graining, we develop a bottom-up, efficient algorithm for constructing the underlying space of a fitness landscape based on the equivalence classes. Critically, this algorithm does not require pre-computing the genotype network and therefore permits the exploration of substantially larger GRNs.
Naively, ruggedness and accessibility would seem to be contradictory characteristics of a fitness landscape. Indeed, reciprocal sign epistasis has been shown to yield a strong influence on a landscape’s ruggedness and was regarded as an impediment to evolutionary accessibility when first introduced [2,32,55]. Nevertheless, recent studies suggest that fitness landscape models most closely aligned with empirical observations show that sign epistasis (and thus ruggedness) can co-exist with accessibility [63,82]. In addition to demonstrating that ruggedness and accessibility are not mutually exclusive, our model is compatible with three additional empirical observations. First, GRNs result in high dimensional genotype–phenotype maps [63]. Second, selection acts on the superposition of mutations and the background GRN rather than a few pairs of mutations [60]. Third, and perhaps most importantly, a GRN may experience a series of neutral mutations and then evolve into a nearby phenotype [3,8,83,84]. The accessibility induced in fitness landscapes of GRNs via neutral evolution agrees with the phenomenon of punctuated equilibrium/epochal evolution [23,85,86].
Our derived equivalence classes for GRNs provide a novel, mesoscopic, and optimally descriptive skeleton of a fitness landscape. Neither the genotypic space nor the phenotypic space alone fully characterize a fitness landscape; however, models with even a relatively simple genotype–phenotype map are computationally intensive because they must retain all plausible genotypes [70,73,74,75]. Intuitively, the complexity of a genotype–phenotype map can be reduced by combining similar phenotypes into high-level descriptors [87]. The equivalence classes of GRNs, on the other hand, serve as an intermediate level between the genotypic and phenotypic space, which provides an optimal coarse-graining that encodes all necessary information to predict the evolutionary trajectory on the fitness landscape.
We argue that our proposed algorithm for coarse-graining GRN fitness landscapes is more efficient than brute-force approaches. First, because we consolidate an equivalence class into a single representative GRN, our method is less costly in memory and requires fewer computations when finding mutational neighbors. Second, suppose all plausible GRNs were organized into layers by the number of non-self-loop edges (see Section 3.3), every layer would still super-exponentially contain many GRNs. Our algorithm instead finds the equivalence classes in each layer iteratively. To construct the ( k + 1 )-th layer, we only have to exhaust the representative GRNs in the k-th layer–along with any plausible additional non-self-loop edge(s)–this amount will be significantly fewer than the number of GRNs in the ( k + 1 )-th layer. Lastly, existing heuristics for graph automorphisms [88,89] can be used to produce the phenotype-preserving automorphisms of the representative GRNs, which is the only prerequisite when joining together different GRN–edge pairs. Because the set of automorphisms becomes more limited as the complexity of GRNs increases, we expect only a minor overhead in the joining procedure as compared to the exhaustive, brute-force approach.
Despite our model being constrained to the pathway framework of GRNs [77,78] and a few naive assumptions described in Section 2, we believe our methodology to be flexible and, in what follows, we outline some potential directions to extend the framework. First, when GRNs are modeled through more complex computation, e.g., with different logic gates connecting multiple expression activators/suppressors/products, those GRNs that only consist of naive interactions are never excluded. Thus, the current model represents a subset of the complete landscape built by more complex gene regulation. The derived connectivity and accessibility among the naive GRNs still hold, and we expect these topographical features to manifest for complex GRNs if mutations between the simple and complex expressions are permitted. Second, hypergraphs [90] could be used to describe the expression behavior of genes where multiple activators/products appear. Third, stable motif identification [91] and target control [92] for Boolean network models could be used to explore the phenotypes of mutational neighbors of a focal complex GRN. Lastly, our methodologies are likely applicable to other classes of genotype–phenotype maps [93,94]. In particular, once the mapping and the genotype network are determined, one can simply follow the proposed iterative procedure (Figure 6) to obtain a genotype partition coarser than the equivalence classes.
More broadly, this work showcases the potential of combining biological computation across different scales along the hierarchy of living systems. Computing biological functionality on the organism level with genotype–phenotype mapping provides a blueprint of the overall fitness landscape, where evolutionary processes occur/compute on the population level. Furthermore, several intriguing perspectives arise from the proposed mesoscopic backbone if we consider evolution to be a random walk on the fitness landscape. The process of evolution not only manifests genotypes with higher fitness values but also reveals genotypes whose mutational neighbors are more fit [19,23,78]; in other words, the prevalence of different genotypes would reflect the connection counts between equivalence classes of GRNs. In addition, these “connection counts” could become associated with an analogous theory of computation in evolution that addresses questions such as how likely a genotype in an equivalence class is to evolve into a specified phenotype, as well as how likely it is to “reset” to another genotype in the same equivalence class and recover its position in the fitness landscape.

Author Contributions

Conceptualization, C.-H.Y. and S.V.S.; Formal analysis, C.-H.Y.; Funding acquisition, S.V.S.; Investigation, C.-H.Y.; Methodology, C.-H.Y.; Project administration, S.V.S.; Supervision, S.V.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The authors affirm that all data necessary for confirming the conclusions of the article are present within the article, gures, and tables.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Central and Peripheral GRNs Where No Regulation Presents

Here, we show the connectivity of “central” and “peripheral” GRNs mentioned in Section 3.1 under the extreme scenario where no gene regulation appears, in particular, when | Ω y + | = | Γ | . We observe that, since | Ω y + | = | Γ | , each node ω Ω y + is incident to one and only one incoming edge, and thus, each central GRN g ˜ G ˜ y corresponds to a bijective mapping from Ω y + to edge labels Γ .
First, we show that if two central GRNs g ˜ 1 , g ˜ 2 G ˜ y correspond to different mappings between Ω y + and Γ , there is no mutational trajectory connecting them among G y . Start with assuming that such a mutational trajectory does exist. Due to the different associated mappings of g ˜ 1 and g ˜ 2 , there is an ω Ω ^ with distinct labels of the incident edge γ 1 and γ 2 in g ˜ 1 and g ˜ 2 , respectively. Because mutating g ˜ 1 into g ˜ 2 requires rewiring both γ 1 and γ 2 , there must exist a GRN g where either none or both the edges γ 1 and γ 2 point to ω . Nevertheless, g contradicts our observation following the constraint | Ω y + | = | Γ | so g G y . As a result, under this extreme scenario, G y fragment into multiple connected components when only mutations among themselves are considered.
Next, for any phenotype y for which | Ω y + Ω y + | | Γ | + 1 , we show that there is a mutational trajectory among G y connecting an arbitrary central GRN g ˜ G ˜ y and a peripheral GRN g ^ G y at the boundary of G y and G y . Specifically, take ω Ω y + Ω y + and its incident edge pointing from ω 0 Ω 0 . For each ω Ω y + Ω y + where ω ω , one can sequentially rewire the incident edge of ω to form a chain of Ω y + Ω y + initiated by ω 0 , which leads to a resultant GRN g ^ . Moreover, since | Ω y + | = | Γ | , we have | Ω y + Ω y + | = 1 . Recall from Section 3.1, this g ^ is indeed a peripheral GRN between G y and G y .

Appendix B. Phenotype-Preserving Automorphisms of the Genotype Network of GRNs

In this section, we demonstrate (a) why GRNs mapped by automorphisms of the genotype network G are equivalent, (b) four graphical operations that generate phenotype-preserving automorphisms of G, and (c) the correctness of our iterative procedure to obtain a coarser partition than the equivalence classes of GRNs.
Proposition A1.
Given an automorphism σ of the genotype network G and a mega-node function f G that depends on the adjacency matrix A of G, for any GRNs g 1 , g 2 G where g 2 = σ ( g 1 ) , f G ( g 2 ) = f G ( g 1 ) .
Proof. 
Take f G = f G σ . Since for any g 1 , g 2 V ( G ) , ( σ ( g 1 ) , σ ( g 2 ) ) E ( G ) if and only if ( g 1 , g 2 ) E ( G ) , the adjacency matrix A remains unchanged after permuting the mega-nodes through σ . As a result, we have f G = f G , and f G ( g 2 ) = f G ( g 1 ) = f G ( g 1 ) . □
Let π and π be a permutation of the loci Γ and the dummy proteins Ω , respectively. It is not hard to see that π and π also generate a permutation of the GRNs G . For g G , we abuse the notation π ( g ) to be the GRN mapped through the locus permutation π , where an edge with e g ( γ ) = ( u , v ) is transformed into e π ( g ) ( π ( γ ) ) = ( u , v ) . Similarly, in the GRN π ( g ) mapped through the dummy protein permutation π , an edge with e g ( γ ) = ( u , v ) is transformed into e g ( γ ) = ( π ( u ) , π ( v ) ) .
Furthermore, we have two more types of graphical operations on GRNs:
Definition A1.
For a locus γ and two stimuli ω , ω Ω 0 , ρ γ , ω , ω : G G transforms a GRN g into g such that edge γ becomes
e g ( γ ) = ( u , ω ) if e g ( γ ) = ( u , ω ) , e g ( γ ) = ( u , ω ) if e g ( γ ) = ( u , ω ) , unchanged otherwise .
Definition A2.
For a locus γ and two nodes ω and ω , ϱ γ , ω , ω : G G transforms a GRN g into g such that the edge γ becomes
e g ( γ ) = ( ω , ω ) if e g ( γ ) = ( ω , ω ) , e g ( γ ) = ( ω , ω ) if e g ( γ ) = ( ω , ω ) , unchanged otherwise .
Note both ρ γ , ω , ω and ϱ γ , ω , ω are permutations of G as well, where pairs of GRNs are mutually mapped from one to the other.
These four graphical operations introduced above are, more importantly, automorphisms of the genotype network G that also preserve the phenotype of GRNs:
Theorem A1.
The transformations π, π , ρ γ , ω , ω , and ϱ γ , ω , ω are phenotype-preserving automorphisms of G.
Proof. 
For g 1 , g 2 G , let Δ = γ Γ e g 1 ( γ ) e g 2 ( γ ) . Since π is a permutation of dummy proteins, it preserves Δ . Additionally, because ρ γ , ω , ω and ϱ γ , ω , ω can be viewed as permutations of the source–target pair of a single edge γ , they also preserve Δ . The permutation π of Γ , on the other hand, does not preserve Δ but maintains its size | Δ | . Since ( g 1 , g 2 ) E ( G ) if and only if | Δ | = 1 , the four transformations are automorphisms of G.
Furthermore, since π , π and ϱ γ , ω , ω simply change the labels of edges, labels of the intermediate nodes, and the location of a self-loop, they maintain any path from a stimulus ω 0 Ω 0 to a fitness-relevant protein ω ^ Ω ^ . ρ γ , ω , ω may alter the path between ω 0 and ω ^ , but the reachability from Ω 0 to ω ^ remains. Therefore, the four transformations also preserve the phenotype of GRNs. □
Finally, we turn to the computationally acquired partition that can be shown to be coarser than the equivalence classes of GRNs. Recall from Section 3.2 that our iterative procedure starts from a partition Θ ^ 0 where GRNs with the same phenotype are grouped together. Given the partition Θ ^ i , the next partition Θ ^ i + 1 is obtained by further dividing groups into Θ ^ i (if needed) such that for each group θ Θ ^ i and θ Θ ^ i + 1 , any two GRNs in θ have the same number of neighbors among θ in the genotype network G. The procedure terminates when a stationary partition Θ ^ is reached. We then have:
Theorem A2.
Every equivalence class θ Θ is included in a group θ ^ Θ ^ , θ θ ^ .
Proof. 
Recall that GRNs in an equivalence class have the same phenotype, so for each θ Θ , there is some θ 0 Θ 0 where θ θ 0 . Suppose that θ θ i for each θ Θ and some θ i Θ i . Since Θ forms an equitable partition, every g θ has the same number of neighbors in each θ Θ and thus also in each θ i Θ i . Consequently, no two GRNs in θ will be separated into two different groups in Θ i + 1 , and the theorem is proved by induction. □

Appendix C. Combining Mutational Neighbors into Equivalence Classes

In this section, we tackle the two questions raised in Section 3.3:
(A)
For an equivalence class θ Θ k and its representative GRN g θ , under what condition will g 1 , g 2 M + ( g ) belong to the same equivalence class in layer Θ k + 1 ?
(B)
For two distinct equivalence classes θ 1 , θ 2 Θ k and their representative GRNs g 1 θ 1 and g 2 θ 2 , under what condition will g 1 M + ( g 1 ) and g 2 M + ( g 2 ) belong to the same equivalence class in layer Θ k + 1 ?
Furthermore, recall that for ease of demonstration, we constrain the GRNs g, g 1 , g 2 , g 1 and g 2 where only one stimulus node is incident to out-going edges.
Definition A3.
A phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal is a permutation of loci Γ and dummy protein Ω such that for any locus γ and non-self-loop source–target pair ( u , v ) , e g 2 ( π ( γ ) ) = ( π ( u ) , π ( v ) ) if and only if e g 1 ( γ ) = ( u , v ) .
Starting with the question (A), we write e g 1 ( γ 1 ) = ( u 1 , v 1 ) and e g 2 ( γ 2 ) = ( u 2 , v 2 ) , where γ 1 and γ 2 are the non-self-loop edge newly rewired to generate g 1 and g 2 from g, respectively. A few observations appear when we assume g 1 and g 2 belong to the same equivalence class:
Lemma A1.
Suppose a phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal. There are two integers q < p such that π p ( γ 1 ) = γ 1 , ( π p ( u 1 ) , π p ( v 1 ) ) = ( u 1 , v 1 ) , and π q ( γ 1 ) = γ 2 , ( π q ( u 1 ) , π q ( v 1 ) ) = ( u 2 , v 2 ) .
Proof. 
Since π is a permutation of finite sets, it must have a finite period, i.e., an integer p such that π p ( γ 1 ) = γ 1 and ( π p ( u 1 ) , π p ( v 1 ) ) = ( u 1 , v 1 ) .
If π ( γ 1 ) = γ 2 and ( π ( u 1 ) , π ( v 1 ) ) = ( u 2 , v 2 ) , then we have q = 1 . Otherwise, it must map to a non-self-loop edge in g because γ 2 is the only additional non-self-loop edge in g 2 , i.e., e g ( π ( γ 1 ) ) = ( π ( u 1 ) , π ( v 1 ) ) . Assume there is no integer q < p such that π q ( γ 1 ) = γ 2 and ( π q ( u 1 ) , π q ( v 1 ) ) = ( u 2 , v 2 ) . Then, π p 1 ( γ 1 ) is a non-self-loop edge in g with e g ( π p 1 ( γ 1 ) ) = ( π p 1 ( u 1 ) , π p 1 ( v 1 ) ) . However, since γ 1 is not a non-self-loop edge in g 2 , the fact that π p ( γ 1 ) = γ 1 and ( π p ( u 1 ) , π p ( v 1 ) ) = ( u 1 , v 1 ) contradicts π being an isomorphism from g 1 to g 2 after self-loop removal. Therefore, there is an integer q < p such that π q ( γ 1 ) = γ 2 and ( π q ( u 1 ) , π q ( v 1 ) ) = ( u 2 , v 2 ) . □
Lemma A2.
Suppose a phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal. For integers p and q in Lemma A1, e g 2 ( π k ( γ 1 ) ) = ( π k ( u 1 ) , π k ( v 1 ) ) for k = 1 , 2 , , q , and e g 2 ( π k ( γ 1 ) ) ( π k ( u 1 ) , π k ( v 1 ) ) for k = q + 1 , q + 2 , , p .
Proof. 
Since π is an isomorphism and γ 2 = π q ( γ 1 ) is the only additional non-self-loop edge in g 2 , we have π ( γ 1 ) , π 2 ( γ 1 ) , , π q 1 ( γ 1 ) to be non-self-loop edges in g. Thus, e g 2 ( π k ( γ 1 ) ) = ( π k ( u 1 ) , π k ( v 1 ) ) for k = 1 , 2 , , q . On the other hand, since γ 2 = π q ( γ 1 ) is not a non-self-loop edge in g 1 , the isomorphism π guaranteed that the source–target pairs ( π q + 1 ( u 1 ) , π q + 1 ( v 1 ) ) , ( π q + 2 ( u 1 ) , π q + 2 ( v 1 ) ) , , ( π p ( u 1 ) , π p ( v 1 ) ) do not match to edges in g 2 , in particular, e g 2 ( π k ( γ 1 ) ) ( π k ( u 1 ) , π k ( v 1 ) ) for k = q + 1 , q + 2 , , p . □
Lemma A3.
Suppose a phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal. Given integer q in Lemma A1, for a locus γ and non-self-loop source–target pair ( u , v ) where there is no 0 k q 1 such that ( γ , u , v ) = ( π k ( γ ) , π k ( u 1 ) , π k ( v 1 ) ) , e g ( π ( γ ) ) = ( π ( u ) , π ( v ) ) if and only if e g ( γ ) = ( u , v ) .
Proof. 
For ( γ , u , v ) ( π k ( γ ) , π k ( u 1 ) , π k ( v 1 ) ) k = 0 p 1 , since ( γ 1 , u 1 , v 1 ) and ( γ 2 , u 2 , v 2 ) are already excluded, and by Definition A3, we have e g ( π ( γ ) ) = ( π ( u ) , π ( v ) ) if and only if e g ( γ ) = ( u , v ) . Furthermore, for k = q , q + 1 , , p 1 , because the only additional non-self-loop edge in g 2 follows π q ( γ 1 ) = γ 2 and ( π q ( u 1 ) , π q ( v 1 ) ) = ( u 2 , v 2 ) , according to Lemma A2, we know that e g ( π k ( γ 1 ) ) ( π k ( u 1 ) , π k ( v 1 ) ) and e g ( π k + 1 ( γ 1 ) ) ( π k + 1 ( u 1 ) , π k + 1 ( v 1 ) ) . As a result, the statement in Lemma A3 is true for any locus γ and non-self-loop source–target pair ( u , v ) where ( γ , u , v ) ( π k ( γ ) , π k ( u 1 ) , π k ( v 1 ) ) k = 0 q 1 . □
The following theorem resolves the necessary and sufficient condition in question (A):
Theorem A3. 
Let g 1 , g 2 M + ( g ) with the additional non-self-loop edge e g 1 ( γ 1 ) = ( u 1 , v 1 ) and e g 2 ( γ 2 ) = ( u 2 , v 2 ) , respectively. g 1 and g 2 belong to the same equivalence class if and only if there exist two integers q < p and a phenotype-preserving automorphism σ of a subgraph g ¯ of g such that
i.     
( σ q ( γ 1 ) , σ q ( u 1 ) , σ q ) = ( γ 2 , u 2 , v 2 ) ;
ii.   
( σ p ( γ 1 ) , σ p ( u 1 ) , σ p ( v 1 ) ) = ( γ 1 , u 1 , v 1 ) ;
iii
σ k ( γ 1 ) k = 1 q 1 = Γ ( g ) Γ ( g ¯ ) and e g σ k ( γ 1 ) = σ k ( u 1 ) , σ k ( v 1 ) for k = 1 , 2 , , q 1 ;
iv
e g σ k ( γ 1 ) σ k ( u 1 ) , σ k ( v 1 ) for k = q , q + 1 , , p .
Proof. 
For one direction, suppose a phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal. According to Lemmas A1–A3, we see that π is a phenotype-preserving automorphism of a subgraph g ¯ of g where Γ ( g ) Γ ( g ¯ ) = π k ( γ 1 ) k = 1 q 1 . Taking σ = π satisfies all four conditions.
For the other direction, we show that such a phenotype-preserving automorphism σ is also a phenotype-preserving isomorphism from g 1 to g 2 after self-loop removal. First, regarding any locus γ and non-self-loop target pair ( u , v ) such that ( γ , u , v ) σ k ( γ 1 ) , σ k ( u 1 ) , σ k ( v 1 ) k = 1 p , we have e g 2 ( σ ( γ ) ) = ( σ ( u ) , σ ( v ) ) if and only if e g 1 ( γ ) = ( u , v ) because σ is an automorphism of g ¯ and γ 1 , γ 2 , and Γ ( g ) Γ ( g ¯ ) are already excluded. Second, for k = 0 , 1 , , q 1 , the conditions i.–iii. ensure that e g 1 σ k ( γ ) = σ k ( u ) , σ k ( v ) and e g 2 σ k + 1 ( γ ) = σ k + 1 ( u ) , σ k + 1 ( v ) . Lastly, for k = q , q + 1 , , p 1 , the conditions i., ii., and iv. indicate e g 1 σ k ( γ ) σ k ( u ) , σ k ( v ) and e g 2 σ k + 1 ( γ ) σ k + 1 ( u ) , σ k + 1 ( v ) . □
Next, the necessary and sufficient condition of question (B) is described in the theorem below:
Theorem A4. 
Let g 1 and g 2 be of two different equivalence classes, and let g 1 M + ( g 1 ) and g 2 M + ( g 2 ) with the additional non-self-loop edge e g 1 ( γ 1 ) = ( u 1 , v 1 ) and e g 2 ( γ 2 ) = ( u 2 , v 2 ) , respectively. g 1 and g 2 belong to the same equivalence class if and only if there exist GRNs g ˜ 1 and g such that
i.    
g ˜ 1 and g 1 belong to the same equivalence class;
ii.   
g ˜ 1 , g 2 M + ( g ) ;
iii
g 2 M + ( g ˜ 1 ) .
Proof. 
For one direction, suppose a phenotype-preserving isomorphism π from g 1 to g 2 after self-loop removal. Take g ˜ 1 = π ( g 1 ) , i.e., e g ˜ 1 ( π ( γ ) ) = ( π ( u ) , π ( v ) ) for each γ Γ ( g 1 ) with e g 1 ( γ ) = ( u , v ) , so π also becomes a phenotype-preserving isomorphism from g 1 to g ˜ 1 . Moreover, due to the isomorphism π , we note g 2 M + ( g ˜ 1 ) with the additional non-self-loop edge e g 2 ( π ( γ 1 ) ) = ( π ( u 1 ) , π ( v 1 ) ) . Let g be a GRN obtained by rewiring edges π ( γ 1 ) and γ 2 from g 2 to two arbitrary self-loops. We have g ˜ 1 , g 2 M + ( g ) with the additional non-self-loop edge e g ˜ 1 ( γ 2 ) = ( u 2 , v 2 ) and e g 2 ( π ( γ 1 ) ) = ( π ( u 1 ) , π ( v 1 ) ) , respectively. Observe that the GRNs g ˜ 1 and g satisfy the conditions i.–iii.
For the other direction, suppose there exist GRNs g ˜ 1 and g where the conditions i.–iii. hold. Let π be the phenotype-preserving isomorphism from g 1 to g ˜ 1 after self-loop removal such that the additional non-self-loop edge of g 2 M + ( g ) complies e g 2 ( π ( γ 1 ) ) = ( π ( u 1 ) , π ( v 1 ) ) . Since g ˜ 1 , g 2 M + ( g ) , g 2 M + ( g 2 ) , and g 2 M + , we also have the addition non-self-loop edge in g 2 M + to follow e g 2 ( π ( γ 1 ) ) = ( π ( u 1 ) , π ( v 1 ) ) . Therefore, π is also a phenotype-preserving isomorphism from g 1 to g 2 after self-loop removal. □

Appendix D. Size of an Equivalence Class of GRNs

Here, given the representative GRN g of an equivalence class θ , we calculate the number of GRNs in θ in Equation (2). Observe that | θ | is proportional to (a) the number of GRNs to which there is a phenotype-preserving isomorphism from g, (b) the number of ways to arbitrarily allocate self-loops on g, and (c) the number of ways to arbitrarily rewire source nodes of edges among the stimuli Ω 0 .
First, for part (a), if we temporally ignore labels on the edges, the set of permutations over dummy proteins Ω is partitioned into groups of isomorphisms from g to different GRNs. In addition, the size of each group is exactly the number of automorphisms of g since the composition of an automorphism of g and an isomorphism from g to g also generates an isomorphism from g to g . Thus, there are in total | Π | | Σ ( g ) | different GRNs isomorphic to g, where Π is the set of permutations over Ω , and Σ ( g ) is the set of automorphisms of g that only permutes Ω .
Second, for part (b), every possible allocation distributes | Γ | | Γ ( g ) | self-loops over | Ω Ω 0 | nodes. Suppose that the labels of self-loops Γ Γ ( g ) are given, we have
n l ( | Γ Γ ( g ) | ) = | Ω Ω 0 | | Γ Γ ( g ) | .
Third, we write k s ( g ) as the number of incident edges to stimuli Ω 0 in g. Any possibility in part (c) chooses a source node among Ω 0 for each of the k s ( g ) incident edges. Providing that the labels of incident edges of the stimuli are already known, we have
m s ( g ) = | Ω 0 | k s ( g ) .
Combined, the size of the equivalence class θ becomes
| θ | = | Γ | ! | Γ Γ ( g ) | ! k s ( g ) ! | Π | | Σ ( g ) | n l ( | Γ Γ ( g ) | ) m s ( g ) = | Γ | ! | Γ Γ ( g ) | ! k s ( g ) ! | Π | | Σ ( g ) | | Ω Ω 0 | | Γ Γ ( g ) | | Ω 0 | k s ( g )
where the first fraction represents first selecting combinations of | Γ Γ ( g ) | and k s ( g ) labels for self-loops and edges incident to stimuli and then permuting the remaining labels, which also contributes to different phenotype-preserving isomorphisms from g but was previously omitted.

References

  1. Wright, S. The roles of mutation, inbreeding, crossbreeding, and selection in evolution. In Proceedings of the Sixth International Congress on Genetics, Ithaca, NY, USA, 24–31 August 1932; pp. 356–366. [Google Scholar]
  2. De Visser, J.A.G.M.; Krug, J. Empirical fitness landscapes and the predictability of evolution. Nat. Rev. Genet. 2014, 15, 480–490. [Google Scholar] [CrossRef] [PubMed]
  3. Fragata, I.; Blanckaert, A.; Louro, M.A.D.; Liberles, D.A.; Bank, C. Evolution in the light of fitness landscape theory. Trends Ecol. Evol. 2019, 34, 69–82. [Google Scholar] [CrossRef] [PubMed]
  4. Szendro, I.G.; Schenk, M.F.; Franke, J.; Krug, J.; De Visser, J.A.G. Quantitative analyses of empirical fitness landscapes. J. Stat. Mech. Theory Exp. 2013, 2013, P01005. [Google Scholar] [CrossRef] [Green Version]
  5. Wagner, A. The Origins of Evolutionary Innovations: A Theory of Transformative Change in Living Systems; Oxford University Press: Oxford, UK, 2011. [Google Scholar]
  6. Jain, K.; Krug, J. Deterministic and stochastic regimes of asexual evolution on rugged fitness landscapes. Genetics 2007, 175, 1275–1288. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Kryazhimskiy, S.; Tkačik, G.; Plotkin, J.B. The dynamics of adaptation on correlated fitness landscapes. Proc. Natl. Acad. Sci. USA 2009, 106, 18638–18643. [Google Scholar] [CrossRef] [Green Version]
  8. Draghi, J.A.; Parsons, T.L.; Wagner, G.P.; Plotkin, J.B. Mutational robustness can facilitate adaptation. Nature 2010, 463, 353–355. [Google Scholar] [CrossRef] [Green Version]
  9. Wu, N.C.; Dai, L.; Olson, C.A.; Lloyd-Smith, J.O.; Sun, R. Adaptation in protein fitness landscapes is facilitated by indirect paths. eLife 2016, 5, e16965. [Google Scholar] [CrossRef]
  10. Gavrilets, S. Fitness Landscapes and the Origin of Species; Princeton University Press: Princeton, NY, USA, 2004. [Google Scholar]
  11. Fraïsse, C.; Gunnarsson, P.A.; Roze, D.; Bierne, N.; Welch, J.J. The genetics of speciation: Insights from fisher’s geometric model. Evolution 2016, 70, 1450–1464. [Google Scholar] [CrossRef] [Green Version]
  12. de Visser, J.A.G.; Park, S.C.; Krug, J. Exploring the effect of sex on empirical fitness landscapes. Am. Nat. 2009, 174, S15–S30. [Google Scholar] [CrossRef] [Green Version]
  13. Otto, S.P. The evolutionary enigma of sex. Am. Nat. 2009, 174, S1–S14. [Google Scholar] [CrossRef] [Green Version]
  14. Watson, R.A.; Weinreich, D.M.; Wakeley, J. Genome structure and the benefit of sex. Evol. Int. J. Org. Evol. 2011, 65, 523–536. [Google Scholar] [CrossRef] [PubMed]
  15. Orr, H.A. The genetic theory of adaptation: A brief history. Nat. Rev. Genet. 2005, 6, 119–127. [Google Scholar] [CrossRef] [PubMed]
  16. Lobkovsky, A.E.; Wolf, Y.I.; Koonin, E.V. Predictability of evolutionary trajectories in fitness landscapes. PLoS Comput. Biol. 2011, 7, e1002302. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Salverda, M.L.; Dellus, E.; Gorter, F.A.; Debets, A.J.; Van Der Oost, J.; Hoekstra, R.F.; Taw, D.S.; de Visser, J.A.G. Initial mutations direct alternative pathways of protein evolution. PLoS Genet. 2011, 7, e1001321. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Bank, C.; Matuszewski, S.; Hietpas, R.T.; Jensen, J.D. On the (un) predictability of a large intragenic fitness landscape. Proc. Natl. Acad. Sci. USA 2016, 113, 14085–14090. [Google Scholar] [CrossRef] [Green Version]
  19. Van Nimwegen, E.; Crutchfield, J.P.; Huynen, M. Neutral evolution of mutational robustness. Proc. Natl. Acad. Sci. USA 1999, 96, 9716–9720. [Google Scholar] [CrossRef] [Green Version]
  20. Van Nimwegen, E.; Crutchfield, J.P. Metastable evolutionary dynamics: Crossing fitness barriers or escaping via neutral paths? Bull. Math. Biol. 2000, 62, 799–848. [Google Scholar] [CrossRef] [Green Version]
  21. Wilke, C.O. Adaptive evolution on neutral networks. Bull. Math. Biol. 2001, 63, 715–730. [Google Scholar] [CrossRef] [Green Version]
  22. Smith, T.; Husbands, P.; O’Shea, M. Neutral networks and evolvability with complex genotype-phenotype mapping. In European Conference on Artificial Life; Springer: Berlin/Heidelberg, Germany, 2001; pp. 272–281. [Google Scholar]
  23. Aguirre, J.; Catalán, P.; Cuesta, J.A.; Manrubia, S. On the networked architecture of genotype spaces and its critical effects on molecular evolution. Open Biol. 2018, 8, 180069. [Google Scholar] [CrossRef] [Green Version]
  24. Manrubia, S.; Cuesta, J.A.; Aguirre, J.; Ahnert, S.E.; Altenberg, L.; Cano, A.V.; Catalán, P.; Diaz-Uriarte, R.; Elena, S.F.; García-Martín, J.A.; et al. From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics. Phys. Life Rev. 2021, 38, 55–106. [Google Scholar] [CrossRef]
  25. Tenaillon, O. The utility of fisher’s geometric model in evolutionary genetics. Annu. Rev. Ecol. Evol. Syst. 2014, 45, 179–201. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Fisher, R.A. The Genetical Theory of Natural Selection; Oxford University Press: Oxford, UK, 1930. [Google Scholar]
  27. Kingman, J.F.C. A simple model for the balance between selection and mutation. J. Appl. Probab. 1978, 15, 1–12. [Google Scholar] [CrossRef]
  28. Kauffman, S.; Levin, S. Towards a general theory of adaptive walks on rugged landscapes. J. Theor. Biol. 1987, 128, 11–45. [Google Scholar] [CrossRef]
  29. Kauffman, S.A.; Weinberger, E.D. The nk model of rugged fitness landscapes and its application to maturation of the immune response. J. Theor. Biol. 1989, 141, 211–245. [Google Scholar] [CrossRef]
  30. Aita, T.; Uchiyama, H.; Inaoka, T.; Nakajima, M.; Kokubo, T.; Husimi, Y. Analysis of a local fitness landscape with a model of the rough mt. fuji-type landscape: Application to prolyl endopeptidase and thermolysin. Biopolym. Orig. Res. Biomol. 2000, 54, 64–79. [Google Scholar] [CrossRef]
  31. Neidhart, J.; Szendro, I.G.; Krug, J. Adaptation in tunably rugged fitness landscapes: The rough mount fuji model. Genetics 2014, 198, 699–721. [Google Scholar] [CrossRef] [Green Version]
  32. Weinreich, D.M.; Delaney, N.F.; DePristo, M.A.; Hartl, D.L. Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 2006, 312, 111–114. [Google Scholar] [CrossRef] [Green Version]
  33. Chou, H.-H.; Chiu, H.-C.; NDelaney, N.F.; Segrè, D.; Marx, C.J. Diminishing returns epistasis among beneficial mutations decelerates adaptation. Science 2011, 332, 1190–1192. [Google Scholar] [CrossRef] [Green Version]
  34. Khan, A.I.; Dinh, D.M.; Schneider, D.; Lenski, R.E.; Cooper, T.F. Negative epistasis between beneficial mutations in an evolving bacterial population. Science 2011, 332, 1193–1196. [Google Scholar] [CrossRef]
  35. De Visser, J.A.G.M.; Hoekstra, R.F.; van den Ende, H. Test of interaction between genetic markers that affect fitness in aspergillus niger. Evolution 1997, 51, 1499–1505. [Google Scholar] [CrossRef] [Green Version]
  36. Hall, D.W.; Agan, M.; Pope, S.C. Fitness epistasis among 6 biosynthetic loci in the budding yeast saccharomyces cerevisiae. J. Hered. 2010, 101 (Suppl. S1), S75–S84. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Whitlock, M.C.; Bourguet, D. Factors affecting the genetic load in drosophila: Synergistic epistasis and correlations among fitness components. Evolution 2000, 54, 1654–1660. [Google Scholar] [CrossRef] [PubMed]
  38. Hinkley, T.; Martins, J.; Chappey, C.; Haddad, M.; Stawiski, E.; Whitcomb, J.M.; Petropoulos, C.J.; Bonhoeffer, S. A systems analysis of mutational effects in hiv-1 protease and reverse transcriptase. Nat. Genet. 2011, 43, 487–489. [Google Scholar] [CrossRef] [PubMed]
  39. Kouyos, R.D.; Leventhal, G.E.; Hinkley, T.; Haddad, M.; Whitcomb, J.M.; Petropoulos, C.J.; Bonhoeffer, S. Exploring the complexity of the hiv-1 fitness landscape. PLoS Genet. 2012, 8, e1002551. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Hietpas, R.T.; Jensen, J.D.; Bolon, D.N.A. Experimental illumination of a fitness landscape. Proc. Natl. Acad. Sci. USA 2011, 108, 7896–7901. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Otwinowski, J.; Nemenman, I. Genotype to phenotype mapping and the fitness landscape of the E. coli lac promoter. PLoS ONE 2013, 8, e61570. [Google Scholar] [CrossRef] [Green Version]
  42. Sarkisyan, K.S.; Bolotin, D.A.; Meer, M.V.; Usmanova, D.R.; Mishin, A.S.; Sharonov, G.V.; Ivankov, D.N.; Bozhanova, N.G.; Baranov, M.S.; Soylemez, O.; et al. Local fitness landscape of the green fluorescent protein. Nature 2016, 533, 397–401. [Google Scholar] [CrossRef] [Green Version]
  43. Rogers, Z.N.; McFarland, C.D.; Winters, I.P.; Seoane, J.A.; Brady, J.J.; Yoon, S.; Curtis, C.; Petrov, D.A.; Winslow, M.M. Mapping the in vivo fitness landscape of lung adenocarcinoma tumor suppression in mice. Nat. Genet. 2018, 50, 483–486. [Google Scholar] [CrossRef]
  44. Watson, C.J.; Papula, A.L.; Poon, G.Y.P.; Wong, W.H.; Young, A.L.; Druley, T.E.; Fisher, D.S.; Blundell, J.R. The evolutionary dynamics and fitness landscape of clonal hematopoiesis. Science 2020, 367, 1449–1454. [Google Scholar] [CrossRef]
  45. Rowe, W.; Platt, M.; Wedge, D.C.; Day, P.J.; Kell, D.B.; Knowles, J. Analysis of a complete DNA–protein affinity landscape. J. R. Soc. Interface 2010, 7, 397–408. [Google Scholar] [CrossRef]
  46. Pitt, J.N.; Ferré-D’Amaré, A.R. Rapid construction of empirical rna fitness landscapes. Science 2010, 330, 376–379. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Jiménez, J.I.; Xulvi-Brunet, R.; Campbell, G.W.; Turk-MacLeod, R.; Chen, I.A. Comprehensive experimental fitness landscape and evolutionary network for small rna. Proc. Natl. Acad. Sci. USA 2013, 110, 14984–14989. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Li, C.; Qian, W.; Maclean, C.J.; Zhang, J. The fitness landscape of a trna gene. Science 2016, 352, 837–840. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Bendixsen, D.P.; Østman, B.; Hayden, E.J. Negative epistasis in experimental rna fitness landscapes. J. Mol. Evol. 2017, 85, 159–168. [Google Scholar] [CrossRef]
  50. Aguilar-Rodríguez, J.; Payne, J.L.; Wagner, A. A thousand empirical adaptive landscapes and their navigability. Nat. Ecol. Evol. 2017, 1, 0045. [Google Scholar] [CrossRef] [Green Version]
  51. Martin, C.H. Context dependence in complex adaptive landscapes: Frequency and trait-dependent selection surfaces within an adaptive radiation of caribbean pupfishes. Evolution 2016, 70, 1265–1282. [Google Scholar] [CrossRef]
  52. Boucher, J.I.; Cote, P.; Flynn, J.; Jiang, L.; Laban, A.; Mishra, P.; Roscoe, B.P.; Bolon, D.N.A. Viewing protein fitness landscapes through a next-gen lens. Genetics 2014, 198, 461–471. [Google Scholar] [CrossRef]
  53. Aita, T.; Iwakura, M.; Husimi, Y. A cross-section of the fitness landscape of dihydrofolate reductase. Protein Eng. 2001, 14, 633–638. [Google Scholar] [CrossRef] [Green Version]
  54. Carneiro, M.; Hartl, D.L. Adaptive landscapes and protein evolution. Proc. Natl. Acad. Sci. USA 2010, 107 (Suppl. S1), 1747–1751. [Google Scholar] [CrossRef] [Green Version]
  55. Poelwijk, F.J.; Tănase-Nicola, S.; Kiviet, D.J.; Tans, S.J. Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapes. J. Theor. Biol. 2011, 272, 141–144. [Google Scholar] [CrossRef] [Green Version]
  56. Lozovsky, E.R.; Chookajorn, T.; Brown, K.M.; Imwong, M.; Shaw, P.J.; Kamchonwongpaisan, S.; Neafsey, D.E.; Weinreich, D.M.; Hartl, D.L. Stepwise acquisition of pyrimethamine resistance in the malaria parasite. Proc. Natl. Acad. Sci. USA 2009, 106, 12025–12030. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Lunzer, M.; Miller, S.P.; Felsheim, R.; Dean, A.M. The biochemical architecture of an ancient adaptive landscape. Science 2005, 310, 499–501. [Google Scholar] [CrossRef] [PubMed]
  58. Bridgham, J.T.; Carroll, S.M.; Thornton, J.W. Evolution of hormone-receptor complexity by molecular exploitation. Science 2006, 312, 97–101. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Poelwijk, F.J.; Kiviet, D.J.; Tans, S.J. Evolutionary potential of a duplicated repressor-operator pair: Simulating pathways using mutation data. PLoS Comput. Biol. 2006, 2, e58. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Poelwijk, F.J.; Kiviet, D.J.; Weinreich, D.M.; Tans, S.J. Empirical fitness landscapes reveal accessible evolutionary paths. Nature 2007, 445, 383–386. [Google Scholar] [CrossRef] [PubMed]
  61. Franke, J.; Klözer, A.; de Visser, J.A.G.; Krug, J. Evolutionary accessibility of mutational pathways. PLoS Comput. Biol. 2011, 7, e1002134. [Google Scholar] [CrossRef] [Green Version]
  62. Hegarty, P.; Martinsson, A. On the existence of accessible paths in various models of fitness landscapes. Ann. Appl. Probab. 2014, 24, 1375–1395. [Google Scholar] [CrossRef]
  63. Zagorski, M.; Burda, Z.; Waclaw, B. Beyond the hypercube: Evolutionary accessibility of fitness landscapes with realistic mutational networks. PLoS Comput. Biol. 2016, 12, e1005218. [Google Scholar] [CrossRef] [Green Version]
  64. Kell, D.B. Genotype–phenotype mapping: Genes as computer programs. Trends Genet. 2002, 18, 555–559. [Google Scholar] [CrossRef]
  65. Kondrashov, D.A.; Kondrashov, F.A. Topological features of rugged fitness landscapes in sequence space. Trends Genet. 2015, 31, 24–33. [Google Scholar] [CrossRef]
  66. Chan, H.S.; Bornberg-Bauer, E. Perspectives on protein evolution from simple exact models. Appl. Bioinform. 2002, 50, 121–144. [Google Scholar]
  67. Cowperthwaite, M.C.; Economo, E.P.; Harcombe, W.R.; Miller, E.L.; Meyers, L.A. The ascent of the abundant: How mutational networks constrain evolution. PLoS Comput. Biol. 2008, 4, e1000110. [Google Scholar] [CrossRef] [PubMed]
  68. Stich, M.; Lázaro, E.; Manrubia, S.C. Phenotypic effect of mutations in evolving populations of rna molecules. BMC Evol. Biol. 2010, 10, 46. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Palmer, M.E.; Moudgil, A.; Feldman, M.W. Long-term evolution is surprisingly predictable in lattice proteins. J. R. Soc. Interface 2013, 10, 20130026. [Google Scholar] [CrossRef] [Green Version]
  70. Bershtein, S.; Serohijos, A.W.R.; Shakhnovich, E.I. Bridging the physical scales in evolutionary biology: From protein sequence space to fitness of organisms and populations. Curr. Opin. Struct. Biol. 2017, 42, 31–40. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Perfeito, L.; Ghozzi, S.; Berg, J.; Schnetz, K.; Lässig, M. Nonlinear fitness landscape of a molecular pathway. PLoS Genet. 2011, 7, e1002160. [Google Scholar] [CrossRef] [Green Version]
  72. Chou, H.-H.; Delaney, N.F.; Draghi, J.A.; Marx, C.J. Mapping the fitness landscape of gene expression uncovers the cause of antagonism and sign epistasis between adaptive mutations. PLoS Genet. 2014, 10, e1004149. [Google Scholar] [CrossRef] [Green Version]
  73. Friedlander, T.; Prizak, R.; Barton, N.H.; Tkačik, G. Evolution of new regulatory functions on biophysically realistic fitness landscapes. Nat. Commun. 2017, 8, 216. [Google Scholar] [CrossRef] [Green Version]
  74. Cuypers, T.D.; Rutten, J.P.; Hogeweg, P. Evolution of evolvability and phenotypic plasticity in virtual cells. BMC Evol. Biol. 2017, 17, 60. [Google Scholar] [CrossRef] [Green Version]
  75. Yubero, P.; Manrubia, S.; Aguirre, J. The space of genotypes is a network of networks: Implications for evolutionary and extinction dynamics. Sci. Rep. 2017, 7, 13813. [Google Scholar] [CrossRef] [Green Version]
  76. Harmand, N.; Gallet, R.; Jabbour-Zahab, R.; Martin, G.; Lenormand, T. Fisher’s geometrical model and the mutational patterns of antibiotic resistance across dose gradients. Evolution 2017, 71, 23–37. [Google Scholar] [CrossRef] [PubMed]
  77. Yang, C.-H.; Scarpino, S.V. Reproductive barriers as a byproduct of gene network evolution. bioRxiv 2020. [Google Scholar] [CrossRef]
  78. Yang, C.-H.; Scarpino, S.V. The ensemble of gene regulatory networks at mutation-selection balance. bioRxiv 2021. [Google Scholar] [CrossRef]
  79. Ciliberti, S.; Martin, O.C.; Wagner, A. Innovation and robustness in complex regulatory gene networks. Proc. Natl. Acad. Sci. USA 2007, 104, 13591–13596. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  80. Payne, J.L.; Wagner, A. Latent phenotypes pervade gene regulatory circuits. BMC Syst. Biol. 2014, 8, 64. [Google Scholar] [CrossRef] [Green Version]
  81. Godsil, C.D. Compact graphs and equitable partitions. Linear Algebra Its Appl. 1997, 255, 259–266. [Google Scholar] [CrossRef] [Green Version]
  82. Das, S.G.; Direito, S.O.L.; Waclaw, B.; Allen, R.J.; Krug, J. Predictable properties of fitness landscapes induced by adaptational tradeoffs. eLife 2020, 9, e55155. [Google Scholar] [CrossRef]
  83. Wagner, A. Neutralism and selectionism: A network-based reconciliation. Nat. Rev. Genet. 2008, 9, 965–974. [Google Scholar] [CrossRef] [Green Version]
  84. Bendixsen, D.P.; Collet, J.; Østman, B.; Hayden, E.J. Genotype network intersections promote evolutionary innovation. PLoS Biol. 2019, 17, e3000300. [Google Scholar] [CrossRef] [Green Version]
  85. Hunt, G.; Hopkins, M.J.; Lidgard, S. Simple versus complex models of trait evolution and stasis as a response to environmental change. Proc. Natl. Acad. Sci. USA 2015, 112, 4885–4890. [Google Scholar] [CrossRef] [Green Version]
  86. Heasley, L.R.; Sampaio, N.M.V.; Argueso, J.L. Systemic and rapid restructuring of the genome: A new perspective on punctuated equilibrium. Curr. Genet. 2020, 67, 57–63. [Google Scholar] [CrossRef] [PubMed]
  87. Aguilar-Rodríguez, J.; Peel, L.; Stella, M.; Wagner, A.; Payne, J.L. The architecture of an empirical genotype-phenotype map. Evolution 2018, 72, 1242–1260. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  88. López-Presa, J.L.; Chiroque, L.F.; Anta, A.F. Novel techniques to speed up the computation of the automorphism group of a graph. J. Appl. Math. 2014, 2014, 934637. [Google Scholar] [CrossRef]
  89. Stoichev, S.D. New exact and heuristic algorithms for graph automorphism group and graph isomorphism. J. Exp. Algorithmics (JEA) 2019, 24, 1–27. [Google Scholar] [CrossRef] [Green Version]
  90. Battiston, F.; Cencetti, G.; Iacopini, I.; Latora, V.; Lucas, M.; Patania, A.; Young, J.-G.; Petri, G. Networks beyond pairwise interactions: Structure and dynamics. Phys. Rep. 2020, 874, 1–92. [Google Scholar] [CrossRef]
  91. Maheshwari, P.; Albert, R. A framework to find the logic backbone of a biological network. BMC Syst. Biol. 2017, 11, 122. [Google Scholar] [CrossRef] [Green Version]
  92. Yang, G.; Gómez Tejeda Zañudo, J.; Albert, R. Target control in logical models using the domain of influence of nodes. Front. Physiol. 2018, 9, 454. [Google Scholar] [CrossRef] [Green Version]
  93. Hu, T.; Tomassini, M.; Banzhaf, W. A network perspective on genotype–phenotype mapping in genetic programming. Genet. Program. Evolvable Mach. 2020, 21, 375–397. [Google Scholar] [CrossRef]
  94. Greenbury, S.F.; Louis, A.A.; Ahnert, S.E. The structure of genotype-phenotype maps makes fitness landscapes navigable. bioRxiv 2021. [Google Scholar] [CrossRef]
Figure 1. Cartoon illustration of the pathway framework of GRNs adapted from [77,78]. Under our four simplified assumptions, a GRN (genotype) consists of a fixed number of proteins as nodes and a constant number of directed edges depicting the activator/product pairs of genes. The phenotype is modeled as the Boolean states of proteins (colored), which are determined by their reachability from the external stimulus (lightning icon).
Figure 1. Cartoon illustration of the pathway framework of GRNs adapted from [77,78]. Under our four simplified assumptions, a GRN (genotype) consists of a fixed number of proteins as nodes and a constant number of directed edges depicting the activator/product pairs of genes. The phenotype is modeled as the Boolean states of proteins (colored), which are determined by their reachability from the external stimulus (lightning icon).
Entropy 24 00622 g001
Figure 2. Connectivity exists between all GRNs of the same phenotype. (a) Any GRN can be rewired/mutated into a “central” GRN (shown on the right). (b) A redundant edge (dark green) makes it feasible to turn any central GRN into another via edge rewiring. (c) There is a mutational trajectory between any GRNs of the same phenotype through the central GRNs.
Figure 2. Connectivity exists between all GRNs of the same phenotype. (a) Any GRN can be rewired/mutated into a “central” GRN (shown on the right). (b) A redundant edge (dark green) makes it feasible to turn any central GRN into another via edge rewiring. (c) There is a mutational trajectory between any GRNs of the same phenotype through the central GRNs.
Entropy 24 00622 g002
Figure 3. Example of peripheral GRNs connecting two different phenotypes—a peripheral GRN g ^ of phenotype y in this example. There is a chain that triggers the presence state of proteins Ω y + Ω y + = { 3 , 4 } . However, the other peripheral GRN g ^ of phenotype y contains a chain of proteins Ω y + Ω y + = { 5 , 6 } . g ^ and g ^ are mutational neighbors since they only differ by rewiring the dark green edge, i.e., the first edge in either chain.
Figure 3. Example of peripheral GRNs connecting two different phenotypes—a peripheral GRN g ^ of phenotype y in this example. There is a chain that triggers the presence state of proteins Ω y + Ω y + = { 3 , 4 } . However, the other peripheral GRN g ^ of phenotype y contains a chain of proteins Ω y + Ω y + = { 5 , 6 } . g ^ and g ^ are mutational neighbors since they only differ by rewiring the dark green edge, i.e., the first edge in either chain.
Entropy 24 00622 g003
Figure 4. The genotype network has symmetry such that multiple GRNs have similar local neighborhoods, as we demonstrate in (a) since the corresponding GRNs only differ by exchanging the role of loci A and B. More formally, these GRNs constitute an equivalence class under phenotype-preserving automorphisms, which can be found by graphical operations of (b) permuting loci, (c) permuting dummy proteins (circles), (d) exchanging edges pointing from two different stimuli (squares), and (e) exchanging self-loops at two different nodes.
Figure 4. The genotype network has symmetry such that multiple GRNs have similar local neighborhoods, as we demonstrate in (a) since the corresponding GRNs only differ by exchanging the role of loci A and B. More formally, these GRNs constitute an equivalence class under phenotype-preserving automorphisms, which can be found by graphical operations of (b) permuting loci, (c) permuting dummy proteins (circles), (d) exchanging edges pointing from two different stimuli (squares), and (e) exchanging self-loops at two different nodes.
Entropy 24 00622 g004
Figure 5. (a) As a minimal example, imagine an operation that rotates a geometric object 90 degrees clockwise. The rotation maps one object onto another (dashed arrows), and it leads to equivalence classes where objects are grouped by their symmetry under rotation (pink rectangles). (b) An automorphism of a graph is a permutation of nodes that retains the same graph. (c) Equivalence classes under graph automorphisms bring together nodes that have similar roles connection-wise in the graph.
Figure 5. (a) As a minimal example, imagine an operation that rotates a geometric object 90 degrees clockwise. The rotation maps one object onto another (dashed arrows), and it leads to equivalence classes where objects are grouped by their symmetry under rotation (pink rectangles). (b) An automorphism of a graph is a permutation of nodes that retains the same graph. (c) Equivalence classes under graph automorphisms bring together nodes that have similar roles connection-wise in the graph.
Entropy 24 00622 g005
Figure 6. (a) Consider a toy example genotype network of GRNs. (Here, we omit the exact content of GRNs.) Given the partition Θ ^ k , note that mega-nodes in a group (dashed orange rectangle) may share a different number of connections among other groups (blue shaded circles), and they are further divided to generate the next partition Θ ^ k + 1 . (b) Both the equivalence classes of GRNs and the stationary partition from our iterative procedure are equitable, e.g., each mega-node in group (1) has one connection among (1), another connection with (2), and none with other groups.
Figure 6. (a) Consider a toy example genotype network of GRNs. (Here, we omit the exact content of GRNs.) Given the partition Θ ^ k , note that mega-nodes in a group (dashed orange rectangle) may share a different number of connections among other groups (blue shaded circles), and they are further divided to generate the next partition Θ ^ k + 1 . (b) Both the equivalence classes of GRNs and the stationary partition from our iterative procedure are equitable, e.g., each mega-node in group (1) has one connection among (1), another connection with (2), and none with other groups.
Entropy 24 00622 g006
Figure 7. Example partition coarser than the equivalences classes of GRNs. We run the proposed iterative procedure with | Γ | = 3 and | Ω 0 | = | Ω | = | Ω ^ | = 2 , where stimuli Ω 0 are drawn as squares and the present/absent state of fitness-relevant proteins Ω ^ are colored by orange/blue. (a) The number of GRNs and the number of isomorphism classes of GRNs in each group of the obtained partition Θ ^ , where the dashed lines separate groups of different phenotypes and (b) isomorphism classes of GRNs in a group.
Figure 7. Example partition coarser than the equivalences classes of GRNs. We run the proposed iterative procedure with | Γ | = 3 and | Ω 0 | = | Ω | = | Ω ^ | = 2 , where stimuli Ω 0 are drawn as squares and the present/absent state of fitness-relevant proteins Ω ^ are colored by orange/blue. (a) The number of GRNs and the number of isomorphism classes of GRNs in each group of the obtained partition Θ ^ , where the dashed lines separate groups of different phenotypes and (b) isomorphism classes of GRNs in a group.
Entropy 24 00622 g007
Figure 8. Layering the GRNs by their number of non-self-loop edges. A GRN’s mutational neighbors must fall into the same or the adjacent layers. For ease of illustration, we only show the non-self-loop edges and neglect the protein states in GRNs.
Figure 8. Layering the GRNs by their number of non-self-loop edges. A GRN’s mutational neighbors must fall into the same or the adjacent layers. For ease of illustration, we only show the non-self-loop edges and neglect the protein states in GRNs.
Entropy 24 00622 g008
Figure 9. Sufficient conditions that two M + neighbors belong to an equivalence class. For illustration purposes, we only show the dummy proteins and omit the protein states, edge labels and self-loops in (a) three examples such that two M + neighbors of a GRN g are isomorphic, and (b) an example where the M + neighbors g 1 and g 2 of GRNs g 1 and g 2 in different equivalence classes are isomorphic.
Figure 9. Sufficient conditions that two M + neighbors belong to an equivalence class. For illustration purposes, we only show the dummy proteins and omit the protein states, edge labels and self-loops in (a) three examples such that two M + neighbors of a GRN g are isomorphic, and (b) an example where the M + neighbors g 1 and g 2 of GRNs g 1 and g 2 in different equivalence classes are isomorphic.
Entropy 24 00622 g009
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yang, C.-H.; Scarpino, S.V. A Family of Fitness Landscapes Modeled through Gene Regulatory Networks. Entropy 2022, 24, 622. https://0-doi-org.brum.beds.ac.uk/10.3390/e24050622

AMA Style

Yang C-H, Scarpino SV. A Family of Fitness Landscapes Modeled through Gene Regulatory Networks. Entropy. 2022; 24(5):622. https://0-doi-org.brum.beds.ac.uk/10.3390/e24050622

Chicago/Turabian Style

Yang, Chia-Hung, and Samuel V. Scarpino. 2022. "A Family of Fitness Landscapes Modeled through Gene Regulatory Networks" Entropy 24, no. 5: 622. https://0-doi-org.brum.beds.ac.uk/10.3390/e24050622

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop