CALINCA—A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease

Talyan, Sweta; Filipów, Samantha; Ignarski, Michael; Smieszek, Magdalena; Chen, He; Kühne, Lucas; Butt, Linus; Göbel, Heike; Hoyer-Allo, K. Johanna R.; Koehler, Felix C.; Altmüller, Janine; Brinkkötter, Paul; Schermer, Bernhard; Benzing, Thomas; Kann, Martin; Müller, Roman-Ulrich; Dieterich, Christoph

doi:10.3390/cells10030692

Open AccessArticle

CALINCA—A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease

by

Sweta Talyan

^1,2,†,

Samantha Filipów

^3,4,†

,

Michael Ignarski

^3,4

,

Magdalena Smieszek

²,

He Chen

^3,4

,

Lucas Kühne

^3,4,

Linus Butt

^3,4,

Heike Göbel

⁵,

K. Johanna R. Hoyer-Allo

^3,4,

Felix C. Koehler

^3,4,

Janine Altmüller

⁶,

Paul Brinkkötter

^3,4,

Bernhard Schermer

^3,4

,

Thomas Benzing

^3,4,

Martin Kann

^3,4,

Roman-Ulrich Müller

^3,4,*,‡

and

Christoph Dieterich

^1,2,*,‡

¹

German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany

²

Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology and Department of Internal Medicine III, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany

³

Department II of Internal Medicine and Center for Molecular Medicine, University of Cologne, Faculty of Medicine and University Hospital Cologne, 50931 Cologne, Germany

⁴

Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, 50931 Cologne, Germany

⁵

Institute for Pathology, Diagnostic and Experimental Nephropathology Unit, University of Cologne, Faculty of Medicine and University Hospital Cologne, 50931 Cologne, Germany

⁶

Cologne Center for Genomics, University of Cologne, 50931 Cologne, Germany

^*

Authors to whom correspondence should be addressed.

^†

The authors have contributed equally.

^‡

These authors have contributed equally.

Cells 2021, 10(3), 692; https://0-doi-org.brum.beds.ac.uk/10.3390/cells10030692

Submission received: 24 February 2021 / Revised: 13 March 2021 / Accepted: 15 March 2021 / Published: 20 March 2021

(This article belongs to the Special Issue Long Noncoding RNAs in Disease)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Diseases of the renal filtration unit—the glomerulus—are the most common cause of chronic kidney disease. Podocytes are the pivotal cell type for the function of this filter and focal-segmental glomerulosclerosis (FSGS) is a classic example of a podocytopathy leading to proteinuria and glomerular scarring. Currently, no targeted treatment of FSGS is available. This lack of therapeutic strategies is explained by a limited understanding of the defects in podocyte cell biology leading to FSGS. To date, most studies in the field have focused on protein-coding genes and their gene products. However, more than 80% of all transcripts produced by mammalian cells are actually non-coding. Here, long non-coding RNAs (lncRNAs) are a relatively novel class of transcripts and have not been systematically studied in FSGS to date. The appropriate tools to facilitate lncRNA research for the renal scientific community are urgently required due to a row of challenges compared to classical analysis pipelines optimized for coding RNA expression analysis. Here, we present the bioinformatic pipeline CALINCA as a solution for this problem. CALINCA automatically analyzes datasets from murine FSGS models and quantifies both annotated and de novo assembled lncRNAs. In addition, the tool provides in-depth information on podocyte specificity of these lncRNAs, as well as evolutionary conservation and expression in human datasets making this pipeline a crucial basis to lncRNA studies in FSGS.

Keywords:

kidney; glomerulus; podocyte; focal-segmental glomerulosclerosis; FSGS; long non-coding RNA; lncRNA; RNAscope

1. Introduction

Chronic kidney disease (CKD) affects almost 10% of the global population and is one of the most important independent risk factors for cardiovascular morbidity and mortality [1]. In humans, each kidney contains about 1 million nephrons made up of the actual filtration unit—the glomerulus—and the tubulus system, which determines the volume and composition of urine to be excreted. Glomerular disorders are the pre-dominant cause of kidney diseases leading to end-stage renal failure. The glomerulus consists of a capillary convolute containing a three-layered filtration barrier: Fenestrated endothelial cells are covered by a specialized basement membrane followed by podocytes, a post-mitotic epithelial cell type located on the urinary side of the barrier. Podocytes form primary and secondary foot processes and are crucial to the function of the filter [2]. Consequently, and due to their postmitotic nature as well as their limited capacity of regeneration, podocyte disorders play a central role in most—if not all—glomerular diseases. Cytoskeletal re-arrangements resulting in podocyte effacement are the common hallmark of podocyte injury and the loss of podocytes results in glomerular scarring, i.e., glomerulosclerosis [3]. Importantly, proteinuria—which is the direct clinical consequence of podocyte injury—is directly associated with disease progression in both cardiac and renal disease [4,5]. Focal-segmental glomerulosclerosis (FSGS) is a classic example of a podocytopathy. However, FSGS is rather a histopathological pattern of injury resulting from multiple pathogenic mechanisms than a uniform disease. Furthermore, secondary FSGS caused by systemic diseases such as arterial hypertension has to be clearly separated from the primary disease. Consequently, it is not surprising that clinical trials including all FSGS patients have failed to provide targeted treatment options [6]. At the moment, the treatment of FSGS is still unspecific and based on immunosuppression and blockade of the renin-angiotensin-aldosterone axis (RAAS). A better understanding of the molecular mechanisms underlying podocyte injury is urgently needed not only to design tailored therapeutic strategies but also to allow for a subclassification of FSGS by pathophysiology. Here, both external factors and podocyte-intrinsic defects are expected to play a role (reviewed by D’Agati et al. [7]). Regarding podocyte cell biology, the research was focused on proteins and protein-coding genes in the past. However, only 2% of all human transcripts possess a coding potential [8]. Interestingly, the non-protein coding component of the transcriptome shows greater tissue and context specific expression patterns than the coding genome and plays an important role in phenotypic variation between individuals and species [9]. This fact, on the one hand, clearly shows that the primary research focus on coding RNAs—followed by the majority of scientists over decades—comes at the risk of entirely missing a crucial aspect of cellular and molecular biology. On the other hand, considering the lack of knowledge, it is clear that studying non-coding RNAs (ncRNA) in FSGS bears a great potential to identify novel disease pathways. Much ncRNA research at the beginning of the 21st century focused on microRNAs (miRNAs)—with, e.g., highly interesting results regarding miR-193a in FSGS [10,11]. Meanwhile, various additional types of ncRNA molecules have been identified—including long non-coding circular RNAs—and have been implicated to play important roles in numerous cellular processes [12,13]. LncRNAs are defined as non-coding transcripts of more than 200 nucleotides and are transcribed by RNA polymerase 2. However, beyond this definition, the term lncRNAs refers to a heterogenous group of transcripts regarding their genomic organization and contains transcripts overlapping with other genes (both sense and antisense) as well as enhancer RNAs (eRNAs) and intergenic transcripts [14]. As a note of caution, when working with lncRNAs, it is important to keep in mind that some transcripts that are annotated as lncRNAs are in fact coding [15,16]—an aspect that needs to be considered and addressed both bioinformatically and experimentally. LncRNAs generally harbor a rather low sequence conservation, while recent evidence suggests that they show much higher functional, structural, and positional conservation across evolution [8,14,17]. Current estimates assume the existence of at least 20,000 lncRNAs in mammals ranging up to more than 200,000 in humans [18,19] (RNAcentral.org). Interestingly, a small number of lncRNAs had already been identified in the 80s and 90s, which—both as to their function, e.g., X-chromosome inactivation by XIST or inhibition of Igf2 by H19 and their nature—were regarded rare and exotic exceptions back then [20,21]. However, little to nothing is known regarding the function of the vast majority of these transcripts. The establishment of novel techniques in the field of RNA biology—both regarding computational biology and functional wet-lab experiments—coupled with an increasing amount of data over the last 5–10 years has greatly changed this situation. Efficient studies on the lncRNA function in biology and disease in a wide range of fields have thus become possible only recently. This work revealed a multitude of functions in cell biology including epigenetic regulation, transcriptional and posttranscriptional control, and adaptation of nuclear but also modulation of the stability/function of protein interaction partners [22]. Based on these findings, the importance of lncRNAs in the development and disease of a variety of organ systems has been underlined by the recent literature [23,24,25,26,27,28,29,30,31,32]. The vast majority of publications on lncRNAs in the kidney have remained merely descriptive and characterized the lncRNA expression in a large variety of cell culture and animal models [33]. Diabetic nephropathy (DN) is one of the few examples in which significant progress has been made. As a specific and highly interesting example, the lncRNA TUG1 was shown to be repressed upon high glucose exposure and to modulate mitochondrial bioenergetics in diabetic nephropathy by recruiting PGC-1α to its own promoter [34]. Regarding FSGS, published data on the involvement of lncRNAs still remain extremely scarce and have only been provided by two studies from the same group up to now. In both studies, the authors used transcriptome analyses in human tissue to identify upregulated lncRNAs in FSGS patients [35,36]. In general, the lack of a simple and readily available tool to identify candidate lncRNAs involved in the disease based on existing expression data has hampered systematic studies in this field. Here, we present CALINCA—a pipeline solving this problem in FSGS by providing tools to identify podocyte-enriched lncRNAs that are differentially regulated in FSGS models and conserved in evolution.

2. Materials and Methods

2.1. RNAscope

Kidneys from 12-week-old, three wildtype FVB/N mice were fixed in 4% formaldehyde, embedded in paraffin and 4.5 µm sections were cut and placed on Superfrost^® Plus glass slides (Thermo Scientific, Waltham, MA, USA). Samples were processed and stained using the commercially available in situ hybridization assay for FFPE samples—RNAscope 2.5 HD Assay—BROWN (cat no. 322310, Advanced Cell Diagnostics (ACD), Inc., Newark, CA, USA) and RNAscope probes for following lncRNAs targets: Wt1os, 4921504A21Rik, Gm10824, and XLOC_024349 (ACD, Inc, Newark, CA, USA). Representative images were captured using the Slide Scanner Leica SCN400 system (Leica Biosystems, Wetzlar, Germany) and prepared in the Aperio ImageScope 12.4.3 software (Leica Biosystems, Wetzlar, Germany). More details as well as the probes are provided in the Supplementary Methods.

2.2. QPCR

For the detection of lncRNA expression in the glomeruli and the whole mouse kidneys, we performed quantitative RT-PCR analyses. Isolation of glomeruli and qPCR was carried out with magnetic beads, as described previously [37,38]. Briefly, total RNA was extracted using the Direct-zol RNA Kit (Zymo Research, Irvine, CA, USA). The cDNA was synthesized using the High-Capacity cDNA Reverse Transcription Kit (Applied Biosystems, Waltham, MA, USA). The qPCR was performed using custom TaqMan PrimeTime assays (Integrative DNA Technologies, Coralville, IA, USA) and the 7900HT Fast Real-Time PCR System (Applied Biosystems, Waltham, MA, USA). More details as well as primer sequences are contained in the Supplementary Methods.

2.3. Animal Maintenance and Permissions

All animal experiments were conducted in accordance with European, national, and institutional guidelines and were permitted by the State Office of North Rhine-Westphalia, Department of Nature, Environment and Consumer Protection (LANUV NRW, Germany; animal approval AZ 81-02.04.2018.A325, AZ 2019.A085, and AZ 84-02_04_2014_A372). Experimental mice were kept in individually ventilated cages (Greenline GM500m Tecniplast, West Chester, PA, USA) at 22 °C and a humidity of 55% under a 12-h light cycle with unlimited access to water and food in a specific and pathogen free animal facility of the CECAD Research Center, University of Cologne, Germany.

2.4. Genetic Mouse Models

As described previously [2], podocinR231Q and podocinA286V mice were separately generated in our in vivo research facility (CECAD Research Center University of Cologne, Germany) using CRISPR-Cas9 based mutagenesis. Subsequently, podocinR231Q and podocinA286V mice were crossed to compound-heterozygosity. Genotyping was performed according to standard procedures using DNA isolated from ear biopsies. For podocinR231Q, DNA was visualized using gel electrophoresis following PCR-based amplification. For podocinA286V, DNA was amplified by PCR and analyzed by Sanger sequencing. The Wt1 heterozygous deletion model has been described extensively in the literature [37]. All mice were kept in a pure C57Bl/6 background.

2.5. Adriamycin Treatment

Mice were obtained from Janvier Labs (Le Genest-Saint-Isle, France). Adriamycin nephropathy was induced in 11-week-old, male BALB/cJRj-wildtype mice via injection of Adriamycin at a concentration of 12 mg/kg body weight [39]. Male BALB/cJRj-wildtype mice of the same age served as control animals. For injections, the mice were anesthetized with isoflurane. Adriamycin, solubilized in 0.9% sodium chloride, was then administered intravenously via a tail vein cannula. After Adriamycin application, the mice were housed individually and examined daily for weight loss, tail vein necrosis, and abnormal behavior. The day of injection was counted as Day 0. Animals were euthanized on Day 5 and the tissue was collected for glomeruli isolation.

2.6. Preparation of Glomeruli and Isolation of Podocytes

Preparation of glomeruli and FACS-sorting of podocytes was performed as previously described [40,41]. Briefly, dynabeads M-450 (in Hank’s balanced salt solution, HBSS) were used to perfuse renal arteries after kidney dissection. Kidneys were minced and digested at 37 °C for 15 min (digestion solution: Collagenase II 300 U/mL (Worthington, Worthington, OH, USA), pronase E 1 mg/mL (Sigma-Aldrich, Darmstadt, Germany), and DNAse I 50 U/µL (Applichem, Darmstadt, Germany) in HBSS). The resulting suspension was sieved twice (100 µm) and glomeruli were separated using a magnetic particle concentrator. A glomerular single cell suspension was generated by incubation in a digestion solution at 37 °C for 40 min. GFP-positive cells were FACS-sorted on a BD FACSAria™ III cell sorter (Franklin Lakes, NJ, USA) (after sieving through a 40 µm filter).

Total RNA extraction was performed using the RNeasy RNA extraction kit (Qiagen, Germantown, MD, USA) according to the manufacturer’s protocol. RNA integrity was assessed using the Tape Station system (Agilent, Santa Clara, CA, USA), only samples reaching an RNA integrity number ≥8 were used for RNA-seq.

2.7. RNA-Sequencing

Wildtype whole kidney and FACS-sorted podocyte RNA-seq datasets were used as previously published by our groups [37,42]. Briefly, separate libraries were prepared both after polyA-RNA enrichment and ribosomal RNA depletion and sequenced on an HiSeq4000 sequencer (Illumina, San Diego, CA, USA) with PE75 read length. For the glomerular RNA-seq datasets, libraries were prepared with the TruSeq (Illumina, San Diego, CA, USA) stranded ribo zero gold protocol. After library validation and quantification (Agilent 4200 tape station), equimolar amounts of the library were pooled. Pools were quantified using the Peqlab KAPA Library Quantification Kit (Roche, Basel, Switzerland) and the 7900HT Sequence Detection System (Applied Biosystems, Waltham, MA, USA) and sequenced on an NovaSeq6000 sequencer (Illumina, San Diego, CA, USA) with PE100 read length. More details are available in the Supplementary Methods.

2.8. Human Tissue

Human specimens were derived from healthy kidney tissue from a tumor nephrectomy after obtaining informed consent. All procedures were approved by the local institutional review board (#20-1206, Ethikkommission, Uniklinik Köln).

2.9. Data Analysis

2.9.1. Read Processing and Mapping

Illumina RNA-seq reads were pre-processed with Flexbar 3 [43] for quality clipping and adapter removal. We used Bowtie2 [44] and mouse reference transcripts (rRNA, tRNA) to subtract t/rRNA reads in silico. All the remaining reads were aligned against the mouse genome using STAR (2.6.0c) [45], guided by the EnsEMBL v90 reference annotation.

2.9.2. Transcript Assembly and Abundance Estimation

We used the following datasets to perform a transcriptome de novo assembly: FACS sorted Podocytes, Glomeruli wildtype samples, and whole kidney control samples. All of them were prepared with a ribo-zero library preparation strategy (see above). Our initial assembly was performed with Stringtie 1.3.5 [46] independently on each of the aforementioned tissue types. Subsequently, we merged the results and compared them against the Ensembl 90 reference using cufflinks 2.2.1 merge. Then, we used the resulting annotation file to estimate the RNA abundance with Stringtie 1.3.5 in the two podocyte-specific libraries (ribo-zero and polyA-selected RNA).

2.9.3. Selection of Potential lncRNA Candidates

Our initial selection step across our transcriptome assembly was based on transcript length ≥200 bp and RNA expression cutoff ≥1 FPKM.

Then, we used TransDecoder 5.5 (https://github.com/TransDecoder) to identify candidate coding regions (ORF in sense orientation ≥50 aa in length) within all the selected transcript sequences generated. Non-coding sequences were classified based on a dynamic ORF length cutoff [47], TransDecoder log-likelihood score, and start codon prediction (position specific scoring matrix). The longest ORF was always selected, when candidates were contained one within the other.

2.9.4. LncRNA Candidate Downstream Analysis

Sequence or gene order conservation with the human genome: We detected conserved gene orders (syntenic regions) based on the EnsEMBL protein coding gene annotation in man and mouse. Briefly, our Cyntenator software (latest version from November 2021) was used to compute all local gene order alignments based on gene coordinates and BLASTP scores [48]. NcRNAs adjacent to protein-coding genes were predicted as candidate orthologs. The nucleotide-level sequence conservation was detected by the alignment of lncRNA transcripts sequences from mouse against all human transcripts (n = 200, 310) annotated in reference GTF annotations (GRCh38.90.gtf). We only retained sequence alignments with an identity of more than 80% and alignment length of more than 100 bp. These criteria were used to assign additional ortholog pairs and are motivated by an approach for ultraconserved sequence element detection [1]. We assessed the expression of predicted candidate human orthologs using four non-tumour whole kidney GTEX samples. Protein Homology Search: All complete ORFs (i.e., with proper start and stop codons) which passed the dynamic ORF length cutoff were subjected to an additional BLASTX-based protein homology search using Swiss-Prot (cutoff: 50 bits).

Differential gene expression in FSGS disease models: We used edgeR (v3.24.3, [49]) to assess lncRNA gene locus expression changes. Briefly, we bundled all predicted lncRNA candidates into a new gene annotation set (excluding all protein-coding transcripts) to obtain read counts. Then, we used a two-factor design (age, condition) for the WT1-ko and NPHS2-ko model and a conventional one-factor design for the Adriamycin model. All loci with an FDR ≥ 0.05 in any comparison were retained as significant. Moreover, we performed an analysis of tissue specific expression using the tissue specificity index (TSI [50]). A TSI index score of ≤0.80 is used to predict the tissue specific enrichment.

2.9.5. Re-Analysis of scRNA-seq Datasets

We obtained scRNA-seq control datasets from Kidney Glomeruli (Chung et al. [51]). Specifically, NCBI SRA accessions ctrl1: SRR11300654 and SRR11300655, ctrl2: SRR11300658 and SRR11300659, ctrl3: SRR11300660 and SRR11300661. We used the 10x Cell Ranger 3.0.2 software to process the data. We employed Seurat 3.2.2 [52] with default settings to generate UMAP plots and to identify podocytes based on cell-specific marker gene expression: Wt1, Nphs1, Nphs2, and Mafb. Only cells that express all four markers were labelled as podocytes. We tested for podocyte-specific gene expression with a Wilcoxon test using two groups: Podocytes vs. other cell types. All p-values are reported on calinca.dieterichlab.org and in Supplementary Table S1.

2.10. Transcriptome Data Availability

For the podocyte RNA-seq datasets, raw and processed sequencing data have been deposited in GEO (GSE64063) (accessible at http://www.ncbi.nlm.gov/geo/query/acc.cgi?acc=GSE64063). Regarding the wildtype kidney RNA-seq datasets, the RNA-seq primary data are available at https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-7982. The glomerular RNA-seq data were uploaded to GEO (BioProject ID PRJNA715735).

3. Results

3.1. Bioinformatic Pipeline Design to Identify lncRNAs Involved in FSGS

In order to allow for the identification of lncRNA candidates involved in FSGS with a potential role in human disease, two aspects needed to be solved. Firstly, appropriate transcriptome datasets generated using comparable methodology were required. Secondly, the analysis workflow could not be based on the available tools for coding RNAs considering the specific challenges regarding lncRNAs—i.e., exclusion of coding potential, determination of evolutionary conservation, and identification of novel transcripts. The selection of sample types for RNA-seq was based on the following considerations. Dysregulation of lncRNAs in FSGS should not be based on a single model only and be determined before overt glomerular scarring. Consequently, we sequenced RNA obtained from glomeruli from three different mouse models. Wt1 heterozygous deletion mutants (Wt1^+/−) and Nphs2 compound heterozygous mice (Pod^R231Q/A286V) were analyzed as genetic models at two timepoints each (4 and 12 weeks of age). Both models reliably lead to proteinuria due to FSGS and have been characterized extensively by our group in the past [1,2]. Additionally, Adriamycin treatment was used as a pharmacological model to induce podocyte disease, as described in the literature [3]. To confirm the induction of podocyte disease, urinary albumin-to-creatinine ratios were measured on the fifth day after Adriamycin injection (Supplementary Figure S1). To address lncRNA cell-type specificity whole kidneys, glomeruli and FACS-sorted podocytes from wildtype mice were analyzed in parallel (Figure 1, see also CALINCA flowchart on calinca.dieterichlab.org). To allow for optimal usage of these datasets, a bioinformatic pipeline (CALINCA) was set up addressing the key challenges associated with lncRNA transcript analyses (Figure 1). After read processing/mapping and transcript assembly for both annotated and novel lncRNAs, candidates are stringently examined for protein coding potential. Cell-type specificity is analyzed using the tissue specificity index (TSI) [50] and conservation in humans is determined based both on sequence and synteny. The CALINCA website (calinca.dieterichlab.org) provides interactive tools to create both user-defined tables and graphs to make all aspects of the pipeline accessible. Here, in addition to the points set forth in Figure 1, podocyte-specific expression can be interrogated using published scRNA-seq data and an insight on the expression of lncRNA candidates in humans is provided using GTEX kidney cortex RNA-seq data.

3.2. Characteristics of lncRNA Expression in the Kidney

An analysis of the datasets described revealed renal expression of 48,055 lncRNA transcripts after removal of transcripts containing ORFs above the defined length cutoff or showing an FPKM <1. About two thirds of these transcripts had previously been annotated, whilst one third comprises novel lncRNAs (Figure 2A, see also https://calinca.dieterichlab.org). In addition, 21,514 of these transcripts are conserved in human by sequence, 35,620 by synteny. In the context of conservation, we introduced a second step to remove putatively coding transcripts using a protein homology filter. As expected, this reduces the number of transcripts conserved by sequence (by about 32%) and by gene order conservation (by 19%). Podocytes express 20,942 lncRNA transcripts and these are derived from 13,199 genes (Figure 2A). We found 1,500 lncRNAs that show high podocyte-specific expression (TSI ≥ 0.8). The putative orthologues of 464 of these 1500 lncRNAs were found to be expressed in human kidney cortex data obtained from GTEX without de novo assembly, i.e., allowing only for a retrieval of annotated lncRNAs. A set of 879 podocyte-specific lncRNA transcripts remain after filtering for DNA sequence or gene order conservation and protein homology. Moreover, we computed the tissue specificity index for 15,155 novel transcripts, in which we had enough sequencing coverage. Our analysis indicated that expression of these novel transcripts is distributed across all of the three renal compartments analyzed with a tendency of the highest TSI in podocytes (39.7% podocytes, 34.9% glomeruli, 25.4% kidney (non-glomerular), Figure 2B). Figure 2C shows the overlap between the different transcript features highlighting a small set of 334 transcripts, which meet all listed criteria. Intriguingly, novel lncRNAs harbor more exons and tend to be longer than the annotated transcripts (Figure 2D).

3.3. Dysregulation of lncRNAs in FSGS Models

Since our main focus were lncRNAs dysregulated in podocytes we now focused on podocyte-expressed and conserved lncRNA gene loci in the three FSGS models. To this end, we established a lncRNA-only genome annotation and computed lncRNA gene loci abundance based on this annotation. A detailed list of dysregulation over all lncRNA transcripts is provided in Supplementary Table S1 and can be visualized in a user-defined manner on the CALINCA website (https://calinca.dieterichlab.org). In addition, 379/2270/1833 out of 9789 tested lncRNA gene loci are differentially regulated in the Wt1^+/−/ Pod^R231Q/A286V/Adriamycin model (FDR < 0.05), respectively (Figure 3A, Supplementary Table S1). Eighty-nine lncRNA gene loci are significantly dysregulated in all three models. We further dissected this analysis to uncover coherently regulated lnc RNA gene loci (see bottom of Figure 3A). We see a high degree of coherent gene regulation between the Pod^R231Q/A286V/Adriamycin and the Wt1^+/−/ Pod^R231Q/A286V disease model. We integrated our findings from Figure 2D with the set of dysregulated candidates in Figure 3A. Two hundred and forty-one out of the 757 high-confident lncRNA loci overlap with the candidates contained in our three models (Figure 3B). In addition, we independently tested the cell-type specific expression in a published 3′ end scRNA-seq dataset (Chung 2020) (Figure 3C). We could detect the expression of 203 lncRNA gene loci (out of the 241 candidates) strongly supporting our prediction of podocyte-specific expression for the majority of lncRNAs based on bulk RNA-seq (ribo zero) data. Figure 3D highlights this finding for two known loci with high TSI values for podocyte-specificity (Wt1os and 4921504A21Rik) in UMAP projections. An examination of podocyte-specific expression of all lncRNAs is provided on https://calinca.dieterichlab.org.

3.4. Experimental Validation of FSGS lncRNA Candidates

To confirm the validity of the results obtained by CALINCA, we used three independent approaches for a subset of the lncRNAs found to be enriched in podocytes, conserved in human and dysregulated in at least one of our three FSGS models. Firstly, all of these candidates were examined in a published dataset containing transcriptomes of FACS-sorted glomerular cells and comparing podocyte to all non-podocyte cells (Boerries et al.) [2]. Fifty-five percent of the 1500 podocyte-enriched lncRNA transcripts identified by our TSI analysis could be retrieved as podocyte-specific in these data, as well (information on this analysis for each specific transcript is provided on https://calinca.dieterichlab.org). Secondly, we selected six lncRNAs for further confirmation of enrichment in glomeruli compared to whole-kidney RNA samples. Three out of six (Wt1os, 4921504A21Rik, XLOC_024349) could be confirmed as podocyte-specific in both the datasets of FASC-sorted glomerular cells from Boerries et al. and the scRNA-seq dataset from Chung et al. [3]. Two of the remaining three (Gm26759, Gm10824) were confirmed by at least one of these additional analyses and only Gm28876 was not found as podocyte-enriched in either of the two (see https://calinca.dieterichlab.org). Using qPCR, five out of six lncRNAs were significantly enriched in glomeruli, in which Gm10824 showed a trend towards glomerular enrichment (Figure 4A, Supplementary Figure S2). Thirdly, visualization of lncRNA expression using in situ hybridization appeared an important step to validate both expression and cell-type specificity. We chose RNAscope technology [53] for this purpose due to the importance of sensitivity and specificity considering the low expression of most lncRNAs compared to coding genes. RNAscope stainings were performed for four of the lncRNA candidates. For three of the lncRNAs most strongly enriched in glomeruli, as shown by qPCR (Wt1os, 4921504A21Rik, XLOC_024349), RNAscope signal is indeed limited to glomeruli (Figure 4B, Supplementary Figure S3). Importantly, XLOC_024349, a novel lncRNA identified by CALINCA is one of these candidates showing validity of the de novo assembly (Supplementary Figure S3). In line with the qPCR results, expression of Gm10824 was detected in tubular epithelial cells, even though at lower levels compared to glomeruli (Supplementary Figure S3). Importantly, the human orthologue of Wt1os-WT1-AS—is also specifically expressed in glomerular cells when examined by RNAscope in human tissue (Figure 4C, Supplementary Figure S4).

4. Discussion

The development of novel treatment strategies for FSGS as well as other glomerular diseases has been hampered by a lack of knowledge regarding causative factors and molecular mechanisms underlying podocyte pathophysiology. This shortcoming also limits the possibility to classify FSGS by pathobiology, an extremely important goal in a situation in which many different causes lead to the same histopathological picture. Since lncRNAs have not been studied systematically in FSGS to date, this field bears an extraordinary potential for the identification of novel pathomechanisms and therapeutic targets. However, lncRNA research comes with a couple of challenges—especially in the context of human disease. This includes bona-fide definition of truly non-coding transcripts and de novo detection of previously not annotated lncRNAs. Even more importantly—especially in translational research—evolutionary conservation has to be addressed and relies on criteria that differ much from coding genes since sequence conservation is often limited. Consequently, the available tools cannot easily be transferred from coding RNAs to ncRNAs. Additionally, when comparing disease to healthy states as well as different cell types, transcriptome data generation needs to be homogenous regarding tissue preparation and sequencing technology.

CALINCA provides a solution for these challenges and integrates a multi-step bioinformatic pipeline with a row of RNA-seq datasets in FSGS models and healthy kidneys including FACS-sorted podocytes. However, the focus on this specific disease and tissue type is primarily guided by the RNA-seq data used as input—in general, the code underlying CALINCA can easily be adapted to other disease scenarios in different tissues.

Our work now provides the first comprehensive atlas of lncRNAs that fulfil several important aspects to be considered key candidates in FSGS. These candidates are specific to or enriched in podocytes, the key cell type in disorders of the glomerular filter. In addition, the lack of coding potential is ensured by several steps in the algorithm and dysregulation in FSGS is examined in several mouse models. CALINCA also identifies novel lncRNAs. Importantly, the algorithm does not only confirm conservation by sequence and synteny but also considers human expression data to ensure that these genes are indeed transcribed in humans. This is a crucial point, since—obviously—research in disease models should be limited to conserved transcripts, an aspect associated with much uncertainty regarding lncRNAs in the past. Here, we confirm the podocyte-specific expression of one of these lncRNAs—WT1-AS—in human tissue using RNAscope [53]. This approach is very helpful to actually show cell-type specific expression and provides single molecule sensitivity, highly specific staining of the target RNA, and improved detection of degraded RNA. Importantly, RNAscope can be automated and used for RNA-protein co-expression analyses making it a potential entry point towards the use of lncRNA quantification in human diagnostics [54].

Making large-scale RNA-seq datasets available to the scientific community in a fashion that allows for simple and efficient interrogation is an extremely important aspect to make full use of the power of such data. This is especially true when implementing a novel pipeline for new RNA biotypes such as lncRNAs. The CALINCA website (calinca.dieterichlab.org) provides such a tool allowing for both global and transcript-specific analysis of our datasets. Importantly, the generation of user-defined tables is accompanied by the possibility to generate graphs for the visualization of the selected analyses. Using this option shows that dysregulated lncRNAs contain a much higher proportion of podocyte-specific transcripts in models using podocyte-specific interventions (such as Wt1^+/− and Pod^R231Q/A286V) compared to Adriamycin toxicity, which directly affects all cells of the glomerulus (calinca.dieterichlab.org, histograms examples 1-4, and boxplots examples 1–3). To provide another example, CALINCA can also be used to examine glomerular lncRNAs previously described in glomerular disease. As mentioned above, most data are available in the context of diabetic nephropathy. Consequently, we checked CALINCA regarding four examples in the context of diabetic nephropathy. Tug1, an important player in the metabolic response of podocytes in diabetes mellitus [34], is indeed also expressed in podocytes in our data and—in line with current knowledge—conserved in humans. This is also the case for Malat1, a mediator of cellular damage in diabetic glomerulopathy [55]. However, in addition, Malat1 is dysregulated in the Pod^R231Q/A286V model (up at 4 weeks, down at 12 weeks) and may consequently play a role in podocyte biology beyond diabetes mellitus. Neat1 mediates diabetic damage through glomerular activation of Akt/mTOR signaling. CALINCA shows Neat1 to be significantly enriched in podocytes. Interestingly, all three FSGS models lead to a dysregulation of Neat1 suggesting that this lncRNA may be a central player in podocyte pathobiology. Furthermore, knockdown of Pvt1 has been implicated in diabetic nephropathy as a mediator of podocyte apoptosis [56]. However, in our datasets, Pvt1 is not expressed in podocytes pointing towards the fact that podocyte expression of this lncRNA may actually be exclusive to the setting of high glucose. Taken together, these examples show how CALINCA can be used to extend our knowledge on lncRNAs in glomerular disease and to design future studies in this field.

Of course, our current study does not address the lncRNA function in disease yet. Here, further work—primarily using knockout mouse models—will be required.

5. Conclusions

Taken together, CALINCA examines compartment-specific expression of conserved lncRNAs in the healthy kidney, and will be a beneficial tool to study lncRNA involvement in renal (patho-) physiology in general. By including data from several FSGS models, CALINCA is a powerful tool to identify conserved lncRNAs in FSGS. Using this novel pipeline will now facilitate lncRNA research in FSGS but also other glomerular and tubular renal diseases. To allow for dynamic user-friendly access, the analyses are provided through an interactive website (https://calinca.dieterichlab.org).

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/2073-4409/10/3/692/s1. Supplementary Methods; Figure S1: Graphical representation of urine albumin-to-creatinine ratio levels in Adriamycin treated mice vs. control; Figure S2: Graphical representation of qPCR data shown in Figure 4A; Figure S3: Localization of Wt1os (A), 4921504A21Rik (B), Gm10824 (C), and XLOC_024349 (D) by RNAscope; Figure S4: Localization of WT1-AS, the human homolog of Wt1os; Table S1: Excel spreadsheet containing the output of all analyses of the RNA-seq data performed.

Author Contributions

Conceptualization, R.-U.M. and C.D.; methodology, S.T. and C.D.; validation, S.F., M.I., and R.-U.M.; formal analysis, S.T., S.F., M.I., and C.D.; investigation, S.F., M.I., H.G., P.B., L.B., L.K., J.A., H.C., F.C.K. and K.J.R.H.-A.; resources, R.-U.M. and C.D.; data curation, C.D.; writing—original draft preparation, R.-U.M. and C.D.; writing—review and editing, S.F., M.I., R.-U.M., and C.D.; visualization, S.T., S.F., M.I., M.S., and C.D.; supervision, M.I., B.S., R.-U.M., and C.D.; project administration, P.B., T.B., M.K., B.S., R.-U.M., and C.D.; funding acquisition, R.-U.M. and C.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Deutsche Forschungsgemeinschaft (CRU 329, DI1501/9-1 to C.D. and MU 3629/3-1 to R.-U.M.). Thomas Benzing and Bernhard Schermer received funding from the Deutsche Forschungsgemeinschaft (CRU 329). R.-U.M. received additional funding through the Nachwuchsgruppen.NRW program of the Ministry of Science North Rhine-Westphalia.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

RNAseq data are available as specified in Section 2.10 Transcriptome Data Availability and the analyses are provided on https://calinca.dieterichlab.org.

Acknowledgments

In this section you can acknowledge any support given which is not covered by the author contribution or funding sections. This may include administrative and technical support, or donations in kind (e.g., materials used for experiments).

Conflicts of Interest

The authors declare no conflict of interest.

References

Bikbov, B.; Purcell, C.A.; Levey, A.S.; Smith, M.; Abdoli, A.; Abebe, M.; Adebayo, O.M.; Afarideh, M.; Agarwal, S.K.; Agudelo-Botero, M.; et al. Global, Regional, and National Burden of Chronic Kidney Disease, 1990–2017: A Systematic Analysis for the Global Burden of Disease Study 2017. Lancet 2020, 395, 709–733. [Google Scholar] [CrossRef] [Green Version]
Butt, L.; Unnersjö-Jess, D.; Höhne, M.; Edwards, A.; Binz-Lotter, J.; Reilly, D.; Hahnfeldt, R.; Ziegler, V.; Fremter, K.; Rinschen, M.M.; et al. A Molecular Mechanism Explaining Albuminuria in Kidney Disease. Nat. Metab. 2020, 2, 461–474. [Google Scholar] [CrossRef] [PubMed]
Brinkkoetter, P.T.; Ising, C.; Benzing, T. The Role of the Podocyte in Albumin Filtration. Nat. Rev. Nephrol. 2013, 9, 328–336. [Google Scholar] [CrossRef] [PubMed]
Cravedi, P.; Remuzzi, G. Pathophysiology of Proteinuria and Its Value as an Outcome Measure in Chronic Kidney Disease. Br. J. Clin. Pharmacol. 2013, 76, 516–523. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Agrawal, V.; Marinescu, V.; Agarwal, M.; McCullough, P.A. Cardiovascular Implications of Proteinuria: An Indicator of Chronic Kidney Disease. Nat. Rev. Cardiol. 2009, 6, 301–311. [Google Scholar] [CrossRef] [PubMed]
Hogan, J.; Mohan, P.; Appel, G.B. Diagnostic Tests and Treatment Options in Glomerular Disease: 2014 Update. Am. J. Kidney Dis. Off. J. Natl. Kidney Found. 2014, 63, 656–666. [Google Scholar] [CrossRef] [PubMed]
D’Agati, V.D.; Kaskel, F.J.; Falk, R.J. Focal Segmental Glomerulosclerosis. N. Engl. J. Med. 2011, 365, 2398–2411. [Google Scholar] [CrossRef] [Green Version]
Djebali, S.; Davis, C.A.; Merkel, A.; Dobin, A.; Lassmann, T.; Mortazavi, A.; Tanzer, A.; Lagarde, J.; Lin, W.; Schlesinger, F.; et al. Landscape of Transcription in Human Cells. Nature 2012, 489, 101–108. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Barrett, S.P.; Salzman, J. Circular RNAs: Analysis, Expression and Potential Functions. Dev. Camb. Engl. 2016, 143, 1838–1847. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Harvey, S.J.; Jarad, G.; Cunningham, J.; Goldberg, S.; Schermer, B.; Harfe, B.D.; McManus, M.T.; Benzing, T.; Miner, J.H. Podocyte-Specific Deletion of Dicer Alters Cytoskeletal Dynamics and Causes Glomerular Disease. J. Am. Soc. Nephrol. 2008, 19, 2150–2158. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gebeshuber, C.A.; Kornauth, C.; Dong, L.; Sierig, R.; Seibler, J.; Reiss, M.; Tauber, S.; Bilban, M.; Wang, S.; Kain, R.; et al. Focal Segmental Glomerulosclerosis Is Induced by MicroRNA-193a and Its Downregulation of WT1. Nat. Med. 2013, 19, 481–487. [Google Scholar] [CrossRef] [PubMed]
Sebastian, M.; Marvin, J.; Antigoni, E.; Francesca, T.; Janna, K.; Agnieszka, R.; Luisa, M.; Sebastian, D.M.; Lea, H.G.; Mathias, M.; et al. Circular RNAs Are a Large Class of Animal RNAs with Regulatory Potency. Available online: https://pubmed.ncbi.nlm.nih.gov/23446348/ (accessed on 29 September 2020).
Maxmen, A. RNA: The Genome’s Rising Stars. Nature 2013, 496, 127–129. [Google Scholar] [CrossRef]
Derrien, T.; Johnson, R.; Bussotti, G.; Tanzer, A.; Djebali, S.; Tilgner, H.; Guernec, G.; Martin, D.; Merkel, A.; Knowles, D.G.; et al. The GENCODE v7 Catalog of Human Long Noncoding RNAs: Analysis of Their Gene Structure, Evolution, and Expression. Genome Res. 2012, 22, 1775–1789. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Matsumoto, A.; Pasut, A.; Matsumoto, M.; Yamashita, R.; Fung, J.; Monteleone, E.; Saghatelian, A.; Nakayama, K.I.; Clohessy, J.G.; Pandolfi, P.P. MTORC1 and Muscle Regeneration Are Regulated by the LINC00961-Encoded SPAR Polypeptide. Nature 2017, 541, 228–232. [Google Scholar] [CrossRef]
Flower, C.T.; Chen, L.; Jung, H.J.; Raghuram, V.; Knepper, M.A.; Yang, C.-R. An Integrative Proteogenomics Approach Reveals Peptides Encoded by Annotated LincRNA in the Mouse Kidney Inner Medulla. Physiol. Genom. 2020. [Google Scholar] [CrossRef] [PubMed]
Ulitsky, I. Evolution to the Rescue: Using Comparative Genomics to Understand Long Non-Coding RNAs. Nat. Rev. Genet. 2016, 17, 601–614. [Google Scholar] [CrossRef] [PubMed]
Harrow, J.; Frankish, A.; Gonzalez, J.M.; Tapanari, E.; Diekhans, M.; Kokocinski, F.; Aken, B.L.; Barrell, D.; Zadissa, A.; Searle, S.; et al. GENCODE: The Reference Human Genome Annotation for The ENCODE Project. Genome Res. 2012, 22, 1760–1774. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhao, Y.; Li, H.; Fang, S.; Kang, Y.; Wu, W.; Hao, Y.; Li, Z.; Bu, D.; Sun, N.; Zhang, M.Q.; et al. NONCODE 2016: An Informative and Valuable Data Source of Long Non-Coding RNAs. Nucleic Acids Res. 2016, 44, D203–D208. [Google Scholar] [CrossRef] [Green Version]
Penny, G.D.; Kay, G.F.; Sheardown, S.A.; Rastan, S.; Brockdorff, N. Requirement for Xist in X Chromosome Inactivation. Nature 1996, 379, 131–137. [Google Scholar] [CrossRef] [PubMed]
Pachnis, V.; Brannan, C.I.; Tilghman, S.M. The Structure and Expression of a Novel Gene Activated in Early Mouse Embryogenesis. EMBO J. 1988, 7, 673–681. [Google Scholar] [CrossRef]
Kopp, F.; Mendell, J.T. Functional Classification and Experimental Dissection of Long Noncoding RNAs. Cell 2018, 172, 393–407. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Grote, P.; Wittler, L.; Hendrix, D.; Koch, F.; Währisch, S.; Beisaw, A.; Macura, K.; Bläss, G.; Kellis, M.; Werber, M.; et al. The Tissue-Specific LncRNA Fendrr Is an Essential Regulator of Heart and Body Wall Development in the Mouse. Dev. Cell 2013, 24, 206–214. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Klattenhoff, C.A.; Scheuermann, J.C.; Surface, L.E.; Bradley, R.K.; Fields, P.A.; Steinhauser, M.L.; Ding, H.; Butty, V.L.; Torrey, L.; Haas, S.; et al. Braveheart, a Long Noncoding RNA Required for Cardiovascular Lineage Commitment. Cell 2013, 152, 570–583. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Abdelmohsen, K.; Panda, A.; Kang, M.-J.; Xu, J.; Selimyan, R.; Yoon, J.-H.; Martindale, J.L.; De, S.; Wood, W.H.; Becker, K.G.; et al. Senescence-Associated LncRNAs: Senescence-Associated Long Noncoding RNAs. Aging Cell 2013, 12, 890–900. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gupta, S.K.; Piccoli, M.T.; Thum, T. Non-Coding RNAs in Cardiovascular Ageing. Ageing Res. Rev. 2014, 17, 79–85. [Google Scholar] [CrossRef] [PubMed]
Leucci, E.; Vendramin, R.; Spinazzi, M.; Laurette, P.; Fiers, M.; Wouters, J.; Radaelli, E.; Eyckerman, S.; Leonelli, C.; Vanderheyden, K.; et al. Melanoma Addiction to the Long Non-Coding RNA SAMMSON. Nature 2016, 531, 518–522. [Google Scholar] [CrossRef] [PubMed]
Wang, P.; Xu, J.; Wang, Y.; Cao, X. An Interferon-Independent LncRNA Promotes Viral Replication by Modulating Cellular Metabolism. Science 2017, 358, 1051–1055. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fang, Y.; Xu, Y.; Wang, R.; Hu, L.; Guo, D.; Xue, F.; Guo, W.; Zhang, D.; Hu, J.; Li, Y.; et al. Recent Advances on the Roles of LncRNAs in Cardiovascular Disease. J. Cell. Mol. Med. 2020. [Google Scholar] [CrossRef] [PubMed]
Vangoor, V.R.; Gomes-Duarte, A.; Pasterkamp, R.J. Long Non-Coding RNAs in Motor Neuron Development and Disease. J. Neurochem. 2020. [Google Scholar] [CrossRef]
Acharya, S.; Salgado-Somoza, A.; Stefanizzi, F.M.; Lumley, A.I.; Zhang, L.; Glaab, E.; May, P.; Devaux, Y. Non-Coding RNAs in the Brain-Heart Axis: The Case of Parkinson’s Disease. Int. J. Mol. Sci. 2020, 21, 6513. [Google Scholar] [CrossRef] [PubMed]
Chen, W.; Yang, J.; Fang, H.; Li, L.; Sun, J. Relevance Function of Linc-ROR in the Pathogenesis of Cancer. Front. Cell Dev. Biol. 2020, 8, 696. [Google Scholar] [CrossRef] [PubMed]
Ignarski, M.; Islam, R.; Müller, R.-U. Long Non-Coding RNAs in Kidney Disease. Int. J. Mol. Sci. 2019, 20, 3276. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Long, J.; Badal, S.S.; Ye, Z.; Wang, Y.; Ayanga, B.A.; Galvan, D.L.; Green, N.H.; Chang, B.H.; Overbeek, P.A.; Danesh, F.R. Long Noncoding RNA Tug1 Regulates Mitochondrial Bioenergetics in Diabetic Nephropathy. J. Clin. Investig. 2016, 126, 4205–4218. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Han, R.; Hu, S.; Qin, W.; Shi, J.; Zeng, C.; Bao, H.; Liu, Z. Upregulated Long Noncoding RNA LOC105375913 Induces Tubulointerstitial Fibrosis in Focal Segmental Glomerulosclerosis. Sci. Rep. 2019, 9. [Google Scholar] [CrossRef] [Green Version]
Hu, S.; Han, R.; Shi, J.; Zhu, X.; Qin, W.; Zeng, C.; Bao, H.; Liu, Z. The Long Noncoding RNA LOC105374325 Causes Podocyte Injury in Individuals with Focal Segmental Glomerulosclerosis. J. Biol. Chem. 2018, 293, 20227–20239. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kann, M.; Ettou, S.; Jung, Y.L.; Lenz, M.O.; Taglienti, M.E.; Park, P.J.; Schermer, B.; Benzing, T.; Kreidberg, J.A. Genome-Wide Analysis of Wilms’ Tumor 1-Controlled Gene Expression in Podocytes Reveals Key Regulatory Mechanisms. J. Am. Soc. Nephrol. JASN 2015, 26, 2097–2104. [Google Scholar] [CrossRef] [PubMed]
Bartram, M.P.; Höhne, M.; Dafinger, C.; Völker, L.A.; Albersmeyer, M.; Heiss, J.; Göbel, H.; Brönneke, H.; Burst, V.; Liebau, M.C.; et al. Conditional Loss of Kidney MicroRNAs Results in Congenital Anomalies of the Kidney and Urinary Tract (CAKUT). J. Mol. Med. Berl. Ger. 2013, 91, 739–748. [Google Scholar] [CrossRef]
Wang, Y.M.; Wang, Y.; Harris, D.C.H.; Alexander, S.I.; Lee, V.W.S. Adriamycin Nephropathy in BALB/c Mice. Curr. Protoc. Immunol. 2015, 108, 15.28.1–15.28.6. [Google Scholar] [CrossRef] [PubMed]
Boerries, M.; Grahammer, F.; Eiselein, S.; Buck, M.; Meyer, C.; Goedel, M.; Bechtel, W.; Zschiedrich, S.; Pfeifer, D.; Laloë, D.; et al. Molecular Fingerprinting of the Podocyte Reveals Novel Gene and Protein Regulatory Networks. Kidney Int. 2013, 83, 1052–1064. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Takemoto, M.; Asker, N.; Gerhardt, H.; Lundkvist, A.; Johansson, B.R.; Saito, Y.; Betsholtz, C. A New Method for Large Scale Isolation of Kidney Glomeruli from Mice. Am. J. Pathol. 2002, 161, 799–805. [Google Scholar] [CrossRef] [Green Version]
Johnsen, M.; Kubacki, T.; Yeroslaviz, A.; Späth, M.R.; Mörsdorf, J.; Göbel, H.; Bohl, K.; Ignarski, M.; Meharg, C.; Habermann, B.; et al. The Integrated RNA Landscape of Renal Preconditioning against Ischemia-Reperfusion Injury. J. Am. Soc. Nephrol. JASN 2020, 31, 716–730. [Google Scholar] [CrossRef] [PubMed]
Roehr, J.T.; Dieterich, C.; Reinert, K. Flexbar 3.0—SIMD and Multicore Parallelization. Bioinforma. Oxf. Engl. 2017, 33, 2941–2942. [Google Scholar] [CrossRef] [PubMed]
Langmead, B.; Salzberg, S.L. Fast Gapped-Read Alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dobin, A.; Davis, C.A.; Schlesinger, F.; Drenkow, J.; Zaleski, C.; Jha, S.; Batut, P.; Chaisson, M.; Gingeras, T.R. STAR: Ultrafast Universal RNA-Seq Aligner. Bioinform. Oxf. Engl. 2013, 29, 15–21. [Google Scholar] [CrossRef] [PubMed]
Pertea, M.; Pertea, G.M.; Antonescu, C.M.; Chang, T.-C.; Mendell, J.T.; Salzberg, S.L. StringTie Enables Improved Reconstruction of a Transcriptome from RNA-Seq Reads. Nat. Biotechnol. 2015, 33, 290–295. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dinger, M.E.; Pang, K.C.; Mercer, T.R.; Mattick, J.S. Differentiating Protein-Coding and Noncoding RNA: Challenges and Ambiguities. PLoS Comput. Biol. 2008, 4, e1000176. [Google Scholar] [CrossRef] [Green Version]
Rödelsperger, C.; Dieterich, C. CYNTENATOR: Progressive Gene Order Alignment of 17 Vertebrate Genomes. PLoS ONE 2010, 5, e8861. [Google Scholar] [CrossRef] [Green Version]
McCarthy, D.J.; Chen, Y.; Smyth, G.K. Differential Expression Analysis of Multifactor RNA-Seq Experiments with Respect to Biological Variation. Nucleic Acids Res. 2012, 40, 4288–4297. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kryuchkova-Mostacci, N.; Robinson-Rechavi, M. A Benchmark of Gene Expression Tissue-Specificity Metrics. Brief. Bioinform. 2017, 18, 205–214. [Google Scholar] [CrossRef]
Chung, J.-J.; Goldstein, L.; Chen, Y.-J.J.; Lee, J.; Webster, J.D.; Roose-Girma, M.; Paudyal, S.C.; Modrusan, Z.; Dey, A.; Shaw, A.S. Single-Cell Transcriptome Profiling of the Kidney Glomerulus Identifies Key Cell Types and Reactions to Injury. J. Am. Soc. Nephrol. 2020, 31, 2341–2354. [Google Scholar] [CrossRef]
Stuart, T.; Butler, A.; Hoffman, P.; Hafemeister, C.; Papalexi, E.; Mauck, W.M.; Hao, Y.; Stoeckius, M.; Smibert, P.; Satija, R. Comprehensive Integration of Single-Cell Data. Cell 2019, 177, 1888–1902.e21. [Google Scholar] [CrossRef] [PubMed]
Wang, F.; Flanagan, J.; Su, N.; Wang, L.-C.; Bui, S.; Nielson, A.; Wu, X.; Vo, H.-T.; Ma, X.-J.; Luo, Y. RNAscope: A Novel in Situ RNA Analysis Platform for Formalin-Fixed, Paraffin-Embedded Tissues. J. Mol. Diagn. JMD 2012, 14, 22–29. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Anderson, C.M.; Zhang, B.; Miller, M.; Butko, E.; Wu, X.; Laver, T.; Kernag, C.; Kim, J.; Luo, Y.; Lamparski, H.; et al. Fully Automated RNAscope In Situ Hybridization Assays for Formalin-Fixed Paraffin-Embedded Cells and Tissues. J. Cell. Biochem. 2016, 117, 2201–2208. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hu, M.; Wang, R.; Li, X.; Fan, M.; Lin, J.; Zhen, J.; Chen, L.; Lv, Z. LncRNA MALAT1 Is Dysregulated in Diabetic Nephropathy and Involved in High Glucose-Induced Podocyte Injury via Its Interplay with β-Catenin. J. Cell. Mol. Med. 2017, 21, 2732–2747. [Google Scholar] [CrossRef] [PubMed]
Liu, D.-W.; Zhang, J.-H.; Liu, F.-X.; Wang, X.-T.; Pan, S.-K.; Jiang, D.-K.; Zhao, Z.-H.; Liu, Z.-S. Silencing of Long Noncoding RNA PVT1 Inhibits Podocyte Damage and Apoptosis in Diabetic Nephropathy by Upregulating FOXA1. Exp. Mol. Med. 2019, 51, 1–15. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The CALINCA pipeline. Experimental design and stepwise visualization of the CALINCA pipeline developed for this study. RNA-seq data generated from podocytes, glomeruli, and whole kidneys of wildtype mice alongside glomeruli from three focal-segmental glomerulosclerosis (FSGS) models (Wt1^+/−, Nphs2^R231Q/A286V, Adriamycin) were analyzed according to the depicted CALINCA workflow. For a more detailed overview of the datasets, see “CALINCA Flowchart” on https://calinca.dieterichlab.org. Briefly, long non-coding RNA (lncRNA) expression in renal compartments is quantified after read processing and mapping. In addition, the quantification of annotated lncRNAs novel transcripts is detected using a reference-guided de novo assembly. The lncRNA candidates were defined based on open reading fram (ORF) cutoffs. These candidates are then checked for tissue-specificity using the tissue specificity index (TSI) method and evolutionary conservation based on synteny and sequence.

Figure 2. CALINCA identifies 879 conserved podocyte lncRNAs. (A) Our workflow to define the set of podocyte-specific lncRNAs (see Methods). A final set of 879 conserved and podocyte-specific lncRNAs is defined. (B) Tissue specificity of novel podocyte-expressed lncRNAs candidates. We identified 15,155 out of 20,942 expressed transcripts as novel (i.e., not represented in the reference annotation). The tissue specificity is defined by the maximal TSI value. (C) Stratification of lncRNA candidates by expression and conservation (see the colored tags in panel A). The overlap between different lncRNA candidate properties is shown as set intersections. (D) Transcript length and number of exons stratified by annotation status (known/novel) for the final set of 879 conserved and podocyte-enriched lncRNA transcripts.

Figure 3. Dysregulation of lncRNAs in three mouse models of FSGS and validation of cell-type specificity using scRNA-seq data. (A) Venn diagram of differential lncRNA candidate gene loci expression across three disease models (top). Additional details on co-regulation across disease models is given at the bottom bar chart. *The two-factor model (age, condition) is used for Podocin and WT1. (B) Overlap of differential gene expression with the 757 conserved, podocyte-specific lncRNA candidate loci from Figure 2D. (C) Re-assessment of differentially regulated candidate loci from Figure 2D with regards to podocyte-specificity using single cell data from Chung et al. JASN 2020. We could identify 203 out of 241 lncRNA loci as expressed in 3′ scRNA data with the majority being podocyte-specific. (D) Uniform Manifold Approximation and Projection graphs (UMAPs) of two examples of highly podocyte-specific and conserved lincRNAs in scRNA-seq data (Chung et al. JASN 2020): Wt1os and 4921504A21Rik.

Figure 4. The qPCR and RNAscope validate podocyte-specific lncRNAs as defined by CALINCA. A) Table showing six lncRNAs dysregulated in at least one of the FSGS models, the glomerular expression which was validated by qPCR and/or RNAscope. Regarding Wt1^+/− and Pod^R231Q/A286V models, a transcript is classified as differentially expressed (DE/Yes) if it is significantly regulated (adjusted p-value < 0.05) in at least one of the time points (4 weeks, 12 weeks) or in the two-factor model. The same cutoffs apply for the Adriamycin model with a one-time point only. For the full table, refer to Supplementary Table S1 or calinca.dieterichlab.org. (A) Visualization of the qPCR data is provided in Supplementary Figure S2. (B) Representative images of glomeruli and tubules analyzed with custom designed RNAscope probes for lncRNAs Wt1os and 4921504A21Rik. Both lncRNAs are specifically detected in glomerular cells only, whilst the positive control shows a signal in both glomeruli and tubuli. Additional images as well as the results for Gm10824 and XLOC_024349 are provided in Supplementary Figure S3. Target lncRNAs and controls were detected with the RNAscope 2.5 HD—brown assay on FFPE mouse kidney tissue sections. Probe binding is visualized as punctate brown dots. Counterstain: Hematoxylin (blue). Scale bar: 60 µm. (C) The human ortholog of Wt1os (WT1-AS) is expressed in glomerular cells. Representative image of a human glomerulus analyzed with a custom designed RNAscope probe for WT1-AS. The lncRNA was detected with the RNAscope 2.5 HD—brown assay on a formalin-fixed, paraffin-embedded (FFPE) human kidney tissue section. Probe binding is visualized as punctate brown dots. Counterstain: Hematoxylin (blue). Scale bar: 90 µm. More images as well as the images showing lack of expression in tubuli are provided in Supplementary Figure S4.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Talyan, S.; Filipów, S.; Ignarski, M.; Smieszek, M.; Chen, H.; Kühne, L.; Butt, L.; Göbel, H.; Hoyer-Allo, K.J.R.; Koehler, F.C.; et al. CALINCA—A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease. Cells 2021, 10, 692. https://0-doi-org.brum.beds.ac.uk/10.3390/cells10030692

AMA Style

Talyan S, Filipów S, Ignarski M, Smieszek M, Chen H, Kühne L, Butt L, Göbel H, Hoyer-Allo KJR, Koehler FC, et al. CALINCA—A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease. Cells. 2021; 10(3):692. https://0-doi-org.brum.beds.ac.uk/10.3390/cells10030692

Chicago/Turabian Style

Talyan, Sweta, Samantha Filipów, Michael Ignarski, Magdalena Smieszek, He Chen, Lucas Kühne, Linus Butt, Heike Göbel, K. Johanna R. Hoyer-Allo, Felix C. Koehler, and et al. 2021. "CALINCA—A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease" Cells 10, no. 3: 692. https://0-doi-org.brum.beds.ac.uk/10.3390/cells10030692

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

CALINCA—A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease

Abstract

1. Introduction

2. Materials and Methods

2.1. RNAscope

2.2. QPCR

2.3. Animal Maintenance and Permissions

2.4. Genetic Mouse Models

2.5. Adriamycin Treatment

2.6. Preparation of Glomeruli and Isolation of Podocytes

2.7. RNA-Sequencing

2.8. Human Tissue

2.9. Data Analysis

2.9.1. Read Processing and Mapping

2.9.2. Transcript Assembly and Abundance Estimation

2.9.3. Selection of Potential lncRNA Candidates

2.9.4. LncRNA Candidate Downstream Analysis

2.9.5. Re-Analysis of scRNA-seq Datasets

2.10. Transcriptome Data Availability

3. Results

3.1. Bioinformatic Pipeline Design to Identify lncRNAs Involved in FSGS

3.2. Characteristics of lncRNA Expression in the Kidney

3.3. Dysregulation of lncRNAs in FSGS Models

3.4. Experimental Validation of FSGS lncRNA Candidates

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI