4.7 Article

De novo high-accuracy transcriptomes from long-read sequencing reveals a wide variety of novel splice variants in copepodids and adult female salmon lice (Lepeophtheirus salmonis)

Journal

FRONTIERS IN MARINE SCIENCE
Volume 10, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fmars.2023.1167402

Keywords

salmon lice; transcriptome; long-read sequencing; full-length mRNA; PacBio Iso-seq; Illumina sequencing; aquaculture; RNA sequencing

Ask authors/readers for more resources

In this study, a full-length transcriptome of the ectoparasitic salmon louse was generated using a combination of long-read and short-read sequencing. A total of 31,092 high-accuracy full-length transcripts were identified, with over half being specific to either the copepodid or adult life stage. This resource can be applied in expression studies, SNP mining, and exploring gene networks and enrichment analysis following expression studies.
Former transcriptome studies of the ectoparasitic salmon louse (Lepeophtheirus salmonis) are based on short-read sequencing and in silico predictions, with the disadvantage of inadequately describing splice variants and insufficient differentiation between duplicated genes. In the present study, a de novo full-length transcriptome (TSA accession GKKU00000000) was generated using single-molecule long-read RNA-sequencing (PacBio IsoSeq platform) corrected by short reads (Illumina platform) from the same RNA samples. The two samples, cephalothorax of an adult female and her copepodid offspring, were analyzed separately to facilitate comparison and identification of transcripts unique to each life stage. Each transcript has been supported by two or more full-length non-chimeric reads and at least three short reads, ensuring high-sequence accuracy. A total of 31,092 unique high-accuracy full-length transcripts with an open reading frame > 150 bp, originating from 10,034 unique loci of the salmon louse genome, were identified. More than half of the transcripts are life-stage specific, exclusively present in either the copepodid or adult sample. Approximately one-third of the transcripts were full splice matches with predicted protein coding transcripts presented in NCBI, thus validating these. More than half of the transcripts constituted novel isoforms with at least one new splicing site. We conclude that the full-length transcriptomes represent a versatile reference resource of transcripts. Suitable applications include expression studies, SNP mining, and studies on the biological effects of differences in gene (or isoform) expression between copepodids and adult females. The additional functional annotation of 88% of transcripts allows for identification of gene families of particular interest and for exploration of gene networks and enrichment analysis following expression studies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available