January 11, 2022 - The paper "Phased, chromosome-scale genome assemblies of tetraploid potato reveals a complex genome, transcriptome, and predicted proteome landscape underpinning genetic diversity" by Hoopes et al. has been published in Molecular Plant.
The genome assembly and annotation files are available below. The entire data set for the paper is available on Data Dryad at https://doi.org/10.5061/dryad.3n5tb2rhw.
February 17, 2023 - The Buell Lab at the University of Georgia is pleased to make available an updated set of genome annotation for the Atlantic genome assembly (ATL_v3). ATL_v3 was annotated as described in (Hoopes et al. 2022 ) with new RNA-seq libraries and Oxford Nanopore (ONT) cDNA libraries. The methods for processing the new RNA-seq and ONT cDNA libraries as are follows:
RNA-seq libraries were processed for genome annotation by first cleaning with Cutadapt(Martin 2011, v2.10) using a minimum length of 100 nt and quality cutoff of 10 then aligning the cleaned reads to the respective genome using HISAT2(Kim et al. 2019, v2.1.0). Oxford Nanopore (ONT) cDNA reads were processed with Pychopper (v2.5.0; github.com/epi2me-labs/pychopper) and trimmed reads greater than 500 nt were aligned to the respective genome using minimap2(H. Li 2018, v2.17-r941) with a maximum intron length of 5,000 nt. The aligned RNA-seq and ONT cDNA reads were each assembled using Stringtie (Kovaka et al. 2019, v2.2.1) and transcripts less than 500 nt were removed.
Note: the previous Atlantic v2.0 assembly and annotation can be found at Data Dryad at https://doi.org/10.5061/dryad.3n5tb2rhw.
High confidence representative gene models are a subset of the high confidence gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
The set of working gene models contains all loci and isoforms from the annotation pipeline and may include artifacts such as partial gene models.
The representative working gene models are a subset of the working gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
High confidence representative gene models are a subset of the high confidence gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
The set of working gene models contains all loci and isoforms from the annotation pipeline and may include artifacts such as partial gene models.
High confidence representative gene models are a subset of the high confidence gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
The set of working gene models contains all loci and isoforms from the annotation pipeline and may include artifacts such as partial gene models.
High confidence representative gene models are a subset of the high confidence gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
The set of working gene models contains all loci and isoforms from the annotation pipeline and may include artifacts such as partial gene models.
High confidence representative gene models are a subset of the high confidence gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
The set of working gene models contains all loci and isoforms from the annotation pipeline and may include artifacts such as partial gene models.
High confidence representative gene models are a subset of the high confidence gene model set. Each representative gene model is the isoform with the longest CDS at each locus.
The set of working gene models contains all loci and isoforms from the annotation pipeline and may include artifacts such as partial gene models.
![]() |
![]() |
![]() |
![]() |