Index of /download/gff/C_albicans_SC5314

Icon  Name                                                                                Last modified      Size  Description
[DIR] Parent Directory - [   ] 5prime_utr_intron_A22.gff 08-Jul-2016 15:01 12K [   ] A19_ForcheSNPs.gff 31-Aug-2007 13:56 82K [   ] A22_ForcheSNPs.gff 08-Jul-2016 15:01 127K [   ] A22_Historic_Assemblies.gff 08-Jul-2016 15:01 14M [   ] A22_Jones_PMID_15123810_Polymorphisms.gff 08-Jul-2016 15:01 15M [   ] A22_Jones_PMID_15123810_Polymorphisms.vcf 08-Jul-2016 15:03 11M [   ] A22_Unannotated_transcripts_Bruno_et_al.gff 08-Jul-2016 15:02 262K [   ] A22_Unannotated_transcripts_Sellam_et_al.gff 08-Jul-2016 15:02 736K [   ] A22_Unannotated_transcripts_Tuch_et_al.gff 08-Jul-2016 15:02 534K [   ] Assem19mapping.gff 12-Apr-2007 12:23 1.8M [   ] Assem21mapping.gff 26-Aug-2010 13:17 7.5M [TXT] C_albicans_SC5314_A19_current_features.gff 07-Feb-2016 07:08 10M [   ] C_albicans_SC5314_A19_current_features_with_chromosome_sequences.gff.gz 07-Feb-2016 07:08 9.3M [TXT] C_albicans_SC5314_A19_current_intergenic.gff 07-Feb-2016 07:08 2.1M [   ] C_albicans_SC5314_A21.gtf 03-Sep-2015 06:42 2.1M [TXT] C_albicans_SC5314_A21_current_features.gff 07-Feb-2016 07:06 6.2M [   ] C_albicans_SC5314_A21_current_features_with_chromosome_sequences.gff.gz 07-Feb-2016 07:06 4.9M [TXT] C_albicans_SC5314_A21_current_intergenic.gff 07-Feb-2016 07:06 1.3M [TXT] C_albicans_SC5314_A22_current_features.gff 23-Apr-2017 07:03 13M [   ] C_albicans_SC5314_A22_current_features_with_chromosome_sequences.gff.gz 23-Apr-2017 07:03 9.7M [TXT] C_albicans_SC5314_A22_current_intergenic.gff 23-Apr-2017 07:03 2.6M [   ] C_albicans_SC5314_haplotype_variations.gff 08-Jul-2016 16:22 18M [TXT] C_albicans_SC5314_version_A19-s01-m04-r05_features.gff 07-Feb-2016 07:08 10M [   ] C_albicans_SC5314_version_A19-s01-m04-r05_features_with_chromosome_sequences.gff.gz 07-Feb-2016 07:08 9.3M [TXT] C_albicans_SC5314_version_A19-s01-m04-r05_intergenic.gff 07-Feb-2016 07:08 2.1M [TXT] C_albicans_SC5314_version_A21-s02-m09-r10_features.gff 07-Feb-2016 07:06 6.2M [   ] C_albicans_SC5314_version_A21-s02-m09-r10_features_with_chromosome_sequences.gff.gz 07-Feb-2016 07:06 4.9M [TXT] C_albicans_SC5314_version_A21-s02-m09-r10_intergenic.gff 07-Feb-2016 07:06 1.3M [   ] C_albicans_SC5314_version_A22-s07-m01-r03_features.gtf 21-Jul-2016 09:10 4.3M [TXT] C_albicans_SC5314_version_A22-s07-m01-r26_features.gff 23-Apr-2017 07:03 13M [   ] C_albicans_SC5314_version_A22-s07-m01-r26_features_with_chromosome_sequences.gff.gz 23-Apr-2017 07:03 9.7M [TXT] C_albicans_SC5314_version_A22-s07-m01-r26_intergenic.gff 23-Apr-2017 07:03 2.6M [   ] Jones_PMID_15123810_Polymorphisms.gff 16-Feb-2012 14:19 9.2M [   ] Unannotated_transcripts_Bruno_et_al.gff 11-Oct-2010 14:29 132K [   ] Unannotated_transcripts_Sellam_et_al.gff 17-Sep-2010 14:18 380K [   ] Unannotated_transcripts_Tuch_et_al_2010.gff 11-Oct-2010 14:29 270K [DIR] archive/ 23-Apr-2017 07:03 -
This directory contains the downloadable CGD files in the Generic
Feature Format (GFF).  These files describe features in CGD, including 
chromosomes, ORFs, CDSs, introns, sequence gaps, intergenic regions, etc.

Please see http://www.sequenceontology.org/gff3.shtml for a detailed description 
of the Generic Feature Format (GFF).

The notation "version_A21_sXX-mYY-rZZ" in the filenames indicates the genome version
to which data in the file corresponds. Detailed explanation about the genome
version notation can be found at: http://www.candidagenome.org/help/SequenceHelp.shtml#versions
Information pertaining to each version update for C. albicans SC5314 Assembly 21 can be found at:
http://www.candidagenome.org/cgi-bin/genomeVersionHistory.pl?seq_source=C.%20albicans%20SC5314%20Assembly%2021

GFF files with "current" in their names are provided as stable filenames for
automated downloads. They are identical to (technically, symbolic links to) the
corresponding versioned files.

GFF files for previous genome versions are available in the archive sub-directory.

The following Assembly 21 files are updated weekly:

C_albicans_SC5314_version_A21-sXX-mYY-rZZ_features.gff
    This file contains the current CGD annotation based on Assembly 21 of the
    C. albicans SC5314 genome sequence.

C_albicans_SC5314_version_A21-sXX-mYY-rZZ_features_with_chromosome_sequences.gff.gz
    This file contains the current CGD annotation and the current genomic sequence
    of all chromosomes of the genome sequence. The annotations in this file and the
    previous file are the same. The chromosome sequences are specified 
    in the "##FASTA" section at the end of this file according to GFF3 file format 
    specifications (see http://www.sequenceontology.org/gff3.shtml).

C_albicans_SC5314_version_A21-sXX-mYY-rZZ_intergenic.gff
    This file lists the intergenic regions between coding regions in the
    chromosomes. This file also contains lengths of these intergenic sequences
    and GC and AT content of each intergenic region (percent GC and percent AT).

The following Assembly 19 files are updated only when major changes to the underlying
sequence or gene models are incorporated.  For a record of the changes made
to Assembly 19 subsequent to the divergence of Assemblies 19 and 21, see:
http://www.candidagenome.org/cgi-bin/genomeVersionHistory.pl?seq_source=C.%20albicans%20SC5314%20Assembly%2019

C_albicans_SC5314_version_A19-sXX-mYY-rZZ_features.gff
    This file contains CGD annotation from Assembly 19 of the C. albicans
    genome sequence. It is primarily archival and is rarely updated.  It
    was converted to the canonical GFF3 format in October 2012.

C_albicans_SC5314_version_A19-sXX-mYY-rZZ_features_with_chromosome_sequences.gff.gz
    This file contains CGD annotation from Assembly 19 of the C. albicans genome
    sequence and the genomic sequence of all the contigs based on Assembly 19.
    The annotations in this file and the previous file are the same.  The contig
    sequences are specified in the "##FASTA" section at the end of this file
    according to GFF3 file format specifications (see http://www.sequenceontology.org/gff3.shtml).

The following files map special features or historic assemblies to current assemblies.
The mappings are only updated following major sequence updates to the current assemblies.
These files are not converted to the canonical GFF format due to the historic nature
of the data represented.

Assem19mapping.gff
    This file contains mappings of historic assemblies to Assembly 19 supercontigs.
    BLAST analysis was performed to map Contigs and ORF sequences from each of the
    older assemblies to the Assembly 19 supercontigs.  For further details on the
    analysis procedure and separate mapping files for individual assemblies, please
    see http://candidagenome.org/download/mapping_historic_assemblies/

Assem21mapping.gff
    This file contains mappings of historic assemblies to Assembly 21 chromosomes.
    BLAST analysis was performed to map Contigs and ORF sequences from each of the
    older assemblies to the Assembly 21 chromsomes.  For further details on the
    analysis procedure and separate mapping files for individual assemblies, please
    see http://candidagenome.org/download/mapping_historic_assemblies/

A19_ForcheSNPs.gff
    This file contains all the SNP locations from Forche A, Magee PT, Magee BB,
    May G "Genome-wide single-nucleotide polymorphism map for Candida albicans."
    Eukaryotic Cell. 2004 Jun;3(3):705-14. SNP locations were mapped to Assembly 19
    contigs using the original marker sequences.

Jones_PMID_15123810_Polymorphisms.gff
    This file contains all polymorphisms discussed in Jones et al. (2004) "The Diploid
    Genome of Candida albicans." PNAS 101:7329-7334.  Polymorphism locations were mapped
    to Assembly 21 using 50 bp flanking sequence on both sides of each polymorphism to
    locate exact matches using BLAST.  Locations for "Deletion" type polymorphism indicates
    the region that is deleted, including the start and stop coordinates. Locations for
    "Insertion" type polymorphism indicate that an insertion has been made in the homolog
    sequence immedeatly AFTER the location specified. Locations for "Substitution" type
    polymorphisms indicate the site of a single nucleotide substitution.

Unannotated_transcripts_Bruno_et_al_2010.gff
    This file contains novel transcriptionally active regions detected in high-throuhgput
    sequencing of cDNA (RNA-seq) under several environmental conditions, described in
    Bruno VM, Wang Z, Marjani SL, Euskirchen GM, Martin J, Sherlock G, Snyder M (2010)
    "Comprehensive annotation of the transcriptome of the human fungal pathogen Candida
    albicans using RNA-seq." Genome Res 20(10):1451-8

Unannotated_transcripts_Sellam_et_al.gff
    This file contains novel, unannotated transcripts detected in tiling microarray experiments
    from Sellam A, Hogues H, Askew C, Tebbji F, van het Hoog M, Lavoie H, Kumamoto CA, Whiteway M,
    Nantel A "Experimental annotation of the human pathogen Candida albicans coding and noncoding  
    transcribed regions using high-resolution tiling arrays." Genome Biol 2010; 11(7):R71.

Unannotated_transcripts_Tuch_et_al_2010.gff
    This file contains novel, unannotated transcriptionally active regions detected by strand-specific
    sequencing of RNA from white and opaque cells, described in Tuch BB, Mitrovich QM, Homann OR,
    Hernday AD, Monighetti CK, De La Vega FM, Johnson AD (2010) "The transcriptomes of two heritable
    cell types illuminate the circuit governing their differentiation." PLoS Genet 6(8)