Index of /download/sequence/C_glabrata_CBS138/current
Name Last modified Size Description
Parent Directory -
EMBL_format/ 2024-12-02 02:39 -
C_glabrata_CBS138_current_other_features_genomic.fasta.gz 2024-12-01 07:05 29K
C_glabrata_CBS138_version_s05-m03-r28_other_features_genomic.fasta.gz 2024-12-01 07:05 29K
C_glabrata_CBS138_current_other_features_no_introns.fasta.gz 2024-12-01 07:05 29K
C_glabrata_CBS138_version_s05-m03-r28_other_features_no_introns.fasta.gz 2024-12-01 07:05 29K
C_glabrata_CBS138_current_other_features_plus_intergenic.fasta.gz 2024-12-01 07:05 199K
C_glabrata_CBS138_version_s05-m03-r28_other_features_plus_intergenic.fasta.gz 2024-12-01 07:05 199K
C_glabrata_CBS138_current_other_features_genomic_1000.fasta.gz 2024-12-01 07:05 265K
C_glabrata_CBS138_version_s05-m03-r28_other_features_genomic_1000.fasta.gz 2024-12-01 07:05 265K
C_glabrata_CBS138_current_not_feature.fasta.gz 2024-12-01 07:04 1.4M
C_glabrata_CBS138_version_s05-m03-r28_not_feature.fasta.gz 2024-12-01 07:04 1.4M
C_glabrata_CBS138_current_orf_trans_all.fasta.gz 2024-12-01 07:05 2.0M
C_glabrata_CBS138_version_s05-m03-r28_orf_trans_all.fasta.gz 2024-12-01 07:05 2.0M
C_glabrata_CBS138_current_orf_coding.fasta.gz 2024-12-01 07:05 3.0M
C_glabrata_CBS138_version_s05-m03-r28_orf_coding.fasta.gz 2024-12-01 07:05 3.0M
C_glabrata_CBS138_current_orf_genomic.fasta.gz 2024-12-01 07:05 3.0M
C_glabrata_CBS138_version_s05-m03-r28_orf_genomic.fasta.gz 2024-12-01 07:05 3.0M
C_glabrata_CBS138_current_chromosomes.fasta.gz 2024-12-01 07:04 3.9M
C_glabrata_CBS138_version_s05-m03-r28_chromosomes.fasta.gz 2024-12-01 07:04 3.9M
C_glabrata_CBS138_current_orf_plus_intergenic.fasta.gz 2024-12-01 07:05 5.8M
C_glabrata_CBS138_version_s05-m03-r28_orf_plus_intergenic.fasta.gz 2024-12-01 07:05 5.8M
C_glabrata_CBS138_current_orf_genomic_1000.fasta.gz 2024-12-01 07:05 6.5M
C_glabrata_CBS138_version_s05-m03-r28_orf_genomic_1000.fasta.gz 2024-12-01 07:05 6.5M
This directory contains the current version of Candida glabrata CBS138
genomic sequences.
C. glabrata CBS138 was originallysequenced by Genolevures (DuJon et al., 2004,
Nature 430:35-44). The genome was later re-assembled, leveraging long-read
sequencing to correct errors in repetitive regions (Xu et al., 2020,
Mol Microbiol 113:1209-1224). Sequence and annotation obtained by CGD from NCBI.
The notation "version_sXX-mYY-rZZ" in the filenames indicates the genome version
to which data in the file corresponds. Detailed explanation about the genome
version notation can be found at: http://www.candidagenome.org/help/SequenceHelp.shtml#versions
Information pertaining to each version update for C. glabrata CBS138 can be found at:
http://www.candidagenome.org/cgi-bin/genomeVersionHistory.pl?seq_source=C.%20glabrata%20CBS138
Sequence files with "current" in their names are provided as stable filenames for
automated downloads. They are identical to (technically, symbolic links to) the
corresponding versioned sequence files.
These files are updated weekly:
* Chromosomal sequence:
C_glabrata_CBS138_version_sXX-mYY-rZZ_chromosomes.fasta.gz
* Sequence with no introns for all ORFs:
C_glabrata_CBS138_version_sXX-mYY-rZZ_orf_coding.fasta.gz
* Sequence with introns for all ORFs:
C_glabrata_CBS138_version_sXX-mYY-rZZ_orf_genomic.fasta.gz
* Sequence with introns for all ORFs, plus flanking 1000 bp upstream and downstream:
C_glabrata_CBS138_version_sXX-mYY-rZZ_orf_genomic_1000.fasta.gz
* Sequences with introns for all ORFs, plus upstream and downstream intergenic sequence:
C_glabrata_CBS138_version_sXX-mYY-rZZ_orf_plus_intergenic.fasta.gz
* Translation of all ORFs:
C_glabrata_CBS138_version_sXX-mYY-rZZ_orf_trans_all.fasta.gz
* Sequence of non-ORF features (tRNA, rRNA, repeat regions, etc.) with any introns removed:
C_glabrata_CBS138_version_sXX-mYY-rZZ_other_features_no_introns.fasta.gz
* Sequence of non-ORF features including introns:
C_glabrata_CBS138_version_sXX-mYY-rZZ_other_features_genomic.fasta.gz
* Sequence of non-ORF features including introns and flanking 1000 bp upstream and downstream:
C_glabrata_CBS138_version_sXX-mYY-rZZ_other_features_genomic_1000.fasta.gz
* Sequence of non-ORF features including introns and upstream and downstream intergenic sequence:
C_glabrata_CBS138_version_sXX-mYY-rZZ_other_features_plus_intergenic.fasta.gz
* Sequence between annotated chromosomal features:
C_glabrata_CBS138_version_sXX-mYY-rZZ_not_features.fasta.gz
Note: this file contains genomic DNA sequences between (and excluding) the
following feature types:
Protein Coding Sequence
tRNA
rRNA
repeat regions
#################################################################################
The files in this directory are in FASTA format.
All files are gzip compressed. There are several freely available
software options for decompressing gzipped files. The software
and other useful information is available on these web sites:
- WinZip (http://www.winzip.com/)
- Stuffit (http://www.stuffit.com/)
- Gzip (http://www.gzip.org/
and the gzip user's manual:
http://www.math.utah.edu/docs/info/gzip_toc.html
Additional sequence documentation is found on the CGD web site at:
http://www.candidagenome.org/help/SequenceHelp.shtml
------------------------------------------------