Index of /download/genome_alignments

Icon  Name                          Last modified      Size  Description
[DIR] Parent Directory - [   ] wgd_clade_tree.par 19-Dec-2017 09:55 128 [   ] wgd_clade.maf.gz 19-Dec-2017 11:51 39M [   ] saccharomycetales_tree.par 19-Dec-2017 09:52 179 [   ] saccharomycetales_B.maf.gz 19-Dec-2017 09:41 62M [   ] saccharomycetales.maf.gz 19-Dec-2017 09:40 62M [   ] parapsilosis_group_tree.par 19-Dec-2017 09:56 51 [   ] parapsilosis_group.maf.gz 19-Dec-2017 09:44 12M [   ] nakaseomyces_candida_tree.par 19-Dec-2017 09:54 75 [   ] nakaseomyces_candida.maf.gz 19-Dec-2017 09:43 19M [   ] glabrata_group_tree.par 19-Dec-2017 09:54 43 [   ] glabrata_group.maf.gz 19-Dec-2017 09:42 11M [   ] ctg_clade_tree.par 19-Dec-2017 09:51 142 [   ] ctg_clade_B.maf.gz 19-Dec-2017 09:38 44M [   ] ctg_clade.maf.gz 19-Dec-2017 09:37 44M [   ] candida_lodderomyces_tree.par 19-Dec-2017 09:51 96 [   ] candida_lodderomyces_B.maf.gz 19-Dec-2017 09:35 27M [   ] candida_lodderomyces.maf.gz 19-Dec-2017 09:34 27M [   ] albicans_group_tree.par 19-Dec-2017 09:50 44 [   ] albicans_group_B.maf.gz 19-Dec-2017 09:33 12M [   ] albicans_group.maf.gz 19-Dec-2017 09:32 12M [DIR] C_parapsilosis_CDC317/ 19-Dec-2017 13:23 - [DIR] C_glabrata_CBS138/ 19-Dec-2017 13:19 - [DIR] C_dubliniensis_CD36/ 19-Dec-2017 13:16 - [DIR] C_albicans_SC5314/ 19-Dec-2017 13:11 -
The /download/genome_alignments/ directory and subdirectories contain 
whole-genome multiple sequence alignments, phylogenetic trees, and 
conservation scores for Candida-related species.

NOTE: Separate alignments were produced for the two haplotypes 
[A and B] of C. albicans SC5314 Assembly 22. Filenames for alignments
for the B haplotype contain a "_B" tag.

This (top-level) directory contains the unprocessed alignments and 
guide trees. Alignments were produced with the MULTIZ/TBA package 
(Blanchette et al, Genome Res. 2004 14(4):708; PMID 15060014) 
from the Miller lab: http://www.bx.psu.edu/miller_lab/
The sources for the genomic DNA sequences used are listed below. All-by-all 
pairwise alignments were produced using the all_bz program, from which 
multiple alignments were made using the tba program. Default parameters 
were used for both programs. The resulting alignments are in MAF format, 
described here: https://genome.ucsc.edu/FAQ/FAQformat.html#format5

Guide trees are rooted and have no branch lengths. Phylogenies are based
on Fitzpatrick et al. (BMC Evolutionary Biology 2006, 6:99; PMID 17121679),
Wang et al. (BMC Evolutionary Biology 2009, 9:195; PMID 19664262), and
Gabaldon et al. (BMC Genomics 2013, 14:623; PMID 24034898). Trees are in 
Newick format, described here: 
http://evolution.genetics.washington.edu/phylip/newicktree.html

The four species-specific subdirectories contain alignments projected 
with that species as the "reference". The subdirectories also contain 
conservation scores based on those reference species alignments:
C_albicans_SC5314/
C_dubliniensis_CD36/
C_glabrata_CBS138/
C_parapsilosis_CDC317/

The files in the present directory are:

albicans_group.maf.gz
	Alignment of the genomes of 3 closely-related species: C. albicans SC5314, 
	C. dubliniensis CD36, and C. tropicalis MYA-3404
albicans_group_B.maf.gz
	As albicans_group.maf.gz, but using the B haplotype of C. albicans SC5314
albicans_group_tree.par
	Guide tree for albicans_group.maf.gz
candida_lodderomyces.maf.gz
	Alignment of the genomes of 6 related species: C. albicans SC5314, 
	C. dubliniensis CD36, C. tropicalis MYA-3404, C. parapsilosis CDC317,
	C. orthopsilosis Co 90-125, and L. elongisporus NRLL YB-4239
candida_lodderomyces_B.maf.gz
	As candida_lodderomyces.maf.gz, but using the B haplotype of C. albicans SC5314
candida_lodderomyces_tree.par
	Guide tree for candida_lodderomyces.maf.gz
ctg_clade.maf.gz
	Alignment of the genomes of 9 species in the "CTG clade": C. albicans SC5314, 
	C. dubliniensis CD36, C. tropicalis MYA-3404, C. parapsilosis CDC317,
	C. orthopsilosis Co 90-125, L. elongisporus NRLL YB-4239, C. guilliermondii ATCC 6260,
	D. hansenii CBS767, and C. lusitaniae ATCC 42720
ctg_clade_B.maf.gz
	As ctg_clade.maf.gz, but using the B haplotype of C. albicans SC5314
ctg_clade_tree.par
	Guide tree for ctg_clade.maf.gz
glabrata_group.maf.gz
	Alignment of the genomes of 3 closely-related species: C. glabrata CBS138, 
	C. bracarensis CBS 10154, and C. nivariensis CBS 9983
glabrata_group_tree.par
	Guide tree for glabrata_group.maf.gz
nakaseomyces_candida.maf.gz
	Alignment of the genomes of 5 related species: C. glabrata CBS138, 
	C. bracarensis CBS 10154, C. nivariensis CBS 9983, N. bacillisporus CBS 7720,
	and C. castellii CBS 4332
nakaseomyces_candida_tree.par
	Guide tree for nakaseomyces_candida.maf.gz
parapsilosis_group.maf.gz
	Alignment of the genomes of 3 closely-related species: C. parapsilosis CDC317,
	C. orthopsilosis Co 90-125, and L. elongisporus NRLL YB-4239
parapsilosis_group_tree.par
	Guide tree for parapsilosis_group.maf.gz
saccharomycetales.maf.gz
	Alignment of the genomes of 12 budding yeast species: C. albicans SC5314, 
	C. dubliniensis CD36, C. parapsilosis CDC317, L. elongisporus NRLL YB-4239, 
	D. hansenii CBS767, C. glabrata CBS138, C. bracarensis CBS 10154,  
	N. bacillisporus CBS 7720, S. cerevisiae S288C, S. castellii NRRL-Y12630,
	A. gossypii ASM9102, and Y. lipolytica CLIB122
saccharomycetales_B.maf.gz
	As saccharomycetales.maf.gz, but using the B haplotype of C. albicans SC5314
saccharomycetales_tree.par
	Guide tree for saccharomycetales.maf.gz
wgd_clade.maf.gz
	Alignment of the genomes of 9 species in the "whole-genome duplication clade":
	C. glabrata CBS138, C. bracarensis CBS 10154, C. nivariensis CBS 9983, 
	N. bacillisporus CBS 7720, C. castellii CBS 4332, S. cerevisiae S288C,
	S. mikatae IFO1815, S. bayanus 623-6C, and S. castellii NRRL-Y12630
wgd_clade_tree.par
	Guide tree for wgd_clade.maf.gz

Sequence Sources
	Genome sequences were downloaded in November 2017 from the following sources:
	CGD (http://www.candidagenome.org/download/sequence/)
		C. albicans SC5314 Assembly 22, haplotypes A and B
		C. dubliniensis CD36
		C. tropicalis MYA-3404
		C. parapsilosis CDC317
		C. orthopsilosis Co 90-125
		L. elongisporus NRLL YB-4239
		C. guilliermondii ATCC 6260
		D. hansenii CBS767
		C. lusitaniae ATCC 42720
		C. glabrata CBS138
	SGD
		S. cerevisiae S288C (https://downloads.yeastgenome.org/sequence/S288C_reference/)
		Other Saccharomyces species (https://downloads.yeastgenome.org/sequence/fungi/)
			S. mikatae IFO1815
			S. bayanus 623-6C
			S. castellii NRRL-Y12630
	NCBI Genome
		C. bracarensis CBS 10154 (https://www.ncbi.nlm.nih.gov/Traces/wgs/?val=CAPU01)
		C. nivariensis CBS 9983 (https://www.ncbi.nlm.nih.gov/Traces/wgs/?val=CAPV01)
		N. bacillisporus CBS 7720 
			(ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/001/046/975/GCA_001046975.1_ASM104697v1)
		C. castellii CBS 4332 (https://www.ncbi.nlm.nih.gov/Traces/wgs/?val=CAPW01)
		Yarrowia lipolytica CLIB122
			(ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/002/525/GCA_000002525.1_ASM252v1)
	Ensembl Fungi
		A. gossypii ASM9102v1
			(ftp://ftp.ensemblgenomes.org/pub/fungi/release-37/fasta/ashbya_gossypii/dna/)

-----------------------------------------------

Alignment files are gzip compressed. There are several freely available
software options for decompressing gzipped files using Windows.  The
software and other useful information is available on these web sites:
- WinZip (http://www.winzip.com/)
- Stuffit (http://www.stuffit.com/)
- Gzip (http://www.gzip.org/
   
and the gzip user's manual:
http://www.math.utah.edu/docs/info/gzip_toc.html

------------------------------------------------