CGD Help: JBrowse

Contents

JBrowse Overview
Note on C. albicans Assembly 22 Haplotypes
Launching JBrowse
Organization of the JBrowse Window

Top Menu Bar
Navigation and Search
Main Browser Window
Track Selecton Menu

Creating Custom Tracks

Load Your Own Data
Search Tracks
Combination Tracks

JBrowse Overview

The JBrowse genome browser allows CGD users to view and explore datasets from large-scale genomics experiments. Developed by the Generic Model Organism Database project (GMOD), JBrowse is a widely used application that is fast, intuitive, and compatible with most web browsers. JBrowse allows users to quickly view large-scale sequence data in a genomic context, at multiple zoom-levels of resolution, from base pairs in individual sequence reads to read-density summaries across large genomic regions. The display also includes parallel tracks of annotated sequence features, with links to the corresponding CGD Locus Summary Pages. Quantitative tracks graphically display comparative information, such as relative expression level or sequence conservation. JBrowse also allows users to load their own datasets and analysis tracks, for display in the context of genomic features, or for comparison with datasets and tracks provided by CGD.

For additional information and help with JBrowse see:

2016 publication by Buels et al.
GMOD's JBrowse documentation
JBrowse Configuration Guide

Note on C. albicans Assembly 22 Haplotypes

Please note that the two haplotypes of C. albicans SC5314 Assembly 22 are displayed separately, and that high-throughput sequence data have been aligned by CGD to each haplotype separately. This is primarily to reduce alignment errors for short-read sequences, but also allows sequence variation to be viewed in the context of each haplotype.

Launching JBrowse

JBrowse can be launched from any CGD feature's Locus Summary Page by clicking on the JBrowse logo about halfway down the page:

The JBrowse window will be centered upon that feature.

Alternately, mouse-over "JBrowse" on the top menu bar of any CGD page, then select and click your organism of interest in the drop-down menu. The browser window will be centered on an arbitrary location of the genome.

Organization of the JBrowse Window

Top Menu Bar

The top menu bar has 4 items:

JBrowse, shows version information with links to GMOD
File, load data and customize tracks, as described below
View, highlight features of interest, or resize quantitative data tracks
Help, display version information and general browser help

Navigation and Search

Beneath the top menu are the navigation controls. The arrow and magnification buttons allow panning and zooming of the display. The chromosome pull-down menu to the right of the buttons changes the chromosome displayed. The box to the right of that displays the current location, but can also be used to search by location or feature name simply by clicking and entering text. Please note that best search results will be obtained by entering the systematic name of the target feature.

Just above the buttons and search box is a linear representation of the current chromosome, with a draggable red box over the current location. The current location can be changed by clicking and dragging the red box. The size of the box -- and of the corresponding region displayed -- can also be changed by clicking on and moving either side of the box.

Main Browser Window

The main browser window displays data tracks of various types. In the far left of each track is a label with the track name, and a pull-down menu with technical information about the track. The position of a track can be changed by clicking on the track label and dragging it up or down to the desired position in the window.

The main types of tracks are:

Sequence Tracks. Nucleotide sequence of both strands, as well as 6-frame translated amino acid sequences are shown. Note that the window may need to be zoomed-in considerably for sequence tracks to be displayed:

Feature Tracks. All transcribed features annotated at CGD are displayed in Feature Tracks. Features encoded on the "W" strand are displayed in red, and features on the "C" strand in blue. Clicking on features brings up an information window, and clicking on the feature name in the window opens the CGD Locus Summary Page for that feature.

Alignment Tracks. Genomic alignments of high-throughput sequence data from a variety of experiment types can be displayed: DNA-Seq, RNA-Seq, ChIP-Seq, etc. Note that the window may need to be zoomed in to view alignments, and that scrolling may be necessary to view all the aligned reads. Also note that browser speed and performance may be affected if more than one alignment track is displayed. For each alignment, CGD also provides two quantitative tracks, summarizing read coverage and read density. These may be preferred if more than one dataset are to be viewed or compared.

Quantitative Tracks. Various types of quantitative data may be displayed, for example the number or density of aligned reads, expression levels, or degree of conservation (see below). Quantitative data can be displayed on either log scale (default for expression data) or linear scale (default for conservation scores). This can be changed by the user in the information pull-down menu at the far left of the track.
- Conservation Tracks. Evolutionary conservation of genomic features is shown in four quantitative tracks, representing increasing levels of evolutionary diversity:
  In general, the top level (derived from an alignment of 3 closely related species) will show the highest levels of conservation, including intergenic regions, while the lowest level (derived from an alignment of 12 distantly-related budding yeasts) will show the least conservation, and include only the most highly-conserved genes and features. Exceptions are generally due to low information in the underlying alignments, such as the absence of a homologous region in a closely related species. Conservation scores were produced with the phyloP program (Siepel et al, 2005), based on alignments produced by the program TBA (Blanchette et al, 2004). Conservation scores and their underlying alignments and phylogenetic models are available for download in the CGD downloads directory.

Track Selection Menu

To choose which tracks to display, click on the "Select Tracks" tab on the upper-left side of the main JBrowse window to bring up the selection menu:

The many tracks available are listed in the right panel of the menu (shown below). Select tracks by clicking the check-box to the left of the item description. The left panel of the menu gives a number of filtering criteria to help narrow down the track choices. For example, you may filter by experimental technique (RNA-Seq, DNA-Seq, etc.), experimental condition (oxidative stress, pH, control, etc.), or source publication (either by first author or PubMed ID). Select "Currently Active" under "My Tracks" to show only the tracks that are currently selected and displayed in the main window. The "Clear All Filters" button above the right panel removes any filters applied from the left panel (but does not de-select tracks), returning the display to all available tracks. You can also find tracks using the search box above the right panel: for example, searching with the text "stress" shows tracks for both oxidative stress and nitrosative stress. Clicking the "Back to Browser" button above the right panel executes any track changes you made and returns to the main JBrowse window.

Creating Custom Tracks

Load your own data

You can load your own data in GFF3, BAM, Wiggle/BigWig or VCF formats, to display as tracks in CGD JBrowse. Data can be either in files on your local system or from a remote web server. From the File menu in the top menu bar, select "Open track file or URL", and add local files to the left box, or URLs to files from web servers to the right box. Note that in order to display correctly, the data must be annotated with the sequence coordinates of the current CGD genome version.

Search Tracks

You can display all instances of a particular sequence or motif as a separate track. From the File menu in the top menu bar, select "Add sequence search track", and enter the query sequence (either plain or as regular expression). You can search using either nucleotide or amino acid sequence, and you can search on either or both strands.

Combination Tracks

You can combine information from separate tracks into a custom "combination" track. For example, you may want to create a Search Track for a given sequence motif, as above, but only display the instances that occur in genes displayed in the "Transcribed Features" track. From the File menu in the top menu bar, select "Add combination track". A new "empty" track is created. Click on the information labels of the tracks you wish to combine, drag them over to the information label of the new combination track, and release to combine (the label will appear red while you are dragging it, and turn green when it is in position over the combination track). After the different tracks have been added, a menu pops up to select details of the combination (intersection, union, etc.). For the above example, you would select "Intersection".


Return to CGD	Send a Message to the CGD Curators