PSAT User's Manual
5. Utilizing Tool Options
Modifying the input options can affect the results that are returned by the tool. Knowing how to utilize the different options can help you take full advantage of the tool for performing a useful comparative analysis.
Modifying BLAST Alignment Score Thresholds
- Scroll to the top of the Genomic Neighborhoods page and select the Edit Settings link in the header of the homoologs table.
- You should now be back to the input page, and the fields should be pre-populated with the current settings.
- Select the first BLAST alignment score thresholds option (e-value < 1e-172, bit score > 600, % identity > 90) and resubmit the form.
- The Genomic Neighborhoods page should reappear with different results. Since you selected to use stricter BLAST scores, the query returned fewer results. As you might expect, homologs were only found in the other Francisella tularensis genomes when using the stricter criteria
Specifying a Subset of Comparison Genomes
- Select to Edit Settings again.
- Edit the Comparison Genomes to include the Salmonella, Shewanella, and Vibrio genomes (select multiple items using Ctrl-Click or Apple-Click).
- As expected, the Genomic Neighborhoods page shows that no matches were found for these comparison genomes. Select to edit the settings again so that we can try using more relaxed criteria.
- Select the third, least strict BLAST alignment score thresholds option and resubmit the form.
- The Genomic Neighborhoods page should reappear again, but now with 16 results. Since you selected to use less strict BLAST scores, the query returned results this time. Note that the results are ordered by homolog clustering score and are grouped by bacterial genus.
Limiting the Genomic Regions Displayed in the Graphic
- Assume that you are interested in comparing the genomic neighborhoods of homologs in the Shewanella genomes only. Deselect all the homologs in the table by clicking on the toggle checkbox icon
. Then reselect each homolog in the Shewanella genome individually.
- Click the Update Graphic button. Note that only the Shewanella genomes now appear in the graphic.
Finding Genes by Product Description
- Select to Edit Settings again.
- Remove 'pilQ' from the Gene name text box and enter cytochrome into the Description text box.
- Resubmit the form.
- A list of 11 genes should be displayed with Product including the string 'cytochrome'.
- Select the gene cydB in the list of results.
- Note that 20 hits total were found in 16 genomes (indicating there may be duplicated regions in some genomes).
- Click the 'Genome' column header at the top of the results table to sort by genome. Deselect all checkboxes for all hits by clicking the toggle checkbox icon at the top of the results table.
- Scroll down to Escherichia_coli_K12 results and note that there were 2 hits for this genome. Select the checkboxes for these 2 hits and scroll down to select the Update Image button.
- View the genomic neighborhoods graphic for the 2 results
Viewing Top Hits Only
- Scroll back up to the top of the results page and click on the view top hits link in the header of the results table. The results will now only show the top hit for each comparison genome (the hit with the lowest e-value will be considered the 'top' hit).
Filtering Homologs Based on Homolog Cluster Score
- Select to Edit Settings again.
- Change the display setting to Only show hits with a homolog clustering score of at least '1' and resubmit the form.
- Note that the number of bottom hits i
- Browse the table of results, noting the scores for each hit. You will notice that some scores have a value of 1. These results are not filtered based on clustering of genes and so may contain more spurious hits.
You have now completed the tutorial. Continue on to the next section to view some examples of how the tool can be used to help with genome annotation and genomic comparison studies.
|