Supplementary material for Secondary structure prediction and threading

This is the supplementary material for the "Secondary structure prediction and threading" lesson in Master in Molecular Medicine; Module IV: Functional Genomics; coordinator Hanspeter Herzel.

Software required

ClustalX: multiple sequence alignment tool (Des Higgins / Julie Thompson)

Java scripts should be allowed in your browser.
  • Firefox
    • Go to Tools/Options/Content
      • enable Java
      • enable JavaScript
  • Explorer
    • Go to Internet options/Security/Custom level/Scripting/Scripting of Java applets (enable)

Verify that you can run BiasViz http://cbdm.mdc-berlin.de/tools/biasviz/

Exercises

Remember to ask the teacher if you get lost! His job is to help you! :-)

1. Predicting the domains of a protein sequence

2A. Predicting the transmembrane helices of protein sequences

2B. Predicting the transmembrane helices of protein sequences

3A. Collecting homologs and making a multiple sequence alignment

  • Obtain the sequence of this fly protein of unknown function from here: 
  • Use BLAST to find similar sequences in the NCBI database:
  • Scroll down to the Alignments section then tick at least 8 hits from different organisms in the results list and then click on the Get selected sequences link at the bottom of the page. Try to include some mammals too!
  • Look for the option Display and select FASTA. Look for the option Send to and select Text. Save the output to a file. Name it like “a.txt”. You will likely get lost here. Just ask the teacher!
  • Input that file using the alignment program ClustalX and align the sequences (menu Alignment, option Do complete alignment). Save alignment in FASTA format (menu File, option Save sequences as, tick option FASTA). You should a file named like “a.fasta”.
    • In the alignment representation the asterisks above the alignment represent conserved amino acids and the colors conserved aminoacid properties.
    • Do you notice some conserved features that could indicate function?

 

3B. Studying secondary structure using a multiple sequence alignment

  • Input your alignment a.fasta to Jpred 
    • http://www.compbio.dundee.ac.uk/www-jpred/advanced.html (use the Choose file option. Select type of input: Multiple Alignment, FASTA and tick option Skip searching PDB before prediction. Then hit the button Make prediction at the bottom of the page. (If you see that there are three or more jobs in the queue just delete the job and try again later / Go to next exercise)
  • If you get an output hit button Start Jalview and examine the predictions. 
    • Find out the predicted structure for a proline rich region.

3C. Studying amino acid biased regions using a multiple sequence alignment
  • Input your alignment to BiasViz to identify conserved amino acid biased regions
  • Identify amino acid biased regions.
    • Try hydrophobic residues and window 10. What is the result suggesting (compare to secondary structure prediction)?
    • Can you verify your hypothesis with another tool we have used before?
    • Can you see other biased regions?