Skip to content

Commit 097bcc5

Browse files
committed
new chapter on protein disorder
1 parent 1a7d69b commit 097bcc5

2 files changed

Lines changed: 125 additions & 1 deletion

File tree

README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ BioJava Tutorial
44
A brief introduction into [BioJava](https://github.com/biojava/biojava).
55
-----
66

7-
The goal of this tutorial is to provide an educational introduction into some of the features that are provided by BioJava.
7+
The goal of this tutorial is to provide an educational introduction into some of the features that are provided by BioJava.
88

99
At the moment this tutorial is still under development. Please check the [BioJava Cookbook](http://biojava.org/wiki/BioJava:CookBook3.0) for a more comprehensive collection of examples about what is possible with BioJava and how to do things.
1010

@@ -22,6 +22,8 @@ Book 3: [The Structure Modules](structure/README.md), everything related to work
2222

2323
Book 4: [The Genomics Module](genomics/README.md), working with genomic data.
2424

25+
Book 5: [The Protein-Disorder Module](protein-disorder/README.md), predicting protein-disorder.
26+
2527
## License
2628

2729
The content of this tutorial is available under the [CC-BY](http://creativecommons.org/licenses/by/3.0/) license.

protein-disorder/README.md

Lines changed: 122 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,122 @@
1+
The Protein-Disorder Module of BioJava
2+
=====================================================
3+
4+
A tutorial for the protein-disorder module of [BioJava](http://www.biojava.org)
5+
6+
## About
7+
<table>
8+
<tr>
9+
<td>
10+
11+
</td>
12+
<td>
13+
The <i>protein-disorder module</i> of BioJava provide an API that allows to
14+
<ul>
15+
<li>predict protein-disorder using the JRONN algorithm</li>
16+
</ul>
17+
18+
19+
</td>
20+
</tr>
21+
</table>
22+
23+
## How can I predict disordered regions on a protein sequence?
24+
-----------------------------------------------------------
25+
26+
BioJava provide a module *biojava-protein-disorder* for prediction
27+
disordered regions from a protein sequence. Biojava-protein-disorder
28+
module for now contains one method for the prediction of disordered
29+
regions. This method is based on the Java implementation of
30+
[RONN](http://www.strubi.ox.ac.uk/RONN) predictor.
31+
32+
This code has been originally developed for use with
33+
[JABAWS](http://www.compbio.dundee.ac.uk/jabaws). We call this code
34+
*JRONN*. *JRONN* is based on the C implementation of RONN algorithm and
35+
uses the same model data, therefore gives the same predictions. JRONN
36+
based on RONN version 3.1 which is still current in time of writing
37+
(August 2011). Main motivation behind JRONN development was providing an
38+
implementation of RONN more suitable to use by the automated analysis
39+
pipelines and web services. Robert Esnouf has kindly allowed us to
40+
explore the RONN code and share the results with the community.
41+
42+
Original version of RONN is described in [Yang,Z.R., Thomson,R.,
43+
McMeil,P. and Esnouf,R.M. (2005) RONN: the bio-basis function neural
44+
network technique applied to the detection of natively disordered
45+
regions in proteins. Bioinformatics 21:
46+
3369-3376](http://bioinformatics.oxfordjournals.org/content/21/16/3369.full)
47+
48+
Examples of use are provided below. For more information please refer to
49+
JronnExample testcases.
50+
51+
Finally instead of an API calls you can use a [ command line
52+
utility](/wikis/BioJava:CookBook3:ProteinDisorderCLI "wikilink"), which is
53+
likely to give you a better performance as it uses multiple threads to
54+
perform calculations.
55+
56+
Example 1: Calculate the probability of disorder for every residue in the sequence
57+
----------------------------------------------------------------------------------
58+
59+
```java
60+
FastaSequence fsequence = new FastaSequence("name",
61+
"LLRGRHLMNGTMIMRPWNFLNDHHFPKFFPHLIEQQAIWLADWWRKKHC" +
62+
"RPLPTRAPTMDQWDHFALIQKHWTANLWFLTFPFNDKWGWIWFLKDWTPGSADQAQRACTWFFCHGHDTN");
63+
64+
float[] rawProbabilityScores = Jronn.getDisorderScores(fsequence);
65+
```
66+
67+
Example 2: Calculate the probability of disorder for every residue in the sequence for all proteins from the FASTA input file
68+
-----------------------------------------------------------------------------------------------------------------------------
69+
70+
```java
71+
final List<FastaSequence> sequences = SequenceUtil.readFasta(new FileInputStream("src/test/resources/fasta.in"));
72+
Map<FastaSequence, float[]> rawProbabilityScores = Jronn.getDisorderScores(sequences); ```
73+
74+
Example 3: Get the disordered regions of the protein for a single protein sequence
75+
----------------------------------------------------------------------------------
76+
77+
```java
78+
FastaSequence fsequence = new FastaSequence("Prot1", "LLRGRHLMNGTMIMRPWNFLNDHHFPKFFPHLIEQQAIWLADWWRKKHC" +
79+
               "RPLPTRAPTMDQWDHFALIQKHWTANLWFLTFPFNDKWGWIWFLKDWTPGSADQAQRACTWFFCHGHDTN" +
80+
               "CQIIFEGRNAPERADPMWTGGLNKHIIARGHFFQSNKFHFLERKFCEMAEIERPNFTCRTLDCQKFPWDDP");
81+
82+
Range[] ranges = Jronn.getDisorder(fsequence);
83+
```
84+
85+
Example 4: Calculate the disordered regions for the proteins from FASTA file
86+
----------------------------------------------------------------------------
87+
88+
```java
89+
final List<FastaSequence> sequences = SequenceUtil.readFasta(new FileInputStream("src/test/resources/fasta.in"));
90+
Map<FastaSequence, Range[]> ranges = Jronn.getDisorder(sequences);
91+
92+
```
93+
94+
95+
### Author:
96+
97+
[Andreas Prli&#263;](https://github.com/andreasprlic)
98+
99+
## Please cite
100+
101+
**BioJava: an open-source framework for bioinformatics in 2012**<br/>
102+
*Andreas Prlic; Andrew Yates; Spencer E. Bliven; Peter W. Rose; Julius Jacobsen; Peter V. Troshin; Mark Chapman; Jianjiong Gao; Chuan Hock Koh; Sylvain Foisy; Richard Holland; Gediminas Rimsa; Michael L. Heuer; H. Brandstatter-Muller; Philip E. Bourne; Scooter Willis* <br/>
103+
[Bioinformatics (2012) 28 (20): 2693-2695.](http://bioinformatics.oxfordjournals.org/content/28/20/2693.abstract) <br/>
104+
[![doi](http://img.shields.io/badge/doi-10.1093%2Fbioinformatics%2Fbts494-blue.svg?style=flat)](http://bioinformatics.oxfordjournals.org/content/28/20/2693.abstract) [![pubmed](http://img.shields.io/badge/pubmed-22877863-blue.svg?style=flat)](http://www.ncbi.nlm.nih.gov/pubmed/22877863)
105+
106+
## License
107+
108+
The content of this tutorial is available under the [CC-BY](http://creativecommons.org/licenses/by/3.0/) license.
109+
110+
[view license](../license.md)
111+
112+
113+
114+
<!--automatically generated footer-->
115+
116+
---
117+
118+
Navigation:
119+
[Home](../README.md)
120+
| Book 3: The Protein Structure modules
121+
122+
Prev: [Book 4: The Genomics Module](../genomics/README.md)

0 commit comments

Comments
 (0)