STAR (Site Targeted Amino acid Recombination) is a tool for predicting recombination sites from amino acid sequence as deemed useful by the protein design algorithm SCHEMA. Unlike SCHEMA, the tool allows the user to explore structural building blocks in amino acid sequences with unknown structure.

STAR accepts amino acid sequences, presented in the FASTA format, and predicts a normalised SCHEMA score of each residue in the sequence.

Paste or upload your sequence below.


 Upload FASTA file:

 

OR

 
 Paste FASTA sequence here:

DATA

The single-sequence S-score is defined to be
and describes, for each sequence position i, the number of contacts within the window (i-w, i+w) that would be broken if the recombination site is positioned at i. ckl=1 if residues k and l are within 4.5 Ångstrom of one another, and ckl = 0 otherwise. w is set to 14 residues.
and the normalised S-score is
(where max is a constant pre-set to 637).  

STAR is trained to predict the normalised S-scores from a diverse and non-redundant range of protein structures. The data set consists of 945 structures taken from the PDB. The data including the normalised S-score (per above) can be found here.

Example FASTA files: 1BLS, 1AMO and 1CDD.


REFERENCES

Model development and evaluation is reported in

Bauer, D. C., Bodén, M., Thier, R. and Gillam, E. M. STAR: Predicting recombination sites from amino acid sequence. BMC Bioinformatics, 7:437. Open Access

Bauer, D. C., Bodén, M., Thier, R. and Yuan, Z. Predicting structural disruption of proteins caused by crossover. In Proceedings of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, San Diego, pp. 514-520, 2005. PDF

STAR uses the continuum secondary structure predictor reported in

Bodén, M. Yuan, Z. and Bailey, T. L. Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures. BMC Bioinformatics, 7:68. Open Access


Contact: d.bauer@imb.uq.edu.au or mikael@itee.uq.edu.au