SPIDER: software for protein identification from sequence tags with de novo sequencing error

Author(s): Han Y, Ma B, Zhang K

Abstract

For the identification of novel proteins using MS/MS, de novo sequencing software computes one or several possible amino acid sequences (called sequence tags) for each MS/MS spectrum. Those tags are then used to match, accounting amino acid mutations, the sequences in a protein database. If the de novo sequencing gives correct tags, the homologs of the proteins can be identified by this approach and software such as MS-BLAST is available for the matching. However, de novo sequencing very often gives only partially correct tags. The most common error is that a segment of amino acids is replaced by another segment with approximately the same masses. We developed a new efficient algorithm to match sequence tags with errors to database sequences for the purpose of protein and peptide identification. A software package, SPIDER, was developed and made available on Internet for free public use. This paper describes the algorithms and features of the SPIDER software.

Similar Articles

Protein identification by mass profile fingerprinting

Author(s): James P, Quadroni M, Carafoli E, Gonnet G

Modular Peptide Synthetases Involved in Nonribosomal Peptide Synthesis

Author(s): Marahiel MA, Stachelhaus T, Mootz HD

Improving peptide fragmentation by N-terminal derivatization with high proton affinity

Author(s): Miyashita M, Hanai Y, Awane H, Yoshikawa T, Miyagawa H

Distortionless Enhancement of NMR Signalsby Polarization Transfer

Author(s): Doddrell DM, Pegg DT, Bendall MR

t-kagaku

Author(s): http://www

Curtin-Hammett and steric effects in HOBt acylation regiochemistry

Author(s): Brink BD, DeFrancisco JR, Hillner JA, Linton BR

Termination of the structural confusion between plipastatin A1 and fengycin IX

Author(s): Honma M, Tanaka K, Konno K, Tsuge K, Okuno T, et al.