ToxSci Advance Access originally published online on December 7, 2005
Toxicological Sciences 2006 90(1):252-258; doi:10.1093/toxsci/kfj068
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The Value of Short Amino Acid Sequence Matches for Prediction of Protein Allergenicity



* Monsanto Company, Product Characterization Center, Global Regulatory Affairs, St. Louis, Missouri 63167; and
Dow AgroSciences, Biotechnology Regulatory Science, Indianapolis, Indiana 46268
Received October 6, 2005; accepted November 20, 2005
Typically, genetically engineered crops contain traits encoded by one or a few newly expressed proteins. The allergenicity assessment of newly expressed proteins is an important component in the safety evaluation of genetically engineered plants. One aspect of this assessment involves sequence searches that compare the amino acid sequence of the protein to all known allergens. Analyses are performed to determine the potential for immunologically based cross-reactivity where IgE directed against a known allergen could bind to the protein and elicit a clinical reaction in sensitized individuals. Bioinformatic searches are designed to detect global sequence similarity and short contiguous amino acid sequence identity. It has been suggested that potential allergen cross-reactivity may be predicted by identifying matches as short as six to eight contiguous amino acids between the protein of interest and a known allergen. A series of analyses were performed, and match probabilities were calculated for different size peptides to determine if there was a scientifically justified search window size that identified allergen sequence characteristics. Four probability modeling methods were tested: (1) a mock protein and a mock allergen database, (2) a mock protein and genuine allergen database, (3) a genuine allergen and genuine protein database, and (4) a genuine allergen and genuine protein database combined with a correction for repeating peptides. These analyses indicated that searches for short amino acid sequence matches of eight amino acids or fewer to identify proteins as potential cross-reactive allergens is a product of chance and adds little value to allergy assessments for newly expressed proteins.
Key Words: bioinformatics; allergens; sequence comparisons; allergy assessment.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Z. H. Zhang, J. L. Y. Koh, G. L. Zhang, K. H. Choo, M. T. Tammi, and J. C. Tong AllerTool: a web server for predicting allergenicity and allergic cross-reactivity in proteins Bioinformatics, February 15, 2007; 23(4): 504 - 506. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Soeria-Atmadja, T. Lundell, M. G. Gustafsson, and U. Hammerling Computational detection of allergenic proteins attains a new level of accuracy with in silico variable-length peptide extraction and machine learning Nucleic Acids Res., August 29, 2006; 34(13): 3779 - 3793. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Saha and G. P. S. Raghava AlgPred: prediction of allergenic proteins and mapping of IgE epitopes. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W202 - W209. [Abstract] [Full Text] [PDF] |
||||

