Academia.eduAcademia.edu

Outline

GeneGrid: Grid Service Based Virtual Bioinformatics Laboratory

www-unix.gridforum.org

Abstract

GeneGrid is a collaborative industrial R&D project initiated by the Belfast e-Science Centre, under the UK e-Science Programme, with commercial partners involved in the research and development of antibodies and drugs. GeneGrid provides a platform for scientists, especially biologists, to access their collective skills, experiences and results in a secure, reliable and scalable manner through the creation of a 'Virtual Bioinformatics Laboratory'. It enables the seamless integration of a myriad of heterogeneous applications and datasets that span multiple administrative domains and locations across the globe, and present these to the scientist through a simple user friendly interface. This paper presents how the grid services of GeneGrid are involved in the integration of bioinformatics applications as well as in the creation and execution of in silico experiments. A real use case scenario is also presented, involving the identification of novel members belonging to a protein family, for demonstrating the capabilities of GeneGrid. Experiences from the adoption of standards such as OGSA and the integration of third party programs, are also presented.

References (16)

  1. References
  2. P. Donachy, T.J. Harmer, R.H. Perrott et al, "Grid Based Virtual Bioinformatics Laboratory", Proceedings of the UK e-Science All Hands Meeting (2003), 111-116
  3. I. Foster, C. Kesselman, et al., "The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration", Open Grid Service Infrastructure WG, Global Grid Forum ( 2002)
  4. P.V. Jithesh, N. Kelly, D.R. Simpson, et al "Bioinformatics Application Integration and Management in GeneGrid: Experiments and Experiences", Proceedings of UK e- Science All Hands Meeting (2004), 563-570
  5. P.V. Jithesh, N. Kelly, Paul Donachy et al "GeneGrid: Grid Based Solution for Bioinformatics Application Integration and Experiment Execution", IEEE Symposium on Computer Based Medical Systems, Dublin (2005).
  6. S. Tuecke, K. Czajkowski, I. Foster et al., Open Grid Services Infrastructure (OGSI) Version 1.0. Global Grid Forum Draft Recommendation, (6/27/2003).
  7. S.F. Altschul, et al, "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs," Nucleic Acids Res., vol. 25, pp. 3389-3402, Sep 1. 1997.
  8. A. Crogh et al, "Predicting transmembrane topology," J.Mol.Biol., vol. 305, pp. 567- 580, Jan. 2001.
  9. J.D. Bendtsen, H. Nielsen, G. von Heijne and S. Brunak, "Improved prediction of signal peptides: SignalP 3.0," J.Mol.Biol., vol. 340, pp. 783-795, Jul 16. 2004.
  10. J.D. Thompson, D.G. Higgins and T.J. Gibson, "CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice," Nucleic Acids Res., vol. 22, pp. 4673-4680, (1994).
  11. S.R. Eddy, "Profile hidden Markov Models," Bioinformatics, 14, 755-763 (1998)
  12. N. Kelly, P.V. Jithesh, D.R. Simpson et al, "Bioinformatics Data and the Grid: The GeneGrid Data Manager", Proceedings of UK e-Science All Hands Meeting (2004), 571-578
  13. R. Apweiler, et al, "UniProt: the Universal Protein knowledgebase," Nucleic Acids Res., 32, D115-9, 2004.
  14. C. Kanz, P. Aldebert, N. Althorpe et al, "The EMBL Nucleotide Sequence Database," Nucleic Acids Res., vol. 33 Database Issue, pp. D29-33, Jan 1. 2005.
  15. J. Novotny, M. Russell, O. Wehrens, "GridSphere: An Advanced Portal Framework", Proceedings of EuroMicro Conference (2004), 412-419
  16. P.R. Crocker Siglecs: sialic acid binding immunoglobulin-like lectins in cell-cell interactions and signaling. Curr. Opin. Struct. Biol., 12, 609-615 (2002).