Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Playing “Hide and Seek” - The Hidden Genome

Speaker  Michal Linial

Affiliation  Dept of Biological Chemistry, The Surarsky Center for Computational biology, The Hebrew University of Jerusalem, Israel

Host  Yael Kalai

Duration  01:19:07

Date recorded  30 August 2012

The overwhelming increase in sequencing methodology resulted in the accumulation of millions of DNA sequences. These sequences are collected from thousands of genomes that (ideally) sample the ‘tree of life’. I will briefly discuss the ‘minimal set of instructions’ by which a linear sequence is transformed into a functional protein. What happen when the statistical noise is too high, thus classical procedures to predict protein sequences fail? I will focus on the challenge of identifying short proteins that remain buried in the genomic data. For illustration, I will take you for a ‘treasure hunt’ for short proteins.

Many short proteins share fuzzy features that are common to most animal venom. I will discuss the limitation in using classical tools that are based on string comparison, or pattern finding to identify short proteins. For this task, statistical machine learning methods were useful in identifying hidden bioactive sequences in several genomes. Evidently, such sequences are attractive candidates for novel therapy. The test case of short proteins illustrates the importance of a cycle that starts by a biological hypothesis, then uses a computational formulation and finalizes by an experimental validation. Finally, I will discuss our genomes with respect to our ‘partners’ (viruses, bacteria). Once the interaction of these genomes is considered, the source for the dynamic nature of human evolution becomes evident.

Related publications:

  • Rappoport N, Karsenty S, Stern A, Linial N, Linial M. (2012) Nucl. Acids Res. 40:D313-D320.
  • Rappoport N, Linial M. (2012) PLoS Comput Biol. 8:e1002364.
  • Naamati G, Askenazi M, Linial M. (2010) Bioinformatics 26:i482-i488.
  • Naamati G, Askenazi M, Linial M (2009) Nucl. Acids Res. 37:W363-368.
  • Kaplan N, Morpurgo N, Linial M. (2007) J Mol Biol. 369:553-566.
©2012 Microsoft Corporation. All rights reserved.
> Playing “Hide and Seek” - The Hidden Genome