Human Interactome Project
New HI-2012 Prepublication Human Interactome Dataset is available for immediate search and download
W e are systematically mapping the human binary interactome. The long-term goal is to generate and analyze high-quality yeast two-hybrid (Y2H) interactions at high-throughput for all pairwise combinations of predicted gene products for which there is at least one Gateway-cloned ORF available (Human ORFeome website). All individual datasets, starting with our most recent unpublished data, are described below with hyperlinks to the prepublication data or the relevant publications.
1 A new phase of the human interactome project is now provided prepublication for registered users. Our Human Interactome 2012 Prepublication data set (HI-2012 Prepublication) contains ~14,000 novel binary interactions derived from yeast two-hybrid screens over a search space corresponding to a matrix of ~13,000 x 13,000 proteins. This search space represents ~35% of the complete search space, assuming a total of ~22,000 protein-coding genes and limiting the scope to one variant per gene. Early release of data from these high-throughput screens is now available (HI-2012 Prepublication), and data analysis is ongoing.
2 To demonstrate the effectiveness of a novel Stitch-seq interactome mapping protocol, a search space of ~6,000 x ~6,000 ORFs, overlapping the Space-I and Space-II search spaces, was screened by Y2H (Yu et al Nat Methods 2011). Stitch-seq combines PCR stitching with next-generation sequencing, and increases the efficiency and cost effectiveness of Y2H screening. The resulting high-quality human interactome dataset, ‘Human Interactome produced with Next-Generation Sequencing’ (HI-NGS) contains 1,166 interactions among proteins encoded by 1,147 human genes. This published data is freely available to the research community. (HI-2011)
3 An estimation of the coverage and size of the human interactome (Venkatesan et al Nat Methods 2009), involved four repeat Y2H screens of 1,822 DB-X fusion proteins (or baits, representing 1,744 unique genes) against 1,796 AD-Y proteins (or preys, representing 1,752 genes), corresponding to a 5% subset of the Space-I search space. This HI-Repeat dataset contains 239 high-quality Y2H interactions, out of which 107 interactions are found in a single screen and 132 interactions in multiple screens (63 found in two screens, 44 found in three screens and 25 found in four screens). This published data is freely available to the research community. (HI-2009)
4 Our first iteration at mapping the human interactome (Rual et al Nature 2005) screened a space (Space-I) of 8,107 ORFs corresponding to 7,194 genes, mapping 2,754 high-quality Y2H interactions. This dataset increased the number of binary interactions by nearly 70% over those that were curated at that time in protein interaction databases. This published data is freely available to the research community. (HI-2005)