C. elegans ORFeome version 3.1 : increasing the coverage of ORFeome resources with improved gene predictions
The first version of the Caenorhabditis elegans ORFeome cloning project, based on release WS9 of Wormbase (August 1999), provided experimental verifications for approximately 55% of predicted protein-encoding open reading frames (ORFs). The remaining 45% of predicted ORFs could not be cloned, possibly as a result of mispredicted gene boundaries. Since the release of WS9, gene predictions have improved continuously. To test the accuracy of evolving predictions, we attempted to PCR-amplify from a highly representative worm cDNA library and Gateway-clone approximately 4200 ORFs missed earlier and for which new predictions are available in WS100 (May 2003). In this set we successfully cloned 63% of ORFs with supporting experimental data ("touched" ORFs), and 42% of ORFs with no supporting experimental evidence ("untouched" ORFs). Approximately 2000 full-length ORFs were cloned in-frame, 13% of which were corrected in their exon/intron structure relative to WS100 predictions. In total, approximately 12,500 C. elegans ORFs are now available as Gateway Entry clones for various reverse proteomics (ORFeome v3.1). This work illustrates why the cloning of a complete C. elegans ORFeome, and likely the ORFeomes of other multicellular organisms, needs to be an iterative process that requires multiple rounds of experimental validation together with gradually improving gene predictions.
Medienart: |
Artikel |
---|
Erscheinungsjahr: |
2004 |
---|---|
Erschienen: |
2004 |
Enthalten in: |
Zur Gesamtaufnahme - volume:14 |
---|---|
Enthalten in: |
Genome research - 14(2004), 10B vom: 06. Okt., Seite 2064-9 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Lamesch, Philippe [VerfasserIn] |
---|
Themen: |
Caenorhabditis elegans Proteins |
---|
Anmerkungen: |
Date Completed 16.11.2004 Date Revised 26.04.2024 published: Print Citation Status MEDLINE |
---|
Förderinstitution / Projekttitel: |
|
---|
PPN (Katalog-ID): |
NLM151605491 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM151605491 | ||
003 | DE-627 | ||
005 | 20240426232037.0 | ||
007 | tu | ||
008 | 231223s2004 xx ||||| 00| ||eng c | ||
028 | 5 | 2 | |a pubmed24n1388.xml |
035 | |a (DE-627)NLM151605491 | ||
035 | |a (NLM)15489327 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Lamesch, Philippe |e verfasserin |4 aut | |
245 | 1 | 0 | |a C. elegans ORFeome version 3.1 |b increasing the coverage of ORFeome resources with improved gene predictions |
264 | 1 | |c 2004 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a Date Completed 16.11.2004 | ||
500 | |a Date Revised 26.04.2024 | ||
500 | |a published: Print | ||
500 | |a Citation Status MEDLINE | ||
520 | |a The first version of the Caenorhabditis elegans ORFeome cloning project, based on release WS9 of Wormbase (August 1999), provided experimental verifications for approximately 55% of predicted protein-encoding open reading frames (ORFs). The remaining 45% of predicted ORFs could not be cloned, possibly as a result of mispredicted gene boundaries. Since the release of WS9, gene predictions have improved continuously. To test the accuracy of evolving predictions, we attempted to PCR-amplify from a highly representative worm cDNA library and Gateway-clone approximately 4200 ORFs missed earlier and for which new predictions are available in WS100 (May 2003). In this set we successfully cloned 63% of ORFs with supporting experimental data ("touched" ORFs), and 42% of ORFs with no supporting experimental evidence ("untouched" ORFs). Approximately 2000 full-length ORFs were cloned in-frame, 13% of which were corrected in their exon/intron structure relative to WS100 predictions. In total, approximately 12,500 C. elegans ORFs are now available as Gateway Entry clones for various reverse proteomics (ORFeome v3.1). This work illustrates why the cloning of a complete C. elegans ORFeome, and likely the ORFeomes of other multicellular organisms, needs to be an iterative process that requires multiple rounds of experimental validation together with gradually improving gene predictions | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, U.S. Gov't, P.H.S. | |
650 | 7 | |a Caenorhabditis elegans Proteins |2 NLM | |
650 | 7 | |a DNA, Complementary |2 NLM | |
650 | 7 | |a Proteome |2 NLM | |
700 | 1 | |a Milstein, Stuart |e verfasserin |4 aut | |
700 | 1 | |a Hao, Tong |e verfasserin |4 aut | |
700 | 1 | |a Rosenberg, Jennifer |e verfasserin |4 aut | |
700 | 1 | |a Li, Ning |e verfasserin |4 aut | |
700 | 1 | |a Sequerra, Reynaldo |e verfasserin |4 aut | |
700 | 1 | |a Bosak, Stephanie |e verfasserin |4 aut | |
700 | 1 | |a Doucette-Stamm, Lynn |e verfasserin |4 aut | |
700 | 1 | |a Vandenhaute, Jean |e verfasserin |4 aut | |
700 | 1 | |a Hill, David E |e verfasserin |4 aut | |
700 | 1 | |a Vidal, Marc |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Genome research |d 1996 |g 14(2004), 10B vom: 06. Okt., Seite 2064-9 |w (DE-627)NLM085678031 |x 1549-5469 |7 nnns |
773 | 1 | 8 | |g volume:14 |g year:2004 |g number:10B |g day:06 |g month:10 |g pages:2064-9 |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 14 |j 2004 |e 10B |b 06 |c 10 |h 2064-9 |