| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 21
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Welcome to the “Genome Comparison” project. Your contribution will provide valuable information to researchers around the world, studying proteins function and their roles in disease processes while also contributing to discoveries in medicine and biology. The project is finding, in a consistent and comprehensive manner, all possible similarities between predicted proteins from all of the known genome sequences decoded to date. We thank all of the members of World Community Grid for their valuable contribution to this exciting project form Brasil.
|
||
|
|
teletran
Senior Cruncher Joined: Jul 27, 2005 Post Count: 378 Status: Offline |
Thanks for offering this project. I just added it to my profile. Hope to get my first work unit soon :)
---------------------------------------- |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
*whoop* new project
gig'em crunchers |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
coool new project:)
----------------------------------------very usefull genetics also like hpf notlike some useless projects like climate prediction :) it all serves a greater purpose http://www.sens.org/index.html -- This post has been edited to comply with forum rules. - nelsoc [Edit 4 times, last edit by Former Member at Nov 19, 2006 4:46:06 PM] |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
In conjunction with our 2 year anniversary..... already 2000 WU's done
---------------------------------------- ![]()
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
A simple question which needs a simple answer: What's the difference between this project and SIMAP?
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
nice! and very small requirements too....
----------------------------------------[Edit 1 times, last edit by Former Member at Nov 17, 2006 10:43:12 PM] |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Halfcard, if considering that some 10 reports were actually received on HPF2 problems and to put it in perspective yesterday's 11,541 HPF2 Work units completion or < 0.087% on 1 days production, you are seeding the entirely wrong message.
----------------------------------------thx for being considerate.
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Nov 17, 2006 11:09:51 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
A simple question which needs a simple answer: What's the difference between this project and SIMAP? (Added: The answer was posted in http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=10012#79080 ) Let me ask the same question in a more general setting. Here is a brief article on BLAST which explains how it replaced the FASTA algorithm due to superior speed: http://www.the-scientist.com/article/display/15682/ Here is some information about PUMA2, a system for High-throughput analysis of (currently 1034) genomes which uses BLAST: GADU – Puma2: http://compbio.mcs.anl.gov/ 2005 TeraGrid News: http://www.teragrid.org/news/news05/gadu.html 2006 Nucleic Acid Research PUMA2: http://nar.oxfordjournals.org/cgi/content/full/34/suppl_1/D369 SIMAP ( http://boinc.bio.wzw.tum.de/boincsimap/ ) is described as follows: What is SIMAP? SIMAP is a database of protein similarities and protein domains. It contains about all currently published protein sequences and is continuously updated. Protein similarities are computed using the FASTA algorithm which provides optimal speed and sensitivity. Protein domains are calculated using the InterPro methods and databases. SIMAP is to our knowledge the only project that combines comprehensive coverage with respect to all known proteins and incremental update capabilities. The Fiocruz Genome Comparison Project will produce a database comparing the genes from many genomes with each other using SSEARCH ( http://helix.nih.gov/docs/gcg/ssearch.html ). Here is a brief description I found: SSEARCH - Performs a rigorous Smith-Waterman alignment between a protein sequence and another protein sequence or a protein database, or with DNA sequence to another DNA sequence or a DNA library (very slow). I notice that each of these projects (Genome Comparison, PUMA2, SIMAP) uses a different algorithm (SSEARCH, BLAST, FASTA). What other similarities and differences do they have? (Please note - I'm not complaining. I think that important scientific work ought to be done using many different methods to discover as much as possible and catch errors.) Added: For the benefit of people reading this thread who are trying to learn more, I will post the URL for the Genome Comparison project description: http://www.dbbm.fiocruz.br/GenomeComparison It begins: The Genome Comparison Project: Improving protein functional annotation in databases Over the years, a rather large body of secondary information (structural, functional, similarities to other entries and a variety of cross-references) has been attached to protein database entries. Once such information is entered, it rarely gets updated or corrected. Thus, annotation of predicted protein function is often incomplete, uses non-standardized nomenclature or can be incorrect when inferred from previous, incorrectly annotated sequences. Additionally, many proteins are composed of several structural and/or functional domains (modules comprising distinct evolutionary, functional and structural units), which can be overlooked by automated annotation procedures. Moreover, the comparative information available today is huge when compared to the early days of genomics. The main objective of the Genome Comparison Project is to perform a complete pairwise comparison between all predicted protein sequences, obtaining similarity indices that will be used, together with standardized Gene Ontology (http://www.geneontology.org), as a reference repository for the annotator community, providing an invaluable data source for biologists. The sequence similarity comparison program used in the Genome Comparison Project is called SSEARCH (W.R. Pearson [1991] Genomics 11:635-650), a freely available implementation of the Smith-Waterman rigorous algorithm (T. F. Smith and M. S. Waterman [1981] J. Mol. Biol. 147:195-197) (algorithm is an organized procedure for performing a given type of calculation or solving a given type of problem), which finds the mathematically best local alignment between pairs of sequences. As a result, precise annotation, correction of inconsistencies, and assignment of possible functions to hypothetical proteins of unknown function will be possible. Moreover, proteins with multiple domains and functional elements will be correctly spotted. Even distant relationships will be detected. Added even later: I have been thinking about Genome Comparison and SIMAP. Instead of just waiting for an answer, I'll throw out my most recent idea for public display. ( I must like to hear myself talk. I must have been insufferable back in the third grade. Come to think of it, I remember being surprised back in high school when I could not convince a schoolmate that there were no flying saucers and gave up the argument. One of the bystanders exclaimed that it was great to finally see me lose an argument. I had not thought of my arguing in that sort of way. After all - - I was always right! Just insufficiently persuasive some times. Ummm. . . . Perhaps I achieved my present state of perfection more slowly than I like to think. )SIMAP compares proteins and Genome Comparison compares genes. Although there is a very close relationship between genes and the proteins that they code for, it is not a direct one-to-one relationship. The genetic code ( http://en.wikipedia.org/wiki/Genetic_code ) has several alternative ways to code for each amino acid in a protein. Even worse, there are some species that have modified the genetic code so that they produce non-standard amino acids in response to a genetic sequence. Just looking at the protein that is produced from a gene loses a lot of information that is present in the gene. So evolutionary relationships should be tracked using genes as the Fiocruz Genome Comparison project does rather than by using proteins from SIMAP. The functions performed by a protein are more likely to be the same in different organisms if they are closely related rather than distantly related. This must seem obvious to anyone in the biological sciences. But I studied math and physics. ![]() Added second thoughts: But according to the project description SSEARCH runs on the derived protein sequence, so it does not appear to use any genetic information directly. I guess I will have to wait for the explanation. [Edit 6 times, last edit by Former Member at Dec 1, 2006 1:09:30 AM] |
||
|
|
depriens
Senior Cruncher The Netherlands Joined: Jul 29, 2005 Post Count: 350 Status: Offline Project Badges:
|
Great to see this new project boarding! Hopefully everything will go well and we will be updated from time to time by the project scientist(s).
---------------------------------------- Even better is the fact that it has such friendly system requirements. I have an old AMD Athlon 700MHz / 128MB RAM which couldn't run the other projects. So I have set a profile to crunch FCG exclusively! And it's running like a charm! ![]() Of course it will be crunched on all my other machines as well! For the rest; I'm with Morphy375 and lawrencehardin. Thanks! ![]() |
||
|
|
|