World Community Grid - View Thread - Welcome to the Uncovering Genome Mysteries project

World Community Grid Forums

Category: Completed Research

Forum: Uncovering Genome Mysteries

Thread: Welcome to the Uncovering Genome Mysteries project

Quick Go »

No member browsing this thread

Thread Status: Active
Thread Type: Sticky Thread
Total posts in this thread: 44

[ ]

Author

This topic has been viewed 17259 times and has 43 replies

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: Welcome to the Uncovering Genome Mysteries project

Thanks, we are excited to be part of the WCG.

[Oct 18, 2014 4:19:39 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: Welcome to the Uncovering Genome Mysteries project

Hello,

The database will be freely available, yes.
The project compares predicted protein sequences, mostly from environmental metagenomic samples, contributing to annotation, and studies on metabolic pathways from micro-organisms.
Comparing non-coding sequences (DNA) can be done within a restricted dataset, but has other purposes. The totality of known DNA sequences is now far too large for such overall comparisons.

[Oct 29, 2014 9:35:58 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: ï»¿Welcome to the Uncovering Genome Mysteries project

Hello Crystal,

Simap has indeed the purpose of mapping protein similarities essentially from the public reference protein sequences and especially domains, resulting in a specific database/repository with very valuable research tools. Back in 2006/2007, the Genomecomparison project focussed on protein datasets from whole genomes, and the use of the rigorous ssearch algoritm for enhanced statistical confidence for inter-genome distance calculations and other applications, while Simap ran a much faster (directional hit detection) Blast implementation, in later years redoing the calculations similar to Genome Comparison.
In the Uncovering Genome Mysteries project, we are concentrating much more on metagenomic sequences from environmental samples, containing mostly as yet unknown organisms. The project involves a very large dataset and aims at the discovery of new enzymatic functions, unusual metabolic pathways, and also pretends to shed more light on the ecological relationships and interactions between micro-organisms in specific niches.

[Oct 29, 2014 9:56:04 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: Welcome to the Uncovering Genome Mysteries project

Hi gb,

We´ll do our best to follow discussions and interact with the WCGrid contributors!

[Oct 29, 2014 10:01:38 PM]

Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1414
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

90 day badge for Discovering Dengue Drugs - Together

1 year badge for Nutritious Rice for the World

90 day badge for The Clean Energy Project

2 year badge for Help Fight Childhood Cancer

90 day badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

2 year badge for Discovering Dengue Drugs - Together - Phase 2

2 year badge for The Clean Energy Project - Phase 2

2 year badge for Computing for Clean Water

2 year badge for Drug Search for Leishmaniasis

2 year badge for GO Fight Against Malaria

2 year badge for Computing for Sustainable Water

20 year badge for Mapping Cancer Markers

2 year badge for Uncovering Genome Mysteries

20 year badge for Outsmart Ebola Together

20 year badge for FightAIDS@Home - Phase 2

20 year badge for Smash Childhood Cancer

5 year badge for Microbiome Immunity Project

10 year badge for Africa Rainfall Project

50 year badge for OpenPandemics - COVID-19


Re: ï»¿Welcome to the Uncovering Genome Mysteries project

Thanks Wim for your insight answer.

Since Sept/Oct 2009 SIMAP (we volunteers wink

) also calculated millions and milions of sequences from environmental genomes. You surely know or even met Prof. Thomas Rattei (now from Vienna University) and are aware of the treasures you may find in the SIMAP-database.

"Heel veel succes met jullie UGM ontdekkingsreis".

CP

[Oct 30, 2014 8:47:48 AM]

Antonius_Block
Cruncher
Joined: Sep 10, 2011
Post Count: 3
Status: Offline
Project Badges:

45 day badge for Human Proteome Folding - Phase 2

14 day badge for Help Fight Childhood Cancer

14 day badge for Help Cure Muscular Dystrophy - Phase 2

45 day badge for The Clean Energy Project - Phase 2

14 day badge for Computing for Clean Water

45 day badge for Drug Search for Leishmaniasis

14 day badge for GO Fight Against Malaria

90 day badge for Outsmart Ebola Together

45 day badge for FightAIDS@Home - Phase 2

14 day badge for Microbiome Immunity Project

14 day badge for Africa Rainfall Project

90 day badge for OpenPandemics - COVID-19


Re: ï»¿Welcome to the Uncovering Genome Mysteries project

I hope you won't be looking for a "cure" for autism. We get enough of that insulting condescension from the anti-vaxxers and dumbass neurotypicals who call it a disease.

[Nov 2, 2014 5:17:47 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: ï»¿Welcome to the Uncovering Genome Mysteries project

i'm pretty sure, there is no special intent to find a cure for autism, as there is nothing mentioned in this direction.

[Nov 2, 2014 6:59:13 PM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: ï»¿Welcome to the Uncovering Genome Mysteries project

I hope you won't be looking for a "cure" for autism. We get enough of that insulting condescension from the anti-vaxxers and dumbass neurotypicals who call it a disease.

Until now, you are the only one insulting.

[Nov 3, 2014 7:06:15 AM]

numbermaniac
Cruncher
Australia
Joined: Mar 28, 2014
Post Count: 46
Status: Offline
Project Badges:

45 day badge for Uncovering Genome Mysteries

14 day badge for Outsmart Ebola Together


Re: ï»¿Welcome to the Uncovering Genome Mysteries project

How many proteins are compared in each workunit? It seems to me about 18,000 but I'm just curious.

[Dec 9, 2014 8:41:51 AM]

seippel
Former World Community Grid Tech
Joined: Apr 16, 2009
Post Count: 392
Status: Offline
Project Badges:

14 day badge for Nutritious Rice for the World

10 year badge for Mapping Cancer Markers

180 day badge for Uncovering Genome Mysteries

2 year badge for Outsmart Ebola Together

1 year badge for FightAIDS@Home - Phase 2

180 day badge for Smash Childhood Cancer

2 year badge for Microbiome Immunity Project

180 day badge for Africa Rainfall Project

2 year badge for OpenPandemics - COVID-19


Re: Ã¯Â»Â¿Welcome to the Uncovering Genome Mysteries project

The answer is that the number of proteins in each work unit can vary widely. Each work unit consists of two file of proteins which are compared to each other (every protein in file A is compared to every protein in file B). Shorter proteins take less time to compare than longer proteins. The work unit generation program does some estimating to determine how proteins should be in each file to achieve the targetted runtime. So if the proteins being compared are short, it will compensate by adding more proteins to the file (and vice versa). From a quick sampling, the biggest number of proteins I saw in one of the two work unit files was 116k proteins, but this was just a sampling.

Seippel

[Dec 9, 2014 9:23:28 PM]

[ ]