Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 10
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1485 times and has 9 replies Next Thread
xterminatordedust
Cruncher
Joined: Nov 17, 2004
Post Count: 12
Status: Offline
Reply to this Post  Reply with Quote 
Rosetta@home

http://boinc.bakerlab.org/rosetta/

You can now participate in rosetta@home , a nw project who use the Rosetta software and the BOINC plateforme. This project will develop the rosetta software and to study new simulation methods to help develop cure for diseases such HIV, cancer, malaria.....

This project will make available all results: "everything will be publicy available".

It is important to help this project too, which is compementary with WCG and HPF.

http://boinc.bakerlab.org/rosetta/forum_thread.php?id=7 (it will be used for HPF)
----------------------------------------
[Edit 1 times, last edit by xterminatordedust at Sep 17, 2005 12:12:18 PM]
[Sep 17, 2005 12:09:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rosetta@home

Here is a related article at http://www.innovations-report.com/html/reports/studies/report-49357.html

Computers Make Big Strides in Predicting Protein Structure

Computers can predict the detailed structure of small proteins nearly as well as experimental methods, at least some of the time, according to new studies by Howard Hughes Medical Institute researchers.

The findings, which were reported in the September 16, 2005, issue of the journal Science, provide a glimmer of hope that scientists eventually may be able to determine the structure of proteins from their genomic sequences, a problem that has seemed insurmountable.

"For more than 40 years, people have known the amino acid sequence of a protein specifies its three-dimensional structure, but no one has been able to translate the sequence into an accurate structure," said senior author David Baker, an HHMI researcher at the University of Washington. "The reason this research is exciting is that we’re showing progress in predicting the structure from the sequence. It’s not that the problem is solved, but that there is hope."

Proteins are biological machines, and scientists need to determine their structures to understand how the proteins work. Now, scientists determine structures exclusively by measuring the atomic characteristics of proteins in the lab. In contrast, "in this case, we never touched a test tube," Baker said. "We gave it to a computer and said, ’go.’"

In the study, a sophisticated computer program folded 17 short strings of amino acids into 100,000 possible variations. When the researchers compared the best predictions to the actual structures solved earlier by other scientists using experimental techniques, they had the same success rate as the best hitters in major league baseball.

"For about one-third of our benchmark set of small proteins, we generated relatively high-resolution structure predictions, with parts of the structures predicted to near-atomic resolution," said first author Philip Bradley, a postdoctoral fellow in Baker’s lab. "For us, it is a real step forward to achieve structures that are in some way comparable to what you can get by experiments."

The encouraging results come from a refinement of a sophisticated computer modeling program called Rosetta, first developed several years ago in Baker’s lab. The program works on the premise that proteins collapse into their lowest energy state, like a ball that rolls down a hill until it comes to rest on level ground. The energies of hundreds of thousands of possible shapes generated by the computer are computed, and the lowest energy shape is selected as the prediction.

The prediction process happens in two steps, Bradley said. The first stage uses an approximate model which allows rapid calculation of the energy and so can be carried out rapidly, while the second uses a very detailed model for which the energy calculations take much longer but are much more accurate. A large scale search through possible structures is carried out in the first stage, and promising locations are then explored in detail in the second stage.

The first stage takes advantage of the fact that all amino acids have identical sections, which form the protein backbone. The computer adds a fuzzy picture of the protruding side chains that give each amino acid its unique identity. The sequence of side chains ultimately gives each protein its characteristic shape by the environment and neighbors they prefer.

Then the computer randomly twists, loops, and bends each amino acid sequence into 100,000 different shapes based on the preferred location of the amino acids. Some amino acids tend to dive toward the watery world of the protein surface while others take cover inside the protein. The computer also accounts for the social habits of the 20 amino acids; some want to be close to each other and others like their distance.

In stage two, Rosetta replaces the fuzzy picture of the side chains with detailed, physically realistic models with all the atoms represented. From the positions of the atoms in the sidechains and the protein backbone, the computer then uses a detailed physical chemistry based force field which favors close packing of atoms and hydrogen bonding to more accurately compute the energy of the structure.

"What seems to be critical is the packing of the molecule," Baker said. "The protein fits together perfectly with no holes in the middle, and no atoms on top of each other. It’s about as densely packed as it could be. It’s like a three-dimensional jigsaw puzzle."

The researchers upped their odds of finding the right match by repeating the two-step process with 50 homologs of the proteins from other genomes, such as a mouse or fly. The protocol was first tested on a blind annual prediction test considered to be the highest standard for removing bias from protein structure prediction models.

"We can’t compute the energies perfectly, but the biggest problem is the search through possible shapes," Baker said. "Where we were not getting the right answer on the computer, it was almost always the case that the actual structure had the lowest energy, so we would have succeeded if we had explored this part of the space."

In a related paper published in the August issue of the journal Proteins, Baker and his colleagues reported that similar approaches can be used to predict the structures of protein complexes. "For the first time, computational methods are able, for a subset of cases, to produce really accurate models," he said.

Baker compares the computer simulations of the proteins to the problem of trying to find the lowest point on the surface of the Earth for the first time. A simple way to find the lowest place on the planet is to send out as many explorers as possible. The more explorers there are the more likely one of them is to stumble onto the shoreline of the Dead Sea - the Earth’s lowest point on land not covered by water. Each of the thousands of computer simulations is like one explorer.

Although the 33 percent success rate reported in the Science paper might be good enough to secure hall-of-fame status for a baseball player, Baker is quick to point out that it is not yet reliable enough for biology. Better models will depend on both smarter exploration strategies and more computer power. "If methods stayed where we are, we wouldn’t solve the problem," Baker said. "On the other hand, we would do better with 10 times more computer time."

It takes less than one minute for a protein to fold into its correct shape in cells, but one oft-repeated estimate predicts it would take longer than the age of the universe for a computer to sample all the possible confirmations of a folded protein. Baker’s lab already receives help from supercomputing centers in San Diego and Illinois.

More help will soon be on its way from many of the 5,000 freshman entering University of Washington this fall. Using software developed to assist the Search for Extraterrestrial Intelligence (SETI) project, the students can put their computers to work at night while they are sleeping to search the atomic landscape for the lowest energy structure of proteins.

To improve protein structure prediction further, Baker’s group has also started a distributed computing project that they are hoping will be aided by members of the public. The project, called Rosetta@home, is a scientific research project that uses internet-connected computers to predict and design protein structures, and protein-protein and protein-ligand interactions. The goal is to develop methods that accurately predict and design protein structures and complexes, an endeavor that may ultimately help researchers develop cures for human diseases such as cancer, HIV/AIDS, and malaria. More information is available online at http://boinc.bakerlab.org/rosetta.

More information:
boinc.bakerlab.org/rosetta
www.hhmi.org
[Sep 17, 2005 4:53:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Rosetta and David Baker

Here is a very interesting article in the Sep 2005 issue of NIH Findings:
http://www.nigms.nih.gov/news/findings/sept05/business.html

It has a lot of pictures, so I will not copy it.
[Sep 17, 2005 5:11:24 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Rosetta@Home Research Page

Here is the URL to the Research Page at Rosetta@Home:
http://boinc.bakerlab.org/rosetta/rah_research.php
[Sep 17, 2005 5:41:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Impx
Cruncher
Joined: Sep 7, 2005
Post Count: 9
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Rosetta@home

How does this differ to what we are attempting here? Are they complimentory or are they similar projects ? Any duplicate effort? Or different objectives ?

I did attempt to read the post but if someone could shorten down the explanation to explain the differences between the 2 projets that would be greatly appreciated.

Thanks. :)
[Sep 18, 2005 8:57:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rosetta@home

Hello Impx,

I will give you my opinion, which may be totally wrong. Rosetta is a Swiss-knife type of program that can handle proteins several different ways. We are currently using a low-resolution method of folding proteins which treats an entire residue (amino acid) in the protein as a group. There is another method in Rosetta which can refine an approximate structure by calculating the forces generated by each atom. This is much more computationally intensive but hopefully it produces a more accurate protein structure. We have been referring to a possible project using this high-resolution method as HPF2, to distinguish it from our current project using the low-resolution method, which we sometimes refer to as HPF1. We will have more information about HPF2 after the Advisory Board considers the matter at the end of September. We are running HPF1 on every unknown human protein that is small enough to try to fold with our current version of Rosetta. The current expectation is that, if we commit to HPF2, the target protein list will be much smaller and will have to pass scrutiny according to guidance from the Advisory Board.

I would not expect any duplication of effort unless Dr. Baker decided to compare the output of two different versions of Rosetta. Looking at http://vspx27.stanford.edu/DCcomparison.html I see that Folding@Home is running on the equivalent of about 200,000 CPUs while the World Community Grid is just over 20,000 full-time equivalent CPUs this Sunday. Folding@Home is simulating folding a protein atom-by-atom. Protein folding requires vast computer resources. I doubt that any scientist is going duplicate any effort unless there is a valid need for a cross-check.

Keep in mind; I am just offering my opinion.
mycrofth
[Sep 18, 2005 10:29:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rosetta@home

mycrofth, thanks a lot for your explanation. I was expecting some overlap between the results of Predictor@Home, WCG, and Rosetta due to the nature of these projects.
[Sep 18, 2005 11:02:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rosetta@home

The message board at Rosetta@home has 2 posts on it that explain the purpose of this project at http://boinc.bakerlab.org/rosetta/forum_thread.php?id=4 and at http://boinc.bakerlab.org/rosetta/forum_thread.php?id=7

They both say that the purpose of Rosetta@home is to help Dr. David Baker at the University of Washington develop and test better algorithms for Rosetta. It takes some major computer power to make test runs trying to improve Rosetta. So this is very similar to Predictor@home at http://predictor.scripps.edu/ at the Scripps Research Institute.

I refrained from pointing this out, not knowing if the posters were reliable sources or not, but back-channel info agrees with these posts. So that's the way to bet.

mycrofth
[Sep 19, 2005 4:41:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rosetta@home

How does the human proteome folding project (HPF) on the world community grid (WCG) relate to Rosetta@home?

We are working with the bakerlab on HPF on the WCG and think of the two grids as very different parts of the solution of getting function out of fold prediction: one using Rosetta and one improving Rosetta (Dbaker is involved in both projects). I spoke with David Baker this mourning and we talked about the rolls for the two grids. We also talked about differentiating the two grids at the last annual Rosetta code developers meeting (RosettaCON-2005). It is important to differentiate Rosetta@home from the HPF project (the one currently running on the grid), so I’ll take a few lines to explain each from the perspective of motivation. The two grids HPF and Rosetta@home are not competing grids and we would like to see them both thrive.

Rosetta@home is run by the bakerlab (David Kim) as a way to accelerate development of the Rosetta code. With the focus on all-atom refinement and protein design even their benchmarks (to see if they broke the code or improved the code) are taking a large amount of time. Thus, Rosetta@home is primarily for testing new versions of the Rosetta code and making Rosetta better. Also the robetta server might be hooked up to this project as well. This ideally is a smaller grid than the WCG that meets the spikes in the compute demands of the robetta server and the bakerlab, which are modest compared to the HPF projects computational needs. Rosetta@home will not produce databases that will in turn be used by biologists but if helpd to make the code better, which in turn helps efforts, like HPF, that use the code to give biologists usable fold and function predictions.

Conversely, our project, HPF on the WCG, aims to use stable versions of Rosetta to make predictions that can be presented to biologists and biomedical researchers in comprehensive databases with intuitive front ends. HPF on the WCG can be thought of as the production phase of the project, where we produce function annotation for many genomes and then distribute this product to biologists. Due to the large number of proteins we’re folding (comprehension is essential) we need a great deal more computer time that the problems Rosetta@home as designed to carry out.

Hope this helps…
[Sep 19, 2005 9:26:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Rosetta@home

Here is an article dated 21 Sep 2005 about folding proteins and Rosetta@home in the Telegraph at http://www.telegraph.co.uk/connected/main.jht...d/2005/09/21/ecfori21.xml

Cracked: the puzzle of protein origami
By Roger Highfield
(Filed: 21/09/2005)

Scientists are close to cracking the code of life with an advance that has far-reaching implications for the understanding, diagnosis and treatment of disease. They have known for more than 40 years how the genetic code held in DNA describes the sequence of molecular building blocks - amino acids - that make up proteins, the complex molecules used to build and run our bodies.

But it has not been possible to predict the shape of proteins, which is crucial for understanding how these biological machines work. Solving this three-dimensional puzzle has been one of the grand challenges of biology, one that could speed the development of new drugs to treat cancer, heart disease and all other ailments.

Now Prof David Baker, Dr Philip Bradley and Dr Kira Misura of the University of Washington, Seattle, report in the latest issue of the journal Science that they have managed the feat to high accuracy for a handful of small proteins, providing the first hope that it is possible.

The hard part has been working out the way in which the chain of amino acids described by the DNA code folds up to make the three-dimensional shape of a protein, which is fundamental to the way the protein works in the body.

The basic idea of protein origami is simple: work out each possible shape the string of amino acids could adopt and then work out the energy of each one. The shape with the lowest energy should be the most stable and thus the most likely.

The problem is that trying all the possible conformations is impossible - the number of permutations rapidly spins out of control. To make the problem tractable, Prof Baker and his colleagues devised a two-step search method to work out the shape of 17 short strings of amino acids.

A sophisticated computer program called Rosetta randomly twists, loops and bends a simple representation of the amino acid chain into 100,000 different shapes based on the preferred location of the 20 amino acids: some like water and others don't, for example.

Then, atomically detailed models are built based on the most promising shapes found in this "low-resolution" search and used as the starting points for a high-resolution energy search of possible structures.

"For about one third of our benchmark set of small proteins, we generated relatively high-resolution structure predictions, with parts of the structures predicted to near-atomic resolution," said Dr Bradley.

One successful target was a protein from a competition, the Sixth Critical Assessment of Techniques for Protein Structure Prediction, which has run for a decade. The method "showed the best accuracy by far", said the competition's co-organiser Dr Tim Hubbard of the Wellcome Trust Sanger Institute in Hinxton, near Cambridge. He hailed the achievement as "very significant".

The new work provides the first real hope that scientists eventually may be able, with sufficient computer power, to determine the structure of proteins from their genomic sequences. To improve protein structure prediction further, Prof Baker yesterday invited Telegraph readers with internet access to take part in his Rosetta@home project, on www.boinc.bakerlab.org/rosetta.

The goal is to develop methods that accurately predict and design protein structures and complexes, an endeavour that may ultimately help researchers develop cures for human diseases such as cancer, Aids and malaria. Dr Hubbard called it "one of the holy grails of computational biology".
[Sep 25, 2005 9:44:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread