Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 8
|
![]() |
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Not sure if this issue has been raised previously, but here it is:
----------------------------------------Workunit Status Project Name: Human Proteome Folding - Phase 2 Created: 07/07/2012 20:09:07 Name: qf593_00081 Minimum Quorum: 15 Replication: 19 The large number of copies sent out for this workunit is due to the unique nature of this project. We encourage you to read the FAQs about this project for more information. Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit qf593_ 00081_ 0-- 640 Valid 2012/7/9 16:45:39 2012/7/11 15:28:44 5.85 130.0 / 150.4 qf593_ 00081_ 2-- 640 Valid 2012/7/9 16:32:45 2012/7/11 13:58:36 5.62 128.5 / 150.4 qf593_ 00081_ 5-- 640 Valid 2012/7/9 16:40:18 2012/7/11 12:47:52 6.68 161.6 / 150.4 qf593_ 00081_ 6-- 640 Valid 2012/7/9 16:29:48 2012/7/11 03:04:42 6.09 134.1 / 150.4 qf593_ 00081_ 15-- 640 Valid 2012/7/9 16:31:37 2012/7/11 02:02:58 6.97 168.2 / 150.4 qf593_ 00081_ 18-- 640 Valid 2012/7/9 16:37:18 2012/7/10 23:27:51 6.13 167.4 / 150.4 qf593_ 00081_ 1-- 640 Valid 2012/7/9 16:30:35 2012/7/10 22:54:06 8.21 148.0 / 150.4 qf593_ 00081_ 17-- 640 Valid 2012/7/9 16:29:18 2012/7/10 21:27:00 21.48 143.1 / 150.4 qf593_ 00081_ 4-- 640 Valid 2012/7/9 16:27:32 2012/7/10 19:30:58 3.70 149.8 / 150.4 qf593_ 00081_ 12-- 640 Valid 2012/7/9 16:27:50 2012/7/10 14:57:16 5.53 135.5 / 150.4 qf593_ 00081_ 11-- 640 Valid 2012/7/9 16:29:04 2012/7/10 11:18:39 3.17 100.0 / 150.4 <--mine qf593_ 00081_ 9-- 640 Valid 2012/7/9 16:27:48 2012/7/10 08:21:56 6.22 170.1 / 150.4 qf593_ 00081_ 3-- 640 Valid 2012/7/9 16:41:36 2012/7/10 08:13:16 6.60 156.7 / 150.4 qf593_ 00081_ 14-- 640 Valid 2012/7/9 16:29:59 2012/7/10 07:55:35 7.83 154.4 / 150.4 qf593_ 00081_ 10-- 640 Valid 2012/7/9 16:34:29 2012/7/10 02:17:43 4.87 153.3 / 150.4 qf593_ 00081_ 8-- 640 Invalid 2012/7/9 16:40:47 2012/7/9 16:41:28 0.00 0.0 / 240.1 qf593_ 00081_ 13-- 640 Invalid 2012/7/9 16:28:27 2012/7/9 16:28:54 0.00 0.0 / 230.9 qf593_ 00081_ 16-- - In Progress 2012/7/9 16:46:28 2012/7/19 16:46:28 0.00 0.0 / 0.0 qf593_ 00081_ 7-- - In Progress 2012/7/9 16:32:07 2012/7/19 16:32:07 0.00 0.0 / 0.0 Result Log Result Name: qf593_ 00081_ 8-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> NFO:[07:40:34] Start AutoGrid... autogrid: autogrid4: Successful Completion. INFO:[07:41:26] End AutoGrid... Beginning AutoDock... INFO: Setting num_generations: 27000 About to enter main loop...(dockings already completed: 100) Finished Docking number 0 ________________________________________________________________________________ autodock4: Successful Completion on "World Community Grid device" ________________________________________________________________________________ called boinc_finish called boinc_finish called boinc_finish [07:43:29] Number of tasks = 36 [07:43:29] Starting task 34,CPU time is 17707.501908. Unable to open PDBQT File. [07:43:29] ./ZINC49657359.pdbqt size = 0 0 ../../projects/www.worldcommunitygrid.org/gfam.x3EBI_B_w2WATs_PfAM1.pdbqt size = 8925 0 Unable to update graphics data. [07:43:30] ERROR: Vina exited with exit code = 1 VINA Error: Error: could not open ".\ZINC49657359.pdbqt" for reading. Retrying task. [07:43:35] Starting task 34,CPU time is 17707.501908. Unable to open PDBQT File. [07:43:35] ./ZINC49657359.pdbqt size = 0 0 ../../projects/www.worldcommunitygrid.org/gfam.x3EBI_B_w2WATs_PfAM1.pdbqt size = 8925 0 Unable to update graphics data. [07:43:36] ERROR: Vina exited with exit code = 1 VINA Error: Error: could not open ".\ZINC49657359.pdbqt" for reading. 07:43:36 (7388): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish <Log too long; snipped here (something similar to the remaining part of the log)> called boinc_finish called boinc_finish called boinc_finish called boinc_finish [11:31:44] Number of tasks = 27 [11:31:44]:ERROR: Unable to open output file gfam.x3BWKb_PfFP3_ZINC08755218_529273731_out.pdbqt. 11:31:44 (7740): called boinc_finish [11:31:47] Number of tasks = 34 [11:31:47]:ERROR: Unable to open output file gfam.x3BWKb_PfFP3_ZINC23076791_1931820608_out.pdbqt. 11:31:47 (1920): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish </stderr_txt> ]]> I recently switched temporarily to get some HPF2 WUs to test my new machine. ALL three of these WUs the machine got have two of these rogue copies in them. The remaining two WUs haven't got validated yet, so dunno if those rogue copies will get free credits too. Edit: added indication for my machine's copy. Edit 2: another WU just got validated. And yes, same free shiny credits: Workunit Status Project Name: Human Proteome Folding - Phase 2 Created: 07/07/2012 20:09:07 Name: qf593_00075 Minimum Quorum: 15 Replication: 19 The large number of copies sent out for this workunit is due to the unique nature of this project. We encourage you to read the FAQs about this project for more information. Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit qf593_ 00075_ 9-- 640 Valid 2012/7/9 16:33:59 2012/7/11 17:10:13 6.05 37.4 / 152.7 qf593_ 00075_ 19-- 640 Valid 2012/7/9 16:32:01 2012/7/11 04:16:22 5.39 149.2 / 152.7 qf593_ 00075_ 7-- 640 Valid 2012/7/9 16:27:09 2012/7/11 03:57:13 6.31 157.0 / 152.7 qf593_ 00075_ 3-- 640 Valid 2012/7/9 16:38:58 2012/7/11 02:25:21 5.39 127.1 / 152.7 qf593_ 00075_ 11-- 640 Valid 2012/7/9 16:26:56 2012/7/11 02:02:58 7.07 172.8 / 152.7 qf593_ 00075_ 18-- 640 Valid 2012/7/9 16:29:02 2012/7/10 21:50:42 8.15 148.5 / 152.7 qf593_ 00075_ 2-- 640 Valid 2012/7/9 16:33:36 2012/7/10 21:45:07 6.01 160.3 / 152.7 qf593_ 00075_ 15-- 640 Valid 2012/7/9 16:51:52 2012/7/10 21:32:36 4.84 152.3 / 152.7 qf593_ 00075_ 4-- 640 Valid 2012/7/9 16:31:41 2012/7/10 18:20:27 17.56 143.1 / 152.7 qf593_ 00075_ 17-- 640 Valid 2012/7/9 16:50:23 2012/7/10 15:33:26 4.92 154.1 / 152.7 qf593_ 00075_ 8-- 640 Valid 2012/7/9 16:29:04 2012/7/10 14:31:07 3.20 101.4 / 152.7 <--mine qf593_ 00075_ 16-- 640 Valid 2012/7/9 16:31:37 2012/7/10 12:07:46 6.29 153.1 / 152.7 qf593_ 00075_ 0-- 640 Valid 2012/7/9 16:28:36 2012/7/10 08:46:20 5.65 161.9 / 152.7 qf593_ 00075_ 14-- 640 Error 2012/7/9 16:28:01 2012/7/9 16:31:05 0.00 137.8 / 0.0 qf593_ 00075_ 12-- 640 Invalid 2012/7/9 16:26:22 2012/7/9 16:26:45 0.00 0.0 / 243.7 qf593_ 00075_ 1-- 640 Invalid 2012/7/9 16:26:05 2012/7/9 16:26:25 0.00 0.0 / 234.4 qf593_ 00075_ 13-- - In Progress 2012/7/9 16:28:24 2012/7/19 16:28:24 0.00 0.0 / 0.0 qf593_ 00075_ 6-- - In Progress 2012/7/9 16:29:58 2012/7/19 16:29:58 0.00 0.0 / 0.0 qf593_ 00075_ 5-- - In Progress 2012/7/9 16:27:21 2012/7/19 16:27:21 0.00 0.0 / 0.0 qf593_ 00075_ 10-- - In Progress 2012/7/9 16:30:31 2012/7/19 16:30:31 0.00 0.0 / 0.0 [Edit 2 times, last edit by Former Member at Jul 11, 2012 5:15:59 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Update: The final out of the three WUs also got validated, and the same situation appears:
qf593_ 00042_ 6-- 640 Invalid 12年7月9日 16:28:04 12年7月9日 16:28:54 0.00 0.0 / 228.8 qf593_ 00042_ 17-- 640 Invalid 12年7月9日 16:17:51 12年7月9日 16:18:13 0.00 0.0 / 237.9 |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks Moonian,
I always appreciate people carefully reporting the inexplicable. At least we know that something strange is happening. I don't know when we will solve it though. ![]() Lawrence |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Got one of this project's WUs again recently due to the HCC shortage. And it seems that the "cheaters" are still at large (this time, the HCC app is used to "run" this WU):
----------------------------------------(Edit: Since the WU hasn't got validated yet, it's not sure if this copy will eventually get a Valid status, but at least it shouldn't have got a Pending Validation status.) Project Name: Human Proteome Folding - Phase 2 Created: 11/19/2012 21:13:27 Name: qj390_00107 Minimum Quorum: 15 Replication: 19 The large number of copies sent out for this workunit is due to the unique nature of this project. We encourage you to read the FAQs about this project for more information. Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time / Elapsed Time (hours) Claimed/ Granted BOINC Credit qj390_ 00107_ 21-- 640 Pending Validation 12å¹´11æ22æ¥ 07:06:39 12å¹´11æ22æ¥ 15:23:56 3.75 87.8 / 0.0 qj390_ 00107_ 20-- 640 Pending Validation 12å¹´11æ21æ¥ 17:00:43 12å¹´11æ22æ¥ 03:34:53 2.85 93.8 / 0.0 qj390_ 00107_ 19-- 640 Pending Validation 12å¹´11æ21æ¥ 12:01:59 12å¹´11æ22æ¥ 20:08:08 3.41 100.2 / 0.0 qj390_ 00107_ 12-- 640 Error 12å¹´11æ21æ¥ 06:09:50 12å¹´11æ21æ¥ 08:10:54 0.00 0.1 / 0.0 qj390_ 00107_ 10-- 640 Pending Validation 12å¹´11æ21æ¥ 06:08:44 12å¹´11æ22æ¥ 06:51:05 7.77 101.5 / 0.0 qj390_ 00107_ 1-- 640 Error 12å¹´11æ21æ¥ 06:07:48 12å¹´11æ22æ¥ 06:59:01 0.01 0.2 / 0.0 qj390_ 00107_ 2-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:45 12å¹´11æ21æ¥ 09:34:59 2.66 73.9 / 0.0 qj390_ 00107_ 5-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:43 12å¹´11æ21æ¥ 19:04:13 3.94 94.0 / 0.0 qj390_ 00107_ 9-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:42 12å¹´11æ21æ¥ 15:36:30 7.04 79.9 / 0.0 qj390_ 00107_ 15-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:41 12å¹´11æ21æ¥ 15:08:07 3.20 90.8 / 0.0 qj390_ 00107_ 16-- - In Progress 12å¹´11æ21æ¥ 06:07:37 12å¹´12æ1æ¥ 06:07:37 0.00 0.0 / 0.0 qj390_ 00107_ 4-- - In Progress 12å¹´11æ21æ¥ 06:07:37 12å¹´12æ1æ¥ 06:07:37 0.00 0.0 / 0.0 qj390_ 00107_ 3-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:37 12å¹´11æ22æ¥ 04:13:03 3.64 98.0 / 0.0 qj390_ 00107_ 14-- - In Progress 12å¹´11æ21æ¥ 06:07:34 12å¹´12æ1æ¥ 06:07:34 0.00 0.0 / 0.0 qj390_ 00107_ 6-- - In Progress 12å¹´11æ21æ¥ 06:07:28 12å¹´12æ1æ¥ 06:07:28 0.00 0.0 / 0.0 qj390_ 00107_ 7-- 640 Error 12å¹´11æ21æ¥ 06:07:26 12å¹´11æ21æ¥ 16:43:08 0.00 0.1 / 0.0 qj390_ 00107_ 0-- - In Progress 12å¹´11æ21æ¥ 06:07:24 12å¹´12æ1æ¥ 06:07:24 0.00 0.0 / 0.0 qj390_ 00107_ 8-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:24 12å¹´11æ21æ¥ 16:32:11 4.77 143.9 / 0.0 qj390_ 00107_ 18-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:22 12å¹´11æ22æ¥ 01:39:08 5.43 93.1 / 0.0 qj390_ 00107_ 17-- - In Progress 12å¹´11æ21æ¥ 06:07:19 12å¹´12æ1æ¥ 06:07:19 0.00 0.0 / 0.0 qj390_ 00107_ 13-- - In Progress 12å¹´11æ21æ¥ 06:07:16 12å¹´12æ1æ¥ 06:07:16 0.00 0.0 / 0.0 qj390_ 00107_ 11-- 640 Pending Validation 12å¹´11æ21æ¥ 06:07:15 12å¹´11æ21æ¥ 06:07:33 0.00 0.0 / 0.0 Result Name: qj390_ 00107_ 11-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Number of Images defined in image list is 2 07:02:30 (7064): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:07:35 (6664): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:14:22 (1328): called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:16:40 (6904): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:18:37 (6292): called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:20:24 (6612): called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:21:05 (6676): called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:22:51 (6644): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:26:21 (6772): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:29:10 (4868): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish Number of Images defined in image list is 2 07:33:00 (4600): called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish <snipped; stuff above repeated on an irregular basis> called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish called boinc_finish </stderr_txt> ]]> [Edit 1 times, last edit by Former Member at Nov 23, 2012 5:23:39 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Let us clarify. There are no "cheaters" [sorry, I missed the humorous inflection], lest the server 700, which is the one computing the "claim" off confused statistics is deemed to be the one. The client "claim" is these days thoroughly ditched. ;>) Overall science app speed of the project, the task duration and the device performance are tracked from which a credit "claim" is cooked. If the task ran in ZR it will also be the grant, not always, else the grant is computed by the same rules as under server 601, with outlier/average or adjusted to some median [seeing HFCC with the grant being different to claim even when run alone). Sample
----------------------------------------"Minimum Quorum: 1 Replication: 1 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time / Elapsed Time (hours) Claimed/ Granted BOINC Credit HFCC_ target-10_ 01572186_ target-10_ 0000_ 0-- 640 Valid 11/21/12 07:36:23 11/23/12 07:15:21 7.40 142.5 / 141.2 Lifting one corner of the coperto, it's thoroughly incapable of understanding that result runtimes are substantially variable within a single science app. [The non-determinism]. Trying to understand the current [Berkeley] credit system requires the ingestion of some psychedelics and will thus not be discussing this any further. A seeming mix up of log by engine X for science Y has been reported before [one or two times]. Don't remember the tech response, think there was one, which then makes it's searchable somehow. [Edit 1 times, last edit by Former Member at Nov 23, 2012 7:41:44 AM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just an update: The WU previously mentioned has been validated, an the WU copy in question has its status got turned from Pending Validation to Error (and, of course, no free shiny credits for you!):
qj390_ 00107_ 11-- 640 Error 12年11月21日 06:07:15 12年11月21日 06:07:33 0.00 0.0 / 0.0 Have the Techs intervened manually? Or is there a new mechanism to deal with this kind of WUs when the validator runs (i.e. a WU copy can still get an Error instead of Invalid even it's previously at the Pending Validation status)? |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
PVal/PVer have before been able to convert to Error.
Rest assured that at 1.2-1.4 million results coming through daily, the last thing is to intervene into the validation of a single result. BTW, at times some clients succeed to *not* report their processing time. The servers still are able to compute the credit from the data in the result header, but in this case the red marked results were not on the client for longer than 20 seconds. The HPF2 will in likelihood have been the infamous /711 |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Ah, I see. Thanks for the info, Rob
![]() |
||
|
|
![]() |