Deadlines are all screwed up.

Message boards : Number crunching : Deadlines are all screwed up.

To post messages, you must log in.

AuthorMessage
Sandman192

Send message
Joined: 22 Sep 07
Posts: 16
Credit: 2,018,819
RAC: 0
Message 95195 - Posted: 23 Apr 2020, 9:14:34 UTC

I get work, work running 24 hours a day, work running my CPU at 100%, work doesn't even get canceled after 2 days of being behind, and just keep crunching all day long. 4 GHz at 100% work time and no playing game at all that time and apparently is not fast enough and this is only happing to Rosetta's work. Prime work that some take 15 days to finish on CPUs finish even after suspending it to play games for over 2 days it still finishes it on time.

Not only BOINC checks for how long it takes, for how fast my CPU is, and for when the deadline is it still grabs the work and never finishes it on time. Resetting does nothing.

4/23/2020 12:14:15 AM | Rosetta@home | Result r4d_2598_fold_SAVE_ALL_OUT_918965_80_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result r3x_8820_fold_SAVE_ALL_OUT_920503_80_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 8mx1zm8r_26HBAAB4HM3HBBE26H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_2_SAVE_ALL_OUT_IGNORE_THE_REST_1th5wo8t_924136_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 3mz0ij2n_26HBAAB4HM2HBAAB26H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_1_SAVE_ALL_OUT_IGNORE_THE_REST_4vt5at1a_924135_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_2_SAVE_ALL_OUT_IGNORE_THE_REST_8xu0gk4n_924136_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 9uq0kw3c_25HBBAAB1HM2HGABAB25H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result polar_protein_test1_SAVE_ALL_OUT_IGNORE_THE_REST_1sy7xu3a_924137_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 6vh6ar1a_24HGABAB2HM0HBBAAB24H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 6at9xn7w_24HBGBB2HM0HBBE24H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 0ep2st7r_23HBAB1HM0HGBBB23H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_1_SAVE_ALL_OUT_IGNORE_THE_REST_0pu4wc7d_924135_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_2_SAVE_ALL_OUT_IGNORE_THE_REST_0ne7zu3d_924136_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_2_SAVE_ALL_OUT_IGNORE_THE_REST_0nr6vb2s_924136_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_2_SAVE_ALL_OUT_IGNORE_THE_REST_3fd9md3r_924136_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1p9m_1_SAVE_ALL_OUT_IGNORE_THE_REST_2vj9ex5l_924135_1_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 8qt3pd5a_26HBBBB4HM0HBBBB26H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_21_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1bqu_1_SAVE_ALL_OUT_IGNORE_THE_REST_7sk1an0s_924127_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 7vw2yn8x_26HBBBB4HM4HGB26H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_21_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result Mini_Protein_binds_IL6R_COVID-19_1bqu_2_SAVE_ALL_OUT_IGNORE_THE_REST_3ds7dz6i_924128_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result 6ky5uo8q_26HBAB4HM0HBAB26H_build_COVID-19_binder_build1_SAVE_ALL_OUT_924149_21_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Result polar_protein_test1_SAVE_ALL_OUT_IGNORE_THE_REST_4zx0ph3o_924137_2_0 is no longer usable
4/23/2020 12:14:15 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 12:14:48 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 12:15:03 AM | Rosetta@home | No tasks sent
4/23/2020 12:15:03 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 12:15:03 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 12:25:20 AM | Rosetta@home | No tasks sent
4/23/2020 12:25:20 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 12:25:20 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 12:40:19 AM | Rosetta@home | Task hgfp_loop2_788_fold_SAVE_ALL_OUT_915548_202_1 exited with zero status but no 'finished' file
4/23/2020 12:40:19 AM | Rosetta@home | If this happens repeatedly you may need to reset the project.
4/23/2020 12:40:19 AM | Rosetta@home | Task hgfp_split2_206_fold_SAVE_ALL_OUT_915949_202_1 exited with zero status but no 'finished' file
4/23/2020 12:40:19 AM | Rosetta@home | If this happens repeatedly you may need to reset the project.
4/23/2020 12:40:19 AM | Rosetta@home | Task hgfpsplit2_39_fold_SAVE_ALL_OUT_916387_202_1 exited with zero status but no 'finished' file
4/23/2020 12:40:19 AM | Rosetta@home | If this happens repeatedly you may need to reset the project.
4/23/2020 12:40:38 AM | Rosetta@home | No tasks sent
4/23/2020 12:40:38 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 12:40:38 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 12:40:51 AM | Rosetta@home | No tasks sent
4/23/2020 12:40:51 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 12:40:51 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 12:47:33 AM | Rosetta@home | No tasks sent
4/23/2020 12:47:33 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 12:47:33 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 1:11:49 AM | Rosetta@home | No tasks sent
4/23/2020 1:11:49 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 1:11:49 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 1:53:44 AM | Rosetta@home | No tasks sent
4/23/2020 1:53:44 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 1:53:44 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 1:53:58 AM | Rosetta@home | No tasks sent
4/23/2020 1:53:58 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 1:53:58 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:00:06 AM | Rosetta@home | No tasks sent
4/23/2020 2:00:06 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:00:06 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:00:27 AM | Rosetta@home | No tasks sent
4/23/2020 2:00:27 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:00:27 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:10:28 AM | Rosetta@home | No tasks sent
4/23/2020 2:10:28 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:10:28 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:10:44 AM | Rosetta@home | No tasks sent
4/23/2020 2:10:44 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:10:44 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:11:20 AM | Rosetta@home | No tasks sent
4/23/2020 2:11:20 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:11:20 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:11:35 AM | Rosetta@home | No tasks sent
4/23/2020 2:11:35 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:11:35 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:13:00 AM | Rosetta@home | No tasks sent
4/23/2020 2:13:00 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:13:00 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:13:13 AM | Rosetta@home | No tasks sent
4/23/2020 2:13:13 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:13:13 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:26:04 AM | Rosetta@home | No tasks sent
4/23/2020 2:26:04 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:26:04 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:37:45 AM | Rosetta@home | No tasks sent
4/23/2020 2:37:45 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:37:45 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:37:59 AM | Rosetta@home | No tasks sent
4/23/2020 2:37:59 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:37:59 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:45:16 AM | Rosetta@home | No tasks sent
4/23/2020 2:45:16 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:45:16 AM | Rosetta@home | Project requested delay of 7 seconds
4/23/2020 2:58:43 AM | Rosetta@home | No tasks sent
4/23/2020 2:58:43 AM | Rosetta@home | Tasks won't finish in time: BOINC runs 98.7% of the time; computation is enabled 99.6% of that
4/23/2020 2:58:43 AM | Rosetta@home | Project requested delay of 7 seconds

And from Rosetta site. 51 errors. Didn't show all but you get the message.

1157657352 1041264977 3817060 22 Apr 2020, 11:16:55 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157657355 1041264983 3817060 22 Apr 2020, 11:16:55 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157657487 1041264982 3817060 22 Apr 2020, 11:16:55 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157657488 1041264984 3817060 22 Apr 2020, 11:16:55 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157657288 1041264849 3817060 22 Apr 2020, 11:16:55 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157657041 1041264622 3817060 22 Apr 2020, 11:16:55 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513482 1041141208 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513483 1041141207 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513487 1041141211 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513242 1041140968 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513257 1041141009 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513530 1041141260 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513536 1041141255 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513033 1041140796 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513314 1041141047 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513331 1041141065 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513376 1041141107 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513383 1041141110 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513391 1041141118 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513412 1041141143 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1157513170 1041140925 3817060 22 Apr 2020, 7:39:57 UTC 23 Apr 2020, 5:14:46 UTC Cancelled by server 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182658 1038298238 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154221849 1038330905 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta Mini v3.78
windows_x86_64
1154221850 1038330890 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta Mini v3.78
windows_x86_64
1154182433 1038298008 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182705 1038298285 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182706 1038298287 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182707 1038298289 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182196 1038297849 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182708 1038298291 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182711 1038298297 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182715 1038298305 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182781 1038298286 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182782 1038298288 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:44 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182785 1038298294 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182786 1038298296 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182787 1038298298 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182792 1038298308 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182546 1038298141 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
1154182547 1038298143 3817060 18 Apr 2020, 19:42:31 UTC 21 Apr 2020, 19:42:45 UTC Not started by deadline - canceled 0.00 0.00 --- Rosetta v4.15
windows_x86_64
ID: 95195 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1681
Credit: 17,854,150
RAC: 22,647
Message 95197 - Posted: 23 Apr 2020, 9:35:39 UTC - in response to Message 95195.  
Last modified: 23 Apr 2020, 9:46:28 UTC

Resetting does nothing.
Oh, it does something- it makes it worse, because it throws out all the information it has got up to that point and has to start all over again from scratch.

1 Rosetta Tasks have a 3 day deadline (at present). So if you have a cache set longer than that, you will run into problems until the estimated completion times are correct. And the more projects you do, and the bigger the cache, the longer that will take to occur.
2 If you run more than a couple of projects, then you don't even need a cache. And having a cache just means it takes longer for thing to settle down.
3 The default runtime for a Rosetta Task is 8 hours. If you set a longer Target CPU time, then it takes longer for things to sort themselves out.
4 The more you do with your system, the less time there is for Rosetta (and any other project) to actually process work.

One of your Tasks
Run time 1 days 21 hours 16 min 27 sec
CPU time 1 days 11 hours 51 min 43 sec
A lightly used system may have a minute's difference between CPU time & Run time for each hour of CPU time.
So for that Task, the difference should be 36min- you've got almost 10 hours. So your system is spending a lot of time doing things other than processing BOINC work.



This is what you need-
In your Account, Preferences, Preferences for this project- Rosetta@home preferences
Target CPU run time (not selected)
Make sure it is "Not selected", that way it will use the project default (8 hours).
Update Preferences to save the changes.


In your Account, Preferences, When and how BOINC uses your computer- Computing preferences
Computing
   Usage limits	
                                   Use at most 100% of the CPUs
                                   Use at most 100% of CPU time

   When to suspend	
           Suspend when computer is on battery (not selected)
               Suspend when computer is in use (not selected)
 Suspend GPU computing when computer is in use (not selected)
   'In use' means mouse/keyboard input in last 3 minutes
  Suspend when no mouse/keyboard input in last --- minutes
     Suspend when non-BOINC CPU usage is above --- %
                          Compute only between ---

   Other	
                                Store at least 0.05 days of work
                     Store up to an additional 0.02 days of work
                    Switch between tasks every 60 minutes
     Request tasks to checkpoint at most every 60 seconds

   Disk
                              Use no more than 20 GB
                                Leave at least 2 GB free
                              Use no more than 60 % of total

   Memory
          When computer is in use, use at most 95 %
      When computer is not in use, use at most 95 %
 Leave non-GPU tasks in memory while suspended (not selected)
                   Page/swap file: use at most 75 %
Update Preferences to save the changes.
Then on the BOINC Manager, on the Project tab select Rosetta, Update.


Given the number of projects you are attached to, and the size of your cache, it will take at least a week for these settings to have a significant impact on the BOINC Manager being able to make a start on meeting your Resource share settings.
But it will stop you from getting more work than you can possibly return, but still keep the system busy processing BOINC work (at least when it's not busy doing other things).
Grant
Darwin NT
ID: 95197 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Raistmer

Send message
Joined: 7 Apr 20
Posts: 49
Credit: 797,293
RAC: 0
Message 95203 - Posted: 23 Apr 2020, 11:04:00 UTC - in response to Message 95197.  
Last modified: 23 Apr 2020, 11:04:26 UTC

Another way could be to change "not selected" to smth smaller than 8 hours.
Minimum is 2 hours.
The less value one set the less effective will be computation of those tasks. But if it will allow to complete in time overall efficiency for that particular host could be improved.
So one can gradually decrease that value to the point no deadlines occured.
ID: 95203 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2125
Credit: 41,228,659
RAC: 10,982
Message 95250 - Posted: 23 Apr 2020, 22:54:04 UTC - in response to Message 95195.  
Last modified: 23 Apr 2020, 22:55:02 UTC

I get work, work running 24 hours a day, work running my CPU at 100%, work doesn't even get canceled after 2 days of being behind, and just keep crunching all day long. 4 GHz at 100% work time and no playing game at all that time and apparently is not fast enough and this is only happing to Rosetta's work. Prime work that some take 15 days to finish on CPUs finish even after suspending it to play games for over 2 days it still finishes it on time.

Not only BOINC checks for how long it takes, for how fast my CPU is, and for when the deadline is it still grabs the work and never finishes it on time. Resetting does nothing.

No-one like to hear it's their fault rather than someone else's, but the 3-day deadlines are the deadlines. Your settings are. frankly, catastrophic.

You've increased your runtime to 36 hours from the default 8hrs and I don't know how many days cache you've set, but it's not consistent with how many projects you run on that PC - even if it's only Rosetta.

Boinc does the scheduling, not Rosetta, and the cache needs to be a maximum of 1 day total until Boinc works out how long tasks run for you, and return your runtime to 8hrs (either explicitly or by not setting it explicitly to anything to cover future changes)

Change those two things, leave it untouched for at least a week, and everything will work out fine.
ID: 95250 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sandman192

Send message
Joined: 22 Sep 07
Posts: 16
Credit: 2,018,819
RAC: 0
Message 95281 - Posted: 24 Apr 2020, 8:19:41 UTC

For people who said.
Another way could be to change "not selected" to smth smaller than 8 hours.

Then why would Rosetta give you an option to set it for 1 day 12 hours? If a 4 GHz CPU can't crunch it on time then why that an option? Meaning almost no one can because 5GHz is only a hand full of computers (if it was a speed issue, to begin with). Plus, BOINC knows what speed a computer is and compensate for the long work and even for slow computers or just won't get the work.

Oh, it does something- it makes it worse, because it throws out all the information it has got up to that point and has to start all over again from scratch.

I mean resetting does nothing to fix the strang won't finish on time.

I don't know how many days cache you've set or Given the number of projects you are attached to, and the size of your cache

Cache is irrelevant since it does not affect not finish on time. I have no problems for years with work not finish on time. That's for some WU that took 15 days to finish on work other than Rosetta. Until Rosetta can with this option for changing the "Target CPU run time". That's with 10 days of work with 10 more of additional work. Again NO problems until now for years.

Number of projects you attached? I gave you the number of errors not the number of projects. All you see is how may that got errors because it won't finish. How do you know how many I attached. I only have Rosetta to grab for work. Anyway, that is still irrelevant.

Boinc does the scheduling, not Rosetta

Your right. But Rosetta DOES add the timelines to there work that BOINC has to relie on.

You've increased your runtime to 36 hours from the default 8hrs

Yes, because I want it that way. It's an option. If they're giving us an option that we can't use then why give us it in the first place.


Ok, right now I'm running 1 day 12 hours and it seems to finish on time for tomorrow. So I don't know why only 51 of them failed on not finishing on time. After that, I only have in my cache is 6 1/2 hour deadline work for Rosseta.
ID: 95281 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1681
Credit: 17,854,150
RAC: 22,647
Message 95288 - Posted: 24 Apr 2020, 9:05:25 UTC - in response to Message 95281.  
Last modified: 24 Apr 2020, 9:22:29 UTC

Another way could be to change "not selected" to smth smaller than 8 hours.

Then why would Rosetta give you an option to set it for 1 day 12 hours? If a 4 GHz CPU can't crunch it on time then why that an option? Meaning almost no one can because 5GHz is only a hand full of computers (if it was a speed issue, to begin with). Plus, BOINC knows what speed a computer is and compensate for the long work and even for slow computers or just won't get the work.
The reason there is an option, is because some people will do 10 projects, some people will do one. Some people will set the Resource share to be even, others will have it in all favour of of 1 project. An option that is suitable for just 1 project is almost never remotely suitable if you have more than just a couple of projects.
Just because it's an option doesn't mean you need to change it; the default is 8 hours, because that is what the project would prefer people used.
If you choose to use longer time, then you may need to change other settings in order to make it possible.



Oh, it does something- it makes it worse, because it throws out all the information it has got up to that point and has to start all over again from scratch.
I mean resetting does nothing to fix the strang won't finish on time.
For the very reasons i pointed out. As it does work (or can't finish it) it records that information and makes changes based on that to -eventually- stop the problem from occurring. But if you keep throwing out that information, it has to start all over again from scratch.
So you just keeping prolonging the problems you are having.



Cache is irrelevant since it does not affect not finish on time. I have no problems for years with work not finish on time. That's for some WU that took 15 days to finish on work other than Rosetta. Until Rosetta can with this option for changing the "Target CPU run time". That's with 10 days of work with 10 more of additional work. Again NO problems until now for years.
Your excessive cache is the reason you are having the problem. Rosetta has very short deadlines- they are 3 days for the present Tasks. It has no history for how long it takes you to complete work, so the Estimated completion times aren't even close to accurate. So you get more work than you could even possibly do. So you miss deadlines.
It is that simple.

The point of a cache is so you don't run out of work. As you have dozens of projects you are attached to, there is no chance of all of those projects being down or not having work at the same time. So there is no need for any sort of cache at all. Even 0.5 days & 0.01 extra days would be more than is necessary, but at least it would stop you from missing deadlines.



How do you know how many I attached. I only have Rosetta to grab for work. Anyway, that is still irrelevant.
It is not irrelevant- it is the reason you are having problems- lots of projects + excessive cache = lots of problems while the BOINC manager sorts things out. I saw how many projects you are attached to when i went to see how many errors your system was having.



You've increased your runtime to 36 hours from the default 8hrs
Yes, because I want it that way. It's an option. If they're giving us an option that we can't use then why give us it in the first place.
They also give the option of 2 hours. It's an option, why not select it?
As i pointed out earlier- the option is there because there are many different projects. What is good for a project with short deadlines is not necessarily good for a project with long deadlines. What might be good for a single project is not good if there are lots of projects.
What might be good for a system with certain options selected, won't be a suitable choice to use if other options are selected.



Ok, right now I'm running 1 day 12 hours and it seems to finish on time for tomorrow. So I don't know why only 51 of them failed on not finishing on time. After that, I only have in my cache is 6 1/2 hour deadline work for Rosseta.
Because some Tasks will finish earlier than their Target CPU time, and some will take longer (up to 10 hours).

Because of the number of projects you run, the excessive cache you use, the changed from the default Target CPU Run time the simple fact of the matter is that the Estimated times are wrong, and will be that way for ages. And until they finally match your selected Target CPU Runtime you will continue to miss deadlines.




EDIT- and i notice you didn't address the fact that your system does not spend 100% of it's time processing work.
Grant
Darwin NT
ID: 95288 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2125
Credit: 41,228,659
RAC: 10,982
Message 95303 - Posted: 24 Apr 2020, 13:14:34 UTC - in response to Message 95281.  

Then why would Rosetta give you an option to set it for 1 day 12 hours? If a 4 GHz CPU can't crunch it on time then why that an option? Meaning almost no one can because 5GHz is only a hand full of computers (if it was a speed issue, to begin with). Plus, BOINC knows what speed a computer is and compensate for the long work and even for slow computers or just won't get the work.

36hrs runtime is fine, but only at the right time.
It seems that when a new version of Rosetta comes out, whatever the runtime set, the tasks come down on the assumed basis of (someone said) 4.5hr runtimes. This is bad enough when the default 8hr runtime is set. You say you have a 10+10hr cache set, which is usually no problem - Boinc will initially bring down what it thinks are 50 x 4.5hr tasks for your 12 cores - but instead of completing them in 20hrs, if your runtime is actually set at 36hrs, they'll take 150hrs to complete, long after the 3-day deadlines. This is why so many of your tasks got cancelled and will continue to until Boinc recognises and factors in your real runtime. At default 8hr runtime they'll take 33 hours, which is within deadline. A quick calculation reveals, with the size of cache you have, a 16hr runtime is the maximum you can set to complete all tasks within deadline. And when Boinc eventually reflects 16hr runtimes, only then can you increase your runtime again to get nearer your preference.

I don't know how many days cache you've set or Given the number of projects you are attached to, and the size of your cache

Cache is irrelevant since it does not affect not finish on time. I have no problems for years with work not finish on time. That's for some WU that took 15 days to finish on work other than Rosetta. Until Rosetta can with this option for changing the "Target CPU run time". That's with 10 days of work with 10 more of additional work. Again NO problems until now for years.

On cache, see above.
On what works with other projects, Rosetta's requirements are independent of any other project and every other project is independent of Rosetta. And your own settings have to work with all of them.
I may have misread what you've written. Are you saying you have a 10 DAY minimum cache plus 10 DAYS more?
Then it's very simple. Rosetta deadlines are 3 days. If you set 10+10 days cache, every task will fail to meet deadline the minute you download them and Boinc will have to run every Rosetta task high priority to meet deadline, while every other task will cancel for failing to meet deadline. Those settings are completely incompatible with Rosetta, independent of anything that works with any other project. It may also mean no other project's tasks on each host will ever get a chance to run at all.

Number of projects you attached? I gave you the number of errors not the number of projects. All you see is how may that got errors because it won't finish. How do you know how many I attached. I only have Rosetta to grab for work. Anyway, that is still irrelevant.

You're right, I don't. But you do, so make the adjustments that will work.

You've increased your runtime to 36 hours from the default 8hrs

Yes, because I want it that way. It's an option. If they're giving us an option that we can't use then why give us it in the first place.

You can make any choices you want, but 1) not immediately when a new program version arrives and/or 2) if you make incompatible choices, they're you choices to be incompatible

No-one like to hear it's their fault rather than someone else's, but the 3-day deadlines are the deadlines. Your settings are. frankly, catastrophic.

ID: 95303 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Deadlines are all screwed up.



©2024 University of Washington
https://www.bakerlab.org