Message boards : Number crunching : minirosetta 2.15
Author | Message |
---|---|
Yifan Song Volunteer moderator Project developer Project scientist Send message Joined: 26 May 09 Posts: 62 Credit: 7,322 RAC: 0 |
minirosetta is updated to add new protocols for symmetrical oligomers and membrane proteins. |
Murasaki Send message Joined: 20 Apr 06 Posts: 303 Credit: 511,418 RAC: 0 |
Here is some more information about Oligomers and Membrane proteins from Wikipedia for those who are interested. If I hang around these forums for another 20 years I might learn enough Biochemistry to consider a pre-retirement career change. |
Warped Send message Joined: 15 Jan 06 Posts: 48 Credit: 1,788,185 RAC: 0 |
Does this version ignore the limit of 100 models per workunit? I have a workunit which has reached 300 models and another has done 200. Both are only about 20% complete. Warped |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
The 100 model limit is only for certain protocols that are expected to complete models very rapidly. The main reason (to my knowledge anyway) for the imposition of that limit was due to large upload file sizes. Do you happen to know how large the uploads are getting? Size is shown in the transfers tab. But you'd have to catch one before it is sent (suspend network activity for a short time until one completes would be a simple way to orchestrate keeping one around). Rosetta Moderator: Mod.Sense |
Warped Send message Joined: 15 Jan 06 Posts: 48 Credit: 1,788,185 RAC: 0 |
Thanks for the response, Mod.Sense. My concern was that I had workunits which needed to be aborted. I have some time yet before my first 2.15 task completes as I have selected a 10-hour run time option. I'll try to catch the upload but may miss it. |
Warped Send message Joined: 15 Jan 06 Posts: 48 Credit: 1,788,185 RAC: 0 |
The upload is only 231KB, which is insignificant. |
diederiks Send message Joined: 13 Oct 05 Posts: 2 Credit: 740,392 RAC: 0 |
Today i had me first WU https://boinc.bakerlab.org/rosetta/workunit.php?wuid=333863670 with V2.15, i see 1,1GB memmory beeing used, is this normal? And if so, why does is say 512MB minimal memmory requirment on the site? I have 4 GB machine but with 2 other WU form other projects that exceed 1GB memory requirments, i have to start watching these mmemory requirments to stil do normal work with the machine. |
AtHomer Send message Joined: 26 Jan 10 Posts: 13 Credit: 7,145,229 RAC: 0 |
I have this WU which uses about 900 MB of RAM, but cpu usage is down to 0%: T0611_t4_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_22276_368_0 Yesterday I had a WU with the exact same behaviour, eating lots of RAM, but no cpu use, so it must have crashed or something... Pausing and resuming does not fix the problem by the way. |
Jochen Send message Joined: 6 Jun 06 Posts: 133 Credit: 3,847,433 RAC: 0 |
This one crashed yesterday after 40 minutes: T0528_t4_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_22246_78_0 When it crashed the memory consumption was 1.5 GB. The message says 'Directory not found'.
|
jumpo64 Send message Joined: 23 Mar 06 Posts: 1 Credit: 334,274 RAC: 0 |
Yeah, I have one Rosetta processing using 1.5 Gb of RAM at 2.3% completion and another using 1.15 Gb of RAM at 16% completion. All together Rosetta is using 70% of my 8 gigs of RAM. Granted I have a 6-core processor and therefore 6 threads running, but still, that's a lot of RAM. The biggest RAM hogs currently, are named T0523_t4_rs_stg0_lrlxjcst_t000_casp9_SAVE_ALL_OUT_22242_375_0 and T0520_t4_rs_stg0_lrlxjcst_t000_casp9_SAVE_ALL_OUT_22239_376_0 I was away for the weekend. Looking back at my results I show over 20 results that ended as "Compute Error" since 2.15, most of which came Friday or after. Never had one Compute Error in any work unit before that. |
Ohelig Send message Joined: 2 May 10 Posts: 2 Credit: 84,515 RAC: 0 |
I also appear to be having problems with WU's starting with T05**. They end up using almost 1.3GB of RAM. |
Ademers Send message Joined: 20 Oct 09 Posts: 2 Credit: 131,161 RAC: 0 |
I also appear to be having problems with WU's starting with T05**. They end up using almost 1.3GB of RAM. I think i have a problem with the T0549_t4_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_22255_997 he run since 21 hours and reach 0.099% when i look at the properties, the calculating time is only 25 second !!! |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
T0605_t2_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_22177_296_1 ERROR: Error in traceback: pointer doesn't go anywhere! ERROR:: Exit from: ....srccoresequenceAligner.cc line: 79 BOINC:: Error reading and gzipping output datafile: default.out called boinc_finish T0605_t2_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_22177_1921_0 RROR: Error in traceback: pointer doesn't go anywhere! ERROR:: Exit from: ....srccoresequenceAligner.cc line: 79 BOINC:: Error reading and gzipping output datafile: default.out called boinc_finish T0528_tj_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_21880_4843_2 <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The system cannot find the path specified. (0x3) - exit code 3 (0x3) </message> |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Ademers, double check the status shown in BOINC for that task. Does BOINC say it is "running"? And does the task manager show some other, higher priority task consuming your available CPU? But otherwise that sounds like another issue that we see crop up once and a while. Best way to move it along seems to be to exit (not close) and restart BOINC. Rosetta Moderator: Mod.Sense |
Ademers Send message Joined: 20 Oct 09 Posts: 2 Credit: 131,161 RAC: 0 |
Good, I exit from BOINC and restart and the application is at 1.8% in 7 minutes and continue to go up !!! Thank you Mod.Sense |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
This one failed after 20sec. https://boinc.bakerlab.org/rosetta/workunit.php?wuid=336639766 T0524_t3_rs_stg0_lrlxjcst_t000__casp9_SAVE_ALL_OUT_22207_1756_0 <core_client_version>6.2.14</core_client_version> <![CDATA[ <message> process got signal 11 </message> <stderr_txt> |
Brian Priebe Send message Joined: 27 Nov 09 Posts: 16 Credit: 33,020,247 RAC: 0 |
I too am seeing an unusually high number of errors on 3 different machines (and 3 different operating systems) for Rosetta 2.15. 16 WU's in the last few days failed on various errors: "The system cannot find the path specified. (0x3) - exit code 3 (0x3)" "Reason: Access Violation (0xc0000005) at address 0x00581B5C write attempt to address 0x00000024" "Incorrect function. (0x1) - exit code 1 (0x1)" (many different root causes per detailed error messages in the log. <ERROR: Error in traceback: pointer doesn't go anywhere!> occurred multiple times.) "Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x759AB727" |
ingebrigtsen685 Send message Joined: 2 Jun 09 Posts: 1 Credit: 6,015,983 RAC: 140 |
I am repeatedly getting this message since the upgrade: "Microsoft Visual C++ Runtime Library Runtime Error ....Bakerlab.orgminirosetta_2.15_windows_intelx86.exe This application has requested the Runtime to terminate it in an unusual way. Please contact the application's support team for more information." It is difficult to remove the message which sometimes locks up the computer. What can be done to prevent this? |
Mad_Max Send message Joined: 31 Dec 09 Posts: 209 Credit: 26,062,938 RAC: 16,868 |
+1 to problems with "Txxxx_" tasks on minirosetta 2.15. Some of them crash and others consume very higt amount of RAM (like 800-1400 Mb per task) I think crashes was due to lack of memory too - when two such tasks run concurrently (have 2 Gb of RAM on 2 core CPU) |
dlsqbinder Send message Joined: 23 Nov 05 Posts: 3 Credit: 371,859 RAC: 0 |
I too have recently seen messages indicating shortage of virtual memory, so have suspended Rosetta. |
Message boards :
Number crunching :
minirosetta 2.15
©2024 University of Washington
https://www.bakerlab.org