300+ TeraFLOPS sustained!

Author	Message
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0	Message 79861 - Posted: 8 Apr 2016, 20:37:05 UTC - in response to Message 79860. If you are curious, you might install a VM and then install Mint on it. You can compare the performance of the 32-bit windows binary with the 64-bit Linux version. The last time I tried this, the VM was about 10% faster. Thanks. I know it is hard to measure on Rosetta, which is why I asked. ID: 79861 · Rating: 0 · rate: / Reply Quote

Chilean Send message Joined: 16 Oct 05 Posts: 711 Credit: 26,694,507 RAC: 0	Message 79864 - Posted: 9 Apr 2016, 12:19:34 UTC - in response to Message 79860. Last modified: 9 Apr 2016, 12:20:15 UTC While we are on the subject, I am presently on Win7 64-bit. But I could go to Linux Mint 18 when it comes out. Is there an advantage? 8-} hit POST instead of PREVIEW ... If you are curious, you might install a VM and then install Mint on it. You can compare the performance of the 32-bit windows binary with the 64-bit Linux version. The last time I tried this, the VM was about 10% faster. Why isn't there a 64-bit Windows version again? ID: 79864 · Rating: 0 · rate: / Reply Quote

rjs5 Send message Joined: 22 Nov 10 Posts: 274 Credit: 23,730,845 RAC: 0	Message 79865 - Posted: 9 Apr 2016, 18:25:02 UTC - in response to Message 79864. Last modified: 9 Apr 2016, 18:26:43 UTC While we are on the subject, I am presently on Win7 64-bit. But I could go to Linux Mint 18 when it comes out. Is there an advantage? 8-} hit POST instead of PREVIEW ... If you are curious, you might install a VM and then install Mint on it. You can compare the performance of the 32-bit windows binary with the 64-bit Linux version. The last time I tried this, the VM was about 10% faster. Why isn't there a 64-bit Windows version again? I think 64-bit support was added to Visual Studio 2005. Windows 32-bit was all Rosetta could build before 2005. Rosetta would still have to build a 32-bit Windows version for all the 32-bit systems running Rosetta and I suspect that early 64-bit Windows versions were not faster, so why bother. That "2005 finding" probably still circulates in developer circles today. The much more interesting question is: "How do you tell if one Rosetta binary "A" is faster or slower than another Rosetta binary "B"?" That is a VERY tough problem to solve and past decisions have been wrong. I still think that a 64-bit binary should be faster than a 32-bit version, but my empirical results have not shown that for Rosetta. ID: 79865 · Rating: 0 · rate: / Reply Quote

Chilean Send message Joined: 16 Oct 05 Posts: 711 Credit: 26,694,507 RAC: 0	Message 79868 - Posted: 10 Apr 2016, 14:30:21 UTC - in response to Message 79865. While we are on the subject, I am presently on Win7 64-bit. But I could go to Linux Mint 18 when it comes out. Is there an advantage? 8-} hit POST instead of PREVIEW ... If you are curious, you might install a VM and then install Mint on it. You can compare the performance of the 32-bit windows binary with the 64-bit Linux version. The last time I tried this, the VM was about 10% faster. Why isn't there a 64-bit Windows version again? I think 64-bit support was added to Visual Studio 2005. Windows 32-bit was all Rosetta could build before 2005. Rosetta would still have to build a 32-bit Windows version for all the 32-bit systems running Rosetta and I suspect that early 64-bit Windows versions were not faster, so why bother. That "2005 finding" probably still circulates in developer circles today. The much more interesting question is: "How do you tell if one Rosetta binary "A" is faster or slower than another Rosetta binary "B"?" That is a VERY tough problem to solve and past decisions have been wrong. I still think that a 64-bit binary should be faster than a 32-bit version, but my empirical results have not shown that for Rosetta. Is it possible to get a "test" WU (or bunch of them) that have a certain limit to the amount of models it'll produce and a fixed seed number, so that everytime you execute it, it run thru the same energy pathway, creating the same models, etc. Then run this "test" WU thru different binaries... and compare. ID: 79868 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 79869 - Posted: 10 Apr 2016, 16:40:08 UTC Yes rjs5, I was going to make the same suggestion as Chilean. Creating a benchmark WU for various types, and running each for a given number of models in a given environment and comparing CPU and wall-clock time would seem to be the only means of pinning down performance. Have you been able to create such WUs that you can reuse time and again? I should think you could do so just by capturing the downloads. And DK could help you get a complete set of all of the various types. Remind him that BOINC server has means of assigning tasks to specific hosts (although, come to think of it, that may require settings they would not be using, perhaps they could send to you via Ralph). Rosetta Moderator: Mod.Sense ID: 79869 · Rating: 0 · rate: / Reply Quote

rjs5 Send message Joined: 22 Nov 10 Posts: 274 Credit: 23,730,845 RAC: 0	Message 79871 - Posted: 10 Apr 2016, 17:50:22 UTC - in response to Message 79869. Yes rjs5, I was going to make the same suggestion as Chilean. Creating a benchmark WU for various types, and running each for a given number of models in a given environment and comparing CPU and wall-clock time would seem to be the only means of pinning down performance. Have you been able to create such WUs that you can reuse time and again? I should think you could do so just by capturing the downloads. And DK could help you get a complete set of all of the various types. Remind him that BOINC server has means of assigning tasks to specific hosts (although, come to think of it, that may require settings they would not be using, perhaps they could send to you via Ralph). I asked David for just that when I started my performance work. He provided me two workloads. I like the jobs to run long enough to spend lots of time in the compute sections so the measurements are representative of the job. It took a couple of emails to make my "fixed amount of work" request clear. Rosetta is a "fixed time" project. You tell Rosetta how long you want each job to take and the job terminates when the time is reached OR the task reaches completion. This makes it virtually IMPOSSIBLE for the Rosetta volunteer to really tell the difference between machines. I still haven't been able to figure out what is a "good" or "bad" Rosetta job result. 8-) I think the "fixed time" model is just wrong ... just my opinion as a performance engineer. I would prefer the jobs be issued without the timer .... as long as they finish. It would not take long to figure out the Rosetta pf* jobs are long and the rb* jobs shorter. That said, if anyone has a Rosetta workload that is typical of their work AND can be configured to run about 1 hour, I would be happy to add it to my performance analysis work. Just message me and we can figure out how to transfer the files. ID: 79871 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 79873 - Posted: 10 Apr 2016, 20:59:51 UTC Yes, you'd also either have to run a fixed number of models, or double check how many models your benchmark ran (I'd think DK has ways to do that too). The current settings were setup to help users control when their machines were running, not as a performance benchmark. Ultimately, a user compares machines by granted credit, and RAC. But those are relative to BOINC's benchmarks and to other user's results. So, yes, a moving target so far as trying to measure performance traits of one run to another. Rosetta Moderator: Mod.Sense ID: 79873 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2155 Credit: 12,998,581 RAC: 8,218	Message 79876 - Posted: 13 Apr 2016, 7:17:03 UTC - in response to Message 79871. I like the jobs to run long enough to spend lots of time in the compute sections so the measurements are representative of the job. It took a couple of emails to make my "fixed amount of work" request clear. Rosetta is a "fixed time" project. You tell Rosetta how long you want each job to take and the job terminates when the time is reached OR the task reaches completion. This makes it virtually IMPOSSIBLE for the Rosetta volunteer to really tell the difference between machines. I don't think they'll make big changes to rosetta infrastructure now, with the beginning of CASP12. ID: 79876 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2155 Credit: 12,998,581 RAC: 8,218	Message 79877 - Posted: 13 Apr 2016, 7:18:08 UTC - in response to Message 79871. That said, if anyone has a Rosetta workload that is typical of their work AND can be configured to run about 1 hour, I would be happy to add it to my performance analysis work. Just message me and we can figure out how to transfer the files. I like testing new sw. What do you need? A VM with linux? ID: 79877 · Rating: 0 · rate: / Reply Quote

rjs5 Send message Joined: 22 Nov 10 Posts: 274 Credit: 23,730,845 RAC: 0	Message 79879 - Posted: 13 Apr 2016, 14:59:33 UTC - in response to Message 79877. That said, if anyone has a Rosetta workload that is typical of their work AND can be configured to run about 1 hour, I would be happy to add it to my performance analysis work. Just message me and we can figure out how to transfer the files. I like testing new sw. What do you need? A VM with linux? thanks much! I was soliciting workloads from developers/researchers too. 8-) When isolating performance problems, you start with the program to optimize. That program will expose bottlenecks in the code path but not in areas of Rosetta that are not executed. If a Rosetta developer or user supplies me with his sample workload, then the bottlenecks affecting his work will be exposed and possibly optimized. He gets results faster. ID: 79879 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 79880 - Posted: 14 Apr 2016, 4:00:02 UTC Yes, I should think you need a task for each of the various protocols. Ideally a task that is created to terminate after a fixed number of models are completed (or least ends before the runtime preference). I'm not certain how many protocols there are. Rosetta Moderator: Mod.Sense ID: 79880 · Rating: 0 · rate: / Reply Quote

Timo Send message Joined: 9 Jan 12 Posts: 185 Credit: 45,662,635 RAC: 0	Message 79889 - Posted: 18 Apr 2016, 21:31:22 UTC Last modified: 18 Apr 2016, 21:31:50 UTC ... this may be due to some backed-up submissions, but it's still cool to see... compute power clocking in at 407 TeraFLOP/S today! Almost halfway to a PetaFLOP.. ID: 79889 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2155 Credit: 12,998,581 RAC: 8,218	Message 79892 - Posted: 19 Apr 2016, 12:28:59 UTC - in response to Message 79889. ... this may be due to some backed-up submissions, but it's still cool to see... compute power clocking in at 407 TeraFLOP/S today! Almost halfway to a PetaFLOP.. I don't know if it is a new Charity Engine campaign or these are optimizations... ID: 79892 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 79893 - Posted: 19 Apr 2016, 13:31:46 UTC Optimizations, as they are made, do not really reflect in the project TFlops estimate. This because in the end, the estimate is based on the BOINC benchmark and number of hours running. Not trying to be negative about optimization. Just pointing out that you will not see it in this way. Granted credit is based on the number of models completed, and yes, with optimizations, you'll tend to complete more models per unit time... but so will everyone else. So you're credit and therefore TFlops impact should only prove that the BOINC benchmark differs from a Rosetta workload. Yes, there have been over 1000 new users per day for a couple of weeks now, and over 2000 today. The only time I've seen numbers like that was when Charity Engine was actively serving Rosetta tasks. This past week has also been some of the only times I've seen the server status page show over a million tasks in progress. I saw 1.6m one day. Then an hour later it was down to 300,000. Another hour later, was probably over a million again. That's a LOT of work moving around. Rosetta Moderator: Mod.Sense ID: 79893 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 79894 - Posted: 19 Apr 2016, 13:33:57 UTC Last modified: 19 Apr 2016, 13:34:27 UTC Check that, over 3,500 new hosts today! Rosetta Moderator: Mod.Sense ID: 79894 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2155 Credit: 12,998,581 RAC: 8,218	Message 79896 - Posted: 19 Apr 2016, 14:24:15 UTC - in response to Message 79893. Yes, there have been over 1000 new users per day for a couple of weeks now, and over 2000 today. The only time I've seen numbers like that was when Charity Engine was actively serving Rosetta tasks. This past week has also been some of the only times I've seen the server status page show over a million tasks in progress. I saw 1.6m one day. Then an hour later it was down to 300,000. Another hour later, was probably over a million again. That's a LOT of work moving around. :-O Ready for Casp!!! :-) ID: 79896 · Rating: 0 · rate: / Reply Quote

Chilean Send message Joined: 16 Oct 05 Posts: 711 Credit: 26,694,507 RAC: 0	Message 79899 - Posted: 20 Apr 2016, 1:51:12 UTC - in response to Message 79893. Optimizations, as they are made, do not really reflect in the project TFlops estimate. This because in the end, the estimate is based on the BOINC benchmark and number of hours running. Not trying to be negative about optimization. Just pointing out that you will not see it in this way. Granted credit is based on the number of models completed, and yes, with optimizations, you'll tend to complete more models per unit time... but so will everyone else. So you're credit and therefore TFlops impact should only prove that the BOINC benchmark differs from a Rosetta workload. Yes, there have been over 1000 new users per day for a couple of weeks now, and over 2000 today. The only time I've seen numbers like that was when Charity Engine was actively serving Rosetta tasks. This past week has also been some of the only times I've seen the server status page show over a million tasks in progress. I saw 1.6m one day. Then an hour later it was down to 300,000. Another hour later, was probably over a million again. That's a LOT of work moving around. So, the only way to "see" whatever speed-up comes from new application optimizations is our RAC? ID: 79899 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 79901 - Posted: 20 Apr 2016, 3:48:23 UTC Last modified: 20 Apr 2016, 3:50:20 UTC No, host RAC suffers the same fate as the project TFlops (which is essentially the aggregation of all of the host RACs). This is part of why rjs5 has been saying it is difficult to estimate how a given improvement might actually effect overall efficiency. It's sorta like a factory that makes candy bars. And the same number of bars come off the line each day, only now several varieties are slightly larger than they used to be. But that's really hard to tell because on different days, the production consists of differing ratios of any of the ten kinds of candy bars the factory makes. Optimizations will be a great thing, and will certainly help the science behind the project, but you won't see a blip on a chart. In fact, part of what rjs5 has been asking for is a computational run, that can be repeated, to MAKE a chart, so there is something to compare to after making some changes. But even then all you can say is that one sample set, for a specific type of candy bar, is now showing an x% increase in average size. But it is possible that the change has adverse side effects on the other candy bar types, or on system environments that differ from that used for the measurements. The answer to any performance question is always "it depends". Rosetta Moderator: Mod.Sense ID: 79901 · Rating: 0 · rate: / Reply Quote

sgaboinc Send message Joined: 2 Apr 14 Posts: 282 Credit: 208,966 RAC: 0	Message 79920 - Posted: 24 Apr 2016, 3:01:23 UTC Last modified: 24 Apr 2016, 3:01:48 UTC 400Tflops! Server Status as of 24 Apr 2016 2:38:57 UTC [ Scheduler running ] Total queued jobs: 2,347,476 In progress: 692,514 Successes last 24h: 372,980 Users (last day ) : 807,306 (+2475) Hosts (last day ) : 1,721,021 (+2585) Credits last 24h : 41,980,059 Total credits : 42,338,375,367 TeraFLOPS estimate: 419.801 if this results partly the effort from ris thanks ! :D ID: 79920 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2155 Credit: 12,998,581 RAC: 8,218	Message 79929 - Posted: 25 Apr 2016, 7:53:40 UTC - in response to Message 79920. 400Tflops! if this results partly the effort from ris thanks ! :D Now over 450!! ID: 79929 · Rating: 0 · rate: / Reply Quote