Posts by Georgi Vidinski
log in
1) Message boards : News : New CUDA application for Windows x64 Release (Message 6624)
Posted 29 Apr 2020 by Profile Georgi Vidinski
Hi guys,

Yes, there was an issue with addressing multiple CUDA devices. This has been fixed at the beginning of April.

I talked to Daniel, who already confirmed that the application now is distributed correctly on multiple CUDA devices.

Thanks.
Georgi
2) Message boards : Problems and bug reports : nVidia 1660 TI fails tasks instantly (Message 6573)
Posted 19 Apr 2020 by Profile Georgi Vidinski
Hi Dingo,

It will be interesting for us if you provide some more information how did you measured the execution time "before" the new cuda102 application, as the old one doesn't support GTX 1660Ti, could you?

Should I make my statement here, regarding the speed of the applications, sticky?

Georgi
3) Message boards : Number crunching : AMD Windows GPU application? (Message 6561)
Posted 15 Apr 2020 by Profile Georgi Vidinski
Hi guys,

As I said in one of my posts here:

Creating OpenCL application to support AMD GPUs was always at our RoadMap for years. Even now there is ongoing development on that. No mater there were some turbulence during the years, postponing and getting back on, we are doing our best to make it happen. Once we have solid PoC application we will be glad to provide it to our contributors.


Cool things takes time.

Thanks.
Georgi
4) Message boards : Problems and bug reports : Any task using SSE2 and SSE3 versions of app fails instantly (Message 6560)
Posted 15 Apr 2020 by Profile Georgi Vidinski
Hi Ferdinand-QVC,

Thank you for your feedback. It seams server has denoted that particular computer against newer version. It will receive older application that is already proven to work on it from now on.

Thanks.
Georgi
5) Message boards : Number crunching : New plan_class versions (Message 6542)
Posted 5 Apr 2020 by Profile Georgi Vidinski
Hi guys,

Jan Vaclavik is absolutely right. After new versions of Period search applications where introduced there was need of new classes as well. And now the server needs to calculate all estimation times for those application. That's why it is sending different applications on same host.

mmonnin, don't worry. All estimation times will get to their actual levels pretty soon. And you got it right. There are specific versions for all major OS builds now. That's the correct way.

I've posted some information, about the execution times and how server decides what and where to send, at this thread.

Georgi
6) Message boards : Windows : NVIDIA GPU: Upgrade to the latest driver... (Message 6541)
Posted 5 Apr 2020 by Profile Georgi Vidinski
Hi Beyond,

Thank you for your feedback.

On one hand complete times of WUs is not a constant value. There are many factors which has impact on that. Different amount of Light curves or amount of data per LC is one of them. Actual size of the WU is not determinant. Large WU can be done with small number of iterations while small one could take twice or more number of iterations. There is no rule. It takes what it takes.

On other hand there are rules on the server. Very complicated algorithm works behind which may decide to send more heavily WUs to proven faster host while in the same time keeps sending lighter ones to slower machines. It is constantly changing strategy based on many many factors along with the changes in hosts behavior. If you are interesting BOINC has detailed documentation available online.

And last but not least it is all about precision. This is not a race.
Old applications suffer from bugs, so for now we well keep them only for back compatibility until users update their software and for some reasonable time.
Keep on mind that old applications could now generate some faulty output results.

This is why we encourage our contributors to upgrade their platform to the latest software platforms and driver packages if possible.

Thanks.
Georgi
7) Message boards : Windows : NVIDIA GPU: Upgrade to the latest driver... (Message 6532)
Posted 3 Apr 2020 by Profile Georgi Vidinski
Hi Beyond,

You can give it a try now as we've made some changes over the project.
It will be just interesting to know if it is working or not.

I will not going to argue with you according error rate. I'll not quote your words either. I find such approach to be very useless.
Anyway.
Yes, we had some issued for last few days, and hopefully they will be resolved one after another. Some times in order to achieve better results or there is the need to implement support for new hardware or new data structure, there comes the issues as well. And we are trying to fix them as fast as possible. Thus any help from our contributors is more than welcome.
I asked you if you could give us some more detailed info of what you measured. But instead you gave us patronizing speech.
Correct me if I'm wrong but I'm getting mixed feelings here. If you feel better with some other project, please, nobody is forced to stay.

Here we all are one big team - Scientists, Administrators, Developers, Contributors and we are helping each other.

Now to the problem. Every application has its own specifics. Period_search is not an exception and in order to work correctly it needs to be controlled by the Host system. That could lead to some CPU usage - usually a single logic processor - which is totally normal and which depends on the CPU or GPU vendor, architecture, generation, software libraries and last but not least it depends of the specifics inside the code base.

Have a good day.
Georgi
8) Message boards : Problems and bug reports : Any task using SSE2 and SSE3 versions of app fails instantly (Message 6529)
Posted 3 Apr 2020 by Profile Georgi Vidinski
Hi Ferdinand-QVC,
Thank you for your patience and your feedback.

May I ask you to try again. We've made some changes in the project. This time everything should be OK.
You will need to do a "Reset project" from "Project" tab in "Boinc manager" first.
Then you may also need to do an "Update" one or two times.

Thanks,
Georgi
9) Message boards : Windows : NVIDIA GPU: Upgrade to the latest driver... (Message 6526)
Posted 1 Apr 2020 by Profile Georgi Vidinski
Hi Beyond,

The code inside the new CUDA102WIN application is basically the same as in old CUDA55 application. Still there a some necessary bugfixes which does not affect the execution plan though.
Of course you can always switch to the oldest version, but without having those bugfixes applied your error rate will be significantly higher. Second major change comes from the new Nvidia CUDA libraries. Every one could check Nvidia Release notes and ChangeLog if interested in what was improved and fixed.

When the new application was finished it was deeply analyzed with the latest tools provided by Nvidia Insight Package before it has been released.
Since then CUDA102WIN did a lot of work, great work actually, on many configurations. Still there could be exceptions which could be caused by many external factors - Overclocking, bad heat dissipation etc.

One other thing that I can suggest is to downgrade form latest 445.75 Game Driver to latest 442.19 Studio driver and try.

Can you provide some more information about the CPU + GPU usage and how did you measure it - anything - screenshots, logs are welcome by your convenience for both CUDA55 & CUDA102WIN?

Thanks in advance.
Georgi
10) Message boards : Problems and bug reports : Any task using SSE2 and SSE3 versions of app fails instantly (Message 6523)
Posted 31 Mar 2020 by Profile Georgi Vidinski
Dear Ferdinand-QVC,

May I suggest you to check your computer for other issues, like overheating, memory errors, driver incompatibility, hard disk errors etc. It seams your troubles starts somewhere around the 24th of March and they actually affects Asteroids@home.
There was no any changes in applications your client uses neither in the system at that time.

Thanks.
Georgi
11) Message boards : Problems and bug reports : Any task using SSE2 and SSE3 versions of app fails instantly (Message 6520)
Posted 31 Mar 2020 by Profile Georgi Vidinski
Hi Ferdinand-QVC,
Thank you for participating in Asteroid@home and for your feedback.
We are working at the moment on resolving that issue. I hope it will be sorted out in very short time.

Thank you for your patience.

Regards,
Georgi
12) Message boards : Unix/Linux : All new w/u failed again after project reset. (Message 6515)
Posted 27 Mar 2020 by Profile Georgi Vidinski
Hi nairb,

Thank you for contributing Asteroids@home.

Recently we've being receiving some amount of misaligned input data from our feed which led to this issue as in your case. Some times our routines can't handle well misaligned data and as a result that could directly affect Work Units. Still it is taken into account so no data has been lost.

Please be patient as our team is constantly working to improve the code base, data handling and pre- and post-process routines.

Thanks.
Georgi
13) Message boards : Problems and bug reports : Error: Number of lc points is greater than POINTS_MAX = 1000 (Message 6512)
Posted 26 Mar 2020 by Profile Georgi Vidinski
Hi all,

As of yesterday almost all CPU related Windows x64 Period search applications has been updated with applied bugfix about the 'MAX_POINTS' issue, except the 'sse2' application.

If today your client still receives tasks with application version less than 102.12 you may need to reset your project through BOINC manager.

Georgi
14) Message boards : Windows : NVIDIA GPU: Upgrade to the latest driver... (Message 6508)
Posted 26 Mar 2020 by Profile Georgi Vidinski
Hi John,

First of all I can't see any cuda application running by your account at all. But if you are planning to change this I have a few notes.

I don't know what you mean by "latest nvidia driver APPROVED for my 64-bit PC". Have you checked this page: NVIDIA > Download Drivers? It states, that there is most up-to-date driver for tour GPU:
Version: 445.75 WHQL
Release Date: 2020.3.23

Having MX150 with such outdated driver version (398.35) will prevent you from running the latest cuda102 application. You still can use the older cuda55 but you may experience some issues. And you will definitely miss some improvements that has been introduced by NVIDIA with their latest CUDA SDK (Release notes ). The most important idea behind having updated applications or drivers is that in newer versions there are bugfixes applied, leaving aside improvements. Unless you have very important reason to keep an outdated driver I strongly recommend to upgrade yours. Even if you are not planning to use your CUDA enabled GPU for Asterodis@home.

Regards
Georgi
15) Message boards : Problems and bug reports : Error: Number of lc points is greater than POINTS_MAX = 1000 (Message 6507)
Posted 26 Mar 2020 by Profile Georgi Vidinski
Hi Steve,

There were few issues that we've been working recently. Some of them was related to the errors you've seen as a result of some of WUs you've reported.

In first case, where WU have been reported with the following error:
* Error: Number of lc points is greater than POINTS_MAX = 1000
This is due to slightly different structure inside the newly generated WUs after new sources of data were added in the last months. We already started procedure of applying a bugfix for it. SSE2 applications are next in the list.

Next one, happened to be fount on set of WUs on various platforms:
* Unhandled Exception Detected...
This is another known issue and we'll need more time to investigate and fix.

As to your question about lack of tasks observed from time to time. Preparing WUs process can be heavy task and very time expensive. That's why in most of the time it is behind the capabilities and computation power that has been provided by all of you, our contributors.

Thank you for your patience.
Georgi
16) Message boards : News : New CUDA application for Windows x64 Release (Message 6502)
Posted 25 Mar 2020 by Profile Georgi Vidinski
I'll try to shed some more light regarding new CUDA application and the hardware that it addresses.

What is CUDA?
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). More info HERE and HERE.

That means CUDA applications can only works on NVIDIA based video cards (GPUs) thus will be sent only to BOINC clients with NVIDIA GPUs enabled settings. In our case there are two application versions already.
First we had cuda55 which can be run on older models with Compute Capabilities (CC) from 2.0 till 6.5, and the new one, cuda102, which was released on Monday 23rd of March, and which supports as well as older devices, starting with CC 3.0 and the newest too, with CC 7.5.

This application does not addresses and can't be run on AMD based Radeon GPUs. Radeon GPUs usually run applications developed on top of the OpenCL libraries which is different from CUDA and thus it is not subject to the announcement. More info HERE and HERE.

I hope this was helpful to those, who was confused.

Regards!
Georgi
17) Message boards : Problems and bug reports : Any task using SSE2 and SSE3 versions of app fails instantly (Message 6496)
Posted 24 Mar 2020 by Profile Georgi Vidinski
Hello everyone,

Thank you for participating in Asteroids@home.

It's a known issue and we've been working on a resolution for the past few weeks.
It will be fixed very soon.
Please excuse us for the inconvenience and thank you for your patience.

Regards,
Georgi
18) Message boards : News : New CUDA application for Windows x64 Release (Message 6494)
Posted 24 Mar 2020 by Profile Georgi Vidinski
Hi Daniel,

Thank you for the information. That helps a lot.

That means those two task are working on same GPU card definitely. Which is very interesting. We'll need to investigate that issue deeper as it turns out BOINC client does not addresses both GPU cards correctly but reports that tasks are running on separate deviceIds in the same time.

Can you check one more thing please. Which card what deviceId has assigned to it, like:

GTX 1070 Ti | "GPU Core Load" = 0% | DeviceId = x
GTX 2070 | "GPU Core Load" = 100% | DeviceId = y

It is interesting to see what's going on.

Meanwhile you can limit the number of simultaneous running tasks on GPU to one for the Asteroid@home project. Sorry for the inconvenience.

Thanks.
Georgi
19) Message boards : News : New CUDA application for Windows x64 Release (Message 6491)
Posted 24 Mar 2020 by Profile Georgi Vidinski
The new CUDA102WIN hangs after few fractions of an percent,...

Hi HausGeist,
Thank you for participating in Asteroids@home.

Actually application did not hangs at all. The gaps between reported fractions done is because during that time the whole computation is done on the Device (GPU card) and the Host (Computer) does not knows how far it went.
So don't worry, everything is fine and works as expected. You can check that by the status of finished tasks of yours. We are aware of that side effect and will try to improve the reporting routine in feature release.

Thanks,
Georgi
20) Message boards : News : New CUDA application for Windows x64 Release (Message 6490)
Posted 24 Mar 2020 by Profile Georgi Vidinski
I noticed something else that looks strange.

I am running the new application on a computer with 2 graphics cards (GTX 1070 Ti and GTX 2070).

In BOINC manager it shows 2 GPU tasks running (one on device0 and one on device1 as expected), but when I look at the GPU load using GPU-Z, I see that the GPU Load is 98% for the GTX 2070, but always 0% for the GTX 1070 Ti.

Hi Daniel,
Thank you for participating in Asteroids@home and for your feedback.

May I ask you for two more things to check about the issue you are pointing?
Can you please take one note on the execution time per task when both cards are running tasks and another one when just single card is running.
Because of the nature of the application there should be significant difference in execution times if by some reason it has happened two task being running on a single GPU card.
Can you also check what Sensor screen of HWInfo shows in both cases as well?
The way BOINC client addresses GPU cards is very straight forward using "deviceId"-s which should guarantee that different GPU cards are addressed at the same time.

Thanks.
Georgi


Next 20

Main page · Your account · Message boards


Copyright © 2020 Asteroids@home