We are live

Message boards : News : We are live
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 26 - Posted: 25 Jun 2020, 21:40:09 UTC - in response to Message 24.  

Meanwhilst, my AMD RX 5700 XT finished its first task, Run time 40 min 36 sec (or 2,436.47 seconds). [/url]
ID: 26 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 27 - Posted: 25 Jun 2020, 22:50:58 UTC

Just tested in preparation of installing 20.5.1 drivers. present task was at 27 minutes. Stopped BOINC, waited a minute. Started BOINC, task starts from the beginning.
It would be nice to have checkpointing, especially for those with slower GPUs.
ID: 27 · Report as offensive
zombie67 [MM]
Avatar

Send message
Joined: 24 Jun 20
Posts: 25
Credit: 448,784,541
RAC: 24,476
Message 28 - Posted: 25 Jun 2020, 23:16:09 UTC
Last modified: 25 Jun 2020, 23:29:31 UTC

My windows machines seem to be working fine now.

However, not so much with my linux machines. I have two identical machines. Hardware, GPUs, and OS, all the same. One can get tasks, but they never complete. The other can't get tasks at all. It asks for tasks, but nothing is sent. There is nothing in the event log explaining why.

When I look at other linux machines attached to this project, I am not seeing much success either.

Edit: It looks like one of my linux machines does not have OpenCL installed. I assume that is why it can't get work. I will do that now.
Reno, NV
Team: SETI.USA
ID: 28 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 29 - Posted: 25 Jun 2020, 23:31:28 UTC - in response to Message 28.  
Last modified: 25 Jun 2020, 23:32:33 UTC

The other can't get tasks at all. It asks for tasks, but nothing is sent. There is nothing in the event log explaining why.
You have sched_op_debug enabled for more information?

Nice to see you're still around. :)

Edit:
Edit: It looks like one of my linux machines does not have OpenCL installed. I assume that is why it can't get work. I will do that now.
That would probably do it.
ID: 29 · Report as offensive
zombie67 [MM]
Avatar

Send message
Joined: 24 Jun 20
Posts: 25
Credit: 448,784,541
RAC: 24,476
Message 30 - Posted: 25 Jun 2020, 23:40:11 UTC - in response to Message 29.  

Hi Jord! Yeah, sched_op_debug is enabled. For whatever reason, it doesn't say anything when the issue is the app needs OpenCL, but it's not installed. In any case, that fixed the work fetch problem with the one machine.

For the issue of not completing tasks, maybe that was fixed after all. I should know in an hour or two.
Reno, NV
Team: SETI.USA
ID: 30 · Report as offensive
zombie67 [MM]
Avatar

Send message
Joined: 24 Jun 20
Posts: 25
Credit: 448,784,541
RAC: 24,476
Message 31 - Posted: 25 Jun 2020, 23:41:40 UTC

Can stats export please be turned on?
Reno, NV
Team: SETI.USA
ID: 31 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 32 - Posted: 25 Jun 2020, 23:49:22 UTC - in response to Message 30.  

For whatever reason, it doesn't say anything when the issue is the app needs OpenCL, but it's not installed.
I don't think it's up to BOINC to tell you about that. You do send information about your whole system to the project with the sched_request*.xml file so it would be nice if they told you you missed something required. But they'll have to program such a response. And I gather these guys just started with BOINC, so baby steps. ;-)

For the issue of not completing tasks, maybe that was fixed after all. I should know in an hour or two.
Too bad you can't use that VII, it would be interesting to see how it compares to a 5700 XT.
One wingman I was paired against (but who apparently detached and reattached so his task is abandoned) runs a NVIDIA Tesla P100-PCIE-16GB (4095MB) (looks like he has 32bit driver problems) which does tasks in half the time my 5700 XT does.
ID: 32 · Report as offensive
zombie67 [MM]
Avatar

Send message
Joined: 24 Jun 20
Posts: 25
Credit: 448,784,541
RAC: 24,476
Message 33 - Posted: 26 Jun 2020, 0:30:54 UTC

It looks like this project has a similar problem to what SRBase has. A machine with multiple GPUs, all the tasks are running at the same time on just one of the GPUs. And the rest of the GPUs are idle. I am seeing this on all my machines, both Windows and Linux.
Reno, NV
Team: SETI.USA
ID: 33 · Report as offensive
Profile Steve Dodd

Send message
Joined: 26 Jun 20
Posts: 25
Credit: 123,735,290
RAC: 182
Message 34 - Posted: 26 Jun 2020, 3:26:23 UTC - in response to Message 33.  
Last modified: 26 Jun 2020, 3:45:34 UTC

ditto, except i have two types of GPUs in my machine (2 NVidia, 1 AMD) and i can get NVidia and AMD WUs to run but the NVidia WUs only seem to run on one of the cards. Also, the project doesn't correctly identify the machine resources - thinks I have 2 GTX 1060s, in reality I have 1 GTX 1060, 1 GTX 1070. (plus the Radeon WX5100)
ID: 34 · Report as offensive
zombie67 [MM]
Avatar

Send message
Joined: 24 Jun 20
Posts: 25
Credit: 448,784,541
RAC: 24,476
Message 35 - Posted: 26 Jun 2020, 4:54:00 UTC
Last modified: 26 Jun 2020, 4:57:49 UTC

Until the app gets fixed, for those with multiple GPU systems, add this to your cc_config.xml. That way you can free up the rest of your GPUs to run other projects. You will need to quit/restart BOINC for the changes to take effect.:

<exclude_gpu>
   <url>project_URL</url>
   [<device_num>N</device_num>]
   [<type>NVIDIA|ATI|intel_gpu</type>]
   [<app>appname</app>]
</exclude_gpu>

It runs on GPU "0" of each type by default. So exclude GPUs 1+ for each type of additional GPUs you have in your machine. For example, Here are the entries I have on a Machine with three Nvidia GPUs:

                <exclude_gpu>
                        <url>https://minecraftathome.com/minecrafthome/</url>
                        <type>NVIDIA</type>
                        <device_num>1</device_num>
                        <app>kaktwoos</app>
                </exclude_gpu>
                <exclude_gpu>
                        <url>https://minecraftathome.com/minecrafthome/</url>
                        <type>NVIDIA</type>
                        <device_num>2</device_num>
                        <app>kaktwoos</app>
                </exclude_gpu>

FYI, you can see which GPU is # 0, 1, 2, etc., bu looking at your event log at start up. This is what mine shows for this same machine:

6/25/2020 6:34:42 PM	CUDA: NVIDIA GPU 0: GeForce GTX 1660 Ti (driver version 440.10, CUDA version 10.2, compute capability 7.5, 4096MB, 3972MB available, 11336 GFLOPS peak)	
6/25/2020 6:34:42 PM	CUDA: NVIDIA GPU 1: GeForce GTX 1660 Ti (driver version 440.10, CUDA version 10.2, compute capability 7.5, 4096MB, 3972MB available, 11336 GFLOPS peak)	
6/25/2020 6:34:42 PM	CUDA: NVIDIA GPU 2: Quadro K2000 (driver version 440.10, CUDA version 10.2, compute capability 3.0, 2000MB, 1973MB available, 733 GFLOPS peak)	
6/25/2020 6:34:42 PM	OpenCL: NVIDIA GPU 0: GeForce GTX 1660 Ti (driver version 440.100, device version OpenCL 1.2 CUDA, 5942MB, 3972MB available, 11336 GFLOPS peak)	
6/25/2020 6:34:42 PM	OpenCL: NVIDIA GPU 1: GeForce GTX 1660 Ti (driver version 440.100, device version OpenCL 1.2 CUDA, 5945MB, 3972MB available, 11336 GFLOPS peak)	
6/25/2020 6:34:42 PM	OpenCL: NVIDIA GPU 2: Quadro K2000 (driver version 440.100, device version OpenCL 1.2 CUDA, 2000MB, 1973MB available, 733 GFLOPS peak)	

Reno, NV
Team: SETI.USA
ID: 35 · Report as offensive
zombie67 [MM]
Avatar

Send message
Joined: 24 Jun 20
Posts: 25
Credit: 448,784,541
RAC: 24,476
Message 36 - Posted: 26 Jun 2020, 6:41:34 UTC - in response to Message 32.  

Too bad you can't use that VII, it would be interesting to see how it compares to a 5700 XT.

FWIW, now that I have only one task running on my 2080 Ti at a time, it takes 1960 seconds.

Admin: It would be better if you turned on the "number crunching" sub-forum, so that things like this could be discussed without clogging up the news sub-forum.
Reno, NV
Team: SETI.USA
ID: 36 · Report as offensive
Nick Name

Send message
Joined: 24 Jun 20
Posts: 9
Credit: 47,348,036
RAC: 0
Message 37 - Posted: 26 Jun 2020, 7:43:26 UTC - in response to Message 35.  

Until the app gets fixed, for those with multiple GPU systems, add this to your cc_config.xml. That way you can free up the rest of your GPUs to run other projects. You will need to quit/restart BOINC for the changes to take effect.:

<exclude_gpu>
   <url>project_URL</url>
   [<device_num>N</device_num>]
   [<type>NVIDIA|ATI|intel_gpu</type>]
   [<app>appname</app>]
</exclude_gpu>

It runs on GPU "0" of each type by default. So exclude GPUs 1+ for each type of additional GPUs you have in your machine. For example, Here are the entries I have on a Machine with three Nvidia GPUs:

Does that actually work? This should but doesn't.

<ignore_nvidia_dev>0</ignore_nvidia_dev>


This from init_data in the slot folder:
<gpu_type>NVIDIA</gpu_type>
<gpu_device_num>1</gpu_device_num>
<gpu_opencl_dev_index>1</gpu_opencl_dev_index>
<gpu_usage>1.000000</gpu_usage>
<ncpus>0.997799</ncpus>


seems to indicate it should be running on the 2nd GPU, but it isn't.
ID: 37 · Report as offensive
Frank [NT]

Send message
Joined: 25 Jun 20
Posts: 2
Credit: 9,583,202
RAC: 0
Message 38 - Posted: 26 Jun 2020, 8:12:58 UTC - in response to Message 23.  

[quote]
Same here, 6 to 10 days on a 24 hour deadline.

Though the BM estimate is 6 - 10 days, mine completed in 4000 - 4600 seconds. Seems to be running fine here, you just need to ignore the estimated times.


Should look better now, let me know what you see


It seems the BM shows the correct progress and remaining time now, great work, thanks !

And agree @Zombie67, a Number Crunching forum would be good to discuss such things ;-)
ID: 38 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 39 - Posted: 26 Jun 2020, 11:28:57 UTC - in response to Message 36.  

Admin: It would be better if you turned on the "number crunching" sub-forum, so that things like this could be discussed without clogging up the news sub-forum.
In case he wonders how, rerun the html/ops/create_forums.php script after enabling catid 2: https://github.com/BOINC/boinc/blob/master/html/ops/create_forums.php

And yes I know chip's asking for BOINC experts to come forward via Discord, but I'm not so much of a talker. More of a typer. :)
ID: 39 · Report as offensive
Profile Steve Dodd

Send message
Joined: 26 Jun 20
Posts: 25
Credit: 123,735,290
RAC: 182
Message 40 - Posted: 26 Jun 2020, 11:59:32 UTC - in response to Message 35.  

well zombie67, i thought that would work and had already put that in my cc_config, but it didn't.
message in log at start up: 6/26/2020 4:53:08 AM | | Unrecognized tag in cc_config.xml: <exclude_gpu>

my cc_config:

<cc_config>
<exclude_gpu>
<url>https://minecraftathome.com/minecrafthome/</url>
<type>NVIDIA</type>
<device_num>1</device_num>
<app>kaktwoos</app>
</exclude_gpu>
<options>
<use_all_gpus>1</use_all_gpus>
<skip_cpu_benchmarks>1</skip_cpu_benchmarks>
<report_results_immediately>1</report_results_immediately>
</options>
</cc_config>

What did i mess up? :) (Running version 7.16.5 of BOINC)
ID: 40 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 41 - Posted: 26 Jun 2020, 12:04:43 UTC - in response to Message 40.  
Last modified: 26 Jun 2020, 12:09:06 UTC

How did you make/edit cc_config.xml? Used a default ASCII editor or a word processor? Saved it as ANSI format?


Ah I see it... your cc_config is set up wrong.
Try this:
<cc_config>
<options>
<exclude_gpu>
<url>https://minecraftathome.com/minecrafthome/</url>
<type>NVIDIA</type>
<device_num>1</device_num>
<app>kaktwoos</app>
</exclude_gpu>
<use_all_gpus>1</use_all_gpus>
<skip_cpu_benchmarks>1</skip_cpu_benchmarks>
<report_results_immediately>1</report_results_immediately>
</options>
</cc_config>

For reference: https://boinc.berkeley.edu/wiki/Client_configuration#Options, exclude_gpu is an option, so should be inside the <options></options> tags in cc_config.xml

And btw, for this project report results immediately isn't necessary as the deadline is 24 hours so any tasks done will be reported immediately automatically.
ID: 41 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 42 - Posted: 26 Jun 2020, 12:15:18 UTC
Last modified: 26 Jun 2020, 13:00:25 UTC

First task to validate against another user's Nvidia GT 1030 fetched me a validate error: https://minecraftathome.com/minecrafthome/workunit.php?wuid=1259872.
So let's see what it will do against an RX 580 later today: https://minecraftathome.com/minecrafthome/workunit.php?wuid=1260373
I hope that validate errors aren't a precedence for the rest of them.

It also doesn't help that when the wingman returns an error, or detaches/reattaches and thus abandons work, that remaining tasks stay unsent: https://minecraftathome.com/minecrafthome/workunit.php?wuid=1259732

I think I will only go for validate errors because that GT 1030 had a lot more info in its stderr.txt than my RX 5700 XT had:

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
19:09:40 (10408): wrapper (7.5.26014): starting
19:09:40 (10408): wrapper: running ../../projects/minecraftathome.com_minecrafthome/kaktwoos_1.12_opencl ( --start 11400000000000 --end 11500000000000 --chunkseed 9567961692053 --neighbor1 856 --neighbor2 344 --neighbor3 840 --diagonalindex 0 --cactusheight 12)
Received work unit: 9567961692053
Data: n1: 856, n2: 344, n3: 840, di: 0, ch: 12
    Found seed: 5861617559634779173, 182644612078629, height: 20
5861617559634779173
    Found seed: 6001229513563318277, 183010092132357, height: 20
6001229513563318277
    Found seed: 6149848403176123013, 183112001710725, height: 21
6149848403176123013
    Found seed: 5861618302365947205, 183387343246661, height: 20
5861618302365947205
    Found seed: 6149848715775767557, 183424601355269, height: 21
6149848715775767557
Speed: 5.95m/s 
Done
Processed 100000000000 seeds in 16798.995819 seconds
Found seeds: 
    5861617559634779173
    6001229513563318277
    6149848403176123013
    5861618302365947205
    6149848715775767557
23:51:02 (10408): client exited; CPU time 13158.495873
23:51:02 (10408): called boinc_finish(0)

</stderr_txt>
]]>

vs

<core_client_version>7.16.7</core_client_version>
<![CDATA[
<stderr_txt>
00:18:42 (2832): wrapper (7.7.26016): starting
00:18:42 (2832): wrapper: running ../../projects/minecraftathome.com_minecrafthome/kaktwoos_1.12_opencl_amd.exe ( --start 11400000000000 --end 11500000000000 --chunkseed 9567961692053 --neighbor1 856 --neighbor2 344 --neighbor3 840 --diagonalindex 0 --cactusheight 12)
00:46:25 (9924): wrapper (7.7.26016): starting
00:46:25 (9924): wrapper: running ../../projects/minecraftathome.com_minecrafthome/kaktwoos_1.12_opencl_amd.exe ( --start 11400000000000 --end 11500000000000 --chunkseed 9567961692053 --neighbor1 856 --neighbor2 344 --neighbor3 840 --diagonalindex 0 --cactusheight 12)
00:56:59 (11396): wrapper (7.7.26016): starting
00:56:59 (11396): wrapper: running ../../projects/minecraftathome.com_minecrafthome/kaktwoos_1.12_opencl_amd.exe ( --start 11400000000000 --end 11500000000000 --chunkseed 9567961692053 --neighbor1 856 --neighbor2 344 --neighbor3 840 --diagonalindex 0 --cactusheight 12)
01:36:57 (11396): client exited; CPU time 10.531250
01:36:57 (11396): called boinc_finish(0)

</stderr_txt>
]]>
So just because my tasks end well doesn't mean it's doing any useful work.

I also see for Nvidia users that their CPU time is high, whereas mine is 10 seconds. The kaktwoos application doesn't use any CPU in Task Manager Details. So it would seem that the application doesn't run correctly on my system. It does on that RX 580 I pointed out earlier, and there also the CPU time is high. So isn't the OpenCL app optimized for Navi GPUs?
ID: 42 · Report as offensive
Profile Steve Dodd

Send message
Joined: 26 Jun 20
Posts: 25
Credit: 123,735,290
RAC: 182
Message 43 - Posted: 26 Jun 2020, 13:56:03 UTC - in response to Message 41.  

Thank you, Jord. Figures that's what it was. Thought about trying that after seeing the message in the log, but I like "talking" to people, so... :)
ID: 43 · Report as offensive
Jord
Volunteer moderator
Help desk expert
Avatar

Send message
Joined: 24 Jun 20
Posts: 85
Credit: 207,156
RAC: 0
Message 44 - Posted: 26 Jun 2020, 15:40:10 UTC

Yup, I have validate errors for all tasks so far, so that means this cannot be run on an RX 5700 XT. Would be nice to have someone else with such a card, or at least 5000 series chime in to see if it's my system or not.
ID: 44 · Report as offensive
[H]auntjemima

Send message
Joined: 26 Jun 20
Posts: 4
Credit: 1,530,361
RAC: 0
Message 45 - Posted: 26 Jun 2020, 23:42:04 UTC

Anyone else running a 1080gtx and not getting any tasks? Event log says 0 tasks sent.
ID: 45 · Report as offensive
Previous · 1 · 2 · 3 · 4 · Next

Message boards : News : We are live