1.20 Find seeds with zero villages within a radius v1.08 failing on Quadro M5000

Message boards : Number crunching : 1.20 Find seeds with zero villages within a radius v1.08 failing on Quadro M5000
Message board moderation

To post messages, you must log in.

AuthorMessage
Pokihead

Send message
Joined: 19 Feb 25
Posts: 5
Credit: 66,037,500
RAC: 902,011
Message 1159 - Posted: 18 Apr 2025, 8:48:12 UTC

Hey,
dont know if that should be like that but all "1.20 Find seeds with zero villages within a radius v1.08 (cuda) windows_x86_64" are failing on my Quadro M5000.
On my P2200 they are slow but working.

<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
{Segment laden}
Eine VDM l�dt, l�scht oder bewegt ein MS-DOS- oder Win16-Segmentbild.
Eine Ausnahme wird veranlasst, damit ein Debugger innerhalb dieses 16-Bit-Segments Symbole und Haltepunkte laden, l�schen oder nachverfolgen kann.
 (0x2be) - exit code 702 (0x2be)</message>
<stderr_txt>
boinc gpu 0 gpuindex: 0 
No checkpoint to load
Maxwell device detected - running smaller kernel runs for compat.
GPUassert: the launch timed out and was terminated (code 702) main.cu 4890
</stderr_txt>
]]>

Link to one Workunit https://minecraftathome.com/minecrafthome/result.php?resultid=10540427
ID: 1159 · Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 8 Mar 21
Posts: 87
Credit: 1,286,102,973
RAC: 19,415,984
Message 1163 - Posted: 18 Apr 2025, 18:57:20 UTC - in response to Message 1159.  

I believe the card is too old an architecture to run on the app. Check with boysanic the developer.
ID: 1163 · Report as offensive     Reply Quote
Thefanman64

Send message
Joined: 30 Mar 25
Posts: 12
Credit: 50,590,000
RAC: 417,377
Message 1174 - Posted: 19 Apr 2025, 3:43:43 UTC

The m5000 is maxwell ( GTX 900 series equivalent i believe) I'd think it would run
ID: 1174 · Report as offensive     Reply Quote
Pokihead

Send message
Joined: 19 Feb 25
Posts: 5
Credit: 66,037,500
RAC: 902,011
Message 1177 - Posted: 19 Apr 2025, 8:24:03 UTC

Yes would think that too.
GTX 900 Series is not newest but also not extremely old^^
But got in 2 hours about 370 failed "1.20 Find seeds with zero villages within a radius v1.08 (cuda) windows_x86_64" works so it looks like actual it is not working.
Just wanted to if that is normal or a bug
ID: 1177 · Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 8 Mar 21
Posts: 87
Credit: 1,286,102,973
RAC: 19,415,984
Message 1186 - Posted: 20 Apr 2025, 0:48:03 UTC

The error message says it detects specifically a case for the Maxwell class gpu and makes some changes in the app code for it.
But it does not handle the card correctly after that and produces instant errors.
You still need to check with the developer boysanic and see if the app really CAN handle the Maxwell card and if it supposedly can, need to report to him the app is busted on your host.
ID: 1186 · Report as offensive     Reply Quote
boysanic
Project administrator
Project developer

Send message
Joined: 15 Jun 20
Posts: 94
Credit: 201,320,555
RAC: 3,295,430
Message 1187 - Posted: 20 Apr 2025, 1:38:04 UTC

Hi,

The message about "Maxwell detected" was specifically something I was trying to optimize kernel runtimes on older (and weaker) cards.
Not overall task runtime, but specifically shortening the per-GPU-execution time (of which there are many in a single task) as to try to resolve the "launch timeout" error.

I don't think we ever fully resolved it. But, I would expect that a M5000 could run it.

It's possible I need to do more work to get it running on GPUs that can't cope with the launch bounds we've given so far. I don't have access to a GPU in Maxwell besides a Titan X, but my Titan X runs these tasks without complaint.

I'll add this to my list to investigate further. Lately I've been focused on our CPU-based app, Xoroshigo, but it's running fairly stable after the fixes we deployed yesterday.
ID: 1187 · Report as offensive     Reply Quote
boysanic
Project administrator
Project developer

Send message
Joined: 15 Jun 20
Posts: 94
Credit: 201,320,555
RAC: 3,295,430
Message 1188 - Posted: 20 Apr 2025, 1:43:09 UTC

Oh, since this is a windows machine, something you can try to work around this is to change this registry value if you're comfortable with that.

HKEY_LOCAL_MACHINE\System\CurrentControlSet\Control\GraphicsDrivers
Add a "DWORD" key called "TdrLevel" , set it to 0.

Fair warning, if it's your display GPU, you may find your screen unresponsive if you run GPU-based tasks.
If this is a concern to you, you can set TdrLevel to something higher. I'd start with 3, then try 10 if that also fails.

Please let me know if you do try this and how it goes for you.
ID: 1188 · Report as offensive     Reply Quote
Pokihead

Send message
Joined: 19 Feb 25
Posts: 5
Credit: 66,037,500
RAC: 902,011
Message 1190 - Posted: 20 Apr 2025, 12:43:26 UTC - in response to Message 1188.  

Yeah my screen output is on the igpu so no problem.
And this setting (with 0) seems to be working.
Workunit is only at 5%, but before it always was stopping with the error after a few seconds.
But i will post an update when it finished the task
ID: 1190 · Report as offensive     Reply Quote
Pokihead

Send message
Joined: 19 Feb 25
Posts: 5
Credit: 66,037,500
RAC: 902,011
Message 1193 - Posted: 20 Apr 2025, 21:00:34 UTC

Now it finished and everything is ok.
Wasnt really fast, but no errors.
ID: 1193 · Report as offensive     Reply Quote

Message boards : Number crunching : 1.20 Find seeds with zero villages within a radius v1.08 failing on Quadro M5000