Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

watchdog does not work #79

Open
Optimusjuju opened this issue Nov 2, 2016 · 10 comments
Open

watchdog does not work #79

Optimusjuju opened this issue Nov 2, 2016 · 10 comments

Comments

@Optimusjuju
Copy link

The minor crash at random on random, I would like to know they was your solution?

WIN 10 15.12 AND 16.1.1

@ofpcarnage
Copy link

add -f 2 -z 1 to your script

@Optimusjuju
Copy link
Author

Merci, Thank's you

@DefiantMiner
Copy link

DefiantMiner commented Nov 3, 2016

Yeah...same here. All machines have gpus that go to 0S/s and no "auto-fix". Machines: Win10, 6xRX470
@ofpcarnage Adding -f 2 CAUSES EXIT on failure. Why the heck are you recommend that?

@mo35
Copy link

mo35 commented Nov 3, 2016

-f 2 actualy helps at given time , just run miner in loop . something like this

title MinerX
:start
genoil -c pool.whatever.com:XXXX -u wallet.worker -p x -g 0 1 2 X -f 2 (if necesarry add/change platform switch -P 0 or 1)
timeout /T 5
goto start

@DefiantMiner
Copy link

I have it on a respawn timer but since it takes 3-4mins to just load the binaries for 6gpus...I'd be spending more time booting the miner than actually hashing.

@jjziets
Copy link

jjziets commented Nov 3, 2016

that bat file tends to crash after a few hours. So try this rather.

@echo off
:check
tasklist /FI "imagename eq genoil.exe" | find /I /N "genoil.exe" >NUL
if "%ERRORLEVEL%"=="1" start genoil.exe -c eu1-zcash.flypool.org:3333 -u Address.worker -p x -i 20 -w 64 -P 0 -k zec -f 2 -z 1
timeout /t 2 /nobreak

goto :check

pause

@pelelademadera
Copy link

thats my scrypt, the problem is that genoil sometimes fail, and sometimes get 0 S/s and dont fail, so I restart it every 1200 sec or 20 minutes, one script for each gpu so I can see it, and the commands is for start minimized and dont apear in screen

:restart
start /min "GPU0" cmd /C start.bat
TIMEOUT /T 3
start /min "GPU1" cmd /C start2.bat
TIMEOUT /T 3
start /min "GPU2" cmd /C start3.bat
TIMEOUT /T 1200
taskkill /f /im genoil.exe
TIMEOUT /T 3
goto restart

@jjziets
Copy link

jjziets commented Nov 3, 2016

My have a log file for jy genoil 0.6. it is really random how ot restarts.

started genoil at Thu 11/03/2016 15:44:04.28
started genoil at Thu 11/03/2016 15:46:07.44
started genoil at Thu 11/03/2016 15:50:11.46
started genoil at Thu 11/03/2016 15:51:34.02
started genoil at Thu 11/03/2016 15:53:42.07
started genoil at Thu 11/03/2016 15:55:51.63
started genoil at Thu 11/03/2016 15:56:17.16
started genoil at Thu 11/03/2016 15:58:01.21
started genoil at Thu 11/03/2016 15:58:55.44
started genoil at Thu 11/03/2016 16:03:42.19
started genoil at Thu 11/03/2016 16:08:25.16
started genoil at Thu 11/03/2016 16:10:45.88
started genoil at Thu 11/03/2016 16:11:44.34
started genoil at Thu 11/03/2016 16:14:45.39
started genoil at Thu 11/03/2016 16:18:30.01
started genoil at Thu 11/03/2016 16:18:47.05
started genoil at Thu 11/03/2016 16:19:07.44
started genoil at Thu 11/03/2016 16:21:34.53
started genoil at Thu 11/03/2016 16:25:36.08
started genoil at Thu 11/03/2016 16:26:32.51

On 3 November 2016 at 16:01, pelelademadera notifications@github.com
wrote:

thats my scrypt, the problem is that genoil sometimes fail, and sometimes
get 0 S/s and dont fail, so I restart it every 1200 sec or 20 minutes, one
script for each gpu so I can see it, and the commands is for start
minimized and dont apear in screen

:restart
start /min "GPU0" cmd /C start.bat
TIMEOUT /T 3
start /min "GPU1" cmd /C start2.bat
TIMEOUT /T 3
start /min "GPU2" cmd /C start3.bat
TIMEOUT /T 1200
taskkill /f /im genoil.exe
TIMEOUT /T 3
goto restart


You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#79 (comment), or mute
the thread
https://github.com/notifications/unsubscribe-auth/ASUwlVdpMx45S73aIDL_Ww4-QegpGPebks5q6ekjgaJpZM4Kn2A_
.

Hannes Zietsman
Cell +27 84 407 1414
Fax +27 86 519 1021

@DefiantMiner
Copy link

DefiantMiner commented Nov 4, 2016

Here's a log of a typical failure mode...6gpus drop to 3 over 10mins...then this happens:

22:03:32 main : zec-sa#0: 0.0S/s zec-sa#1: 25.3S/s zec-sa#2: 25.5S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 21.1S/s total: 72.0S/s
22:03:35 gpu#1: submitting solution
22:03:35 pool : received new job #7260602f8bd574ca844e
22:03:35 pool : not accepted: Stale job! (21)
22:03:35 pool : Retrying in 5 seconds...
22:03:36 gpu#1: submitting solution
zec-sa#1write: The file handle supplied is not valid
22:03:36 gpu#1: Mining thread quit
22:03:37 gpu#5: submitting solution
zec-sa#5write: The file handle supplied is not valid
22:03:37 gpu#5: Mining thread quit
22:03:37 main : zec-sa#0: 0.0S/s zec-sa#1: 21.3S/s zec-sa#2: 26.9S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 23.7S/s total: 72.0S/s
22:03:40 pool : Connecting to us1-zcash.flypool.org:3333
22:03:40 pool : Connected!
22:03:40 pool : subscribed
22:03:40 pool : received new target: 0x0083126e978d4f...
22:03:41 pool : received new job #6771047e7e2fdf94d7b8
22:03:42 main : 0.0S/s detected! Attempting to restart gpu#1
22:03:42 main : 0.0S/s detected! Attempting to restart gpu#5
22:03:42 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 22.1S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 22.1S/s
22:03:47 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 19.7S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 19.7S/s
22:03:50 gpu#2: submitting solution
22:03:50 pool : submitted and accepted
22:03:52 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 24.1S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 24.1S/s
22:03:57 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 21.3S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 21.3S/s
22:03:58 gpu#2: submitting solution
22:03:58 pool : submitted and accepted
22:04:02 main : zec-sa#0: 0.0S/s zec-sa#1: 0.0S/s zec-sa#2: 19.7S/s zec-sa#3: 0.0S/s zec-sa#4: 0.0S/s zec-sa#5: 0.0S/s total: 19.7S/s

It's on a respawn timer but limping along on ONE gpu for the rest of the period is killing my effective hash rate. Note the watchdogs activate...but do nothing.

Hope this helps diagnose and fix.

Win 10, 6xRX470s, 1840, 4GB, 16.10.1 drivers

@DefiantMiner
Copy link

@pelelademadera Stability is very much improved when running 6 separate processes (one per gpu) than running a single process with all 6 gpus. The binary load time is instant with separate processes as well. So, to maximize output, I have the script re-spawning all 6 processes every 15min to catch any strays that may drop off. This seems to be the only way to maximize multi gpu output with the current software.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants