Calvin Clark wrote:
Remove the entry that says "--gpu 0 --gpu 1 --gpu 2 --gpu 3": true, as that is not a valid parameter to Lizzie and try again.
If that does not work, then we need to see the output of running leelaz by itself. To show that, do this from a a command prompt the same directory Lizzie.jar resides:
If you are running on Windows:
Code:
.\leela-zero\leelaz.exe -w network.gz
If you are running on Mac or Linux, use forward slashes:
Code:
./leela-zero/leelaz.exe -w network.gz
On a two-GPU system, you should see output like this:
Code:
E:\Lizzie>.\leela-zero\leelaz.exe -w network.gz
Using 2 thread(s).
RNG seed: 2961845906471187626
Leela Zero 0.16 Copyright (C) 2017-2018 Gian-Carlo Pascutto and contributors
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under certain conditions; see the COPYING file for details.
BLAS Core: Haswell
Detecting residual layers...v1...256 channels...40 blocks.
Initializing OpenCL (autodetecting precision).
Detected 1 OpenCL platforms.
Platform version: OpenCL 1.2 CUDA 9.2.102
Platform profile: FULL_PROFILE
Platform name: NVIDIA CUDA
Platform vendor: NVIDIA Corporation
Device ID: 0
Device name: Tesla K80
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 397.44
Device speed: 823 MHz
Device cores: 13 CU
Device score: 1112
Device ID: 1
Device name: Tesla K80
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 397.44
Device speed: 823 MHz
Device cores: 13 CU
Device score: 1112
Selected platform: NVIDIA CUDA
Selected device: Tesla K80
with OpenCL 1.2 capability.
Half precision compute support: No.
Detected 1 OpenCL platforms.
Platform version: OpenCL 1.2 CUDA 9.2.102
Platform profile: FULL_PROFILE
Platform name: NVIDIA CUDA
Platform vendor: NVIDIA Corporation
Device ID: 0
Device name: Tesla K80
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 397.44
Device speed: 823 MHz
Device cores: 13 CU
Device score: 1112
Device ID: 1
Device name: Tesla K80
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 397.44
Device speed: 823 MHz
Device cores: 13 CU
Device score: 1112
Selected platform: NVIDIA CUDA
Selected device: Tesla K80
with OpenCL 1.2 capability.
Half precision compute support: No.
Loaded existing SGEMM tuning.
Wavefront/Warp size: 32
Max workgroup size: 1024
Max workgroup dimensions: 1024 1024 64
Loaded existing SGEMM tuning.
Wavefront/Warp size: 32
Max workgroup size: 1024
Max workgroup dimensions: 1024 1024 64
Using OpenCL single precision (less than 5% slower than half).
Detected 1 OpenCL platforms.
Platform version: OpenCL 1.2 CUDA 9.2.102
Platform profile: FULL_PROFILE
Platform name: NVIDIA CUDA
Platform vendor: NVIDIA Corporation
Device ID: 0
Device name: Tesla K80
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 397.44
Device speed: 823 MHz
Device cores: 13 CU
Device score: 1112
Device ID: 1
Device name: Tesla K80
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 397.44
Device speed: 823 MHz
Device cores: 13 CU
Device score: 1112
Selected platform: NVIDIA CUDA
Selected device: Tesla K80
with OpenCL 1.2 capability.
Half precision compute support: No.
Loaded existing SGEMM tuning.
Wavefront/Warp size: 32
Max workgroup size: 1024
Max workgroup dimensions: 1024 1024 64
Setting max tree size to 3736 MiB and cache size to 415 MiB.
Passes: 0 Black (X) Prisoners: 0
Black (X) to move White (O) Prisoners: 0
a b c d e f g h j k l m n o p q r s t
19 . . . . . . . . . . . . . . . . . . . 19
18 . . . . . . . . . . . . . . . . . . . 18
17 . . . . . . . . . . . . . . . . . . . 17
16 . . . + . . . . . + . . . . . + . . . 16
15 . . . . . . . . . . . . . . . . . . . 15
14 . . . . . . . . . . . . . . . . . . . 14
13 . . . . . . . . . . . . . . . . . . . 13
12 . . . . . . . . . . . . . . . . . . . 12
11 . . . . . . . . . . . . . . . . . . . 11
10 . . . + . . . . . + . . . . . + . . . 10
9 . . . . . . . . . . . . . . . . . . . 9
8 . . . . . . . . . . . . . . . . . . . 8
7 . . . . . . . . . . . . . . . . . . . 7
6 . . . . . . . . . . . . . . . . . . . 6
5 . . . . . . . . . . . . . . . . . . . 5
4 . . . + . . . . . + . . . . . + . . . 4
3 . . . . . . . . . . . . . . . . . . . 3
2 . . . . . . . . . . . . . . . . . . . 2
1 . . . . . . . . . . . . . . . . . . . 1
a b c d e f g h j k l m n o p q r s t
Hash: 9A930BE1616C538E Ko-Hash: A14C933E7669946D
Black time: 01:00:00
White time: 01:00:00
Leela:
The key thing to note here, other than I haven't gotten around to updating to LZ 0.17, is that two Device IDs are detected: 0 and 1. Show us what you are seeing. Also, have you recently added the 2nd GPU after running Lizzie before? If so, it's not a bad idea to remove the tuning file 'leelaz_opencl_tuning' before doing the above steps. This will cause Leela Zero to regenerate the tuning file by running some benchmarks. That can take a couple of minutes the first time, so be patient. The tuning process should also see your 2nd GPU. If it does not, then we need to look more deeply at OS specifics.
Removing gpu 0 to 3 does not help.
Using OpenCL batch size of 5
Using 10 thread(s).
RNG seed: 5707416252940862580
Leela Zero 0.17 Copyright (C) 2017-2019 Gian-Carlo Pascutto and contributors
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under certain conditions; see the COPYING file for details.
BLAS Core: Sandybridge
Detecting residual layers...v1...256 channels...40 blocks.
Initializing OpenCL (autodetecting precision).
Detected 2 OpenCL platforms.
Platform version: OpenCL 2.0 AMD-APP (2079.4)
Platform profile: FULL_PROFILE
Platform name: AMD Accelerated Parallel Processing
Platform vendor: Advanced Micro Devices, Inc.
Device ID: 0
Device name: Intel(R) Core(TM) i7-3930K CPU @ 3.20GHz
Device type: CPU
Device vendor: GenuineIntel
Device driver: 2079.4 (sse2,avx)
Device speed: 3200 MHz
Device cores: 6 CU
Device score: 520
Platform version: OpenCL 1.2 CUDA 10.0.150
Platform profile: FULL_PROFILE
Platform name: NVIDIA CUDA
Platform vendor: NVIDIA Corporation
Device ID: 1
Device name: GeForce RTX 2080 Ti
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 411.70
Device speed: 1545 MHz
Device cores: 68 CU
Device score: 1112
Device ID: 2
Device name: GeForce RTX 2080 Ti
Device type: GPU
Device vendor: NVIDIA Corporation
Device driver: 411.70
Device speed: 1545 MHz
Device cores: 68 CU
Device score: 1112
Selected platform: NVIDIA CUDA
Selected device: GeForce RTX 2080 Ti
with OpenCL 1.2 capability.
Half precision compute support: No.
Tensor Core support: Yes.
OpenCL: using fp16/half or tensor core compute support.
Started OpenCL SGEMM tuner.
Will try 380 valid configurations.
(1/380) KWG=16 KWI=2 MDIMA=8 MDIMC=8 MWG=64 NDIMB=8 NDIMC=8 NWG=64 SA=1 SB=1 STR
M=0 STRN=0 TCE=0 VWM=4 VWN=2 0.1145 ms (5150.0 GFLOPS)
(2/380) KWG=16 KWI=8 MDIMA=8 MDIMC=8 MWG=64 NDIMB=8 NDIMC=8 NWG=32 SA=1 SB=1 STR
M=0 STRN=0 TCE=0 VWM=4 VWN=4 0.0943 ms (6257.2 GFLOPS)
(5/380) KWG=32 KWI=2 MDIMA=16 MDIMC=16 MWG=128 NDIMB=16 NDIMC=32 NWG=32 SA=0 SB=
0 STRM=0 STRN=0 TCE=1 VWM=2 VWN=2 0.0921 ms (6404.4 GFLOPS)
(10/380) KWG=16 KWI=2 MDIMA=32 MDIMC=32 MWG=64 NDIMB=8 NDIMC=16 NWG=32 SA=0 SB=0
STRM=0 STRN=0 TCE=1 VWM=2 VWN=2 0.0915 ms (6449.3 GFLOPS)
(11/380) KWG=32 KWI=2 MDIMA=8 MDIMC=16 MWG=64 NDIMB=32 NDIMC=32 NWG=32 SA=0 SB=0
STRM=0 STRN=0 TCE=1 VWM=2 VWN=2 0.0823 ms (7165.0 GFLOPS)
(14/380) KWG=16 KWI=2 MDIMA=16 MDIMC=32 MWG=128 NDIMB=16 NDIMC=16 NWG=32 SA=0 SB
=0 STRM=0 STRN=0 TCE=1 VWM=2 VWN=2 0.0697 ms (8461.8 GFLOPS)
(25/380) KWG=32 KWI=2 MDIMA=8 MDIMC=8 MWG=128 NDIMB=32 NDIMC=32 NWG=32 SA=0 SB=0
STRM=0 STRN=0 TCE=1 VWM=2 VWN=2 0.0613 ms (9616.3 GFLOPS)
Wavefront/Warp size: 32
Max workgroup size: 1024
Max workgroup dimensions: 1024 1024 64
Setting max tree size to 3736 MiB and cache size to 415 MiB.
Passes: 0 Black (X) Prisoners: 0
Black (X) to move White (O) Prisoners: 0
a b c d e f g h j k l m n o p q r s t
19 . . . . . . . . . . . . . . . . . . . 19
18 . . . . . . . . . . . . . . . . . . . 18
17 . . . . . . . . . . . . . . . . . . . 17
16 . . . + . . . . . + . . . . . + . . . 16
15 . . . . . . . . . . . . . . . . . . . 15
14 . . . . . . . . . . . . . . . . . . . 14
13 . . . . . . . . . . . . . . . . . . . 13
12 . . . . . . . . . . . . . . . . . . . 12
11 . . . . . . . . . . . . . . . . . . . 11
10 . . . + . . . . . + . . . . . + . . . 10
9 . . . . . . . . . . . . . . . . . . . 9
8 . . . . . . . . . . . . . . . . . . . 8
7 . . . . . . . . . . . . . . . . . . . 7
6 . . . . . . . . . . . . . . . . . . . 6
5 . . . . . . . . . . . . . . . . . . . 5
4 . . . + . . . . . + . . . . . + . . . 4
3 . . . . . . . . . . . . . . . . . . . 3
2 . . . . . . . . . . . . . . . . . . . 2
1 . . . . . . . . . . . . . . . . . . . 1
a b c d e f g h j k l m n o p q r s t
Hash: 9A930BE1616C538E Ko-Hash: A14C933E7669946D
Black time: 01:00:00
White time: 01:00:00
Leela: