Starting up CUDA context...
Allocating and initializing host arrays...

Allocating and initializing CUDA arrays...

Running GPU bitonic sort (1 identical iterations)...

Testing array length 64 (16 arrays per batch)...
Average time: 1129.927002 ms


Validating the results...
...reading back GPU results
...inspecting keys array: OK
...inspecting keys and values array: OK
...stability property: NOT stable

Testing array length 128 (8 arrays per batch)...
Average time: 1049.439941 ms


Validating the results...
...reading back GPU results
...inspecting keys array: OK
...inspecting keys and values array: OK
...stability property: stable!

Testing array length 256 (4 arrays per batch)...
Average time: 975.627991 ms


Validating the results...
...reading back GPU results
...inspecting keys array: OK
...inspecting keys and values array: OK
...stability property: NOT stable

Testing array length 512 (2 arrays per batch)...
Average time: 1089.600952 ms


Validating the results...
...reading back GPU results
...inspecting keys array: OK
...inspecting keys and values array: OK
...stability property: NOT stable

Testing array length 1024 (1 arrays per batch)...
Average time: 1109.443970 ms

sortingNetworks-bitonic, Throughput = 0.0009 MElements/s, Time = 1.10944 s, Size = 1024 elements, NumDevsUsed = 1, Workgroup = 512

Validating the results...
...reading back GPU results
...inspecting keys array: OK
...inspecting keys and values array: OK
...stability property: NOT stable

Shutting down...
