AccelBench Benchmark CPU and GPU in TFLOPs by multiple matrix with Numpy and PyTorch Run example for C = A[N,N] x B[N,N] and sum (C). N - size = 10000: $ python accelbench.py 10000