@jk good arrays are typically arranged sequentially in standard memory and often large amounts of memory are read at a time
You will likely see slightly faster runtimes if each thread is reading the arrays in memory contiguous order- so it's better to provide 8 arrays to 8 processes, instead of trying to segment the arrays by an indexed space per array