Excellent explanation, thanks!
For files in different physical devices, are you sure multi-thread will be faster? My observation is that it will be even slower due to bandwidth limitation.
If those disks belong to different computers, then sure it will be faster, but we won't need to discuss this scenario
We have a cluster with more than 2000 computers nodes with one giant disk server, if the 2000 nodes read/write at the same time, you may just have to kill the job...
I just get to my office, so here is the actual number in my test:
File size 1098 x 1168 x 1001 x 335 (float points), each read/write a block 1098 x 1168 points:
If only one process read/write, it take 15333 second (4.15'33"), speed = 107 MB/second
If two processes read/write two files of same size:
process 1 take 52300 seconds (14.31'40"), speed = 31.37 MB / second
process 2 take 55254 seconds (15.20"54"), speed = 29.69 MB / second
Bookmarks