Hi,
I would like to ask, if is it possible to echange data directly between two MICs in offload model? My application belongs to forward-in-time groups of algorithms. Due to data dependencies, I have to transfer data between devices in every time step. I am using up to 5 Intel KNL processors as accelerators (offload over fabric) and in this moment host memory is a point of data exchange. In order to optimize path of data transfer, I thought about transferring data directly between MICs. However I do not know, how to achieve this goal using offloading programming model.
Thanks for help.
Best regards,
Kamil :)