Quantcast
Channel: Intel® Software - Intel® Many Integrated Core Architecture (Intel MIC Architecture)
Viewing all articles
Browse latest Browse all 1789

Problem running MPI on two nodes: host and Xeon Phi

$
0
0

Hello,

I am having trouble running a simple hello world test program on two nodes. I was hoping someone would be able to help.

OS: CentOS 6

Here is the error:

[phi@localhost ~]$ mpirun -n 2 -host mic0 -iface mic0 ./hello.MIC : -n 2 -host localhost ./hello.XEON

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

CMA: unable to get RDMA device list

librdmacm: couldn't read ABI version.

librdmacm: assuming: 4

Hello World from rank 2 running on localhost.localdomain!

Hello World from rank 3 running on localhost.localdomain!

Hello World from rank 0 running on mic0.local!

MPI World size = 4 processes

Hello World from rank 1 running on mic0.local!



It still runs the code, but it takes a long time. and the more processes i use, the worse it becomes (obviously).

When i run the code on either the host or mic0 alone, everything seems to work fine.

if anyone has any idea on how to fix this, please help me out.

Thanks,

Charlie


Viewing all articles
Browse latest Browse all 1789

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>