[CPMD-list] help on parellel computing
weizhuang
weiz at mail.rochester.edu
Wed May 15 19:13:11 CEST 2002
Hi, friends:
I was running CPMD program on a linux cluster, I set the restart file to be
saved every 5 steps. however, every time when the machine is going to save
the restart file. the job is crashed. and following is the information. could
anybody give me some clue about what is wrong here and any suggestion of how
to solve it. thanks a lot.
wei zhuang
-----------------
MPI_Recv: process in local group is dead (rank 4, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 1, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 8, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 2, MPI_COMM_WORLD)
Rank (1, MPI_COMM_WORLD): Call stack within LAM:
Rank (1, MPI_COMM_WORLD): - MPI_Recv()
Rank (1, MPI_COMM_WORLD): - MPI_Barrier()
Rank (1, MPI_COMM_WORLD): - main()
Rank (2, MPI_COMM_WORLD): Call stack within LAM:
Rank (2, MPI_COMM_WORLD): - MPI_Recv()
Rank (2, MPI_COMM_WORLD): - MPI_Barrier()
Rank (2, MPI_COMM_WORLD): - main()
MPI_Recv: process in local group is dead (rank 5, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 9, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 10, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 3, MPI_COMM_WORLD)
Rank (4, MPI_COMM_WORLD): Call stack within LAM:
Rank (4, MPI_COMM_WORLD): - MPI_Recv()
Rank (4, MPI_COMM_WORLD): - MPI_Barrier()
Rank (4, MPI_COMM_WORLD): - main()
Rank (3, MPI_COMM_WORLD): Call stack within LAM:
Rank (3, MPI_COMM_WORLD): - MPI_Recv()
Rank (3, MPI_COMM_WORLD): - MPI_Barrier()
Rank (3, MPI_COMM_WORLD): - main()
Rank (5, MPI_COMM_WORLD): Call stack within LAM:
Rank (5, MPI_COMM_WORLD): - MPI_Recv()
Rank (5, MPI_COMM_WORLD): - MPI_Barrier()
Rank (5, MPI_COMM_WORLD): - main()
MPI_Recv: process in local group is dead (rank 6, MPI_COMM_WORLD)
Rank (6, MPI_COMM_WORLD): Call stack within LAM:
Rank (6, MPI_COMM_WORLD): - MPI_Recv()
Rank (6, MPI_COMM_WORLD): - MPI_Barrier()
Rank (6, MPI_COMM_WORLD): - main()
MPI_Recv: process in local group is dead (rank 11, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 12, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 13, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 14, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 7, MPI_COMM_WORLD)
Rank (9, MPI_COMM_WORLD): Call stack within LAM:
Rank (9, MPI_COMM_WORLD): - MPI_Recv()
Rank (9, MPI_COMM_WORLD): - MPI_Barrier()
Rank (9, MPI_COMM_WORLD): - main()
Rank (7, MPI_COMM_WORLD): Call stack within LAM:
Rank (7, MPI_COMM_WORLD): - MPI_Recv()
Rank (7, MPI_COMM_WORLD): - MPI_Barrier()
Rank (7, MPI_COMM_WORLD): - main()
Rank (8, MPI_COMM_WORLD): Call stack within LAM:
Rank (8, MPI_COMM_WORLD): - MPI_Recv()
Rank (8, MPI_COMM_WORLD): - MPI_Barrier()
Rank (8, MPI_COMM_WORLD): - main()
Rank (10, MPI_COMM_WORLD): Call stack within LAM:
Rank (10, MPI_COMM_WORLD): - MPI_Recv()
Rank (10, MPI_COMM_WORLD): - MPI_Barrier()
Rank (10, MPI_COMM_WORLD): - main()
Rank (12, MPI_COMM_WORLD): Call stack within LAM:
Rank (12, MPI_COMM_WORLD): - MPI_Recv()
Rank (12, MPI_COMM_WORLD): - MPI_Barrier()
Rank (12, MPI_COMM_WORLD): - main()
Rank (11, MPI_COMM_WORLD): Call stack within LAM:
Rank (11, MPI_COMM_WORLD): - MPI_Recv()
Rank (11, MPI_COMM_WORLD): - MPI_Barrier()
Rank (11, MPI_COMM_WORLD): - main()
Rank (14, MPI_COMM_WORLD): Call stack within LAM:
Rank (14, MPI_COMM_WORLD): - MPI_Recv()
Rank (14, MPI_COMM_WORLD): - MPI_Barrier()
Rank (14, MPI_COMM_WORLD): - main()
Rank (13, MPI_COMM_WORLD): Call stack within LAM:
Rank (13, MPI_COMM_WORLD): - MPI_Recv()
Rank (13, MPI_COMM_WORLD): - MPI_Barrier()
Rank (13, MPI_COMM_WORLD): - main()
MPI_Recv: process in local group is dead (rank 15, MPI_COMM_WORLD)
Rank (15, MPI_COMM_WORLD): Call stack within LAM:
Rank (15, MPI_COMM_WORLD): - MPI_Recv()
Rank (15, MPI_COMM_WORLD): - MPI_Barrier()
Rank (15, MPI_COMM_WORLD): - main()
More information about the CPMD-list
mailing list