Question about Checkpoint on LamMPI

From: tingyu (tz9_at_msstate.edu)
Date: Mon Jul 07 2003 - 08:28:45 PDT


Dear Sir,
         There is a question making me little puzzle on ur problem: 
since the project supports MPI program checkpoint, how do u define the 
"parallel MPI program checkpoint" ?
         As far as i understand, it seems in ur implementation, there is 
mechanism for cleaning up the messages transmitted in the network, then 
all of the processes invloved in this communication will be suspended, 
and later all of the processes (? not for sure) will be migrated to 
other node ans restarted. Is it correct?
         I checked the paper but still didn't get a comprehensive image..
         Thanks a lot for ur help!


Tingyu
July 7,2003