Checkpointing

From: sichiwai (sichiwai_at_informatik.stud.uni-erlangen.de)
Date: Thu Jul 21 2005 - 04:37:30 PDT

  • Next message: Jeff Squyres: "Problems with BLCR?"
    Hello,
      i have some questions regarding the checkpoint/restart project.
    
    According to FAQ (http://mantis.lbl.gov/blcr/doc/html/FAQ.html#faq7 and 
    http://mantis.lbl.gov/blcr/doc/html/FAQ.html#faq8) the BLCR can not 
    checkpoint TCP/IP and sockets. FAQ8 states that "You must arrange for 
    your program to release such resources before it is checkpointed (see 
    next FAQ)". What happens to those resources not checkpointed. Are they 
    simply ignored or do the cause problems?
    What I especially need to know, what happens to bound sockets, since 
    they need some time to be gracefully released and reacquired, which is 
    certainly not desirable for a performance based application.
    
    Also I'd like to know what happens to a pthread which has not called 
    cr_init(). I one of your examples (pthread_misc) the callback is 
    initialized for every thread. In the other pthread  example, no 
    checkpoint function is called at all.. . What is now the correct way to 
    checkpoint a application with multiple threads?
    
    Regards
      Christian Iwainsky
      Student
    

  • Next message: Jeff Squyres: "Problems with BLCR?"