RESCOMP Archives

May 2010

RESCOMP@LISTSERV.MIAMIOH.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Reply To:
Research Computing Support <[log in to unmask]>, Robin <[log in to unmask]>
Date:
Wed, 19 May 2010 14:55:04 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (27 lines)
http://dmtcp.sourceforge.net/, http://cryopid.berlios.de seem to do check-point.
I've not used it myself and definitely not installed on the server.

Seems to be a neat tool.

Robin




On May 19, 2010, at 2:41 PM, Robin, Robin wrote:

> Hi Steve,
> 
> 
>>> 1. What is the max runtime for a batch job on the new cluster, assuming one node is requested?
> 
> qmgr -c 'p s';
> Seems that the maxtime is 480 hours.
> 
>>> 2. What tools are available that might allow me to save the execution state of a batch job and continue it in a subsequent job?
> 
> I see: http://cryopid.berlios.de
> We can manually increase the time above the queue limit (when needed).
> 
> Robin

ATOM RSS1 RSS2