Module:	RJE_IRIDIS
Description:	Parallel processing on IRIDIS
Version:	1.10.2
Last Edit:	29/11/15

Imported modules: rje rje_seq rje_slimlist rje_uniprot rje_zen gopher_V2

See SLiMSuite Blog for further documentation. See rje for general commands.

Function

This module is designed to control and execute parallel processing jobs on the IRIDIS cluster based on the script written by Ivan Wolton. Initially, it will call other programs but, in time, it is envisaged that other programs will make use of this module and have parallelisation built-in.

In SeqBySeq mode, the program assumes that seqin=FILE and basefile=X are given and irun states the program to be run. Seqin will then be worked through in turn and each sequence farmed out to the irun program. Outputs given by OutList are then compiled, as is the Log, into the correct basefile=X given. In the case of *.csv and *.tdt files, the header row is copied for the first file and then excluded for all subsequent files. For all other files extensions, the whole output is copied.

Commandline

STANDARD RUN OPTIONS

irun=X iini=FILE pypath=PATH rjepy=T/F subsleep=X subjobs=LIST iolimit=X memfree=X test=T/F keepfree=X rsh=T/F

SEQBYSEQ OPTIONS

seqbyseq=T/F seqin=FILE basefile=X outlist=LIST pickup=X

SPECIAL iRUN OPTIONS

runid=X resfile=FILE sortrun=T/F loadbalance=T/F : Exectute a special iRun analysis on Iridis (gopher/slimfinder/qslimfinder/slimsearch/unifake) []
: Ini file to pass to the called program [None]
: Path to python modules ['/home/re1u06/Serpentry/']
: Whether program is an RJE *.py script (adds log processing) [True]
: Sleep time (seconds) between cycles of subbing out jobs to hosts [1]
: List of subjobs to farm out to IRIDIS cluster []
: Limit of number of IOErrors before termination [50]
: Min. proportion of node memory to be free before spawning job [0.0]
: Whether to produce extra output in "test" mode [False]
: Number of processors to keep free on head node [1]
: Whether to use rsh to run jobs on other nodes [True]
: Activate seqbyseq mode - assumes basefile=X option used for output [False]
: Input sequence file to farm out [None]
: Base for output files - compiled from individual run results [None]
: List of extensions of outputs to add to basefile for output (basefile.*) []
: Header to extract from OutList file and used to populate AccNum to skip []
: Text identifier for iX run [None]
: Main output file for iX run [islimfinder.csv]
: Whether to sort input files by size and run big -> small to avoid hang at end [True]
: Whether to split SortRun jobs equally between large & small to avoid memory issues [True]

History Module Version History

    # 0.0 - Initial Compilation.
    # 1.0 - Added additional functions to call other programs
    # 1.1 - Added UniFake.
    # 1.2 - Added generic seqbyseq option
    # 1.3 - Modified for IRIDIS3.
    # 1.4 - Added catching of IOErrors.
    # 1.5 - Added QSLiMFinder iRun
    # 1.6 - Modified iSLiMFinder job processing to try to catch errors better. (Not sure what is happening.)
    # 1.7 - Added memory checking before a run is spawned.
    # 1.8 - Added load balance option for SortRun: splits jobs equally between large and small input (& ends in middle).
    # 1.9 - Added scanning of legacy folder - moving GOPHER_V2!
    # 1.10- Modified freemem setting to run on Katana. Made rsh optional. Removed defunct IRIDIS3 option.
    # 1.10.1 - Attempted to fix SLiMFarmer batch run problem. (Should not be setting irun=batch!)
    # 1.10.2 - Trying to clean up unknown 30s pause. Might be freemem issue?
    # 1.10.3 - Fix issues with batch farming of subjobs splitting on commas.

SLiMSuite REST Server