|
RJE_SEQ V3.25.3DNA/Protein sequence list module
Copyright © 2005 Richard J. Edwards - See source code for GNU License Notice Imported modules:
See SLiMSuite Blog for further documentation. See FunctionContains Classes and methods for sets of DNA and protein sequences. Sequence Input/Output Options
Sequence Loading/Formatting Options
Sequence Filtering Options
System Info Options
blastpath=PATH : Path to BLAST programs ['']
blast+path=PATH : Path to BLAST+ programs [''] fastapath=PATH : Path to FASTA programs [''] clustalw=PATH : Path to CLUSTALW program ['clustalw' ]clustalo=PATH : Path to CLUSTAL Omega alignment program ['clustalo' ]mafft=PATH : Path to MAFFT alignment program ['mafft' ]muscle=PATH : Path to MUSCLE alignment program ['muscle' ]fsa=PATH : Path to FSA alignment program ['fsa'] pagan=PATH : Path to PAGAN alignment program ['pagan'] win32=T/F : Run in Win32 Mode [False ]alnprog=X : Choice of alignment program to use (clustalw/clustalo/muscle/mafft/fsa/pagan) [clustalo ]Sequence Manipulation/Function Optionspamdis : Makes an all by all PAM distance matrix DisMatrix Options
Special Options
ClassesSeqList(rje.RJE_Object): History Module Version History# 0.0 - Initial Compilation. # 0.1 - Renamed major attributes # 0.2 - New implementation on more generic OO approach. Non-Redundancy Output # 0.3 - No Out Object in Objects # 1.0 - Better Documentation to go with GASP V:1.2 # 1.1 - Better DNA stuff # 1.2 - Added ClustalW align # 1.3 - Separated Sequence object into rje_sequence.py # 1.4 - Add rudimentary gnspacc=T/F # 1.5 - Changed pwAln to use popen() # 1.6 - Fixed nrdic problem in Redundancy check and added user-definition of database list # 1.8 - Added UniProt input and acclist reading # 1.9 - Added 'reformat=scanseq' option but not properly implemented. Added align=T/F. # 2.0 - Major reworking of commandline options and introduction of self.list dictionary (rje v3.0) # 2.1 - Added reformat of UniProt with memsaver=T. # 2.2 - Added GABLAM non-redundancy # 2.3 - Added NR in memsaver mode # 2.4 - Changed some of the log output (REM and redundancy) to look better. # 2.5 - Added nr_qry to makeNR() # 2.6 - Added mysql reformat output: fastacmd, protein_id, acc_num, spec_code, description (delimited) # 2.7 - Added SeqCount() method. Incorporated reading of sequence case. # 2.8 - Added NEXUS output for MrBayes compatibility # 2.9 - Added setupSubDict(masking=True) for use in probabilistic conservation scores # 3.0 - Start of improvements for DNA sequences with dna=T. # 3.1 - Added relative conservation calculations for a whole alignment. # 3.2 - Added output of sequences for making alignments in R. # 3.3 - Added 6RF reformatting for DNA sequences. # 3.4 - Added HAQBAT option # 3.5 - Added extra alignment program, MAFFT # 3.6 - Added stripGap() method. Replaced self.seq with self.seqs() for reading. (Replace with list at some point.) # 3.7 - Added raw option for single sequence load. # 3.8 - Added maxGlob setting for screening out globular proteins. # 3.9 - Added reading of mafft format when not producing standard fasta. # 3.10- Added ntrim=X : Trims of regions >= X proportion N bases (X residues for protein) [0.5] # 3.11- Added mapdna=FILE option to map DNA sequences onto protein alignment # 3.12- Added countspec=T/F : Generate counts of different species and output in log [False] # 3.13- Updated sequence type checking for use with GABLAM 2.10. # 3.14- Added CLUSTAL Omega alignment program ['clustalo'] # 3.15- Added PAGAN alignment program ['pagan'] and (hopefully) fixed minor Windows fastacmd bug. # 3.16- Added BLAST+ path and seqFromBlastDBCmd() # 3.17- Updated to use BLAST+ and rje_blast_V2 # 3.18- Minor BLAST+ bug fixes. Added exceptions to readBLAST failure. # 3.19- Fixed BLAST+ sequence extraction name truncation error. # 3.20- Added run() method for SeqSuite. # 3.21.0 - Added extraction of uniprot IDs for seqin. # 3.22.0 - Added loading sequences from provided sequence files contents directly, bypassing file reading. # 3.22.1 - Fixed problem if seqin is blank, triggering odd Uniprot download. # 3.23.0 - Add speclist to reformat options. # 3.24.0 - Added REST seqout output. # 3.25.0 - 9spec=T/F : Whether to treat 9XXXX species codes as actual species (generally higher taxa) [False] # 3.25.1 - Fixed -long_seqids retrieval bug. # 3.25.2 - Fixed 9spec filtering bug. # 3.25.3 - Added some bug fixes from Norman that were giving him errors. RJE_SEQ REST Output formatsRun with&rest=docs for program documentation and options. A plain text version is accessed with &rest=help .&rest=OUTFMT can be used to retrieve individual parts of the output, matching the tabs in the default( &rest=format ) output. Individual OUTFMT elements can also be parsed from the full (&rest=full ) server output,which is formatted as follows: ###~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~### # OUTFMT: ... contents for OUTFMT section ... Available REST OutputsThere is currently no specific help available on REST output for this program.© 2015 RJ Edwards. Contact: richard.edwards@unsw.edu.au. |