public class AwkFastaConverter extends SeqModuleImpl
| Modifier and Type | Field and Description |
|---|---|
protected static String |
FUNCTION_CONVERT_454
Name of the bash function used to conver 454 format to BioLockJ friendly Illumina format:
"convert454"
|
protected static String |
FUNCTION_CONVERT_TO_FASTA
Name of the bash function that converts the file format to Fasta: "convertToFastA"
|
protected static String |
FUNCTION_GUNZIP
Name of the bash function used to decompress gzipped files: "decompressGzip"
|
GZIP_EXT, LOG_EXT, PDF_EXT, RETURN, SH_EXT, TAB_DELIM, TSV_EXT, TXT_EXTSCRIPT_BATCH_SIZE, SCRIPT_DEFAULT_HEADER, SCRIPT_NUM_THREADS, SCRIPT_PERMISSIONS, SCRIPT_TIMEOUTMAIN_SCRIPT_PREFIX, OUTPUT_DIR, TEMP_DIR| Constructor and Description |
|---|
AwkFastaConverter() |
| Modifier and Type | Method and Description |
|---|---|
List<List<String>> |
buildScript(List<File> files)
Method returns a nested list of bash script lines to classify samples containing forward reads only.
|
void |
cleanUp()
|
List<File> |
getSeqFiles(Collection<File> files)
Return only sequence files for sample IDs found in the metadata file.
If Config."metadata.required" = "Y", an
error is thrown to list the files that cannot be matched to a metadata row. |
List<String> |
getWorkerScriptFunctions()
This method generates the required bash functions used by the module scripts.
|
protected static String |
unzip(File file,
String targetPath)
Build script line to decompress gzipped sequence file.
|
getInputFiles, getSummary, isValidInputModulebuildScriptForPairedReads, checkDependencies, executeTask, getJobParams, getMainScript, getRuntimeParams, getScriptDir, getScriptErrors, getTimeout, hasScriptscacheInputFiles, compareTo, equals, findModuleInputFiles, getFileCache, getID, getModuleDir, getOutputDir, getPostRequisiteModules, getPreRequisiteModules, getTempDir, init, toString, validateFileNameUniqueclone, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitbuildScriptForPairedReads, getJobParams, getMainScript, getScriptDir, getScriptErrors, getTimeoutcheckDependencies, executeTask, getID, getModuleDir, getOutputDir, getPostRequisiteModules, getPreRequisiteModules, getTempDir, initprotected static final String FUNCTION_CONVERT_454
protected static final String FUNCTION_CONVERT_TO_FASTA
protected static final String FUNCTION_GUNZIP
public List<List<String>> buildScript(List<File> files) throws Exception
ScriptModulebuildScript in interface ScriptModulebuildScript in class ScriptModuleImplfiles - Files in the input directory that contain only forward readsException - if unable to generate script linespublic void cleanUp()
throws Exception
Config."internal.seqType" = "fasta"Config."internal.seqHeaderChar" =
SeqUtil.FASTA_HEADER_DEFAULT_DELIMcleanUp in interface BioModulecleanUp in class BioModuleImplException - if errors occurpublic List<File> getSeqFiles(Collection<File> files) throws Exception
SeqModuleConfig."metadata.required" = "Y", an
error is thrown to list the files that cannot be matched to a metadata row.getSeqFiles in interface SeqModulegetSeqFiles in class SeqModuleImplfiles - Module input filesException - if no input files are foundpublic List<String> getWorkerScriptFunctions() throws Exception
getWorkerScriptFunctions in interface ScriptModulegetWorkerScriptFunctions in class ScriptModuleImplException - if errors occur