public class AwkFastaConverter extends SeqModuleImpl
Modifier and Type | Field and Description |
---|---|
protected static String |
FUNCTION_CONVERT_454
Name of the bash function used to conver 454 format to BioLockJ friendly Illumina format:
"convert454"
|
protected static String |
FUNCTION_CONVERT_TO_FASTA
Name of the bash function that converts the file format to Fasta: "convertToFastA"
|
protected static String |
FUNCTION_GUNZIP
Name of the bash function used to decompress gzipped files: "decompressGzip"
|
GZIP_EXT, LOG_EXT, PDF_EXT, RETURN, SH_EXT, TAB_DELIM, TSV_EXT, TXT_EXT
SCRIPT_BATCH_SIZE, SCRIPT_DEFAULT_HEADER, SCRIPT_NUM_THREADS, SCRIPT_PERMISSIONS, SCRIPT_TIMEOUT
MAIN_SCRIPT_PREFIX, OUTPUT_DIR, TEMP_DIR
Constructor and Description |
---|
AwkFastaConverter() |
Modifier and Type | Method and Description |
---|---|
List<List<String>> |
buildScript(List<File> files)
Method returns a nested list of bash script lines to classify samples containing forward reads only.
|
void |
cleanUp()
|
List<File> |
getSeqFiles(Collection<File> files)
Return only sequence files for sample IDs found in the metadata file.
If Config ."metadata.required" = "Y", an
error is thrown to list the files that cannot be matched to a metadata row. |
List<String> |
getWorkerScriptFunctions()
This method generates the required bash functions used by the module scripts.
|
protected static String |
unzip(File file,
String targetPath)
Build script line to decompress gzipped sequence file.
|
getInputFiles, getSummary, isValidInputModule
buildScriptForPairedReads, checkDependencies, executeTask, getJobParams, getMainScript, getRuntimeParams, getScriptDir, getScriptErrors, getTimeout, hasScripts
cacheInputFiles, compareTo, equals, findModuleInputFiles, getFileCache, getID, getModuleDir, getOutputDir, getPostRequisiteModules, getPreRequisiteModules, getTempDir, init, toString, validateFileNameUnique
clone, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
buildScriptForPairedReads, getJobParams, getMainScript, getScriptDir, getScriptErrors, getTimeout
checkDependencies, executeTask, getID, getModuleDir, getOutputDir, getPostRequisiteModules, getPreRequisiteModules, getTempDir, init
protected static final String FUNCTION_CONVERT_454
protected static final String FUNCTION_CONVERT_TO_FASTA
protected static final String FUNCTION_GUNZIP
public List<List<String>> buildScript(List<File> files) throws Exception
ScriptModule
buildScript
in interface ScriptModule
buildScript
in class ScriptModuleImpl
files
- Files in the input directory that contain only forward readsException
- if unable to generate script linespublic void cleanUp() throws Exception
Config
."internal.seqType" = "fasta"Config
."internal.seqHeaderChar" =
SeqUtil.FASTA_HEADER_DEFAULT_DELIM
cleanUp
in interface BioModule
cleanUp
in class BioModuleImpl
Exception
- if errors occurpublic List<File> getSeqFiles(Collection<File> files) throws Exception
SeqModule
Config
."metadata.required" = "Y", an
error is thrown to list the files that cannot be matched to a metadata row.getSeqFiles
in interface SeqModule
getSeqFiles
in class SeqModuleImpl
files
- Module input filesException
- if no input files are foundpublic List<String> getWorkerScriptFunctions() throws Exception
getWorkerScriptFunctions
in interface ScriptModule
getWorkerScriptFunctions
in class ScriptModuleImpl
Exception
- if errors occur