Annotate¶
Summary¶
Run combined RNA + protein annotation on nucleotide viral sequences.
Description¶
This command orchestrates annotate-rna and annotate-prot into a single
workflow and writes results into rna_annotation/ and
protein_annotation/ subdirectories under the selected output path.
Use `--skip-steps` to disable specific stages and `--override-parameters`
to forward JSON overrides to sub-tools.
Usage¶
Options¶
-i,--input: Input nucleotide sequence file (fasta, fna, fa, or faa) (type:PATH; required; default:Sentinel.UNSET)-o,--output: Output file location. (type:TEXT; default:rolypoly_annotation)-t,--threads: Number of threads (type:INTEGER; default:1)-g,--log-file: Path to log file (type:TEXT; default:/clusterfs/jgi/scratch/science/metagen/neri/code/rolypoly/annotate_logfile.txt)-M,--memory: Memory in GB. Example: -M 8gb (type:TEXT; default:6gb)--override-parameters: JSON-like string of parameters to override. Example: --override-parameters '{"RNAfold": {"temperature": 37}, "ORFfinder": {"minimum_length": 150}}' (type:TEXT; default:{})--skip-steps: Comma-separated list of steps to skip. Example: --skip-steps RNA_annotation,protein_annotation or --skip-steps IRESfinder,RNAMotif or --skip-steps ORFfinder,hmmsearch (type:TEXT; default: ``)--secondary-structure-tool: Tool for secondary structure prediction (type:CHOICE; default:LinearFold)--ires-tool: Tool for IRES identification (type:CHOICE; default:IRESfinder)--trna-tool: Tool for tRNA identification (type:CHOICE; default:tRNAscan-SE)--rnamotif-tool: Tool for RNA sequence motif identification (type:CHOICE; default:lightmotif)--gene-prediction-tool: Tool for gene prediction (type:CHOICE; default:pyrodigal)--domain-db: Database for domain detection (NOTE: currently packaged with rolypoly data: Pfam, genomad, RVMT) (type:CHOICE; default:Pfam)--custom-domain-db: Path to a custom domain database in HMM format (for use with --domain-db custom) (type:TEXT; default: ``)--min-orf-length: Minimum ORF length for gene prediction (type:INTEGER; default:30)--search-tool: Tool/command for protein domain detection. hmmer commands are via pyhmmer bindings. (type:CHOICE; default:hmmsearch)