Skip to content

Help

Command reference for the TaxonoPy CLI.

taxonopy --help

usage: taxonopy [-h] [--cache-dir CACHE_DIR] [--cache-input CACHE_INPUT]
                [--show-cache-path] [--cache-stats] [--clear-cache]
                [--show-config] [--version]
                {resolve,trace,common-names} ...

TaxonoPy: Resolve taxonomic names using GNVerifier and trace data provenance.

positional arguments:
  {resolve,trace,common-names}
    resolve             Run the taxonomic resolution workflow
    trace               Trace data provenance of TaxonoPy objects
    common-names        Merge vernacular names (post-process) into resolved
                        outputs

options:
  -h, --help            show this help message and exit
  --cache-dir CACHE_DIR
                        Directory for TaxonoPy cache (can also be set with
                        TAXONOPY_CACHE_DIR environment variable) (default:
                        None)
  --cache-input CACHE_INPUT
                        Input dataset path to compute cache stats for when no
                        command is provided (default: None)
  --show-cache-path     Display the current cache directory path and exit
                        (default: False)
  --cache-stats         Display statistics about the cache and exit (default:
                        False)
  --clear-cache         Clear the TaxonoPy object cache. May be used in
                        isolation. (default: False)
  --show-config         Show current configuration and exit (default: False)
  --version             Show version number and exit

taxonopy resolve --help

usage: taxonopy resolve [-h] -i INPUT -o OUTPUT_DIR
                        [--output-format {csv,parquet}]
                        [--log-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}]
                        [--log-file LOG_FILE] [--force-input] [--full-rerun]
                        [--batch-size BATCH_SIZE] [--all-matches]
                        [--capitalize] [--fuzzy-uninomial] [--fuzzy-relaxed]
                        [--species-group] [--refresh-cache] [--cache-stats]

options:
  -h, --help            show this help message and exit
  -i INPUT, --input INPUT
                        Path to input Parquet or CSV file/directory
  -o OUTPUT_DIR, --output-dir OUTPUT_DIR
                        Directory to save resolved and unsolved output files
  --output-format {csv,parquet}
                        Output file format
  --log-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}
                        Set logging level
  --log-file LOG_FILE   Optional file to write logs to
  --force-input         Force use of input metadata without resolution
  --full-rerun          Replace existing cache/output if detected for this
                        input

GNVerifier Settings:
  --batch-size BATCH_SIZE
                        Max number of name queries per GNVerifier
                        API/subprocess call
  --all-matches         Return all matches instead of just the best one
  --capitalize          Capitalize the first letter of each name
  --fuzzy-uninomial     Enable fuzzy matching for uninomial names
  --fuzzy-relaxed       Relax fuzzy matching criteria
  --species-group       Enable group species matching

Cache Management:
  --refresh-cache       Force refresh of cached objects (input parsing,
                        grouping) before running.
  --cache-stats         Display cache statistics for this input and exit.

taxonopy trace --help

usage: taxonopy trace [-h] [--cache-stats] {entry} ...

positional arguments:
  {entry}
    entry        Trace an individual taxonomic entry by UUID

options:
  -h, --help     show this help message and exit
  --cache-stats  Display cache statistics for the provided input and exit.

taxonopy trace entry --help

usage: taxonopy trace entry [-h] --uuid UUID --from-input FROM_INPUT
                            [--format {json,text}] [--verbose]

options:
  -h, --help            show this help message and exit
  --uuid UUID           UUID of the taxonomic entry
  --from-input FROM_INPUT
                        Path to the original input dataset
  --format {json,text}  Output format
  --verbose             Show full details including all UUIDs in groups

taxonopy common-names --help

usage: taxonopy common-names [-h] --resolved-dir ANNOTATION_DIR --output-dir
                             OUTPUT_DIR

options:
  -h, --help            show this help message and exit
  --resolved-dir ANNOTATION_DIR
                        Directory containing your *.resolved.parquet files
  --output-dir OUTPUT_DIR
                        Directory to write annotated .parquet files