Complete Biological Systems Prediction Workflow#
The pansystems command provides a comprehensive, all-in-one workflow to detect biological systems in multiple
pangenomes. This command combines gene family annotation, system detection,
and results visualization into a single streamlined process.
Pansystems Workflow#
The pansystems process integrates three main steps:
Gene Family Annotation: Uses either pre-computed metadata tables or HMM database searches. See annotation documentation for detailed information on annotation modes and options.
System Detection: Matches annotated families against functional models using genomic context analysis. See detection documentation for details on the detection algorithm and sensitivity modes.
Results Writing: Generates comprehensive outputs including projection, association, and partition analysis. See write_systems documentation for complete information on output options and file formats.
Command Line Usage#
HMM-based Annotation and Detection#
panorama pansystems \
--pangenomes pangenomes.tsv \
--source defense_finder \
--hmm hmms.tsv \
--models models.tsv \
--mode fast \
--k_best_hit 3 \
--jaccard 0.8 \
--projection \
--association RGPs spots \
--partition \
--output results/ \
--threads 16
Table-based Annotation and Detection#
panorama pansystems \
--pangenomes pangenomes.tsv \
--source KEGG \
--table annotations.tsv \
--models models.tsv \
--jaccard 0.8 \
--projection \
--output results/ \
--threads 8
Key Arguments#
Required Arguments#
Shortcut |
Argument |
Description |
|---|---|---|
|
|
TSV file listing .h5 pangenome files |
|
|
Name of the annotation source (e.g., |
|
|
Output directory for results |
Annotation Mode (Mutually Exclusive)#
Must provide either --table or --hmm:
Argument |
Description |
|---|---|
|
TSV linking pangenome names to pre-computed annotation files |
|
HMM metadata TSV file (from |
For detailed annotation options, see the annotation command documentation.
System Detection#
Argument |
Type |
Default |
Description |
|---|---|---|---|
|
Path |
- |
Required. Path to models list file |
|
float |
0.8 |
Minimum Jaccard similarity for context graph edges |
|
int |
3 |
Detection sensitivity (1-3, higher = more precise) |
For more detection parameters, see the systems command documentation.
Output Options#
Argument |
Description |
|---|---|
|
Project systems onto individual organisms |
|
Write partition heatmap showing system distribution |
|
Associate systems with pangenome elements (RGPs, spots, modules) |
Note
For complete output options and file formats, see:
Performance Options#
Argument |
Type |
Default |
Description |
|---|---|---|---|
|
int |
1 |
Number of parallel threads |
|
Path |
auto |
Temporary directory |
|
flag |
False |
Keep temporary files |
Output Structure#
The pansystems command creates the same organized output structure as the individual commands. See the write_systems output documentation for complete details on:
System summary files
Per-organism projections
Association matrices and visualizations
Partition heatmaps
Interactive plots
When to Use Pansystems vs Individual Commands#
Use pansystems when:
Running the complete workflow from annotation to visualization
Processing multiple pangenomes with the same annotation source
Need streamlined parameter validation and optimized data flow
Use individual commands when:
Only need specific analysis steps
Working with pre-annotated pangenomes
Need fine-grained control over intermediate outputs
Experimenting with different parameters for each step
Integration Notes#
The pansystems command ensures optimal integration between analysis steps:
Validates parameter compatibility across all workflow components
Eliminates needs for intermediate file management
Provides unified progress reporting and error handling
Maintains consistent source naming throughout the pipeline