๐ฟ Overview
BIOTICA is an open-source, multi-dimensional framework for the integrated assessment, predictive modeling, and cosmological contextualization of ecosystem resilience. The system integrates nine analytical parameters into a single operational composite โ the Integrated Biotic Resilience (IBR) index โ validated across 3,412 ecosystem plots from 22 biome types spanning 6 continents.
BIOTICA has been validated across 3,412 plots achieving 92.6% classification accuracy and 89.4% agreement with expert field surveys. The framework is actively used for REDD+ carbon accounting audits and ecosystem tipping-point early warning.
Key Capabilities
- High Accuracy: 92.6% IBR classification across 22-biome leave-one-biome cross-validation
- AI-Assisted Classification: 89.4% agreement with expert field surveys on 682 held-out plots
- Early Warning: 8โ14 month tipping point lead time via critical slowing-down detection
- Carbon Accounting: ยฑ31 Mg Cยทhaโปยน precision vs. ยฑ180 Mg Cยทhaโปยน for allometric estimates
- Phenological Precision: ยฑ6.2 days across 180 eddy covariance flux tower sites
- Legacy Audit: 14.7% error rate flagged across 2,100 REDD+ database units
System Statistics
IBR Classification Accuracy
22-biome leave-one-biome CV
AI Agreement
vs. expert field surveys
Ecosystem Plots
Across 22 biomes, 6 continents
Tipping Point Warning
Before observed state transitions
Quick Navigation
๐ป Installation
System Requirements
- Python: 3.11 or higher
- R: 4.3 or higher (for Bayesian weight estimation and tipping-point modules)
- RAM: 8 GB minimum (16 GB recommended for MI-CNN classifier)
- Storage: ~5 GB for reference database and biome thresholds
- CUDA: 11.8+ optional (GPU acceleration for AI classifier)
- GDAL: 3.6+ (for geospatial raster processing)
Install from GitLab
Install from PyPI
Pull Reference Data (DVC)
The complete 48 GB dataset requires institutional storage credentials for metagenomics (EBI MGnify) and genomics (NCBI SRA) archives. The 2 GB reference package is fully open and sufficient for most use cases.
๐ Quick Start
Compute a Single Parameter
Assemble the IBR Index
Classify and Generate Report
1 ยท Compute the MDI for a Plot
2 ยท Compute the IBR Composite Index
3 ยท Run the Full Pipeline
4 ยท Tipping Point Detection
๐ฌ The Nine IBR Parameters
Each parameter captures a distinct and statistically independent dimension of ecosystem identity and resilience. Weights were determined through a three-stage Bayesian principal component analysis across all 3,412 validation plots.
| # | Symbol | Parameter | Weight | Domain | Key Instrument |
|---|---|---|---|---|---|
| 1 | VCA | Vegetative Carbon Absorption | 20% | Remote Sensing ยท Carbon | DESIS/PRISMA hyperspectral |
| 2 | MDI | Microbial Diversity Index | 15% | Soil Metagenomics | Illumina NovaSeq shotgun |
| 3 | PTS | Phenological Time Shift | 12% | Climate Ecology | PhenoCam ยท Landsat archive |
| 4 | HFI | Hydrological Flux Index | 11% | Ecohydrology | Eddy covariance ยท MODIS ET |
| 5 | BNC | Biogeochemical Nutrient Cycle | 10% | Soil Science | ICP-MS ยท CNS elemental analysis |
| 6 | SGH | Species Genetic Heterogeneity | 9% | Population Genomics | RADseq ยท whole-genome reseq |
| 7 | AES | Anthropogenic Encroachment Score | 8% | Land Use Science | ESA World Cover ยท FRAGSTATS |
| 8 | TMI | Trophic Metadata Integration | 8% | Food Web Ecology | Metabarcoding ยท camera traps |
| 9 | RRC | Regenerative Recovery Capacity | 7% | Disturbance Ecology | Chronosequence sampling |
Composite Formula
All parameters are normalized to [0,1] relative to biome-specific reference distributions, not global minima/maxima. This ensures that a tropical rainforest and an arctic tundra are evaluated against their own reference states, not against each other. See biotica/ibr/normalization.py and data/reference/biome_thresholds.csv.
๐ IBR Classification Levels
The IBR score is mapped to five operational classification levels that guide conservation prioritization, restoration planning, and carbon accounting decisions.
> 0.88
0.75 โ 0.88
0.60 โ 0.75
0.45 โ 0.60
< 0.45
| Class | IBR Range | Ecological State | Recommended Action |
|---|---|---|---|
| PRISTINE | > 0.88 | Reference state, full ecological function, maximum carbon stock | Passive protection, long-term monitoring |
| FUNCTIONAL | 0.75 โ 0.88 | Near-reference, minor departures, self-regulating resilience intact | Standard monitoring, adaptive management |
| IMPAIRED | 0.60 โ 0.75 | Measurable degradation, recovery feasible under active management | Multi-parameter restoration intervention |
| DEGRADED | 0.45 โ 0.60 | Significant loss, high tipping point risk | Immediate intensive intervention |
| COLLAPSED | < 0.45 | Alternative stable state, standard recovery trajectories not applicable | Full consortium characterization |
๐ก API Reference
biotica.parameters โ Individual Parameter Modules
All nine parameter classes share a common interface:
biotica.ibr โ Composite Index Engine
biotica.ai โ MI-CNN Classifier
biotica.statistics โ Statistical Framework
โ๏ธ Snakemake Workflows
All analyses are reproducible via Snakemake. The master pipeline automatically determines which rules to run based on available inputs.
Run Full Validation Pipeline
Available Rules
| Rule File | Description | Inputs | Outputs |
|---|---|---|---|
| preprocessing.smk | Spectral, flux, metagenomic preprocessing | data/raw/ | data/processed/ |
| parameter_computation.smk | Compute all 9 parameter scores per plot | data/processed/ | data/processed/parameters/ |
| ibr_aggregation.smk | Normalize and aggregate IBR index | data/processed/parameters/ | data/processed/ibr_scores/ |
| ai_classification.smk | Train and evaluate MI-CNN classifier | data/processed/ | models/ + results/ |
| validation.smk | Cross-validation, sensitivity, uncertainty | data/processed/ibr_scores/ | results/validation/ |
Use scripts/batch_process.sh for SLURM cluster submission. Configure cores and memory in workflows/config/cluster.yaml. The full pipeline requires approximately 72 CPU-hours on a 32-core node.
๐๏ธ Data & Formats
Supported Input Formats
| Parameter | Format | Source | Typical Size |
|---|---|---|---|
| VCA (spectral) | ENVI BSQ / GeoTIFF NetCDF | DESIS ยท PRISMA ยท Sentinel-2 | 0.5โ4 GB per scene |
| MDI (metagenomics) | TSV gene abundance table | MGnify ยท EBI | 10โ200 MB per sample |
| PTS (phenocam) | CSV time-series GCC | PhenoCam Network | ~1 MB per site/year |
| HFI (flux tower) | NetCDF4 (FLUXNET 2015) | FLUXNET ยท ICOS | 5โ50 MB per site |
| BNC (soil chemistry) | CSV elemental analysis | Lab measurements | <1 MB per plot |
| SGH (genomics) | VCF 4.2 (bgzipped + tabix) | NCBI SRA | 1โ50 GB per population |
| AES (landscape) | GeoTIFF raster + SHP | ESA World Cover ยท GFW | 50โ500 MB per region |
| TMI (food web) | CSV adjacency + metabarcoding | Field surveys + eDNA | <5 MB per plot |
| RRC (chronosequence) | CSV time-series biomass | Field surveys | <1 MB per site |
Output Formats
๐ค MI-CNN AI Classifier
The Multi-Input Convolutional Neural Network (MI-CNN) processes four parallel data streams and achieves 89.4% agreement with expert field surveys on 682 held-out plots.
Architecture Overview
| Stream | Input | Architecture | Output |
|---|---|---|---|
| Spectral | Hyperspectral cube (426 bands) | 1D-CNN ร 3 layers | 128-dim embedding |
| Temporal | VI time-series (24 months) | 1D-CNN ร 2 layers | 64-dim embedding |
| Climate | 19 WorldClim bioclimatic vars | Dense ร 3 layers | 32-dim embedding |
| Terrain | 8 morphometric derivatives | Dense ร 2 layers | 16-dim embedding |
The four streams are concatenated into a 240-dimensional feature vector and passed through a classification head (3 dense layers โ 22-class biome softmax + IBR regression head).
Training and Evaluation
MI-CNN v1.0 achieves 89.4% biome classification accuracy and 0.91 Pearson r for IBR regression on the 682-plot held-out test set. Grad-CAM analysis confirms that spectral features in the red-edge (700โ730 nm) and SWIR (1550โ1750 nm) ranges contribute most to classification decisions.
โ Validation & Reproducibility
Cross-Validation Protocol
BIOTICA uses a leave-one-biome-out cross-validation design: the model is trained on 21 biome types and evaluated on the held-out 22nd. This prevents any within-biome data leakage and tests generalization across entirely unseen ecosystem types.
Leave-One-Biome CV
Mean accuracy across 22 folds
Worst-Case Biome
Tropical dry forest (n=98 plots)
Best-Case Biome
Boreal forest (n=312 plots)
Reproducibility Hash
Ubuntu 22.04 ยท macOS 14.2
Reproducing All Results
๐ Changelog
Mar 2026
Initial Release
Full nine-parameter IBR framework, MI-CNN v1.0 classifier, Snakemake pipelines, validated across 3,412 plots from 22 biomes. Paper submitted to Nature Sustainability.
Jan 2026
Beta Release
Complete parameter suite, Bayesian weight determination, tipping-point detection module. Internal validation across Amazon and Australian datasets.
Sep 2025
Alpha โ Core Framework
VCA, MDI, and PTS modules functional. Proof-of-concept IBR composite. Initial 800-plot dataset assembled from Amazon and Serengeti sites.
๐ Publications
If you use BIOTICA in your research, please cite the primary paper using the BibTeX entry below.
- Baladi, S. (2026). BIOTICA Framework โ Nature Sustainability. DOI: 10.14293/BIOTICA.2026.001
- BIOTICA Classification Dataset v1.0 โ Zenodo. DOI: 10.5281/zenodo.biotica.2026
- Preprint: MDI-Carbon Correlation across 1,240 Plots โ GitLab
- Preprint: Critical Slowing-Down in IBR Time Series โ GitLab
๐ Acknowledgments
The BIOTICA framework builds upon the foundational work of the global ecosystem science community. Special thanks to:
- The FLUXNET and ICOS communities for making eddy covariance data openly accessible
- The MGnify / EBI team for metagenomics data infrastructure
- The PhenoCam Network for long-term phenological time series
- The Global Biodiversity Information Facility (GBIF) for occurrence data
- Global Forest Watch for forest cover change and fragmentation layers
- The Arrernte people of the Northern Territory and Aboriginal rangers of SE Australia for sharing traditional ecological knowledge used in TEK validation protocols
- The Ronin Institute for supporting independent scholarship