Star ๅކๅฒ่ถ‹ๅŠฟ
ๆ•ฐๆฎๆฅๆบ: GitHub API ยท ็”Ÿๆˆ่‡ช Stargazers.cn
README.md

OpenClaw Medical Skills

GitHub Stars GitHub Forks GitHub Issues Skills Count License Platform

The largest open-source medical AI skill library for OpenClaw.

869 curated skills ยท Clinical ยท Genomics ยท Drug Discovery ยท Bioinformatics ยท Medical Devices

English | ไธญๆ–‡


What Is This?

OpenClaw Medical Skills is a curated collection of 869 AI agent skills covering the full spectrum of biomedical and clinical research. These skills are designed for OpenClaw / NanoClaw โ€” Claude-based personal AI assistant frameworks โ€” and transform a general-purpose AI agent into a powerful medical and scientific research companion.

Each skill is a self-contained module (a SKILL.md file) that:

  • Teaches the agent specialized domain knowledge and workflows
  • Connects to real databases, APIs, and computational tools
  • Produces structured, clinically or scientifically relevant outputs

We benefit from the open-source community. The full collection of resources can be found here: Awesome LLM Resources

Why This Collection Matters

Without SkillsWith OpenClaw Medical Skills
Generic AI responses about medicineReal PubMed / ClinicalTrials.gov / FDA queries
No bioinformatics capabilityRNA-seq, scRNA-seq, GWAS, variant calling pipelines
No drug intelligenceChEMBL, DrugBank, DDI prediction, pharmacovigilance
No clinical documentationSOAP notes, discharge summaries, prior auth decisions
No genomics supportVCF annotation, ACMG classification, PRS calculation
No regulatory guidanceFDA, CE mark, IEC 62304, ISO 14971 compliance

This collection aggregates skills from 12+ open-source skill repositories spanning academic research tools, clinical workflows, regulatory frameworks, and cutting-edge AI-driven protein design โ€” giving your AI agent capabilities comparable to a team of specialized research scientists.


Installation

Requirements

  • OpenClaw installed and running, or NanoClaw as an alternative
  • Git (for cloning this repo)

For OpenClaw Users

OpenClaw loads skills from two locations:

PriorityPathScope
High<workspace>/skills/Per-workspace (recommended)
Low~/.openclaw/skills/Global, shared across all agents

Method 1 โ€” Clone and Copy (Recommended)

# Clone this repository (skills only โ€” skips large data files) git clone --depth=1 --no-checkout https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills.git cd OpenClaw-Medical-Skills git sparse-checkout init --cone git sparse-checkout set skills git checkout main # Install to your workspace skills directory cp -r skills/* <your-workspace>/skills/ # Or install globally (available to all agents) cp -r skills/* ~/.openclaw/skills/

Note: Some skills bundle large data files (databases, datasets). The sparse-checkout method above avoids downloading them. If you need the full repo including all data, install Git LFS first, then run git clone https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills.git.

Skills are picked up automatically on the next session. No restart needed.

Method 2 โ€” OpenClaw CLI

If you use the OpenClaw plugin registry, you can search and install individual skills from there. For bulk install from this collection, Method 1 is faster.

openclaw plugins install <skill-slug> # install a single skill openclaw plugins update # update all installed skills

Method 3 โ€” Configure Extra Directories

To point OpenClaw at a cloned copy of this repo permanently, add it to ~/.openclaw/openclaw.json:

{ "plugins": { "local": ["/path/to/OpenClaw-Medical-Skills"] } }

This mounts the entire collection without copying files.

Method 4 โ€” Install Selected Skills Only

Pick skills relevant to your domain:

# Example: clinical + drug discovery stack SKILLS=( "clinical-reports" "tooluniverse-drug-research" "tooluniverse-pharmacovigilance" "clinicaltrials-database" "biomedical-search" "tooluniverse-drug-drug-interaction" ) for skill in "${SKILLS[@]}"; do cp -r OpenClaw-Medical-Skills/skills/$skill ~/.openclaw/skills/ done

For NanoClaw Users

NanoClaw loads skills into agent containers at startup from container/skills/.

# Clone and copy into NanoClaw container skills directory git clone https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills.git cp -r OpenClaw-Medical-Skills/skills/* /path/to/nanoclaw/container/skills/ # Rebuild the container to apply cd /path/to/nanoclaw ./container/build.sh

Verification

After installation, ask your agent:

What medical and clinical skills do you have available?

Your agent should list the installed skills with their capabilities.


Skills Overview

CategoryCountHighlights
General & Core10Browser/search, document tools, and developer workflow utilities
Medical & Clinical119Clinical reports, CDS, oncology, imaging, and healthcare AI
Scientific Databases43Genomics/protein/drug databases and biomedical knowledge retrieval
Bioinformatics (gptomics)239Variant analysis, sequencing QC, DE, pathways, single-cell, and epigenomics
Omics & Computational Biology59Single-cell/spatial, proteomics, cheminformatics, and protein design tools
ClawBio Pipelines21Orchestration pipelines for scRNA, GWAS, ancestry, and structural workflows
BioOS Extended Suite285Extended agent suite for oncology, immunology, clinical AI, and infrastructure
Data Science & Tools93Statistics, visualization, automation, simulation, and scientific tooling
Total869

Table of Contents

General & Core

Medical & Clinical

Scientific Databases

Bioinformatics (gptomics bio-* suite)

Omics & Computational Biology

ClawBio Pipelines

BioOS Extended Suite

Data Science & Tools


Skills List

๐Ÿงฐ General & Core

Expand/Collapse this category

General Tools

Click to expand skill list
SkillDescription
agent-browserBrowse the web for any task โ€” research topics, read articles, interact with web apps, fill forms, take screenshots, extract data, and test web pages. Use whenever a browser would be useful.
find-skillsHelps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities.
multi-search-engineMulti search engine integration with 17 engines (8 CN + 9 Global). Supports Baidu, Bing, 360, Sogou, WeChat, Google, DuckDuckGo, WolframAlpha and more. Supports advanced operators, time filters, site search. No API keys required.
wikipedia-searchSearch and fetch structured content from Wikipedia using the MediaWiki API for reliable, encyclopedic information. Supports multi-language queries.
deep-researchExecute autonomous multi-step deep research on any topic. Searches multiple sources, reads full content, synthesizes findings, and produces a structured report. Use for comprehensive research, literature reviews, competitive analysis, or topic deep-dives.
pdfComprehensive PDF toolkit โ€” extract text and tables, create new PDFs, merge/split documents, handle forms, OCR scanned PDFs. Use when working with any .pdf file.
docxCreate, edit, and analyze Word documents (.docx). Supports tracked changes, comments, formatting preservation, and text extraction. Use for drafting, redlining, or extracting content from Word files.
xlsxSpreadsheet creation, editing, and analysis. Supports formulas, formatting, data analysis, and visualization. Use for any .xlsx, .xlsm, .csv, or .tsv task.
pptxPresentation creation, editing, and analysis. Supports layouts, speaker notes, templates, and design. Use for any .pptx file.
doc-coauthoringGuide users through a structured workflow for co-authoring documentation. Use when writing documentation, proposals, technical specs, decision docs, or similar structured content.

๐Ÿฅ Medical & Clinical

Expand/Collapse this category

Medical Tools

Click to expand skill list
SkillDescription
pubmed-searchSearch PubMed for scientific literature. Use when the user asks to find papers, search literature, look up research, find publications, or asks about recent studies.
medical-research-toolkitQuery 14+ biomedical databases for drug repurposing, target discovery, clinical trials, and literature research. Access ChEMBL, PubMed, ClinicalTrials.gov, OpenTargets, OpenFDA, OMIM, Reactome, KEGG, UniProt, and more through a unified MCP endpoint.
medical-specialty-briefsGenerate daily or on-demand medical research briefs for any medical specialty. Searches latest research from top-tier journals (NEJM, Lancet, JAMA, BMJ, Nature Medicine), delivers concise summaries with 1-sentence takeaways and direct links. Use when user asks for medical news, research updates, or specialty-specific updates (endocrinology, cardiology, oncology, neurology, etc.).
usmlePrepare for US medical licensing exams with progress tracking, weak area analysis, question bank management, and residency match planning. Covers Step 1/2 CK/Step 3, IMG-specific guidance, score prediction, and wellbeing support.
medical-entity-extractorExtract medical entities (symptoms, medications, lab values, diagnoses) from patient messages.
patiently-aiSimplifies medical documents for patients. Takes doctor's letters, test results, prescriptions, discharge summaries, and clinical notes and explains them in clear, personalised language.
biomedical-searchComplete biomedical information search combining PubMed, preprints, clinical trials, and FDA drug labels. Powered by Valyu semantic search.
medical-imaging-reviewWrite comprehensive literature reviews for medical imaging AI research. Use when writing survey papers, systematic reviews, or literature analyses on imaging topics.
fhir-developer-skillFHIR API development guide for building healthcare endpoints (Patient, Observation, Encounter, Condition, MedicationRequest). Use when developing or integrating FHIR REST APIs.
clinical-trial-protocol-skillGenerate clinical trial protocols for medical devices or drugs. Use when designing clinical studies, creating FDA submission documentation, or developing protocols for investigational products.
prior-auth-review-skillAutomate payer review of prior authorization (PA) requests. Assesses medical necessity, validates against coverage policies, and generates PA decisions.
clinical-reportsWrite comprehensive clinical reports โ€” case reports (CARE guidelines), diagnostic reports (radiology/pathology/lab), clinical trial reports (ICH-E3, CSR), and patient documentation (SOAP, H&P, discharge summaries). HIPAA/FDA/ICH-GCP compliant.
clinicaltrials-databaseQuery ClinicalTrials.gov via API v2. Search trials by condition, drug, location, status, or phase. Retrieve trial details by NCT ID, export data for clinical research and patient matching.
clinical-decision-supportGenerate clinical decision support (CDS) documents for pharmaceutical and clinical research โ€” patient cohort analyses, treatment recommendation reports with GRADE evidence grading, biomarker integration, and statistical outputs (hazard ratios, survival curves).
tooluniverse-clinical-trial-designStrategic clinical trial design feasibility assessment. Evaluates patient population sizing, biomarker prevalence, endpoint selection, comparator analysis, safety monitoring, and regulatory pathways. Use when planning Phase 1/2 trials or assessing trial feasibility.
tooluniverse-disease-researchGenerate comprehensive disease research reports covering epidemiology, mechanisms, diagnostics, treatments, and ongoing trials. Use when asking about diseases, syndromes, or needing systematic disease analysis.
tooluniverse-literature-deep-researchDeep literature research with target disambiguation, evidence grading, and structured theme extraction. Resolves gene/protein IDs, identifies synonyms, synthesizes biological models, and generates testable hypotheses. Use for thorough literature reviews or target profiles.
tooluniverse-clinical-guidelinesSearch and retrieve clinical practice guidelines from 12+ sources (NICE, WHO, ADA, AHA/ACC, NCCN, SIGN, CPIC, etc.). Covers cardiology, oncology, diabetes, pharmacogenomics, and more. Use when asking about treatment recommendations or standard of care.
tooluniverse-drug-researchComprehensive drug research reports covering identity, pharmacology, targets, clinical trials, safety, pharmacogenomics, and ADMET. Use for drug profiling, safety assessment, or clinical development research.
tooluniverse-drug-repurposingIdentify drug repurposing candidates using target-based, compound-based, and disease-driven strategies. Finds new indications for approved drugs by analyzing targets, bioactivity, and safety profiles.
tooluniverse-drug-drug-interactionDrug-drug interaction prediction and risk assessment. Analyzes CYP450/transporter mechanisms, severity classification, and provides management strategies. Supports polypharmacy analysis (3+ drugs) and alternative drug recommendations.
tooluniverse-rare-disease-diagnosisDifferential diagnosis for rare diseases based on phenotype and genetic data. Matches symptoms to HPO terms, identifies candidate diseases from Orphanet/OMIM, and interprets variants of uncertain significance.
tooluniverse-pharmacovigilanceAnalyze drug safety signals from FDA adverse event reports, label warnings, and pharmacogenomic data. Calculates PRR/ROR, identifies serious adverse events, and assesses pharmacogenomic risk.
tooluniverse-clinical-trial-matchingPatient-to-trial matching for precision medicine and oncology. Ranks trials from ClinicalTrials.gov by molecular eligibility, clinical criteria, biomarker alignment, and geographic feasibility with a quantitative Trial Match Score (0-100).
literature-reviewSystematic literature reviews across multiple databases (PubMed, arXiv, bioRxiv, Semantic Scholar). Produces professionally formatted reports with verified citations in APA, Nature, Vancouver styles.
tooluniverse-precision-oncologyActionable treatment recommendations for cancer patients based on molecular profile. Interprets tumor mutations, identifies FDA-approved therapies, clinical trials, and resistance mechanisms.
tooluniverse-cancer-variant-interpretationClinical interpretation of somatic mutations in cancer. Given gene+variant (e.g., EGFR L858R, BRAF V600E), assesses oncogenicity, therapeutic implications, and trial eligibility.
tooluniverse-variant-analysisProduction-ready VCF processing, variant annotation, and mutation analysis. Parses VCF files, annotates with ClinVar/gnomAD/COSMIC, and interprets clinical significance.
tooluniverse-variant-interpretationSystematic clinical variant interpretation from raw calls to ACMG-classified recommendations. Aggregates evidence from ClinVar, gnomAD, literature, and population databases.
tooluniverse-structural-variant-analysisComprehensive structural variant (SV/CNV) analysis for clinical genomics. Classifies SVs, assesses pathogenicity, and interprets copy number alterations.
tooluniverse-polygenic-risk-scoreBuild and interpret polygenic risk scores (PRS) for complex diseases using GWAS summary statistics. Calculates genetic risk profiles and interprets PRS percentiles.
tooluniverse-precision-medicine-stratificationPatient stratification for precision medicine by integrating genomic, clinical, and therapeutic data. Identifies treatment-relevant subgroups and biomarker-driven therapy options.
tooluniverse-gwas-trait-to-geneDiscover genes associated with diseases and traits using GWAS Catalog (500k+ associations) and Open Targets Genetics locus-to-gene predictions.
tooluniverse-gwas-drug-discoveryTransform GWAS signals into drug targets and repurposing opportunities. Performs locus-to-gene mapping, druggability assessment, and existing drug identification.
tooluniverse-gwas-study-explorerCompare GWAS studies and assess replication across cohorts. Integrates NHGRI-EBI GWAS Catalog and Open Targets Genetics for cross-study meta-analysis.
tooluniverse-gwas-finemappingIdentify and prioritize causal variants at GWAS loci using statistical fine-mapping. Computes posterior probabilities and credible sets for causal variant identification.
tooluniverse-gwas-snp-interpretationInterpret SNPs from GWAS studies by aggregating evidence from GWAS Catalog, Open Targets Genetics, and ClinVar. Retrieves variant-trait associations and functional annotations.
tooluniverse-phylogeneticsPhylogenetics and sequence analysis โ€” alignment processing, evolutionary tree construction, and evolutionary metrics for pathogens or species.
tooluniverse-epigenomicsEpigenomics data processing โ€” methylation array analysis (CpG filtering, differential methylation), chromatin accessibility, and histone modification analysis.
tooluniverse-rnaseq-deseq2RNA-seq differential expression analysis using PyDESeq2. Performs normalization, dispersion estimation, Wald testing, LFC shrinkage, and pathway enrichment.
tooluniverse-single-cellSingle-cell RNA-seq analysis using scanpy. Performs QC, normalization, PCA, UMAP, Leiden clustering, trajectory analysis, and cell type annotation.
tooluniverse-spatial-transcriptomicsSpatial transcriptomics data analysis โ€” maps gene expression in tissue architecture. Supports 10x Visium, MERFISH, seqFISH, and Slide-seq platforms.
tooluniverse-spatial-omics-analysisComputational analysis for spatial multi-omics data integration โ€” spatially variable genes, domain annotation, and tissue-resolved omics.
tooluniverse-proteomics-analysisMass spectrometry proteomics analysis โ€” protein quantification, differential expression, PTMs, and protein-protein interaction network construction.
tooluniverse-metabolomicsMetabolomics research โ€” identifies metabolites and searches databases (HMDB 220k+ metabolites, MetaboLights, Metabolomics Workbench).
tooluniverse-metabolomics-analysisMetabolomics data analysis โ€” metabolite identification, quantification, pathway analysis, and metabolic flux from LC-MS, GC-MS, or NMR data.
tooluniverse-multi-omics-integrationIntegrate transcriptomics, proteomics, epigenomics, genomics, and metabolomics for systems biology and precision medicine.
tooluniverse-multiomic-disease-characterizationSystems-level disease characterization integrating genomics, transcriptomics, proteomics, pathway, and therapeutic layers.
tooluniverse-expression-data-retrievalRetrieve gene expression and omics datasets from ArrayExpress and BioStudies with quality assessment and structured reports.
tooluniverse-gene-enrichmentGene enrichment and pathway analysis using gseapy, PANTHER, STRING, Reactome. Supports GO enrichment, KEGG pathways, and 40+ ToolUniverse tools.
tooluniverse-systems-biologySystems biology and pathway analysis using Reactome, KEGG, WikiPathways, Pathway Commons, and BioModels. Network modeling and pathway simulation.
tooluniverse-protein-interactionsProtein-protein interaction network analysis using STRING, BioGRID, and SASBDB. Maps interaction networks with confidence scores and functional modules.
tooluniverse-protein-structure-retrievalRetrieve protein structure data from RCSB PDB, PDBe, and AlphaFold with quality assessment and comprehensive structural profiles.
tooluniverse-protein-therapeutic-designDesign novel protein therapeutics (binders, enzymes, scaffolds) using AI-guided de novo design โ€” RFdiffusion, ProteinMPNN, and ESM.
tooluniverse-antibody-engineeringAntibody engineering and optimization for therapeutics โ€” humanization, affinity maturation, developability assessment, and immunogenicity prediction.
tooluniverse-immune-repertoire-analysisTCR/BCR repertoire analysis from sequencing data โ€” clonality, diversity, V(D)J gene usage, clonal expansion, and antigen specificity prediction.
tooluniverse-immunotherapy-response-predictionPredict patient response to immune checkpoint inhibitors using multi-biomarker integration โ€” TMB, MSI, PD-L1, TIL signatures, and HLA typing.
tooluniverse-infectious-diseasePathogen characterization and drug repurposing for infectious disease outbreaks. Identifies taxonomy, essential proteins, structural targets, and treatment options.
tooluniverse-crispr-screen-analysisCRISPR screen analysis for functional genomics โ€” pooled or arrayed screens (knockout/activation/interference) to identify essential genes and hits.
tooluniverse-target-researchComprehensive biological target intelligence โ€” protein info, structure, interactions, pathways, expression, variant landscape, and drug pipeline.
tooluniverse-network-pharmacologyCompound-target-disease network analysis for drug repurposing, polypharmacology discovery, and systems pharmacology.
tooluniverse-statistical-modelingStatistical modeling on biomedical datasets โ€” linear/logistic regression, mixed-effects models, survival analysis, and Bayesian methods.
tooluniverse-image-analysisBiomedical microscopy image analysis โ€” colony morphometry, cell counting, fluorescence quantification, and statistical comparison of imaging data.
literature-searchComprehensive scientific literature search across PubMed, arXiv, bioRxiv, medRxiv using natural language queries powered by Valyu semantic search.
medrxiv-searchSearch medRxiv medical preprints with natural language queries powered by Valyu semantic search.
clinical-trials-searchSearch ClinicalTrials.gov with natural language queries โ€” find trials by condition, enrollment status, and outcomes via Valyu.
drug-discovery-searchEnd-to-end drug discovery platform combining ChEMBL, DrugBank, targets, and FDA labels via natural language Valyu search.
drug-labels-searchSearch FDA drug labels with natural language queries โ€” indications, dosing, and safety data via Valyu.
chembl-searchSearch ChEMBL bioactive molecules database โ€” compounds, assay data, and bioactivity via Valyu semantic search.
open-targets-searchSearch Open Targets drug-disease associations and target validation via Valyu semantic search.
patents-searchSearch global patents with natural language queries โ€” prior art, patent landscapes, and innovation tracking via Valyu.
drugbank-searchSearch DrugBank comprehensive drug database โ€” mechanisms, interactions, and safety data via Valyu semantic search.
arxiv-searchSearch arXiv preprints (biology, medicine, AI) using natural language queries powered by Valyu semantic search.
gwas-databaseQuery NHGRI-EBI GWAS Catalog for SNP-trait associations by rs ID, disease/trait, or gene. Retrieve p-values and summary statistics for genetic epidemiology.
scikit-survivalSurvival analysis and time-to-event modeling in Python โ€” Kaplan-Meier, Cox regression, log-rank tests, and censored data handling using scikit-survival.

Drug Safety & Chemical Biology

Click to expand skill list
SkillDescription
tooluniverse-adverse-event-detectionDetect and analyze adverse drug event signals using FDA FAERS data, drug labels, disproportionality analysis (PRR, ROR, IC), and biomedical evidence. Generates quantitative safety signal scores (0-100).
tooluniverse-binder-discoveryDiscover novel small molecule binders for protein targets using structure-based and ligand-based approaches. Creates actionable reports with candidate compounds, ADMET profiles, and synthesis feasibility.
tooluniverse-chemical-compound-retrievalRetrieves chemical compound information from PubChem and ChEMBL with disambiguation, cross-referencing, and quality assessment. Comprehensive compound profiles with identifiers, properties, bioactivity.
tooluniverse-chemical-safetyComprehensive chemical safety and toxicology assessment integrating ADMET-AI predictions, CTD toxicogenomics, FDA label safety data, DrugBank safety profiles, and STITCH chemical-protein interactions.
tooluniverse-drug-target-validationComputational validation of drug targets across 10 dimensions: disambiguation, disease association, druggability, chemical matter, clinical precedent, safety, and expression evidence.
tooluniverse-sequence-retrievalRetrieve biological sequences (DNA, RNA, protein) from NCBI and ENA with gene disambiguation, accession type handling, and comprehensive sequence profiles.

Medical Imaging & Pathology

Click to expand skill list
SkillDescription
pydicomPython library for working with DICOM medical imaging files. Reading, writing, modifying DICOM data, extracting pixel data, handling metadata and multi-frame files.
histolabDigital pathology image processing toolkit for whole slide images (WSI). Process H&E or IHC stained tissue images, extract tiles from gigapixel slides.
pathmlComputational pathology toolkit for analyzing WSI and multiparametric imaging data. H&E stained images, multiplex immunofluorescence, spatial omics integration.
omero-integrationMicroscopy data management platform. Access images via Python, retrieve datasets, analyze pixels, manage ROIs/annotations, for high-content screening workflows.
neurokit2Comprehensive biosignal processing: ECG, EEG, EDA, RSP, PPG, EMG, EOG signals. Cardiovascular signal analysis, neurophysiology, and physiological data processing.
neuropixels-analysisNeuropixels neural recording analysis. Load SpikeGLX/OpenEphys data, Kilosort4 spike sorting, quality metrics, Allen/IBL curation, for neuroscience research.

Healthcare ML & Clinical AI

Click to expand skill list
SkillDescription
pyhealthComprehensive healthcare AI toolkit for developing ML models with clinical data (EHR, claims). Task definition API, model training, evaluation for clinical NLP and prediction.
scikit-learnMachine learning in Python: supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning.
transformersPre-trained transformer models for NLP, computer vision, audio, and multimodal tasks. Text generation, classification, question answering, and biomedical NLP (BioBERT, ClinicalBERT).
shapModel interpretability using SHAP (SHapley Additive exPlanations). Explain ML model predictions, compute feature importance, generate SHAP plots for biomedical models.
umap-learnUMAP dimensionality reduction. Fast nonlinear manifold learning for 2D/3D visualization, clustering preprocessing (HDBSCAN), for high-dimensional omics data.

Health & Wellness Analytics

Click to expand skill list
SkillDescription
nutrition-analyzerComprehensive nutrition analysis: macro/micronutrient tracking, dietary assessment, meal planning, food data lookup, and nutritional recommendations.
mental-health-analyzerMental health data analysis: mood tracking, symptom patterns, PHQ/GAD scoring, behavioral insights, and wellness recommendations.
sleep-analyzerSleep quality analysis: sleep stages, duration, efficiency metrics, circadian rhythm assessment, and sleep hygiene recommendations.
rehabilitation-analyzerRehabilitation progress tracking: functional assessments, exercise programs, recovery milestones, and outcome measurement for physical/occupational therapy.
fitness-analyzerFitness performance analysis: exercise tracking, strength/cardio metrics, training load, VO2max estimation, and periodization planning.
health-trend-analyzerLongitudinal health trend analysis: vital sign tracking, biomarker trends, risk factor monitoring, and predictive health insights.
weightloss-analyzerWeight management analytics: caloric balance, body composition tracking, progress monitoring, and evidence-based weight loss strategies.
goal-analyzerHealth goal tracking and analysis: SMART goal setting, progress metrics, habit formation, and motivational insights for wellness objectives.
occupational-health-analyzerOccupational health assessment: workplace ergonomics, exposure risk, work-related illness surveillance, and return-to-work planning.
travel-health-analyzerTravel medicine: destination health risks, vaccination requirements, malaria prophylaxis, altitude sickness, and traveler health preparation.
family-health-analyzerFamily health management: pediatric milestones, family medical history, preventive screening schedules, and multigenerational health tracking.
tcm-constitution-analyzerTraditional Chinese Medicine constitution analysis: TCM body type assessment, pattern differentiation, herbal recommendations, and lifestyle guidance.
emergency-cardGenerate emergency medical information cards with critical health data, medications, allergies, and emergency contacts for patient safety.
ai-analyzerAI-powered comprehensive health data interpretation combining multiple biomarkers and health metrics for holistic wellness assessment.
wellally-techTechnical framework for WellAlly health analytics platform: integration patterns, data pipelines, and health AI infrastructure.

Mental Health & Crisis Intervention

Click to expand skill list
SkillDescription
crisis-detection-intervention-aiDetect crisis signals using NLP and mental health sentiment analysis. Implements suicide ideation detection, automated escalation, and crisis resource integration for mental health apps and recovery platforms.
crisis-response-protocolHandle mental health crisis situations safely: crisis detection, safety protocols, emergency escalation, suicide prevention, and hotline integration for AI coaching applications.
hipaa-complianceEnsure HIPAA compliance when handling PHI. Audit logging, data access controls, security event tracking, and compliance verification for health data applications.
clinical-diagnostic-reasoningIdentify and counteract cognitive biases in medical decision-making through systematic error analysis, differential diagnosis frameworks, and clinical judgment improvement.
speech-pathology-aiAI-powered speech-language pathology: phoneme analysis, articulation visualization, voice disorder assessment, fluency intervention, AAC, and stuttering treatment support.
hrv-alexithymia-expertHeart rate variability biometrics and emotional awareness training. HRV analysis, interoception training, biofeedback, vagal tone assessment, and autonomic nervous system evaluation.
adhd-daily-plannerADHD-optimized daily planning: time-blind friendly scheduling, executive function support, dopamine-aware task design, and neurodivergent-friendly productivity systems.
grief-companionCompassionate bereavement support, memorial creation, grief education, and healing journey guidance through the non-linear path of loss.
jungian-psychologistJungian analytical psychology: shadow work, archetypal analysis, dream interpretation, active imagination, addiction/recovery through depth psychology lens, and individuation process.
modern-drug-rehab-computerComprehensive addiction recovery knowledge system: evidence-based treatment (CBT, DBT, MI, EMDR, MAT), recovery resources, crisis intervention, and family systems for rehab environments.
recovery-community-moderatorTrauma-informed AI moderation for addiction recovery communities: harm reduction, 12-step traditions, conflict detection, and crisis post identification.

Medical Device & Regulatory

Click to expand skill list
SkillDescription
iso-13485-certificationComprehensive toolkit for ISO 13485 QMS documentation for medical devices: gap analysis, Quality Manuals, procedures, Medical Device Files. Covers FDA QMSR, EU MDR compliance.

๐Ÿ—‚๏ธ Scientific Databases

Expand/Collapse this category

Scientific Databases (Genomics & Variants)

Click to expand skill list
SkillDescription
clinvar-databaseQuery NCBI ClinVar for variant clinical significance. Search by gene/position, interpret pathogenicity classifications, access via E-utilities API or FTP, annotate VCFs, for genomic medicine.
clinpgx-databaseAccess ClinPGx pharmacogenomics data (successor to PharmGKB). Query gene-drug interactions, CPIC guidelines, allele functions, for precision medicine and genotype-guided dosing decisions.
cosmic-databaseAccess COSMIC cancer mutation database. Query somatic mutations, Cancer Gene Census, mutational signatures, gene fusions, for cancer research and precision oncology. Requires authentication.
ensembl-databaseQuery Ensembl genome database REST API for 250+ species. Gene lookups, sequence retrieval, variant analysis, comparative genomics, orthologs, VEP predictions, for genomic research.
gene-databaseQuery NCBI Gene via E-utilities/Datasets API. Search by symbol/ID, retrieve gene info (RefSeqs, GO, locations, phenotypes), batch lookups, for gene annotation and functional analysis.
geo-databaseAccess NCBI GEO for gene expression/genomics data. Search/download microarray and RNA-seq datasets (GSE, GSM, GPL), retrieve SOFT/Matrix files, for transcriptomics and expression analysis.
ena-databaseAccess European Nucleotide Archive via API/FTP. Retrieve DNA/RNA sequences, raw reads (FASTQ), genome assemblies by accession, for genomics and bioinformatics pipelines.
ggetCLI/Python toolkit for rapid bioinformatics queries with access to 20+ databases: Ensembl, UniProt, AlphaFold, ARCHS4, Enrichr, OpenTargets, COSMIC, BLAST, and more.
pysamGenomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

Scientific Databases (Proteins, Pathways & Drugs)

Click to expand skill list
SkillDescription
alphafold-databaseAccess AlphaFold's 200M+ AI-predicted protein structures. Retrieve structures by UniProt ID, download PDB/mmCIF files, analyze confidence metrics (pLDDT, PAE), for drug discovery and structural biology.
pdb-databaseAccess RCSB PDB for 3D protein/nucleic acid structures. Search by text/sequence/structure, download coordinates (PDB/mmCIF), retrieve metadata, for structural biology and drug discovery.
uniprot-databaseDirect REST API access to UniProt. Protein searches, FASTA retrieval, ID mapping, Swiss-Prot/TrEMBL. For multi-database workflows, prefer bioservices (unified interface to 40+ services).
string-databaseQuery STRING API for protein-protein interactions (59M proteins, 20B interactions). Network analysis, GO/KEGG enrichment, interaction discovery, 5000+ species, for systems biology.
kegg-databaseDirect REST API access to KEGG (academic use). Pathway analysis, gene-pathway mapping, metabolic pathways, drug interactions, ID conversion.
reactome-databaseQuery Reactome REST API for pathway analysis, enrichment, gene-pathway mapping, disease pathways, molecular interactions, expression analysis, for systems biology.
brenda-databaseAccess BRENDA enzyme database via SOAP API. Retrieve kinetic parameters (Km, kcat), reaction equations, organism data, substrate-specific enzyme info for biochemical research.
hmdb-databaseAccess Human Metabolome Database (220K+ metabolites). Search by name/ID/structure, retrieve chemical properties, biomarker data, NMR/MS spectra, pathways, for metabolomics.
metabolomics-workbench-databaseAccess NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, for metabolomics and biomarker discovery.
pubchem-databaseQuery PubChem via PUG-REST API (110M+ compounds). Search by name/CID/SMILES, retrieve properties, similarity/substructure searches, bioactivity, for cheminformatics.
chembl-databaseQuery ChEMBL's bioactive molecules and drug discovery data. Search compounds by structure/properties, retrieve bioactivity data (IC50, Ki), find inhibitors, for medicinal chemistry.
drugbank-databaseAccess comprehensive drug information from DrugBank including drug properties, interactions, targets, pathways, chemical structures, and pharmacology data.
zinc-databaseAccess ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening.
opentargets-databaseQuery Open Targets Platform for target-disease associations, drug target discovery, tractability/safety data, genetics/omics evidence, known drugs, for therapeutic target identification.
fda-databaseQuery openFDA API for drugs, devices, adverse events, recalls, regulatory submissions (510k, PMA), substance identification (UNII), for FDA regulatory data analysis.
pubmed-databaseDirect REST API access to PubMed. Advanced Boolean/MeSH queries, E-utilities API, batch processing, citation management.
openalex-databaseQuery and analyze scholarly literature using the OpenAlex database. Search for academic papers, analyze research trends, find works by authors or institutions.
biorxiv-databaseSearch bioRxiv preprint server by keywords, authors, date ranges, or categories, retrieving paper metadata for life sciences preprint discovery.
bioservicesPrimary Python tool for 40+ bioinformatics services. Unified API for UniProt, KEGG, ChEMBL, PubChem, Reactome, QuickGO โ€” preferred for multi-database workflows.
uspto-databaseAccess USPTO APIs for patent/trademark searches, examination history (PEDS), assignments, citations, office actions, for IP analysis and prior art searches.

Cancer Genomics Databases

Click to expand skill list
SkillDescription
cbioportal-databaseQuery cBioPortal for cancer genomics: somatic mutations, copy number, gene expression, and survival data across hundreds of cancer studies. Cancer target validation, oncogene analysis, and patient-level genomic profiling.
depmapQuery the Cancer Dependency Map (DepMap) for cancer cell line gene dependency scores (CRISPR Chronos), drug sensitivity, and gene effect profiles. Identify cancer-specific vulnerabilities and synthetic lethal interactions.
imaging-data-commonsQuery and download public cancer imaging data from NCI Imaging Data Commons. Access radiology (CT, MR, PET) and pathology datasets for AI training or research. No authentication required.

Genomic & Molecular Databases

Click to expand skill list
SkillDescription
bindingdb-databaseQuery BindingDB for measured drug-target binding affinities (Ki, Kd, IC50, EC50). Drug discovery, lead optimization, polypharmacology, and SAR studies.
gnomad-databaseQuery gnomAD for population allele frequencies, variant constraint scores (pLI, LOEUF), and loss-of-function intolerance. Variant pathogenicity interpretation and rare disease genetics.
gtex-databaseQuery GTEx for tissue-specific gene expression, eQTLs, and sQTLs. Link GWAS variants to gene regulation and interpret non-coding variant effects.
interpro-databaseQuery InterPro for protein family, domain, and functional site annotations. Integrates Pfam, PANTHER, PRINTS, SMART, and 11+ databases for protein function prediction.
jaspar-databaseQuery JASPAR for transcription factor binding site profiles (PWMs/PFMs). Regulatory genomics, motif analysis, and GWAS regulatory variant interpretation.
monarch-databaseQuery the Monarch Initiative knowledge graph for disease-gene-phenotype associations. Integrates OMIM, ORPHANET, HPO, ClinVar for rare disease gene discovery.
tiledbvcfScalable VCF/BCF ingestion, storage, and parallel queries using TileDB for population genomics at scale.

Structural Biology & Drug Discovery

Click to expand skill list
SkillDescription
molecular-dynamicsRun and analyze molecular dynamics simulations with OpenMM and MDAnalysis. Protein/small molecule systems, force fields, energy minimization, RMSD/RMSF analysis, free energy surfaces.
glycoengineeringAnalyze and engineer protein glycosylation. Predict N/O-glycosylation sites, access glycoengineering tools (NetOGlyc, GlycoShield). Therapeutic antibody optimization and vaccine design.
adaptyvCloud laboratory platform for automated protein testing: binding assays, expression testing, thermostability, enzyme activity. Protein sequence optimization with NetSolP, SoluProt, ESM.
ginkgo-cloud-labSubmit and manage protocols on Ginkgo Bioworks Cloud Lab for autonomous lab execution. Cell-free protein expression, protocol workflows, and biotech automation.

๐Ÿงฌ Bioinformatics (gptomics bio-* suite)

Expand/Collapse this category

Bioinformatics Tools & Pipelines

Click to expand skill list
SkillDescription
biopythonPrimary Python toolkit for molecular biology: PubMed/NCBI queries (Bio.Entrez), sequence manipulation, file parsing (FASTA, GenBank, FASTQ, PDB), BLAST workflows.
scikit-bioBiological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, for microbiome analysis.
etetoolkitPhylogenetic tree toolkit (ETE). Tree manipulation (Newick/NHX), evolutionary event detection, orthology/paralogy, NCBI taxonomy, visualization (PDF/SVG), for phylogenomics.
deeptoolsNGS analysis toolkit. BAM to bigWig conversion, QC (correlation, PCA, fingerprints), heatmaps/profiles (TSS, peaks), for ChIP-seq, RNA-seq, ATAC-seq visualization.
nextflow-developmentRun nf-core bioinformatics pipelines (rnaseq, sarek, atacseq) on sequencing data. Use for RNA-seq, WGS/WES, or ATAC-seq from local FASTQs or public datasets (GEO/SRA).
fastq-analysisSRA downloading, FASTQ quality control, STAR alignment, gene quantification, and single-cell kallisto/bustools pipelines for bulk and single-cell sequencing data.
genimlGenomic interval data (BED files) for machine learning tasks. Train region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis.
gtarsHigh-performance genomic interval analysis in Rust with Python bindings. Genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models.
arboretoInfer gene regulatory networks (GRNs) from gene expression data using GRNBoost2 and GENIE3 algorithms. For bulk RNA-seq and single-cell RNA-seq regulatory network inference.
lamindbOpen-source biological data framework for queryable, traceable, reproducible, and FAIR datasets (scRNA-seq, genomics, imaging).
dnanexus-integrationDNAnexus cloud genomics platform. Build apps/applets, manage data, dxpy Python SDK, run workflows, FASTQ/BAM/VCF, for genomics pipeline development.
latchbio-integrationLatch platform for bioinformatics workflows. Build pipelines with Latch SDK, @workflow/@task decorators, deploy serverless workflows, Nextflow/Snakemake integration.

Bioinformatics โ€” Clinical Databases & Variant Analysis

Click to expand skill list
SkillDescription
bio-clinical-databases-clinvar-lookupQuery ClinVar for clinical variant classifications, pathogenicity assertions, and review status.
bio-clinical-databases-dbsnp-queriesQuery dbSNP for SNP frequency, allele, and functional annotation data.
bio-clinical-databases-gnomad-frequenciesRetrieve population allele frequencies from gnomAD for rare variant interpretation.
bio-clinical-databases-hla-typingHLA typing from sequencing data using standard typing tools and databases.
bio-clinical-databases-myvariant-queriesBatch query MyVariant.info for aggregated variant annotations from multiple databases.
bio-clinical-databases-pharmacogenomicsPharmGKB/CPIC pharmacogenomics variant lookup for drug-gene interactions.
bio-clinical-databases-polygenic-riskCalculate polygenic risk scores from GWAS summary statistics and genotype data.
bio-clinical-databases-somatic-signaturesExtract and classify mutational signatures from somatic variant catalogs (COSMIC).
bio-clinical-databases-tumor-mutational-burdenCompute tumor mutational burden (TMB) from somatic variant calls.
bio-clinical-databases-variant-prioritizationRank and filter candidate variants by pathogenicity scores, inheritance, and phenotype match.
bio-variant-callingGATK-based germline variant calling pipeline from aligned BAM/CRAM files.
bio-variant-calling-clinical-interpretationInterpret variant calls in clinical context with ACMG guidelines.
bio-variant-calling-deepvariantDeepVariant deep-learning variant caller for short-read WGS/WES data.
bio-variant-calling-filtering-best-practicesApply VQSR and hard-filtering best practices to variant call sets.
bio-variant-calling-joint-callingJoint genotyping across multiple samples for improved variant discovery.
bio-variant-calling-structural-variant-callingCall structural variants (SVs) from long-read or paired-end sequencing.
bio-variant-annotationAnnotate VCF files with functional, population, and clinical consequence data.
bio-variant-normalizationNormalize variant representations (left-alignment, decomposition) for consistent comparison.
bio-vcf-basicsRead, write, and parse VCF files; filter by quality, region, and sample.
bio-vcf-manipulationAdvanced VCF manipulation: merging, splitting, reformatting, subset extraction.
bio-vcf-statisticsCompute variant statistics: ts/tv ratio, heterozygosity, depth distributions.
bio-gatk-variant-callingEnd-to-end GATK HaplotypeCaller variant calling with BQSR and joint genotyping.
bio-copy-number-cnv-annotationAnnotate CNV calls with gene content, database overlap, and clinical significance.
bio-copy-number-cnv-visualizationVisualize copy number profiles and segment plots from WGS/WES data.
bio-copy-number-cnvkit-analysisCNVKit copy number analysis for targeted sequencing and WES data.
bio-copy-number-gatk-cnvGATK4 somatic copy number alteration calling pipeline.
bio-tumor-fraction-estimationEstimate tumor purity and ploidy from allele frequencies and copy number data.
bio-ctdna-mutation-detectionDetect circulating tumor DNA mutations from liquid biopsy ultra-deep sequencing.
bio-cfdna-preprocessingProcess cell-free DNA sequencing data: adapter trimming, deduplication, QC.
bio-methylation-based-detectionDetect methylation-based cancer signals from cfDNA methylation data.
bio-longitudinal-monitoringTrack somatic variant evolution and clonal dynamics across serial samples.

Bioinformatics โ€” Sequencing & Read QC

Click to expand skill list
SkillDescription
bio-fastq-qualityAssess FASTQ read quality with FastQC/MultiQC; generate per-sample QC reports.
bio-read-qc-adapter-trimmingTrim sequencing adapters with Trimmomatic, Cutadapt, or fastp.
bio-read-qc-contamination-screeningScreen reads for human/microbial contamination using FastQ Screen or Kraken.
bio-read-qc-fastp-workflowEnd-to-end read QC and preprocessing with fastp including UMI handling.
bio-read-qc-quality-filteringApply quality-score and length filters to remove low-quality reads.
bio-read-qc-quality-reportsAggregate multi-sample QC reports with MultiQC.
bio-read-qc-umi-processingDeduplicate PCR duplicates using UMI-tools for accurate quantification.
bio-paired-end-fastqHandle paired-end FASTQ files: validation, interleaving, splitting.
bio-alignment-ioRead/write SAM/BAM/CRAM alignment files with pysam and samtools.
bio-alignment-msa-parsingParse and analyze multiple sequence alignments (FASTA, ClustalW, Stockholm).
bio-alignment-msa-statisticsCompute MSA statistics: conservation, gap content, entropy.
bio-alignment-pairwisePairwise sequence alignment using Smith-Waterman, Needleman-Wunsch, BLAST.
bio-longread-alignmentAlign long reads (ONT/PacBio) with minimap2; sort and index BAM files.
bio-longread-qcQuality control for long-read sequencing: read length, N50, error rate.
bio-longread-medakaConsensus polishing and variant calling with Oxford Nanopore Medaka.
bio-longread-structural-variantsCall large structural variants from long-read data with Sniffles/PBSV.
bio-basecallingBase-call raw ONT signals with Dorado/Guppy; convert FAST5 to FASTQ.
bio-compressed-filesHandle compressed bioinformatics files: bgzip, tabix, zstd, htslib.
bio-format-conversionConvert between bioinformatics formats: FASTQโ†”FASTA, BAMโ†”CRAM, BEDโ†”GTF.
bio-sequence-statisticsCompute sequence statistics: GC content, length distributions, complexity.
bio-read-sequencesRead and iterate over biological sequences from FASTA/FASTQ files.
bio-write-sequencesWrite biological sequences to FASTA/FASTQ with metadata preservation.
bio-filter-sequencesFilter sequences by length, quality, pattern, or taxonomy label.
bio-batch-processingBatch-process large bioinformatics datasets across samples and cohorts.
bio-rnaseq-qcRNA-seq specific QC: strandedness, rRNA contamination, gene body coverage.
bio-long-read-sequencing-clair3-variantsCall variants from long-read sequencing with Clair3 deep-learning model.
bio-long-read-sequencing-isoseq-analysisIso-Seq full-length transcript analysis for isoform discovery.
bio-long-read-sequencing-nanopore-methylationCall CpG methylation from Oxford Nanopore sequencing with Modbam2bed.
bio-splicing-qcRNA splicing quality assessment: junction read coverage, novel splice sites.
bio-splicing-quantificationQuantify alternative splicing events: PSI/inclusion levels per isoform.
bio-sashimi-plotsGenerate sashimi plots for visualizing RNA-seq splicing at specific loci.
bio-consensus-sequencesGenerate consensus FASTA sequences by applying VCF variants to a reference using bcftools consensus; useful for sample-specific references and haplotype reconstruction.

Bioinformatics โ€” Differential Expression & Transcriptomics

Click to expand skill list
SkillDescription
bio-de-deseq2-basicsDESeq2 differential expression analysis: design matrix, size factors, dispersion.
bio-de-edger-basicsEdgeR differential expression for count data with empirical Bayes dispersion.
bio-de-resultsExtract, filter, and annotate DESeq2/EdgeR results tables.
bio-de-visualizationVolcano plots, MA plots, and heatmaps for differential expression results.
bio-differential-expression-batch-correctionRemove batch effects with ComBat/limma for multi-cohort DE analysis.
bio-differential-expression-timeseries-deTime-series differential expression with splines and mixed models.
bio-differential-splicingDetect differential alternative splicing events with rMATS or MAJIQ.
bio-isoform-switchingIdentify isoform switching events with DRIMSeq and IsoformSwitchAnalyzeR.
bio-ribo-seq-orf-detectionDetect translated ORFs from ribosome profiling data with RiboTaper/Ribo-TISH.
bio-ribo-seq-riboseq-preprocessingPreprocess ribosome profiling reads: adapter trimming, rRNA removal, alignment.
bio-ribo-seq-ribosome-periodicityAssess triplet periodicity and ribosome footprint quality in Ribo-seq data.
bio-ribo-seq-ribosome-stallingIdentify ribosome stalling sites and pausing from Ribo-seq profiles.
bio-ribo-seq-translation-efficiencyCompute translation efficiency ratios from matched RNA-seq and Ribo-seq.

Bioinformatics โ€” Pathway & Network Analysis

Click to expand skill list
SkillDescription
bio-pathway-go-enrichmentGene Ontology enrichment analysis with clusterProfiler or g:Profiler.
bio-pathway-gseaGene Set Enrichment Analysis (GSEA) with pre-ranked or count-based statistics.
bio-pathway-kegg-pathwaysKEGG pathway enrichment and visualization for metabolic/signaling pathways.
bio-pathway-reactomeReactome pathway analysis with hierarchical enrichment and visualization.
bio-pathway-wikipathwaysWikiPathways enrichment and network visualization.
bio-pathway-enrichment-visualizationDot plots, enrichment maps, and network visualizations for pathway results.

Bioinformatics โ€” Single-Cell & Spatial Omics

Click to expand skill list
SkillDescription
bio-single-cell-batch-integrationIntegrate scRNA-seq datasets across batches with Harmony, BBKNN, scVI.
bio-single-cell-cell-annotationAnnotate single-cell clusters using marker genes and reference atlases.
bio-single-cell-cell-communicationInfer ligand-receptor cell-cell communication with CellChat or NicheNet.
bio-single-cell-clusteringCluster single cells with Leiden/Louvain algorithms in Scanpy/Seurat.
bio-single-cell-data-ioRead/write AnnData, Seurat, and 10x Genomics h5ad/h5 formats.
bio-single-cell-doublet-detectionRemove doublets from scRNA-seq with Scrublet or DoubletFinder.
bio-single-cell-lineage-tracingReconstruct cell lineage trees from scRNA-seq with clonal barcodes.
bio-single-cell-markers-annotationIdentify cluster marker genes and auto-annotate cell types.
bio-single-cell-metabolite-communicationInfer metabolite-mediated intercellular communication from scRNA-seq.
bio-single-cell-multimodal-integrationIntegrate scRNA-seq with ATAC, CITE-seq, or spatial using WNN/MultiVI.
bio-single-cell-perturb-seqAnalyze genetic perturbation screens from Perturb-seq / CROP-seq data.
bio-single-cell-preprocessingSingle-cell preprocessing: count filtering, normalization, HVG selection.
bio-single-cell-scatac-analysisscATAC-seq peak calling, TF motif enrichment, and chromatin accessibility.
bio-single-cell-splicingRNA velocity and splicing dynamics with scVelo or Alevin.
bio-single-cell-trajectory-inferenceInfer pseudotime trajectories with Monocle3, PAGA, or Slingshot.
bio-spatial-transcriptomics-image-analysisAnalyze histology images co-registered with spatial transcriptomics data.
bio-spatial-transcriptomics-spatial-communicationLigand-receptor communication analysis with spatial context (COMMOT, SpatialDE).
bio-spatial-transcriptomics-spatial-data-ioLoad and process Visium, Slide-seq, MERFISH, and STARmap datasets.
bio-spatial-transcriptomics-spatial-deconvolutionDeconvolve cell type proportions in spatial spots with RCTD, SPOTlight.
bio-spatial-transcriptomics-spatial-domainsIdentify spatially variable genes and tissue domains with SpatialDE/BANKSY.
bio-spatial-transcriptomics-spatial-multiomicsIntegrate spatial transcriptomics with proteomics, metabolomics, or imaging.
bio-spatial-transcriptomics-spatial-neighborsBuild spatial neighbor graphs and perform neighborhood enrichment analysis.
bio-spatial-transcriptomics-spatial-preprocessingPreprocess spatial transcriptomics: QC, normalization, spot filtering.
bio-spatial-transcriptomics-spatial-proteomicsAnalyze spatial proteomics data from CODEX, IMC, or MIBI platforms.
bio-spatial-transcriptomics-spatial-statisticsSpatial statistics: Moran's I, spatial autocorrelation, co-localization.
bio-spatial-transcriptomics-spatial-visualizationVisualize spatial gene expression maps and tissue section overlays.

Bioinformatics โ€” Epigenomics & Chromatin

Click to expand skill list
SkillDescription
bio-atac-seq-atac-peak-callingCall ATAC-seq chromatin accessibility peaks with MACS2/MACS3.
bio-atac-seq-atac-qcATAC-seq quality control: TSS enrichment, fragment size, FRiP score.
bio-atac-seq-differential-accessibilityDifferential chromatin accessibility between conditions with DESeq2/DiffBind.
bio-atac-seq-footprintingTranscription factor footprinting from ATAC-seq with TOBIAS or HINT-ATAC.
bio-atac-seq-motif-deviationTF motif deviation scoring with chromVAR for single-cell ATAC data.
bio-atac-seq-nucleosome-positioningInfer nucleosome positioning from ATAC-seq fragment length distributions.
bio-chipseq-differential-bindingDifferential ChIP-seq binding analysis with DiffBind.
bio-chipseq-motif-analysisDe novo and known motif discovery from ChIP-seq peaks with HOMER/MEME.
bio-chipseq-peak-annotationAnnotate ChIP-seq peaks with genomic features and nearest genes.
bio-chipseq-peak-callingCall ChIP-seq peaks with MACS2 for TF binding and histone marks.
bio-chipseq-qcChIP-seq quality metrics: FRiP, SCC, phantompeakqualtools.
bio-chipseq-super-enhancersIdentify super enhancers from H3K27ac ChIP-seq with ROSE.
bio-chipseq-visualizationHeatmaps and aggregate profiles at peak regions with deepTools.
bio-hi-c-analysis-compartment-analysisCall A/B compartments from Hi-C contact matrices.
bio-hi-c-analysis-contact-pairsProcess Hi-C contact pairs: filtering, deduplication, binning.
bio-hi-c-analysis-hic-data-ioRead and write Hi-C data formats: .hic, cool, mcool with cooler/hicstuff.
bio-hi-c-analysis-hic-differentialDifferential Hi-C interaction analysis between conditions.
bio-hi-c-analysis-hic-visualizationVisualize Hi-C contact maps, TADs, and loops with pyGenomeTracks.
bio-hi-c-analysis-loop-callingDetect chromatin loops from Hi-C data with Mustache or HICCUPS.
bio-hi-c-analysis-matrix-operationsNormalize Hi-C matrices: ICE, KR, VC; compute observed/expected.
bio-hi-c-analysis-tad-detectionIdentify topologically associating domains (TADs) from Hi-C data.
bio-methylation-bismark-alignmentAlign bisulfite sequencing reads and extract CpG methylation with Bismark.
bio-methylation-callingCall CpG methylation from WGBS/RRBS alignments.
bio-methylation-dmr-detectionIdentify differentially methylated regions (DMRs) with DSS or MethylKit.
bio-methylation-methylkitMethylation analysis with MethylKit: CpG tiles, DMR calling, annotation.

Bioinformatics โ€” Metagenomics & Microbiome

Click to expand skill list
SkillDescription
bio-metagenomics-abundanceEstimate microbial taxon abundances from shotgun metagenomics.
bio-metagenomics-amr-detectionDetect antimicrobial resistance genes with AMRFinder or RGI/CARD.
bio-metagenomics-functional-profilingFunctional profiling of metagenomes with HUMAnN3 for pathway/gene families.
bio-metagenomics-krakenTaxonomic classification of metagenomic reads with Kraken2/Bracken.
bio-metagenomics-metaphlanClade-specific marker-based profiling of microbial communities with MetaPhlAn4.
bio-metagenomics-strain-trackingTrack microbial strains across samples with StrainPhlan or inStrain.
bio-metagenomics-visualizationVisualize microbiome composition with Krona charts and stacked bar plots.
bio-microbiome-amplicon-processingProcess 16S/ITS amplicon sequencing with QIIME2 or DADA2.
bio-microbiome-differential-abundanceTest differential microbial abundance with ANCOM-BC, MaAsLin2, or ALDEx2.
bio-microbiome-diversity-analysisAlpha/beta diversity analysis: Shannon, PD, UniFrac, PCoA.
bio-microbiome-functional-predictionPredict functional capacity from 16S data with PICRUSt2 or Tax4Fun.
bio-microbiome-qiime2-workflowEnd-to-end QIIME2 workflow: denoising, diversity, differential abundance.
bio-microbiome-taxonomy-assignmentAssign taxonomy to ASVs/OTUs using SILVA, GTDB, or Greengenes2.

Bioinformatics โ€” Immunoinformatics & Flow Cytometry

Click to expand skill list
SkillDescription
bio-immunoinformatics-epitope-predictionPredict MHC-I/II epitopes from protein sequences with NetMHCpan/MHCflurry.
bio-immunoinformatics-immunogenicity-scoringScore peptide immunogenicity for vaccine and neoantigen prioritization.
bio-immunoinformatics-mhc-binding-predictionPredict peptide-MHC binding affinities for multiple alleles.
bio-immunoinformatics-neoantigen-predictionPredict neoantigens from somatic mutations for personalized cancer vaccines.
bio-immunoinformatics-tcr-epitope-bindingPredict TCR-epitope binding with ERGO, pMTnet, or NetTCR.
bio-tcr-bcr-analysis-immcantation-analysisAnalyze B/T cell receptor repertoires with the Immcantation suite.
bio-tcr-bcr-analysis-mixcr-analysisMiXCR V(D)J alignment and clonotype assembly for immune repertoires.
bio-tcr-bcr-analysis-repertoire-visualizationVisualize repertoire diversity, clonal expansion, and V-gene usage.
bio-tcr-bcr-analysis-scirpy-analysisSingle-cell TCR/BCR analysis integrated with scRNA-seq using Scirpy.
bio-tcr-bcr-analysis-vdjtools-analysisImmune repertoire statistics and overlap analysis with VDJtools.
bio-flow-cytometry-bead-normalizationNormalize flow cytometry data using calibration beads.
bio-flow-cytometry-clustering-phenotypingCluster and phenotype cell populations with FlowSOM, PhenoGraph, or UMAP.
bio-flow-cytometry-compensation-transformationApply compensation matrices and biexponential/arcsinh transformations.
bio-flow-cytometry-cytometry-qcQuality control for flow/mass cytometry: signal drift, spillover, outlier detection.
bio-flow-cytometry-differential-analysisStatistical comparison of cell populations between conditions.
bio-flow-cytometry-doublet-detectionDetect and remove doublets from flow cytometry data.
bio-flow-cytometry-fcs-handlingRead, write, and manipulate FCS files with FlowCore/FlowKit.
bio-flow-cytometry-gating-analysisManual and algorithmic gating strategies for cell population identification.
bio-imaging-mass-cytometry-cell-segmentationSegment cells in IMC images with Mesmer or CellProfiler.
bio-imaging-mass-cytometry-data-preprocessingPreprocess imaging mass cytometry data: hot pixel removal, normalization.
bio-imaging-mass-cytometry-interactive-annotationInteractively annotate cell types in IMC spatial datasets.
bio-imaging-mass-cytometry-phenotypingPhenotype immune and tumor cells from multi-marker IMC panels.
bio-imaging-mass-cytometry-quality-metricsQuality metrics for IMC acquisitions: signal-to-noise, tissue coverage.
bio-imaging-mass-cytometry-spatial-analysisSpatial cell neighborhood analysis from imaging mass cytometry data.

Bioinformatics โ€” Multi-Omics Integration

Click to expand skill list
SkillDescription
bio-multi-omics-data-harmonizationHarmonize multi-omics datasets: sample matching, batch correction, feature alignment.
bio-multi-omics-mixomics-analysisMulti-omics factor analysis with mixOmics (DIABLO, MOFA, sPLS-DA).
bio-multi-omics-mofa-integrationMulti-Omics Factor Analysis (MOFA+) for latent factor discovery across modalities.
bio-multi-omics-similarity-networkSimilarity Network Fusion (SNF) for patient stratification from multi-omics.

Bioinformatics โ€” Proteomics & Metabolomics

Click to expand skill list
SkillDescription
bio-proteomics-data-importImport DDA/DIA proteomics data from MaxQuant, Proteome Discoverer, FragPipe.
bio-proteomics-dia-analysisDIA proteomics analysis with DIA-NN or Spectronaut.
bio-proteomics-differential-abundanceDifferential protein abundance with limma, MSstats, or DEqMS.
bio-proteomics-peptide-identificationPeptide spectrum matching and database search result parsing.
bio-proteomics-protein-inferenceProtein grouping, parsimony, and FDR control for proteomics experiments.
bio-proteomics-proteomics-qcProteomics QC: peptide counts, coverage, missing values, CV.
bio-proteomics-ptm-analysisPost-translational modification analysis: phospho, ubiquitin, glycan enrichment.
bio-proteomics-quantificationLabel-free, TMT/iTRAQ, and SILAC quantification workflows.
bio-proteomics-spectral-librariesBuild and use spectral libraries for DIA data analysis.
bio-metabolomics-lipidomicsLipidomics data analysis: lipid class annotation, fatty acid composition.
bio-metabolomics-metabolite-annotationAnnotate mass spec features with HMDB, MZmine, SIRIUS, or MetFrag.
bio-metabolomics-msdial-preprocessingMS-DIAL-based LC-MS/GC-MS data preprocessing and peak detection.
bio-metabolomics-normalization-qcMetabolomics normalization: PQN, LOESS, median, batch correction.
bio-metabolomics-pathway-mappingMap identified metabolites to KEGG, MetaCyc, or Reactome pathways.
bio-metabolomics-statistical-analysisUnivariate/multivariate stats for metabolomics: PCA, PLS-DA, ANOVA.
bio-metabolomics-targeted-analysisTargeted metabolomics with MRM/SRM: calibration curves, quantification.
bio-metabolomics-xcms-preprocessingXCMS-based LC-MS peak detection, alignment, and grouping.

Bioinformatics โ€” Structural Biology & Cheminformatics

Click to expand skill list
SkillDescription
bio-structural-biology-alphafold-predictionsUse AlphaFold2/3 predictions: model quality assessment, confidence scores.
bio-structural-biology-modern-structure-predictionModern structure prediction with ESMFold, RoseTTAFold, and OpenFold.
bio-pdb-geometric-analysisGeometric analysis of protein structures: distances, angles, contacts, RMSD.
bio-pdb-structure-ioRead and write PDB/mmCIF structure files with BioPython or Gemmi.
bio-pdb-structure-modificationModify protein structures: add hydrogens, mutate residues, energy minimize.
bio-pdb-structure-navigationNavigate and inspect PDB structures: chain, residue, atom selection.
bio-molecular-descriptorsCalculate molecular descriptors (RDKit): MW, LogP, TPSA, fingerprints.
bio-molecular-ioRead/write chemical structure formats: SDF, SMILES, MOL2, PDB with RDKit.
bio-reaction-enumerationEnumerate reactions and products from SMARTS reaction templates.
bio-similarity-searchingMolecular similarity search: Tanimoto, fingerprint-based, scaffold hopping.
bio-substructure-searchSubstructure searching in chemical databases using SMARTS patterns.
bio-virtual-screeningVirtual screening workflows: docking, scoring, pose filtering with AutoDock/Vina.
bio-admet-predictionPredict ADMET properties: absorption, distribution, metabolism, excretion, toxicity.

Bioinformatics โ€” Epidemiological & Causal Genomics

Click to expand skill list
SkillDescription
bio-epidemiological-genomics-amr-surveillanceAntimicrobial resistance surveillance from genomic epidemiology data.
bio-epidemiological-genomics-pathogen-typingPathogen molecular typing: MLST, wgMLST, cgMLST for outbreak analysis.
bio-epidemiological-genomics-phylodynamicsPhylodynamics: molecular clock, population dynamics, BEAST2/TreeTime.
bio-epidemiological-genomics-transmission-inferenceInfer transmission networks from pathogen genomics with TransPhylo/outbreaker2.
bio-epidemiological-genomics-variant-surveillanceTrack pathogen variant emergence and spread from genomic surveillance.
bio-causal-genomics-colocalization-analysisColocalization analysis of GWAS and eQTL signals with coloc or eCAVIAR.
bio-causal-genomics-fine-mappingFine-map causal variants at GWAS loci with SuSiE or FINEMAP.
bio-causal-genomics-mediation-analysisCausal mediation analysis for gene expression intermediaries.
bio-causal-genomics-mendelian-randomizationTwo-sample Mendelian randomization with MR-Base/TwoSampleMR.
bio-causal-genomics-pleiotropy-detectionDetect horizontal pleiotropy and heterogeneity in MR analyses.
bio-genome-engineering-base-editing-designDesign base editors (CBE/ABE) for precise single-base correction.
bio-genome-engineering-grna-designDesign and score CRISPR guide RNAs with Cas-OFFinder and CRISPOR.
bio-genome-engineering-hdr-template-designDesign HDR templates for precise knock-in via homology-directed repair.
bio-genome-engineering-off-target-predictionPredict CRISPR off-target sites genome-wide for safety assessment.
bio-genome-engineering-prime-editing-designDesign pegRNAs and nickase gRNAs for prime editing experiments.
bio-crispr-screens-base-editing-analysisAnalyze base editing screens: guide efficiency, editing outcomes.
bio-crispr-screens-batch-correctionCorrect batch effects in CRISPR screen data across replicates.
bio-crispr-screens-crispresso-editingQuantify editing outcomes with CRISPResso2 from amplicon sequencing.
bio-crispr-screens-hit-callingCall hits from CRISPR screens using MAGeCK, BAGEL2, or casTLE.
bio-crispr-screens-jacks-analysisCRISPR screen analysis with JACKS hierarchical Bayesian model.
bio-crispr-screens-library-designDesign CRISPR screen libraries: guide selection, controls, coverage.
bio-crispr-screens-mageck-analysisMAGeCK MLE/RRA analysis for CRISPR pooled screens.
bio-crispr-screens-screen-qcQuality control for CRISPR screens: Gini index, read distribution.

๐Ÿ”ฌ Omics & Computational Biology

Expand/Collapse this category

Single-Cell & Spatial Omics

Click to expand skill list
SkillDescription
anndataWorking with annotated data matrices in Python for single-cell genomics analysis, managing experimental measurements with metadata and large-scale omics data.
scanpySingle-cell RNA-seq analysis. Load .h5ad/10X data, QC, normalization, PCA/UMAP/t-SNE, Leiden clustering, marker genes, cell type annotation, trajectory.
scvi-toolsDeep learning for single-cell analysis: data integration/batch correction (scVI/scANVI), ATAC-seq (PeakVI), CITE-seq (totalVI), multiome (MultiVI), spatial deconvolution (DestVI).
single-cell-rna-qcQuality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations.
cellxgene-censusQuery CZ CELLxGENE Census (61M+ cells). Filter by cell type/tissue/disease, retrieve expression data, integrate with scanpy/PyTorch, for population-scale single-cell analysis.
pydeseq2Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots.
bulk-combat-correctionRemove batch effects from merged bulk RNA-seq or microarray cohorts using pyComBat, with corrected matrix export and pre/post correction visualizations.
bulk-deg-analysisBulk RNA-seq DEG pipeline: gene ID mapping, DESeq2 normalization, statistical testing, visualization, and pathway enrichment via OmicVerse.
bulk-deseq2-analysisPyDESeq2-based differential expression analysis with ID mapping, DE testing, fold-change thresholding, and enrichment visualization.
bulk-stringdb-ppiQuery STRING for protein interactions, build PPI graphs with pyPPI, and render network figures for bulk gene lists.
bulk-to-single-deconvolutionConvert bulk RNA-seq cohorts to synthetic single-cell datasets using Bulk2Single workflow for cell fraction estimation and beta-VAE generation.
bulk-trajblend-interpolationExtend scRNA-seq developmental trajectories with BulkTrajBlend by generating intermediate cells from bulk RNA-seq using beta-VAE and GNN models.
bulk-wgcna-analysisRun PyWGCNA through OmicVerse โ€” co-expression module construction, eigengene visualization, and hub gene extraction.
single-annotationSingle-cell annotation workflows: SCSA, MetaTiME, CellVote, CellMatch, GPTAnno, and weighted KNN transfer for annotating cell types across modalities.
single-cellphone-dbRun CellPhoneDB v5 on annotated single-cell data to infer ligand-receptor networks and produce CellChat-style visualizations.
single-clusteringSingle-cell clustering workflow: QC, multimethod clustering, topic modeling, cNMF, and cross-batch integration in OmicVerse.
single-downstream-analysisOmicVerse downstream tutorials covering AUCell scoring, metacell DEG, and related exports for single-cell data.
single-multiomicsOmicVerse multi-omics tutorials: MOFA, GLUE pairing, SIMBA integration, TOSICA transfer, and StaVIA cartography.
single-preprocessingSingle-cell preprocessing in OmicVerse: QC, normalization, HVG detection, PCA/embedding pipelines (CPU/GPU).
single-to-spatial-mappingMap scRNA-seq atlases onto spatial transcriptomics slides using Single2Spatial workflow for deep-forest training and marker visualization.
single-trajectoryOmicVerse trajectory workflows: PAGA, Palantir, VIA, velocity coupling, and fate scoring.
spatial-tutorialsSpatial transcriptomics tutorials: preprocessing, deconvolution, and downstream modeling across Visium, Visium HD, Stereo-seq, and Slide-seq.
tcga-preprocessingIngest TCGA sample sheets, expression archives, and clinical carts into OmicVerse, with survival metadata initialization and AnnData export.
gsea-enrichmentGene set enrichment analysis in OmicVerse with correct geneset format handling for loading pathway databases and running GSEA.

Cheminformatics & Drug Discovery

Click to expand skill list
SkillDescription
rdkitCheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity.
datamolPythonic RDKit wrapper with simplified interface for standard drug discovery: SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformer generation.
medchemMedicinal chemistry filters. Apply drug-likeness rules (Lipinski, Veber), PAINS filters, structural alerts, complexity metrics, for compound prioritization and library filtering.
diffdockDiffusion-based molecular docking. Predict protein-ligand binding poses from PDB/SMILES, confidence scores, virtual screening, for structure-based drug design.
molfeatMolecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.
deepchemMolecular machine learning toolkit. Property prediction (ADMET, toxicity), GNNs (GCN, MPNN), MoleculeNet benchmarks, pretrained models, for drug discovery ML.
torchdrugGraph-based drug discovery toolkit. Molecular property prediction (ADMET), protein modeling, knowledge graph reasoning, molecular generation, retrosynthesis, GNNs.
torch_geometricGraph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, molecular property prediction, for geometric deep learning in drug discovery.
pytdcTherapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML.
cobrapyConstraint-based metabolic modeling (COBRA). FBA, FVA, gene knockouts, flux sampling, SBML models, for systems biology and metabolic engineering.

Proteomics & Mass Spectrometry

Click to expand skill list
SkillDescription
matchmsMass spectrometry spectral analysis. Process mzML/MGF/MSP files, spectral similarity (cosine, modified cosine), metadata harmonization, compound identification.
pyopenmsPython interface to OpenMS for LC-MS/MS proteomics and metabolomics workflows. File handling (mzML, mzXML, mzTab, pepXML, mzIdentML) and quantification.
flowioParse FCS (Flow Cytometry Standard) files v2.0-3.1. Extract events as NumPy arrays, read metadata/channels, convert to CSV/DataFrame, for flow cytometry data preprocessing.

Protein Structure & Design

Click to expand skill list
SkillDescription
esmESM3 generative multimodal protein design (sequence, structure, function) and ESM C efficient protein embeddings. Protein language models for sequence scoring and embedding.
alphafoldValidate protein designs using AlphaFold2 structure prediction. Validates designed sequences, predicts binder-target complex structures, calculates pLDDT/PAE metrics.
boltzStructure prediction using Boltz-1/Boltz-2, an open biomolecular structure predictor for protein complexes, binder validation, and open-source AlphaFold alternative.
boltzgenAll-atom protein design using BoltzGen diffusion model. Side-chain aware design from the start, designing around small molecules or ligands.
chaiStructure prediction using Chai-1 foundation model for protein-protein complexes, binder validation, and protein-small molecule interaction prediction.
rfdiffusionGenerate protein backbones using RFdiffusion diffusion model for de novo protein structure generation and binder scaffold design.
bindcraftEnd-to-end binder design using BindCraft hallucination with built-in AF2 validation for production-quality binder campaigns.
binder-designGuidance for choosing the right protein binder design tool (BoltzGen, BindCraft, or RFdiffusion) and planning binder design campaigns.
proteinmpnnDesign protein sequences using ProteinMPNN inverse folding for RFdiffusion backbones, sequence redesign, and partial fixed-position design.
ligandmpnnLigand-aware protein sequence design using LigandMPNN for sequences around small molecules, enzyme active site design, and binding pocket optimization.
solublempnnSolubility-optimized protein sequence design using SolubleMPNN for E. coli expression, reducing aggregation, and solubility optimization.
foldseekStructure similarity search with Foldseek for finding similar structures in PDB/AFDB databases, structural homology search, and evolutionary relationship discovery.
ipsaeBinder design ranking using ipSAE (interprotein Score from Aligned Errors) for ranking binder designs and filtering BindCraft or RFdiffusion outputs.
pdbFetch and analyze protein structures from RCSB PDB by PDB ID, search for similar structures, prepare targets for binder design.
protein-design-workflowEnd-to-end guidance for protein design pipelines from project initiation to experimental validation.
protein-qcQuality control metrics and filtering thresholds for protein design: pLDDT, PAE, ipTM for binding, expression, and structure evaluation.
cell-free-expressionGuidance for cell-free protein synthesis (CFPS) optimization, troubleshooting low yield/aggregation, and optimizing DNA template design.
binding-characterizationGuidance for SPR and BLI binding characterization experiments, kinetics interpretation, and troubleshooting poor binding signal.

Single-Cell & Trajectory Analysis

Click to expand skill list
SkillDescription
scveloRNA velocity analysis. Estimate cell state transitions from unspliced/spliced mRNA dynamics, infer trajectory directions, compute latent time, and identify driver genes in scRNA-seq data.

Phylogenetics & Network Analysis

Click to expand skill list
SkillDescription
phylogeneticsBuild and analyze phylogenetic trees using MAFFT, IQ-TREE 2, and FastTree. Evolutionary analysis, microbial genomics, viral phylodynamics, and molecular clock studies.
networkxNetwork and graph analysis in Python. Biological network analysis, protein interaction networks, pathway graphs, community detection, and centrality measures.
torch-geometricGraph Neural Networks (PyG) for molecular property prediction, drug-target interaction modeling, and geometric deep learning on biological graphs.

โš™๏ธ ClawBio Pipelines

Expand/Collapse this category

Bioinformatics Orchestration & Pipelines (ClawBio)

Click to expand skill list
SkillDescription
bio-orchestratorMeta-agent routing bioinformatics requests to specialized sub-skills. Handles file type detection (VCF, FASTQ, BAM, PDB, h5ad), analysis planning, report generation, and reproducibility export.
scrna-orchestratorLocal Scanpy pipeline for single-cell RNA-seq QC, clustering, marker discovery, and two-group differential expression from raw-count .h5ad files.
seq-wranglerSequence QC, alignment, and BAM processing. Wraps FastQC, BWA/Bowtie2, SAMtools for automated read-to-BAM pipelines.
vcf-annotatorAnnotate VCF variants with VEP, ClinVar, gnomAD frequencies, and ancestry-aware context. Generates prioritized variant reports.
repro-enforcerExport bioinformatics analyses as reproducible bundles with Conda environment, Singularity container definition, and Nextflow pipeline.
galaxy-bridgeGalaxy tool discovery, recommendation, and execution โ€” 8,000+ bioinformatics tools from usegalaxy.org with multi-signal scoring and workflow suggestions.

Genomics, Ancestry & Pharmacogenomics (ClawBio)

Click to expand skill list
SkillDescription
gwas-lookupFederated variant lookup across 9 genomic databases: GWAS Catalog, Open Targets, PheWeb (UKB, FinnGen, BBJ), GTEx, eQTL Catalogue, and more.
gwas-prsCalculate polygenic risk scores from DTC genetic data (23andMe/AncestryDNA) using the PGS Catalog.
pharmgx-reporterPharmacogenomic report from DTC genetic data โ€” 12 genes, 31 SNPs, 51 drugs with CPIC guidelines and personalized dosage cards.
clinpgxQuery the ClinPGx API for pharmacogenomic gene-drug data, clinical annotations, CPIC guidelines, and FDA drug labels.
drug-photoIdentify a medication from a packaging photo via Claude vision, then retrieve genotype-informed dosage guidance.
claw-ancestry-pcaAncestry decomposition PCA against the Simons Genome Diversity Project (345 samples, 164 global populations).
genome-compareCompare genome to reference individuals and estimate ancestry composition via IBS and EM admixture.
equity-scorerCompute HEIM diversity and equity metrics from VCF or ancestry data. Generates heterozygosity, FST, PCA plots, and HEIM Equity Score with markdown reports.
claw-metagenomicsShotgun metagenomics profiling: taxonomy (Kraken2/Bracken), resistome (CARD/RGI), and functional pathways (HUMAnN3) from paired-end FASTQ.
ukb-navigatorSemantic search across UK Biobank's 12,000+ data fields and publications โ€” find the right variables for your research question.

Structural Biology & Literature (ClawBio)

Click to expand skill list
SkillDescription
struct-predictorLocal protein structure prediction with AlphaFold, Boltz, or Chai. Compare structures, compute RMSD, visualize 3D models.
lit-synthesizerSearch PubMed and bioRxiv, summarize papers with LLM, build citation graphs, and generate literature review sections.
claw-semantic-simSemantic Similarity Index for disease research literature using PubMedBERT embeddings. Compute research equity metrics (HEIM).
labstepInteract with the Labstep electronic lab notebook API. Query experiments, protocols, resources, and inventory.
profile-reportGenerate structured bioinformatics analysis profile reports.

๐Ÿง  BioOS Extended Suite

Expand/Collapse this category

BioOS Extended Bioinformatics Suite (mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills-)

Sequence & Alignment Tools

Click to expand skill list
SkillDescription
bio-alignment-sortingSort SAM/BAM files by coordinate or name with samtools sort.
bio-alignment-filteringFilter alignments by flag, quality, region, or paired status.
bio-alignment-indexingIndex BAM/CRAM files with samtools index for random access.
bio-alignment-validationValidate alignment file integrity and detect truncated/corrupt files.
bio-alignment-files-bam-statisticsCompute alignment statistics: flagstat, idxstats, coverage depth.
bio-sam-bam-basicsRead, inspect, and manipulate SAM/BAM files with samtools/pysam.
bio-duplicate-handlingMark and remove PCR duplicates with Picard or samtools markdup.
bio-pileup-generationGenerate base-level pileup from BAM for variant calling and coverage.
bio-reference-operationsDownload, index, and manage reference genome FASTA files.
bio-blast-searchesRun BLAST searches against local or remote databases for sequence homology.
bio-local-blastSet up and run BLAST+ locally with custom databases.
bio-entrez-searchSearch NCBI Entrez databases (PubMed, gene, nucleotide, protein, SRA).
bio-entrez-fetchFetch records from NCBI Entrez by accession or UID.
bio-entrez-linkRetrieve linked records across NCBI Entrez databases.
bio-uniprot-accessQuery UniProt for protein sequences, annotations, and cross-references.
bio-geo-dataDownload and parse GEO datasets and series matrices.
bio-sra-dataDownload raw sequencing data from NCBI SRA with fasterq-dump.
bio-batch-downloadsBatch download bioinformatics data from NCBI, EBI, Ensembl.

Sequence Analysis

Click to expand skill list
SkillDescription
bio-seq-objectsWork with BioPython sequence objects: SeqRecord, features, annotations.
bio-sequence-propertiesCompute sequence properties: MW, pI, hydrophobicity, extinction coefficient.
bio-sequence-similarityCompute sequence similarity with pairwise alignment and percent identity.
bio-sequence-slicingSlice, extract, and manipulate subsequences from FASTA/FASTQ.
bio-motif-searchSearch sequences for regulatory motifs using FIMO, MAST, or regex.
bio-codon-usageAnalyze codon usage bias and optimize sequences for expression.
bio-transcription-translationTranscribe and translate DNA sequences; handle genetic code variations.
bio-reverse-complementCompute reverse complement and strand-aware sequence operations.
bio-primer-design-primer-basicsDesign PCR primers with Primer3 for standard amplification.
bio-primer-design-primer-validationValidate primer specificity by BLAST and thermodynamic analysis.
bio-primer-design-qpcr-primersDesign qPCR/RT-PCR primers with efficiency and specificity optimization.
bio-restriction-sitesFind restriction enzyme recognition sites in DNA sequences.
bio-restriction-mappingCreate restriction maps and in silico digestion patterns.
bio-restriction-fragment-analysisAnalyze restriction fragment patterns for cloning and gel prediction.
bio-restriction-enzyme-selectionSelect restriction enzymes for cloning based on cut sites and compatibility.

Read Alignment

Click to expand skill list
SkillDescription
bio-read-alignment-bwa-alignmentAlign short reads to reference genome with BWA-MEM.
bio-read-alignment-bowtie2-alignmentAlign short reads with Bowtie2; local and end-to-end modes.
bio-read-alignment-hisat2-alignmentSplice-aware RNA-seq alignment with HISAT2.
bio-read-alignment-star-alignmentHigh-speed STAR aligner for RNA-seq with junction detection.

Genome Assembly

Click to expand skill list
SkillDescription
bio-genome-assembly-long-read-assemblyDe novo assembly from ONT/PacBio long reads with Flye or Canu.
bio-genome-assembly-hifi-assemblyHiFi (CCS) read assembly with Hifiasm for high-accuracy genomes.
bio-genome-assembly-short-read-assemblyIllumina de novo assembly with SPAdes for metagenomes/bacteria/transcriptomes.
bio-genome-assembly-metagenome-assemblyMetagenomic assembly: co-assembly, binning, MAG recovery.
bio-genome-assembly-assembly-qcAssess assembly quality with QUAST, BUSCO, and NGA50 metrics.
bio-genome-assembly-assembly-polishingPolish assemblies with Medaka (ONT) or NextPolish (Illumina).
bio-genome-assembly-scaffoldingScaffold contigs with Hi-C, optical mapping, or long reads.
bio-genome-assembly-contamination-detectionDetect and remove contamination in assembled genomes.

Genome Intervals & Annotation

Click to expand skill list
SkillDescription
bio-genome-intervals-bed-file-basicsRead, write, and filter BED files with pybedtools/bedtools.
bio-genome-intervals-interval-arithmeticIntersect, subtract, merge, and complement genomic intervals.
bio-genome-intervals-proximity-operationsFind nearest features and compute distances between intervals.
bio-genome-intervals-coverage-analysisCompute read depth coverage across genomic regions.
bio-genome-intervals-bigwig-tracksCreate and query BigWig signal tracks from BAM/bedGraph.
bio-genome-intervals-gtf-gff-handlingParse and manipulate GTF/GFF annotation files.
bio-bedgraph-handlingProcess bedGraph coverage files: arithmetic, normalization, conversion.

RNA Quantification

Click to expand skill list
SkillDescription
bio-rna-quantification-featurecounts-countingCount reads per gene with featureCounts from subread package.
bio-rna-quantification-alignment-free-quantPseudo-alignment quantification with Salmon or Kallisto.
bio-rna-quantification-tximport-workflowImport Salmon/Kallisto quantification into R/DESeq2 with tximport.
bio-rna-quantification-count-matrix-qcQC count matrices: library size, zero inflation, gene detection rates.
bio-expression-matrix-counts-ingestLoad and validate count matrices from multiple quantification tools.
bio-expression-matrix-gene-id-mappingMap between Ensembl, Entrez, HGNC, and gene symbol identifiers.
bio-expression-matrix-metadata-joinsJoin sample metadata to expression matrices for downstream analysis.
bio-expression-matrix-sparse-handlingHandle sparse count matrices efficiently with scipy sparse formats.

Epitranscriptomics & CLIP-seq

Click to expand skill list
SkillDescription
bio-epitranscriptomics-merip-preprocessingPreprocess MeRIP-seq data for m6A methylation analysis.
bio-epitranscriptomics-m6a-peak-callingCall m6A peaks from MeRIP-seq with exomePeak2 or MACS2.
bio-epitranscriptomics-m6anet-analysisNanopore direct RNA m6A detection with m6Anet deep learning.
bio-epitranscriptomics-m6a-differentialDifferential m6A methylation analysis between conditions.
bio-epitranscriptomics-modification-visualizationVisualize RNA modification profiles and metagene plots.
bio-clip-seq-clip-preprocessingPreprocess CLIP-seq/eCLIP data: adapter trimming, demultiplexing.
bio-clip-seq-clip-alignmentAlign CLIP-seq reads with STAR; handle unique mappers.
bio-clip-seq-clip-peak-callingCall RBP binding peaks from CLIP-seq with PureCLIP or MACS2.
bio-clip-seq-binding-site-annotationAnnotate CLIP-seq peaks with genomic features and RNA regions.
bio-clip-seq-clip-motif-analysisDiscover RBP binding motifs from CLIP-seq peak sequences.

Small RNA-seq

Click to expand skill list
SkillDescription
bio-small-rna-seq-smrna-preprocessingPreprocess small RNA-seq: adapter trimming, size selection.
bio-small-rna-seq-mirdeep2-analysisIdentify and quantify known/novel miRNAs with miRDeep2.
bio-small-rna-seq-mirge3-analysismiRNA annotation and quantification with miRge3.0.
bio-small-rna-seq-target-predictionPredict miRNA target genes with TargetScan or miRDB.
bio-small-rna-seq-differential-mirnaDifferential miRNA expression analysis with DESeq2/edgeR.

Population Genetics & Phasing

Click to expand skill list
SkillDescription
bio-population-genetics-plink-basicsPLINK2 for GWAS QC, LD pruning, and basic population genetics.
bio-population-genetics-population-structurePopulation stratification with PCA, ADMIXTURE, and STRUCTURE.
bio-population-genetics-linkage-disequilibriumCompute LD metrics (rยฒ, D') and LD decay analysis.
bio-population-genetics-association-testingGWAS association testing with PLINK, BOLT-LMM, or SAIGE.
bio-population-genetics-scikit-allel-analysisPopulation genetics analysis with scikit-allel: diversity, Fst, haplotypes.
bio-population-genetics-selection-statisticsDetect natural selection signatures: iHS, XP-EHH, Tajima's D.
bio-phasing-imputation-haplotype-phasingPhase variants with SHAPEIT4 or BEAGLE.
bio-phasing-imputation-genotype-imputationImpute missing genotypes using Michigan/TOPMed imputation servers.
bio-phasing-imputation-reference-panelsSelect and prepare reference panels (1KGP, HRC, TOPMed) for imputation.
bio-phasing-imputation-imputation-qcQC imputed data: Rยฒ filter, INFO score, allele concordance.

Comparative Genomics & Phylogenetics

Click to expand skill list
SkillDescription
bio-comparative-genomics-ortholog-inferenceInfer orthologs and paralogs with OrthoFinder or OMA.
bio-comparative-genomics-synteny-analysisDetect syntenic blocks between genomes with MCScan or SyRI.
bio-comparative-genomics-positive-selectionTest for positive selection with PAML, HyPhy, or dN/dS ratios.
bio-comparative-genomics-hgt-detectionDetect horizontal gene transfer events in microbial genomes.
bio-comparative-genomics-ancestral-reconstructionReconstruct ancestral sequences and traits with ASR methods.
bio-phylo-tree-ioRead/write phylogenetic trees in Newick, Nexus, PhyloXML formats.
bio-phylo-modern-tree-inferenceMaximum likelihood tree inference with IQ-TREE 2 or FastTree.
bio-phylo-tree-manipulationRoot, prune, reorder, and annotate phylogenetic trees.
bio-phylo-tree-visualizationVisualize trees with iTOL, ETE3, or ggtree.
bio-phylo-distance-calculationsCompute pairwise phylogenetic distances and diversity metrics.

Systems Biology & Metabolic Modeling

Click to expand skill list
SkillDescription
bio-systems-biology-flux-balance-analysisFlux balance analysis (FBA) with COBRApy for metabolic network modeling.
bio-systems-biology-metabolic-reconstructionReconstruct genome-scale metabolic models from genome annotations.
bio-systems-biology-gene-essentialityPredict essential genes by single gene knockouts in metabolic models.
bio-systems-biology-context-specific-modelsBuild context-specific metabolic models from expression data (GIMME, iMAT).
bio-systems-biology-model-curationCurate SBML metabolic models: mass/charge balance, gap filling.

Experimental Design & Reporting

Click to expand skill list
SkillDescription
bio-experimental-design-sample-sizePower analysis and sample size calculation for omics experiments.
bio-experimental-design-power-analysisStatistical power analysis for detecting differential signals.
bio-experimental-design-batch-designOptimize sample batching to minimize confounding with ComBat design.
bio-experimental-design-multiple-testingMultiple testing correction: Bonferroni, BH/FDR, q-values.
bio-machine-learning-omics-classifiersTrain classifiers on omics data: random forest, SVM, XGBoost.
bio-machine-learning-biomarker-discoveryIdentify biomarkers from omics data with LASSO, elastic net, SHAP.
bio-machine-learning-model-validationCross-validation, AUC-ROC, calibration, and permutation testing.
bio-machine-learning-survival-analysisSurvival ML: RSF, DeepSurv, CoxBoost from omics features.
bio-machine-learning-atlas-mappingMap query cells to reference atlases with scANVI or Symphony.
bio-machine-learning-prediction-explanationExplain omics ML predictions with SHAP and feature importance.
bio-reporting-automated-qc-reportsGenerate automated MultiQC-style reports for omics pipelines.
bio-reporting-jupyter-reportsCreate Jupyter notebook reports with reproducible analysis code.
bio-reporting-rmarkdown-reportsRender Rmarkdown reports with integrated plots and statistics.
bio-reporting-quarto-reportsBuild Quarto multi-format reports (HTML/PDF) from analysis code.
bio-reporting-figure-exportExport publication-quality figures in PDF/SVG/TIFF at specified DPI.
bio-research-tools-biomarker-signature-studioBuild, validate, and visualize multi-omic biomarker signatures.

End-to-End Workflow Pipelines

Click to expand skill list
SkillDescription
bio-workflows-fastq-to-variantsComplete FASTQ โ†’ alignment โ†’ variant calling pipeline.
bio-workflows-rnaseq-to-deRNA-seq โ†’ alignment โ†’ counts โ†’ DESeq2 differential expression.
bio-workflows-scrnaseq-pipelineSingle-cell RNA-seq end-to-end: Cell Ranger โ†’ Scanpy โ†’ clustering.
bio-workflows-atacseq-pipelineATAC-seq: trimming โ†’ alignment โ†’ peak calling โ†’ differential.
bio-workflows-chipseq-pipelineChIP-seq: alignment โ†’ peak calling โ†’ motif analysis โ†’ annotation.
bio-workflows-methylation-pipelineWGBS/RRBS: bismark alignment โ†’ methylation calling โ†’ DMR detection.
bio-workflows-metagenomics-pipelineMetagenomics: QC โ†’ classification โ†’ functional profiling โ†’ AMR.
bio-workflows-metabolomics-pipelineLC-MS/GC-MS: preprocessing โ†’ annotation โ†’ statistical analysis.
bio-workflows-proteomics-pipelineDDA/DIA proteomics: search โ†’ quantification โ†’ differential abundance.
bio-workflows-gwas-pipelineGWAS: QC โ†’ imputation โ†’ association โ†’ fine-mapping โ†’ annotation.
bio-workflows-somatic-variant-pipelineTumor-normal somatic variant calling with GATK Mutect2/Strelka2.
bio-workflows-cnv-pipelineCopy number variant detection: WGS/WES CNV calling and annotation.
bio-workflows-spatial-pipelineSpatial transcriptomics: alignment โ†’ deconvolution โ†’ domain detection.
bio-workflows-multi-omics-pipelineMulti-omics integration pipeline: MOFA, SNF, similarity network fusion.
bio-workflows-multiome-pipeline10x Multiome: joint scRNA-seq + scATAC-seq processing and integration.
bio-workflows-hic-pipelineHi-C contact map generation, normalization, TAD/loop calling.
bio-workflows-neoantigen-pipelineNeoantigen prediction: somatic variants โ†’ MHC binding โ†’ immunogenicity.
bio-workflows-microbiome-pipelineMicrobiome: 16S/ITS amplicon or shotgun โ†’ diversity โ†’ differential.
bio-workflows-crispr-screen-pipelineCRISPR screen: guide counting โ†’ MAGeCK โ†’ hit calling โ†’ visualization.
bio-workflows-crispr-editing-pipelineCRISPR editing: amplicon sequencing โ†’ CRISPResso2 โ†’ outcome analysis.
bio-workflows-tcr-pipelineTCR/BCR: V(D)J alignment โ†’ clonotype โ†’ repertoire analysis.
bio-workflows-riboseq-pipelineRibo-seq: footprint alignment โ†’ periodicity โ†’ ORF detection.
bio-workflows-smrna-pipelineSmall RNA-seq: miRNA identification โ†’ quantification โ†’ DE analysis.
bio-workflows-merip-pipelineMeRIP-seq: m6A peak calling โ†’ differential โ†’ motif analysis.
bio-workflows-clip-pipelineCLIP-seq: peak calling โ†’ binding site annotation โ†’ motif discovery.
bio-workflows-imc-pipelineImaging mass cytometry: segmentation โ†’ phenotyping โ†’ spatial analysis.
bio-workflows-cytometry-pipelineFlow/mass cytometry: QC โ†’ gating โ†’ clustering โ†’ differential.
bio-workflows-longread-sv-pipelineLong-read structural variant calling and annotation pipeline.
bio-workflows-genome-assembly-pipelineDe novo genome assembly: raw reads โ†’ assembly โ†’ QC โ†’ annotation.
bio-workflows-outbreak-pipelinePathogen genomics: assembly โ†’ typing โ†’ phylodynamics โ†’ transmission.
bio-workflows-biomarker-pipelineBiomarker discovery: omics โ†’ feature selection โ†’ validation โ†’ report.
bio-workflows-metabolic-modeling-pipelineMetabolic model reconstruction โ†’ FBA โ†’ simulation โ†’ visualization.
bio-splicing-pipelineAlternative splicing analysis: rMATS โ†’ PSI โ†’ differential โ†’ sashimi.
bio-liquid-biopsy-pipelineLiquid biopsy: cfDNA/ctDNA QC โ†’ mutation detection โ†’ TMB โ†’ MRD.
bio-workflow-management-snakemake-workflowsCreate and manage Snakemake reproducible bioinformatics workflows.
bio-workflow-management-nextflow-pipelinesBuild and run Nextflow (DSL2) bioinformatics pipelines.
bio-workflow-management-cwl-workflowsWrite Common Workflow Language (CWL) portable workflow definitions.
bio-workflow-management-wdl-workflowsCreate WDL workflows for Terra/Cromwell bioinformatics execution.
bio-workflows-expression-to-pathwaysEnd-to-end workflow from differential expression to GO/KEGG/Reactome enrichment and pathway visualization.

Data Visualization (Bioinformatics)

Click to expand skill list
SkillDescription
bio-data-visualization-heatmaps-clusteringHierarchical clustering heatmaps with ComplexHeatmap or seaborn.
bio-data-visualization-volcano-customizationCustomized volcano plots with ggplot2 or matplotlib for DE results.
bio-data-visualization-circos-plotsCircular genome visualization with Circos or pycirclize.
bio-data-visualization-genome-browser-tracksGenerate genome browser tracks and IGV sessions from BAM/BigWig.
bio-data-visualization-genome-tracksMulti-panel genome track plots with pyGenomeTracks.
bio-data-visualization-ggplot2-fundamentalsR ggplot2 for publication-quality genomics and omics figures.
bio-data-visualization-interactive-visualizationInteractive omics visualizations with Plotly, Bokeh, or shiny.
bio-data-visualization-upset-plotsUpSet plots for multi-set intersection visualization.
bio-data-visualization-multipanel-figuresCompose multipanel publication figures with cowplot or patchwork.
bio-data-visualization-color-palettesScientific color palettes: colorblind-safe, perceptually uniform, diverging.
bio-data-visualization-specialized-omics-plotsSpecialized plots: lollipop (mutations), circomap, oncoprint.

Oncology & Precision Medicine Agents (BioOS)

Click to expand skill list
SkillDescription
autonomous-oncology-agentAutonomous oncology research agent: literature mining, trial matching, biomarker analysis, and treatment hypothesis generation.
precision-oncology-agentPrecision oncology: tumor molecular profiling โ†’ actionable alterations โ†’ treatment recommendations.
pan-cancer-multiomics-agentPan-cancer multi-omics integration for cross-cancer pattern discovery and driver identification.
tumor-clonal-evolution-agentModel tumor clonal evolution: phylogenetic trees, clonal dynamics, branching patterns from somatic variants.
tumor-heterogeneity-agentAnalyze intratumoral heterogeneity from bulk and single-cell sequencing data.
tumor-mutational-burden-agentCompute TMB and assess its predictive value for immunotherapy response.
chromosomal-instability-agentQuantify chromosomal instability (CIN) from copy number and SV data.
cancer-metabolism-agentAnalyze tumor metabolic reprogramming from transcriptomic and metabolomic data.
liquid-biopsy-analytics-agentComprehensive liquid biopsy analytics: ctDNA detection, MRD monitoring, treatment response.
ctdna-dynamics-mrd-agentTrack ctDNA dynamics for minimal residual disease detection and treatment monitoring.
mrd-edge-detection-agentUltra-sensitive MRD detection from deep sequencing with error suppression.
hrd-analysis-agentHomologous recombination deficiency (HRD) analysis for PARP inhibitor response prediction.
computational-pathology-agentComputational pathology: WSI analysis, tissue segmentation, histological feature extraction.
multimodal-radpath-fusion-agentFuse radiology and pathology imaging for integrated cancer phenotyping.
radiomics-pathomics-fusion-agentExtract radiomic and pathomic features and integrate for predictive modeling.
radgpt-radiology-reporterAI-assisted radiology report generation from imaging findings.
organoid-drug-response-agentAnalyze drug response in patient-derived organoids for personalized therapy prediction.
pdx-model-analysis-agentPatient-derived xenograft model analysis for drug efficacy and biomarker discovery.
deep-visual-proteomics-agentDeep visual proteomics: spatial proteomic analysis from laser-capture microdissection MS data.
exosome-ev-analysis-agentExtracellular vesicle and exosome analysis: cargo profiling and biomarker discovery.
microbiome-cancer-agentTumor microbiome analysis and its role in cancer progression and immunotherapy response.
bio-fragment-analysisAnalyze cfDNA fragment size distributions and fragmentomics features (FinaleToolkit/Griffin) for cancer detection and tissue-of-origin assessment.

Hematology & Blood Disorders (BioOS)

Click to expand skill list
SkillDescription
myeloma-mrd-agentMultiple myeloma MRD assessment from flow cytometry and NGS data.
mpn-progression-monitor-agentMyeloproliferative neoplasm progression monitoring from serial molecular data.
mpn-research-assistantResearch assistant for myeloproliferative neoplasms: literature, mutation analysis, treatment.
bone-marrow-ai-agentBone marrow analysis: blast counting, immunophenotyping, disease classification.
hemoglobinopathy-analysis-agentHemoglobin variant analysis, sickle cell, and thalassemia genotype-phenotype assessment.
chip-clonal-hematopoiesis-agentClonal hematopoiesis of indeterminate potential (CHIP) variant detection and risk assessment.
coagulation-thrombosis-agentCoagulation pathway analysis, thrombophilia assessment, anticoagulation guidance.

Immunology & Cell Therapy (BioOS)

Click to expand skill list
SkillDescription
cart-design-optimizer-agentOptimize CAR-T cell construct design: scFv selection, linker, co-stimulatory domain.
armored-cart-design-agentDesign armored CAR-T cells with cytokine payloads and resistance mechanisms.
tcell-exhaustion-analysis-agentAnalyze T cell exhaustion from scRNA-seq and ATAC-seq data.
nk-cell-therapy-agentNK cell therapy design: receptor engineering, expansion protocols, persistence.
tcr-pmhc-prediction-agentPredict TCR-pMHC binding affinity and selectivity for TCR therapy design.
tcr-repertoire-analysis-agentTCR repertoire analysis: V(D)J usage, clonotype dynamics, antigen specificity.
immune-checkpoint-combination-agentPredict optimal immune checkpoint combination strategies from tumor immune microenvironment.
tme-immune-profiling-agentTumor microenvironment immune profiling: cell type deconvolution and spatial mapping.
cytokine-storm-analysis-agentCytokine storm detection, severity scoring, and intervention modeling.

Single-Cell & Spatial Agents (BioOS)

Click to expand skill list
SkillDescription
cellagent-annotationAI-driven single-cell cluster annotation using marker gene databases.
universal-single-cell-annotatorUniversal scRNA-seq annotator using foundation models and multi-reference integration.
scfoundation-model-agentSingle-cell foundation model inference (scFoundation/scGPT) for zero-shot annotation.
rna-velocity-agentRNA velocity analysis with scVelo for trajectory and fate decision inference.
spatial-transcriptomics-agentEnd-to-end spatial transcriptomics analysis: QC, deconvolution, domain detection.
spatial-transcriptomics-analysisSpatial transcriptomics analysis with Squidpy and SpatialDE.
spatial-agentSpatial omics agent: integrate spatial data with imaging, protein, and genomic layers.
nicheformer-spatial-agentSpatial niche analysis with Nicheformer foundation model for tissue microenvironment.
spatial-epigenomics-agentSpatial epigenomics analysis: spatially resolved chromatin accessibility and gene regulation.
bioinformatics-singlecellGeneral single-cell bioinformatics: clustering, trajectory, cell communication.
scrna-qcSingle-cell RNA-seq quality control: doublet removal, ambient RNA, filtering thresholds.
compbioagent-explorerComputational biology exploration agent for multi-omics dataset analysis.
simo-multiomics-integration-agentSingle-cell multi-omics integration with SIMO/MOFA+ for joint embedding.
epigenomics-methylgpt-agentEpigenomics and DNA methylation analysis with MethylGPT-inspired approaches.
biomaster-workflowsBioMaster workflow orchestration for end-to-end bioinformatics analyses.

Drug Discovery & Design (BioOS)

Click to expand skill list
SkillDescription
agentd-drug-discoveryAgentD autonomous drug discovery: target identification, hit finding, ADMET optimization.
chematagent-drug-discoveryCheMatAgent: chemistry-aware drug design with retrosynthesis and property optimization.
chemcrow-drug-discoveryChemCrow drug discovery toolkit: web search, Python, chemical tools integration.
medea-therapeutic-discoveryMEDEA therapeutic discovery: multimodal evidence aggregation for target-disease validation.
molecule-evolution-agentDirected molecular evolution: generative models for compound optimization and library design.
molecular-glue-discovery-agentMolecular glue discovery: induced proximity degraders and ternary complex stabilizers.
protac-design-agentPROTAC design: E3 ligase ligand selection, linker optimization, ternary complex modeling.
tpd-ternary-complex-agentTargeted protein degradation ternary complex modeling and cooperativity prediction.
mage-antibody-generatorMAGE antibody generator: sequence design, humanization, affinity maturation.
antibody-design-agentAntibody design: epitope mapping, CDR engineering, bispecific construction.
aav-vector-design-agentAAV vector design: capsid selection, promoter optimization, payload capacity.
protein-structure-predictionProtein structure prediction with AlphaFold3, ESMFold, or Boltz with comparison.
crispr-guide-designCRISPR guide RNA design with on-target scoring and off-target minimization.
crispr-offtarget-predictorPredict CRISPR Cas9/Cas12 off-target sites genome-wide with CRISPOR/Cas-OFFinder.
chemical-property-lookupLook up chemical properties from PubChem, ChEMBL, DrugBank by name/SMILES.
chemistry-agentGeneral chemistry agent for synthesis planning, reaction prediction, and property calculation.
cryoem-ai-drug-design-agentAI-guided drug design from cryo-EM structures: binding site analysis and docking.
time-resolved-cryoem-agentTime-resolved cryo-EM analysis for dynamic structural biology.
cnv-caller-agentSpecialized CNV detection agent integrating multiple callers with ensemble scoring.
popeve-variant-predictor-agentVariant pathogenicity prediction using EVE population-based evolutionary models.
varcadd-pathogenicityVARCADD pathogenicity scoring for coding variants from structure and evolution.
variant-interpretation-acmgACMG/AMP variant interpretation with evidence-based classification framework.
gene-panel-design-agentDesign targeted gene panels for clinical or research sequencing applications.
pharmacogenomics-agentPharmacogenomics analysis: variant-drug interaction prediction and dosing recommendations.
multi-ancestry-prs-agentMulti-ancestry polygenic risk score computation with ancestry-specific weighting.
prs-net-deep-learning-agentDeep learning PRS prediction with PRSnet for complex traits.
cellfree-rna-agentCell-free RNA analysis: plasma cfRNA profiling for liquid biopsy diagnostics.
long-read-sequencing-agentLong-read sequencing analysis: SV calling, methylation, isoform discovery, assembly.
bayesian-optimizerBayesian optimization for experimental design and hyperparameter tuning in biomedical research.

Clinical AI & Healthcare (BioOS)

Click to expand skill list
SkillDescription
chatehr-clinician-assistantEHR clinical assistant: note summarization, structured data extraction, clinical decision support.
clinical-note-summarizationSummarize clinical notes into structured SOAP format with key findings.
clinical-nlp-extractorExtract clinical entities (diagnoses, medications, procedures) from unstructured text.
ehr-fhir-integrationEHR-FHIR integration: HL7 FHIR resource creation, querying, and workflow automation.
fhir-developmentFHIR API development: build SMART on FHIR apps and FHIR resource endpoints.
digital-twin-clinical-agentCreate patient digital twins for treatment simulation and outcome prediction.
trial-eligibility-agentAssess patient eligibility for clinical trials from EHR data and trial criteria.
trialgpt-matchingTrialGPT patient-to-trial matching with eligibility assessment from clinical notes.
wearable-analysis-agentAnalyze wearable sensor data: activity, sleep, HRV, ECG for health monitoring.
multimodal-medical-imagingMultimodal medical imaging analysis: CT, MRI, PET fusion and segmentation.
prior-auth-coworkerPrior authorization workflow assistant for insurance approval processes.
care-coordinationCare coordination agent: multi-disciplinary team communication and care plan management.
claims-appealsInsurance claims appeals: documentation preparation and denial reasoning analysis.
lab-resultsLab result interpretation: reference ranges, trend analysis, critical value alerts.
drug-interaction-checkerCheck drug-drug interactions from patient medication lists with severity scoring.
regulatory-drafterDraft regulatory submissions: FDA, EMA, ICH document preparation.
regulatory-draftingRegulatory writing and document structuring for medical device/drug submissions.
biomedical-data-analysisComprehensive biomedical data analysis: statistics, visualization, and interpretation.
data-visualization-biomedicalBiomedical-specific data visualization: clinical trial plots, survival curves, forest plots.

Research Infrastructure & Agents (BioOS)

Click to expand skill list
SkillDescription
biomni-general-agentBioMni general biomedical agent for flexible multi-step research tasks.
biomni-research-agentBioMni research-focused agent with literature, database, and analysis integration.
biokernelBioKernel: unified computational kernel for bioinformatics tool orchestration.
biomcp-serverBioMCP: Model Context Protocol server for bioinformatics tool access.
mcpmed-bioinformatics-serverMCP server providing medical bioinformatics tool access to agents.
kragen-knowledge-graphKRAGEN knowledge graph for biomedical entity relationships and reasoning.
leads-literature-miningLEADS literature mining: automated extraction of biological findings from papers.
knowledge-synthesisSynthesize knowledge from multiple biomedical sources into structured summaries.
deep-research-swarmMulti-agent swarm for deep scientific research with parallel literature synthesis.
research-literatureResearch literature management: search, organize, and synthesize scientific papers.
search-strategyDesign systematic search strategies for scientific literature and databases.
scientific-manuscriptScientific manuscript writing and revision with journal-specific formatting.
cellular-senescence-agentCellular senescence analysis: marker scoring, SASP profiling, tissue aging assessment.
ngs-analysisNext-generation sequencing data analysis orchestration and QC.
opentrons-protocol-agentOpentrons liquid handler protocol design for automated lab workflows.
virtual-lab-agentVirtual lab agent for in silico experiment simulation and protocol optimization.
data-visualization-expertExpert data visualization for complex scientific and clinical datasets.
lobster-bioinformaticsRun bioinformatics analyses via Lobster AI: scRNA-seq, bulk RNA-seq, literature mining, dataset discovery, QC, and visualization.

๐Ÿ“Š Data Science & Tools

Expand/Collapse this category

Statistics & Data Analysis

Click to expand skill list
SkillDescription
statistical-analysisStatistical analysis toolkit. Hypothesis tests (t-test, ANOVA, chi-square), regression, correlation, Bayesian stats, power analysis, assumption checks, APA reporting.
statsmodelsStatistical modeling: OLS, GLM, logistic, ARIMA, time series, hypothesis tests, diagnostics, AIC/BIC, for rigorous statistical inference.
pymcBayesian modeling with PyMC. Build hierarchical models, MCMC (NUTS), variational inference, LOO/WAIC comparison, posterior checks, for probabilistic programming.
simpyProcess-based discrete-event simulation for clinical systems: queues, resources, time-based events. Useful for modeling hospital workflows and patient flow.
exploratory-data-analysisComprehensive exploratory data analysis on scientific data files across 200+ file formats โ€” structure, content, quality assessment, and visualization.
data-stats-analysisStatistical tests, hypothesis testing, correlation analysis, and multiple testing corrections using scipy and statsmodels (OmicVerse).
data-transformTransform, clean, reshape, and preprocess biological data using pandas and numpy (OmicVerse).
data-viz-plotsCreate publication-quality plots and visualizations using matplotlib and seaborn (OmicVerse).
scientific-visualizationCreate publication figures with matplotlib/seaborn/plotly. Multi-panel layouts, error bars, significance markers, colorblind-safe, PDF/EPS/TIFF export.

Lab Automation & Integration

Click to expand skill list
SkillDescription
opentrons-integrationLab automation platform for Flex/OT-2 robots. Write Protocol API v2 protocols, liquid handling, hardware modules (heater-shaker, thermocycler), labware management.
pylabrobotLaboratory automation toolkit for controlling liquid handlers, plate readers, pumps, heater shakers, incubators, centrifuges, and analytical equipment.
benchling-integrationBenchling R&D platform integration. Access registry (DNA, proteins), inventory, ELN entries, workflows via API, build Benchling Apps, for lab data management automation.
labarchive-integrationElectronic lab notebook API integration. Access notebooks, manage entries/attachments, backup notebooks, integrate with Protocols.io/Jupyter/REDCap.
protocolsio-integrationIntegration with protocols.io API for managing scientific protocols โ€” search, create, update, publish protocols, and manage protocol steps and reagents.
instrument-data-to-allotropeConvert laboratory instrument output files (PDF, CSV, Excel, TXT) to Allotrope Simple Model (ASM) JSON format for LIMS systems, data lakes, and downstream analysis.

Scientific Research & Writing

Click to expand skill list
SkillDescription
scientific-writingWrite scientific manuscripts in full paragraphs using a two-stage process: section outlines then full text. Covers all sections of research papers.
scientific-critical-thinkingEvaluate research rigor. Assess methodology, experimental design, statistical validity, biases, confounding, evidence quality (GRADE, Cochrane ROB).
scientific-brainstormingResearch ideation partner. Generate hypotheses, explore interdisciplinary connections, challenge assumptions, develop methodologies, identify research gaps.
hypothesis-generationGenerate testable hypotheses. Formulate from observations, design experiments, explore competing explanations, develop predictions, propose mechanisms.
scientific-problem-selectionHelp scientists with research problem selection, project ideation, troubleshooting stuck projects, and strategic scientific decisions.
peer-reviewSystematic peer review toolkit. Evaluate methodology, statistics, design, reproducibility, ethics, figure integrity, reporting standards, for manuscript and grant review.
citation-managementComprehensive citation management. Search Google Scholar and PubMed for papers, extract accurate metadata, validate citations, generate BibTeX entries.
research-grantsWrite competitive research proposals for NSF, NIH, DOE, and DARPA. Agency-specific formatting, review criteria, budget preparation, broader impacts.
research-lookupLook up current research using Perplexity's Sonar Pro Search or Sonar Reasoning Pro via OpenRouter. Automatically selects best model for the query complexity.
biomniAutonomous biomedical AI agent framework for executing complex research tasks across genomics, drug discovery, molecular biology, and clinical analysis.
treatment-plansGenerate concise (3-4 page) medical treatment plans in LaTeX/PDF format for all clinical specialties including general medicine, rehabilitation, mental health, and chronic disease.

Analyst Personas

Click to expand skill list
SkillDescription
biologist-analystExpert biologist analyst persona for interpreting biological experiments, sequencing data, cell biology assays, and molecular biology research.
chemist-analystExpert chemist analyst persona for interpreting chemical data, synthesis routes, spectroscopic results, reaction mechanisms, and laboratory analyses.
epidemiologist-analystExpert epidemiologist analyst persona for study design, cohort analysis, risk factor assessment, public health surveillance, and causal inference.
psychologist-analystExpert psychologist analyst persona for behavioral data analysis, psychological assessment interpretation, clinical case formulation, and mental health research.

Public Health & Time Series

Click to expand skill list
SkillDescription
datacommons-clientAccess public health statistics from Google Data Commons: disease prevalence, demographic data, health indicators across global sources.
timesfm-forecastingZero-shot time series forecasting with Google's TimesFM. For vital sign trends, health sensor data, and longitudinal health monitoring without custom model training.
aeonTime series ML: classification, regression, clustering, anomaly detection, segmentation for temporal health data and sequential clinical measurements.

Scientific Literature & Reference Management

Click to expand skill list
SkillDescription
bgpt-paper-searchSearch scientific papers with BGPT MCP server. Returns 25+ structured fields per paper: methods, results, sample sizes, quality scores. For literature reviews and evidence synthesis.
pyzoteroInteract with Zotero reference libraries programmatically via Zotero Web API v3. Retrieve, create, update items, export citations, upload PDFs, and build research automation workflows.
open-notebookSelf-hosted NotebookLM alternative. Ingest PDFs, videos, web pages, documents; generate AI-powered notes; chat with research materials; supports 16+ AI providers.

Data Processing & Scientific Computing

Click to expand skill list
SkillDescription
daskDistributed computing for larger-than-RAM genomics/omics datasets. Scale pandas/NumPy beyond memory, parallel file processing, distributed ML.
polarsFast in-memory DataFrame library (1-100GB). Faster pandas replacement for biomedical data ETL and analysis pipelines.
vaexOut-of-core DataFrame operations for billions of rows. Fast statistics and visualization for large genomic and clinical datasets.
zarr-pythonChunked N-D arrays for cloud storage. Compressed arrays, parallel I/O, S3/GCS integration for large-scale omics data.
pytorch-lightningOrganized PyTorch deep learning for biomedical AI: multi-GPU training, callbacks, logging, distributed training for clinical/genomic models.

Scientific Visualization & Communication

Click to expand skill list
SkillDescription
matplotlibLow-level plotting library for full customization. Publication-quality figures for scientific manuscripts and journals.
seabornStatistical visualization with pandas integration. Box plots, violin plots, heatmaps, pair plots for biomedical data exploration.
plotlyInteractive visualization. Hover info, zoom, dashboards for exploratory biomedical analysis and presentations.
infographicsCreate professional scientific infographics with iterative AI refinement. Supports 10 infographic types and 8 industry styles.
scientific-schematicsPublication-quality scientific diagrams: neural network architectures, biological pathways, system diagrams, flowcharts.
scientific-slidesBuild research presentation slide decks for conferences, seminars, thesis defenses. PowerPoint and LaTeX Beamer support.
latex-postersCreate professional research posters in LaTeX (beamerposter, tikzposter). Conference posters with multi-column layouts.
pptx-postersHTML/CSS research posters exportable to PDF or PPTX. Modern web-based poster design.
markdown-mermaid-writingScientific documentation with Markdown and 24 Mermaid diagram types. 9 document templates for scientific reports.
paper-2-webConvert academic papers to interactive websites, presentation videos, and conference posters (Paper2Web, Paper2Video, Paper2Poster).

Additional Scientific Tools

Click to expand skill list
SkillDescription
pymooMulti-objective optimization with PYMOO. Drug design parameter optimization, Pareto front analysis, evolutionary algorithms.
markitdownConvert documents (PDF, DOCX, PPTX, HTML, images) to Markdown for processing and analysis.
perplexity-searchAI-powered search via Perplexity for real-time scientific information retrieval.
geopandasGeospatial data analysis with GeoPandas. Epidemiology mapping, disease distribution, spatial health analytics.
hypogenicAutomated hypothesis generation and testing on tabular datasets. Combine literature insights with data-driven hypothesis validation.
pdf-processingAdvanced PDF processing: text extraction, table parsing, annotation, form filling.
pdf-processing-proProfessional PDF processing with enhanced OCR, multi-column layout handling, and batch processing.
pdf-anthropicAnthropic-optimized PDF analysis for scientific and medical document comprehension.
xlsx-officialOfficial Excel/XLSX skill for spreadsheet creation, analysis, and data management.
docx-officialOfficial Word/DOCX skill for document creation, editing, and formatting.
pptx-officialOfficial PowerPoint/PPTX skill for presentation creation and editing.

Computational Simulation & Ontology (HeshamFS/materials-simulation-skills)

Click to expand skill list
SkillDescription
ontology-validatorValidate biomedical ontology structures and term relationships (HPO, GO, MeSH, SNOMED, OBO).
ontology-explorerNavigate and query biomedical ontologies: term hierarchies, annotations, cross-references.
ontology-mapperMap between biomedical ontologies: HPOโ†”OMIM, GOโ†”UniProt, diseaseโ†”phenotype cross-ontology.
slurm-job-script-generatorGenerate SLURM sbatch scripts for HPC genomics/bioinformatics pipeline jobs with optimized resource requests.
numerical-integrationSelect and configure ODE/PDE time integration for biological model simulation (stiff systems, IMEX).
nonlinear-solversConfigure nonlinear solvers for biological network optimization, parameter fitting, FBA.
parameter-optimizationDesign of experiments, sensitivity analysis, Bayesian optimization for biological model calibration.
linear-solversSelect linear solvers for large-scale biological network and metabolic model computations.
numerical-stabilityAnalyze numerical stability for time-dependent biological simulations (CFL criteria, stiffness).
simulation-orchestratorOrchestrate multi-simulation campaigns: parameter sweeps, batch jobs, result aggregation.
simulation-validatorValidate simulations: pre-flight checks, runtime monitoring, convergence, NaN/Inf detection.
convergence-studySpatial/temporal convergence analysis with Richardson extrapolation for simulation verification.
post-processingExtract, analyze, and visualize simulation output data: time series, field profiles, statistics.
performance-profilingIdentify computational bottlenecks, analyze scaling behavior, optimize HPC simulation jobs.
differentiation-schemesSelect finite difference/volume/spectral schemes for PDE discretization in biological models.
time-steppingAdaptive time-step control for biological dynamics: CFL constraints, checkpoint scheduling.
mesh-generationMesh generation for numerical simulations: resolution, quality metrics, adaptive refinement.

Developer Workflow Skills (obra/superpowers)

Click to expand skill list
SkillDescription
test-driven-developmentTDD workflow: write tests before implementation, red-green-refactor cycle for reliable code.
systematic-debuggingStructured debugging approach: hypothesis formation, evidence gathering, root cause analysis.
dispatching-parallel-agentsOrchestrate parallel subagents for independent tasks to maximize throughput.
writing-plansWrite structured implementation plans before touching code for complex multi-step tasks.
executing-plansExecute written implementation plans with review checkpoints in isolated sessions.
brainstormingStructured creative exploration of requirements and design before implementation.
writing-skillsCreate and verify new SKILL.md skills with proper format and deployment validation.
verification-before-completionRun verification commands and confirm outputs before claiming work is complete.
requesting-code-reviewStructure code review requests with context, changes summary, and specific questions.
receiving-code-reviewProcess code review feedback with technical rigor rather than blind acceptance.
subagent-driven-developmentBreak development tasks into subtasks for parallel subagent execution.
using-git-worktreesCreate isolated git worktrees for feature work and plan execution.
finishing-a-development-branchComplete development branches: merge, PR, or cleanup with structured decision options.
using-superpowersMeta-skill: discover and use available skills for any task at conversation start.

Acknowledgements

We have benefited from the following excellent projects. If youโ€™re interested, please check them out.

  1. awesome-claude-skills
  2. Anthropics Skills
  3. Skillsmp
  4. awesome-claude-skills
  5. ClawHub
  6. ๆฐดไบงๅธ‚ๅœบ
  7. Skills.Sh
  8. awesome-agent-skills
  9. llmbase
  10. OpenClaw
  11. awesome-openclaw-skills
  12. claude-scientific-skills
  13. LLMs-Universal-Life-Science-and-Clinical-Skills-
  14. BioClaw
  15. ClawBio

Star History

Star History Chart

ๅ…ณไบŽ About

The largest open-source medical AI skills library for OpenClaw๐Ÿฆž.
awesomeclaude-codeclawhubmedicalnanoclawopenclawopenclaw-skillsskills

่ฏญ่จ€ Languages

Python67.2%
Jupyter Notebook23.3%
TeX2.4%
Shell2.0%
R1.9%
HTML1.6%
JavaScript1.2%
CSS0.1%
Gherkin0.1%
TypeScript0.0%
Makefile0.0%
Common Workflow Language0.0%
WDL0.0%
Nextflow0.0%
Batchfile0.0%
Dockerfile0.0%

ๆไบคๆดป่ทƒๅบฆ Commit Activity

ไปฃ็ ๆไบค็ƒญๅŠ›ๅ›พ
่ฟ‡ๅŽป 52 ๅ‘จ็š„ๅผ€ๅ‘ๆดป่ทƒๅบฆ
46
Total Commits
ๅณฐๅ€ผ: 29ๆฌก/ๅ‘จ
Less
More

ๆ ธๅฟƒ่ดก็Œฎ่€… Contributors