
KG-Microbe -- Knowledge graph construction for microbial traits and beyond
Setup
- Install pipx
- Install poetry using
pipx install poetry git clone https://github.com/Knowledge-Graph-Hub/kg-microbe.gitcd kg-microbepoetry install
Download resources needed
poetry run kg download: This will download the resources needed for this project.
Transform
poetry run kg transform: This transforms the resources into knowledge graphs (KGs).
Merge
poetry run kg merge: This merges all transformed graphs above.
Release
Requirements
In order to be able to make KG releases on this repository, you'll need:
- Appropriate permissions to this repository.
- A Github token that has permissions on this repository. This is how you set it in GitHub. Make sure your token has access to this project.
- Save this token locally assigned to the environemnt variable
GH_TOKEN
or add it to yourexport GH_TOKEN = XXXX~/.bash_profileor~/.bashrcfile.
It should be noted that the KG construction process, particularly the transform step involving trimming of NCBI Taxonomy for any KG and the steps involving the microbial UniProt dataset for KG-Microbe-Function and KG-Microbe-Biomedical-Function, is computationally intensive. Successful execution on a local machine may require significant memory resources (e.g., >500 GB of RAM), further details can be found in the project's code repository.
Contributors
Please remember to run poetry run tox before every commit to make sure the code you commit is error-free.
Environment Variables
If you need to use environment variables for this project, copy .env.example to .env and set the environment variables accordingly:
cp .env.example .env
Then edit the .env file to configure the required environment variables for your setup.
Acknowledgements
This cookiecutter project was developed from the kg-cookiecutter template and will be kept up-to-date using cruft.
Version History
master @ e2861c4 (earliest) Created 5th Jan 2026 at 06:36 by marcin joachimiak
Merge pull request #479 from Knowledge-Graph-Hub/trembl-fix
Filter out TrEMBL protein nodes from EC ontology transform
Frozen
master
e2861c4
Creators and SubmitterCreator
Submitter
Views: 830 Downloads: 128
Created: 5th Jan 2026 at 06:36
Tags
AttributionsNone
View on GitHub
https://orcid.org/0000-0001-8175-045X