sedatasets
a project to bring Bioconductor datasets to AI area
Python and R/Bioconductor packages sedatasets are developped to transfer SummarizedExperiment data structure to AI-friendly Huggingface datasets format.
Usage
Command line
python -m sedatasets.cli -h
Python Module
from sedatasets.se_convert import AD2Datasets, SE2Datasets
SE2Datasets(
efiles={"exp": "tests/data/rse_counts.csv"},
pfile="tests/data/rse_cdata.csv",
ffile="tests/data/rse_rdata.csv",
outdir='/tmp/rse',
)
AD2Datasets("tests/data/adata.h5ad", outdir='/tmp/anndata')