EnDecon

R package supporting the paper "EnDecon: cell type deconvolution of spatiallyresolved transcriptomics data via ensemble learning".

EnDecon integrates multiple base deconvolution results using a weighted optimization model to generate a more accurate result. EnDecon mainly includes two steps: (1) running each base deconvolution method individually to obtain the base cell type deconvolution results, and (2) integrating these base deconvolution results into a better deconvolution result using a new proposed ensemble strategy. EnDecon obtains the ensemble result by alternatively updating the ensemble result as a weighted median of the base deconvolution results and the weights of base results based on their distance from the ensemble result. R package applies ensemble learning for the deconvolution of spatial transcriptomic data.

The EnDecon package has the main following R-package dependencies: SCDC, spacexr, MuSiC, DeconRNASeq, DWLS, Seurat, SPOTlight, Giotto, STdeconvolve, spatstat.geom, CARD, parallel, doParallel, foreach, reticulate and several python packages: scvi-tools, cell2location, scanpy, anndata. For the R-package dependencies, you can load on most of R dependencies packages on your R when install the EnDecon R package by run the code:

devtools::install_github("Zhangxf-ccnu/EnDecon")

However, if the dependencies are not installed correctly, please install them by yourself by the following instruction. We check all the codes and examples in the package on the computer with the system of ubuntu 18.04, 64 GB RAM, i7-10700 CPU, RTX 3080 GPU. We also test the R package on the linux sever with A100 GPU.

Install individual dependencies

Install python dependencies

### construct EnDecon python environment with pytorch GPU version 
conda env create -f requirments_EnDecon_GPU.yml
### construct EnDecon python environment with pytorch CPU version
conda env create -f requirments_EnDecon_CPU.yml

If you want to run the DWLS, SpatialDWLS, Stereoscope and cell2location for the ensemble learning, we advise that the user should install anaconda and run the upper command on the terminal (ubuntu)/CMD (windows) to install the python dependencies for running the methods. In our application, due to the computer with RTX3080 GPU, we install the pytorch with cudatookit. If you don’t want to use the *.yml provided. You can install the python dependencies by the following code.

 pip install scvi-tools
 pip install cell2location
 pip install scanpy
 pip install anadata
 pip install igraph
 pip install networkx
 pip install leidenalg
 pip install community
 pip install smfishHmrf
 pip install scikit-learn
# install pytorch with CPU or GPU version

After install the python dependencies, the user need to get the path of environment of conda and set the path to the python_env variable in the function of EnDecon_individual_methods in our package. The path is similar to "~/.conda/envs/EnDecon_env/bin/python" on the ubuntu and "~/anaconda3/envs/EnDecon_env/python.ext" on Windows.

Install R dependencies

SCDC

install.packages("remotes")
remotes::install_github("renozao/xbioc")
install.packages("devtools")
devtools::install_github("meichendong/SCDC")

RCTD

devtools::install_github("dmcable/spacexr", build_vignettes = FALSE)

MuSiC

devtools::install_github('xuranw/MuSiC')

DeconRNASeq

if (!require("BiocManager", quietly = TRUE))
  install.packages("BiocManager")
BiocManager::install("DeconRNASeq")

DWLS

remotes::install_github("sistia01/DWLS")

Seurat

install.packages("Seurat")

SPOTlight (Version 0.1.7)

devtools::install_github("https://github.com/MarcElosua/SPOTlight/tree/spotlight-0.1.7")

Giotto

devtools::install_github('RubD/Giotto')

spatstat.geom

install.packages("spatstat.geom")

CARD

devtools::install_github('YingMa0107/CARD')

STdeconvolve

require(remotes)
remotes::install_github('JEFworks-Lab/STdeconvolve')

parallel and doParallel

install.packages("parallel")
install.packages("doParallel")

reticulate

install.packages('reticulate')

Run the example

data("breast.sc.ref")
data("breast.sc.cell.label")
data("breast.st")
data("breast.st.loc")
##### path on ubuntu platform on our computer
python_env <- "~/.conda/envs/EnDecon_GPU/bin/python"
# Run 14 individual deconvolution methods with default setting
Results.dec.mouse <- EnDecon_individual_methods(sc_exp = breast.sc.ref,
sc_label = breast.sc.cell.label, spot_exp = breast.st,
spot_loc = breast.st.loc, python_env = python_env,
use_gpu = TRUE,gene_det_in_min_cells_per = 0.01,
RCTD.CELL_MIN_INSTANCE = 5, saving_results = FALSE)
ensemble.results <- solve_ensemble(Results.dec.mouse[[1]])

Users could choose individual deconvolution methods for ensemble learning by setting the parameters in EnDecon_individual_methods function. For example, we could use the following codes running the cell2location, RCTD and CARD for ensemble learning.

Results.dec.mouse <- EnDecon_individual_methods(sc_exp = breast.sc.ref,
sc_label = breast.sc.cell.label, spot_exp = breast.st,
spot_loc = breast.st.loc, python_env = python_env,
use_gpu = TRUE,gene_det_in_min_cells_per = 0.01,
RCTD.CELL_MIN_INSTANCE = 5, saving_results = FALSE, 
SCDC = FALSE, RCTD = TRUE, MuSiC = FALSE, DeconRNASeq = FALSE,
DestVI = FALSE, DWLS = FALSE, SPOTlight = FALSE, SpatialDWLS = FALSE,
Stereoscope = FALSE, cell2location = TRUE, CARD = TRUE, STdeconvolve = FALSE)
ensemble.results <- solve_ensemble(Results.dec.mouse[[1]])

Recommendation for the selection of base deconvolution methods

For a computational method, the accuracy is important, but the running time also needs to be considered. Therefore, we also report the computational time required for the deconvolution methods. To obtain the running time, we run the deconvolution methods on a workstation with Intel core i7-10700 CPU (2.90GHz*16), 64 RAM and RTX 3080 GPU. The first figure presents the running times of the 14 individual methods as well as our ensemble process on the six datasets across the three scenarios in the simulation experiments. Cell2location, DestVI, DWLS and Stereoscope require more time than other methods. Note that after running the individual deconvolution methods, EnDecon can integrate the results from individual methods in a short time. In addition, we also provide an overview of the deconvolution methods in terms of PCC, 1-RMSE, 1-JSD and running time on all simulated datasets (the second figure) for the users to select appropriate individual deconvolution methods for integration.

Tutorials

Please do not hesitate to contact Prof. Zhang at [email protected] to seek any clarifications regarding any content or operation of the archive.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
R		R
data		data
docs		docs
man		man
simulation		simulation
vignettes		vignettes
DESCRIPTION		DESCRIPTION
EnDecon.Rproj		EnDecon.Rproj
EnDecon_0.2.0.pdf		EnDecon_0.2.0.pdf
NAMESPACE		NAMESPACE
README.md		README.md
requirments_EnDecon_CPU.yml		requirments_EnDecon_CPU.yml
requirments_EnDecon_GPU.yml		requirments_EnDecon_GPU.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EnDecon

Install individual dependencies

Run the example

Recommendation for the selection of base deconvolution methods

Tutorials

About

Releases

Packages

Contributors 2

Languages

keyalone/EnDecon

Folders and files

Latest commit

History

Repository files navigation

EnDecon

Install individual dependencies

Run the example

Recommendation for the selection of base deconvolution methods

Tutorials

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages