Single Cell
BioAlpha single-cell is an algorithmic framework that can run full single-cell analysis pipeline on Python for large biological datasets in realtime.
Import BioAlpha single-cell as:
from bioalpha import singlecell as sc
or:
from bioalpha import sc
Following the BioAlpha Tutorial and AUCell Tutorial to see how to use it.
Preprocessing: pp
bioalpha.singlecell.pp
provides basic transformation on data.
Filter cell outliers based on counts and numbers of genes expressed. |
|
Filter genes based on number of cells or counts. |
|
Calculate quality control metrics. |
|
Normalize counts per cell and calculate log1p. |
|
Logarithmize the data matrix. |
|
Annotate highly variable genes. |
|
Regress out (mostly) unwanted sources of variation. |
|
Scale data to unit variance and zero mean. |
|
Principal component analysis. |
|
Subsample to a fraction of the number of observations. |
|
Use harmony algorithm to integrate different experiments. |
|
Combine keys in |
|
Compute a neighborhood graph of observations. |
Tool: tl
Provide methods for an interpretable annotation and visualizing with parallel module on bioalpha.sc.pl
.
Cluster cells into subgroups. |
|
Cluster cells into subgroups. |
|
Run t-SNE t-distributed stochastic neighborhood embedding (tSNE). |
|
Embed the neighborhood graph using UMAP (Uniform Manifold Approximation and Projection). |
|
Rank genes for characterizing groups. |
|
Calculate enrichment of gene signatures for single cells. |
Plotting: pl
Fraction of counts assigned to each gene over all cells. |
|
Plot dispersions or normalized variance versus means for genes. |
|
Scatter plot in tSNE basis. |
|
Scatter plot in UMAP basis. |
|
Plot ranking of genes using volcano plot. |
|
Plotting AUC scores of cells and gene_sets. |
|
Plotting AUC scores of cells and gene_sets. |
Reading
Fast read |
|
Read 10x-Genomics-formatted hdf5 file. |
|
Read .h5ad-formatted hdf5 file. |
|
Fast reading algorithm for |
|
Read gene signatures from |
|
Read gene signatures from |
|
Read gene signatures from |
|
Get gene signature name from |
|
Download suplementary file of data from GEO (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi). |
Writing
Write Bioturing format. |
Experimental
Identifies marker genes. |
|
Find threshold to cut-off and smooth the expression matrix. |