This package allows calling the Leiden algorithm for clustering on an igraph object from R. See the Python and Java implementations for more details:
This package requires the ‘leidenalg’ and ‘igraph’ modules for python (2) to be installed on your system. For example:
pip install leidenalg igraph
If you do not have root access, you can use
pip install --user
or pip install --prefix
to
install these in your user directory (which you have write permissions
for) and ensure that this directory is in your PATH so that Python can
find it.
The ‘devtools’ package will be used to install ‘leiden’ and the dependancies (igraph and reticulate). To install the development version:
if (!requireNamespace("devtools"))
install.packages("devtools")
devtools::install_github("TomKellyGenetics/leiden")
The current release on CRAN can be installed with:
First set up a compatible adjacency matrix:
adjacency_matrix <- rbind(cbind(matrix(round(rbinom(400, 1, 0.8)), 20, 20),
matrix(round(rbinom(400, 1, 0.3)), 20, 20),
matrix(round(rbinom(400, 1, 0.1)), 20, 20)),
cbind(matrix(round(rbinom(400, 1, 0.3)), 20, 20),
matrix(round(rbinom(400, 1, 0.8)), 20, 20),
matrix(round(rbinom(400, 1, 0.2)), 20, 20)),
cbind(matrix(round(rbinom(400, 1, 0.3)), 20, 20),
matrix(round(rbinom(400, 1, 0.1)), 20, 20),
matrix(round(rbinom(400, 1, 0.9)), 20, 20)))
str(adjacency_matrix)
#> num [1:60, 1:60] 1 1 1 1 1 1 1 1 1 1 ...
dim(adjacency_matrix )
#> [1] 60 60
An adjacency matrix is any binary matrix representing links between nodes (column and row names). It is a directed graph if the adjacency matrix is not symmetric.
library("igraph")
rownames(adjacency_matrix) <- 1:60
colnames(adjacency_matrix) <- 1:60
graph_object <- graph_from_adjacency_matrix(adjacency_matrix, mode = "directed")
graph_object
#> IGRAPH ca51815 DN-- 60 1527 --
#> + attr: name (v/c)
#> + edges from ca51815 (vertex names):
#> [1] 1->1 1->2 1->3 1->5 1->7 1->8 1->9 1->10 1->11 1->12 1->13 1->14 1->15 1->16 1->17 1->18 1->20 1->22 1->34
#> [20] 1->38 1->39 1->40 2->1 2->3 2->4 2->5 2->7 2->8 2->9 2->10 2->11 2->12 2->14 2->15 2->17 2->18 2->19 2->20
#> [39] 2->26 2->27 2->28 2->33 2->39 2->43 2->49 2->57 2->59 3->1 3->3 3->4 3->5 3->6 3->7 3->8 3->9 3->10 3->11
#> [58] 3->12 3->13 3->14 3->15 3->16 3->17 3->18 3->19 3->20 3->22 3->23 3->24 3->25 3->26 3->28 3->33 3->35 3->36 3->38
#> [77] 3->40 3->42 3->55 4->1 4->2 4->3 4->4 4->5 4->6 4->7 4->8 4->9 4->10 4->11 4->12 4->15 4->16 4->17 4->18
#> [96] 4->19 4->20 4->26 4->30 4->36 4->38 4->40 4->41 5->1 5->2 5->3 5->4 5->5 5->6 5->7 5->10 5->11 5->12 5->13
#> [115] 5->14 5->16 5->17 5->18 5->19 5->20 5->22 5->26 5->28 5->31 5->35 5->36 5->37 5->45 5->46 5->60 6->1 6->3 6->4
#> [134] 6->5 6->6 6->7 6->8 6->10 6->11 6->13 6->14 6->15 6->16 6->18 6->19 6->20 6->22 6->23 6->30 6->31 6->41 6->46
#> + ... omitted several edges
This represents the following graph structure.
Then the Leiden algorithm can be run on the igraph object.
Here we can see partitions in the plotted results. The nodes that are more interconnected have been partitioned into separate clusters.
library("RColorBrewer")
node.cols <- brewer.pal(max(c(3, partition)),"Pastel1")[partition]
plot(graph_object, vertex.color = node.cols)
Arguments can be passed to the leidenalg implementation in Python:
#run with defaults
partition <- leiden(graph_object)
#run with ModularityVertexPartition"
partition <- leiden(graph_object, partition_type = "ModularityVertexPartition")
#run with resolution parameter
partition <- leiden(graph_object, resolution_parameter = 0.95)
In particular, the resolution parameter can tune the number of clusters to be detected.
partition <- leiden(graph_object, resolution_parameter = 0.5)
node.cols <- brewer.pal(max(c(3, partition)),"Pastel1")[partition]
plot(graph_object, vertex.color = node.cols)
partition <- leiden(graph_object, resolution_parameter = 1.8)
node.cols <- brewer.pal(max(c(3, partition)),"Pastel1")[partition]
plot(graph_object, vertex.color = node.cols)
Weights for edges an also be passed to the leiden algorithm either as a separate vector or as a weighted graph_object. Weights will be derived from a weighted graph object.
# generate (unweighted) igraph object in R
library("igraph")
adjacency_matrix[adjacency_matrix > 1] <- 1
snn_graph <- graph_from_adjacency_matrix(adjacency_matrix)
partition <- leiden(snn_graph)
table(partition)
# pass weights to python leidenalg
adjacency_matrix[adjacency_matrix >= 1 ] <- 1
snn_graph <- graph_from_adjacency_matrix(adjacency_matrix, weighted = NULL)
weights <- sample(1:10, sum(adjacency_matrix!=0), replace=TRUE)
partition <- leiden(snn_graph, weights = weights)
table(partition)
# generate (weighted) igraph object in R
library("igraph")
adjacency_matrix[adjacency_matrix >= 1] <- weights
snn_graph <- graph_from_adjacency_matrix(adjacency_matrix, weighted = TRUE)
partition <- leiden(snn_graph)
table(partition)
See the documentation on the leidenalg Python module for more information: https://leidenalg.readthedocs.io/en/latest/reference.html
To use Leiden with the Seurat pipeline for a Seurat Object
object
that has an SNN computed (for example with
Seurat::FindClusters
with save.SNN = TRUE
).
This will compute the Leiden clusters and add them to the Seurat Object
Class. The R implementation of Leiden can be run directly on the snn
igraph object in Seurat.
library("Seurat")
FindClusters(pbmc_small)
membership <- leiden(pbmc_small@snn)
table(membership)
pbmc_small@ident <- as.factor(membership)
names(pbmc_small@ident) <- rownames(pbmc_small@meta.data)
pbmc_small@meta.data$ident <- as.factor(membership)
Note that this code is designed for Seurat version 2 releases.
Note that the object for Seurat version 3 has changed. For example an SNN can be generated:
library("Seurat")
FindClusters(pbmc_small)
membership <- leiden(pbmc_small@graphs$RNA_snn)
table(membership)
For Seurat version 3 objects, the Leiden algorithm has been
implemented in the Seurat version 3 package with
Seurat::FindClusters
and
algorithm = "leiden"
). See the documentation for these
functions.