Type: | Package |
Title: | Community Estimation in G-Models via CORD |
Version: | 0.1.1 |
Date: | 2015-09-18 |
Author: | Xi (Rossi) LUO, Florentina Bunea, Christophe Giraud |
Maintainer: | Xi (Rossi) LUO <xi.rossi.luo@gmail.com> |
Description: | Partition data points (variables) into communities/clusters, similar to clustering algorithms, such as k-means and hierarchical clustering. This package implements a clustering algorithm based on a new metric CORD, defined for high dimensional parametric or semi-parametric distributions. Read http://arxiv.org/abs/1508.01939 for more details. |
License: | GPL-3 |
Suggests: | pcaPP |
Imports: | Rcpp |
LinkingTo: | Rcpp, RcppArmadillo |
Packaged: | 2015-09-20 02:30:10 UTC; xluo |
NeedsCompilation: | yes |
Repository: | CRAN |
Date/Publication: | 2015-09-20 08:01:07 |
Community estimation in G-models via CORD
Description
Partition data points (variables) into clusters/communities. Reference: Bunea, F., Giraud, C., & Luo, X. (2015). Community estimation in G
-models via CORD. arXiv preprint arXiv:1508.01939. http://arxiv.org/abs/1508.01939.
Usage
cord(X, tau = 2 * sqrt(log(ncol(X))/nrow(X)), kendall = T,
input = c("data", "cor", "dist"))
Arguments
X |
Input data matrix. It should be an n (samples) by p (variables) matrix when |
tau |
Threshold to use at each iteration. A theoretical choice is about |
kendall |
Whether to compute Kendall's tau correlation matrix from |
input |
Type of input |
Value
list
with one element: a vector of integers showing which cluster/community each point is assigned to.
Examples
set.seed(100)
X <- 2*matrix(rnorm(200*2), 200, 10)+matrix(rnorm(200*10), 200, 10)
cord(X)