Lecture3_One-hundred-software-in-bioinformatics094_ConsensusClusterPlus

Jimmyjmzeng1314@outlook.com

Keywords: ConsensusClusterPlus/clust/R/package

Data: ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE62nnn/GSE62254/matrix/GSE62254_series_matrix.txt.gz

 

#source("http://bioconductor.org/biocLite.R")

#biocLite("ConsensusClusterPlus")

library(ConsensusClusterPlus)

#wget ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE62nnn/GSE62254/matrix/GSE62254_series_matrix.txt.gz

a=read.table("GSE62254_series_matrix.txt.gz",comment.char = "!",stringsAsFactors=F,header=T)

rownames(a)=a[,1]

a=a[,-1]

#> dim(a)

#[1] 54675   300

d=a

mads=apply(d,1,mad)

d=d[rev(order(mads))[1:5000],]

d = sweep(d,1, apply(d,1,median,na.rm=T))

title="./" #所有的图片以及数据都会输出到这里的

results = ConsensusClusterPlus(as.matrix(d),maxK=6,reps=50,pItem=0.8,pFeature=1,

                               title=title,clusterAlg="hc",distance="pearson",seed=1262118388.71279,plot="png")

 

Subtype: 4

useful links : http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE62254

http://www.bio-info-trainee.com/?p=945

http://www.bio-info-trainee.com/?p=969

http://www.bioconductor.org/packages/release/bioc/html/ConsensusClusterPlus.html

http://www.bioconductor.org/packages/release/bioc/vignettes/ConsensusClusterPlus/inst/doc/ConsensusClusterPlus.pdf