阅读背景:

R中的群集PAM - 如何忽略列/变量但仍保留它

来源:互联网 

I would like to use the Cluster PAM algorithm in R to cluster a dataset of around 6000 rows. I want the PAM algorithm to ignore a column called "ID" (Not use it in the clustering) but i do not want to delete that column. I want to use that column later on to combine my clustered data with the original dataset. basically what i want is to add a cluster column to the original dataset. I am want to use the PAM as a data compression/variables reduction method. I have 220 variables and i would like to cluster some of the variables and reduce the dimensionality of my dataset so i can apply a classification algorithm (Most likely a tree) to classify a problem that i am trying to solve. If anyone knows a way around this or a better approach, please let me know. Thank youI would like to use the Cluster PAM algorithm i




你的当前访问异常,请进行认证后继续阅读剩余内容。

分享到: