I am trying to calculate distance between rows (data points) on the basis of categorical variables in columns. The simplest method I have seen is to calculate the overlap. In other words in what proportion of variables do x and y take identical values.I am trying to calculate distance between rows