Text this: A semiparametric model for count data clusterization with application to medical data