October 9th, 2008, 10:15 am
I have a dataset of two correlated poisson variablesX1 ~ Poisson(lambda1)X2 ~ Poisson(lambda2)Pr(X1,X2)<>Pr(X1)*Pr(X2)I would like to specify the joint distribution of X1 and X2 so that it fits a dataset.So far I have tried two things:i, created a correlation matrix for X1 and X2, cholesky decomposed it and generated correlated random outcomes (in detail I created two gaussian independent variables, correlated them and then transformed them back to uniform rvs and then to poissons via the cumulative distribution function). perhaps this is a mad approach, please advise if so. to my disappointment the resulting joint distribution does not look anything like the dataset.ii, i tried various copula types (frank, clayton, gumbel) to tie the poisson marginals to fit the empirical dataset, but again no resemblance.what is striking is that the joint distribution for the dataset have a significant higher probability of seeing X1=X2 than any of the two above approaches.in particular i am fighting with the following questions:a, is it valid to test the fit of the model generated joint distribution with the dataset using least squares? i have heard about maximum likelihood, how can i create a ML function for this particular case?b, is there a well known approach to creating a model for an empirical joint distribution (other than my two naive approaches above)?it is important that the modelled joint distribution fits the empirical one (or at least resembles it) and that this relationship is maintained when lambda1 and lambda2 changes. any suggestions on how I can proceed are more than welcome. thanks.