Hello everyone! I’m a PhD student working with eye-tracking data for the first time. First of all, I would like to thank Dev_start for the amazing tutorials provided on their website, through which I learned a lot about eye-tracking data processing and analysis.
I would like to share with the forum an issue I’m encountering when checking my model’s assumptions, particularly whether the predicted data distribution corresponds well to the observed data distribution. My dependent variable is saccadic latency (ms), and the distribution of my data is right-skewed and clearly non-normal.
I therefore decided to implement a GLMM using lme4 in R, choosing the Gamma family. However, the correspondence between the estimated and observed data distributions was poor, and the residuals were not uniformly distributed, possibly violating model assumptions. Subsequently, I chose a lognormal distribution (using the glmmTMB package). There was a slight improvement in the distribution correspondence, although the model still underestimates the median peak of my data; however, the residual distribution appears to have improved.
I am unsure at what point I should stop searching for a distribution family that better fits my data. Do you have any suggestions for other distributions that are suitable for reaction times or latencies? How can I determine when the correspondence between predicted and observed distributions is acceptable?
Thank you again for your help, and I hope this discussion may be useful to others who are approaching GLMMs for the first time.
Best regards,
Martina