Should I switch from Poisson to negative binomial for overdispersed count data?
#1
I’m trying to decide if I should use a Poisson regression for my count data on website errors per day, but the variance is almost double the mean. I’ve read that this overdispersion means the model assumptions are violated, but I’m not sure what to use instead.
Reply
#2
I ran into this last year with daily website errors. Poisson fed me numbers that looked fine but the residuals told a different story and the standard errors were blown up.
Reply
#3
My go to fix was a negative binomial model which lets the variance be bigger than the mean. It tended to fit better and the inference felt more reasonable.
Reply
#4
Could the zeros be the hidden problem instead of dispersion?
Reply
#5
I also tried a couple of tweaks like adding an offset for site traffic and checking a quasi Poisson alternative, but the improvements were small and I stayed unsure about the right path.
Reply


[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Forum Jump: