Interesting problem Chris, I'll give some scattered thoughts, but you're clearly in an area where there are numerous ways to go.
I would model the probability the device 'turns-on' (a success), as a function of the individual, i, and the device, d
(I think you were calling this experiment). I would then model each using a random-effects (Bayesian Hierarchical) model:
logit(p) = beta + alpha_i + theta_d
alpha_i ~ N(0,tau^2); model tau^2 _ Gamma^-1
theta_d ~ N(0, sigma^2); model sigma^2 ~ Gamma^-1
beta ~ N(m,s^2)
I think each of the hierarchical aspects are important, certainly the mouse effects, but modeling the device effects
this way will certainly provide better estimates.
I think the randomization issue is an interesting one -- and it's more about comparisons.
Given there is no comparison to anything going on, the lack of randomization is likely not a big deal. If you model the
individual effects (mouse) and dont worry about 'fatigue' issues -- getting multiple devices implanted, it doesnt seem
that bad. Randomizing within the 5 mice doesnt do much as you still never have concurrent comparisons. Randomization
doesnt help that. What would be great is if device1 and device2 were compared directly, but the sequential nature of the
learning likely prevents that. There may be increased value with keeping the previous device as a control -- so randomize
4 new devices and 1 old device in each batch of 5 mice in the new 100. You then could also estimate a possible "fatigue"
aspect. But you always have a bridging of devices across time (you may have A vs. B, B vs. C, C vs. D, etc, but this could
allow comparisons from A to D).
In humans we worry the population changes as the experiments progress, they get healthier, sicker, younger, older,
have other devices, etc, a nice aspect of mice is that they are probably very homogenous, and the question you are posing
is whether they can evolve much, etc, which makes a control critical. If you assume a mouse is a mouse, I'm not sure
randomization is a big deal at all. Now, I know Bayesian don't need randomization as a basis of a sampling distribution,
but the generalizability may not be a big deal. If mice (humans) selected treatment now we have big worries.
Depending on timing of this you also could have adaptive questions of the experiment (you may estimate after 30 devices
that this change was good/bad and thus evolve faster in the devices).
So, randomization is nice, but concurrent comparison is likely more important.
-------------------------------------------
Scott Berry
Berry Consultants
-------------------------------------------