Thanks for uploading the patch; I found a potential cause for the delay. The original patch uses SoundToGlobalControllers (STGCs), some of which depend on the values of other STGCs. The best practice, especially when timing-accuracy is crucial, is to put EventExpressions directly into the parameter fields where they are used, not to broadcast them using STGCs. STGCs are useful for displaying the results in the VCS but are not intended for deriving any clocks or timing triggers; they are primarily useful for graphic display or classroom demonstrations to show what's going on behind the scenes.
The link below is your patch, rewritten to use TransformEventValue to replace each instance of each EventValue that was previously being broadcast via an STGC with the corresponding EventExpression. This should improve the timing accuracy, and it has the added advantage of gathering all the related EventValue mappings into a single spot so it's easier to see what maps to what. There's also a MultiSample in place of the original Sample that chose its file based on ?VoiceNumber.
Please give this improved time-accuracy Euclidean Drum Box a try. Hope it's helpful!