Dude, this isn't too bad. A few things:
The combination of the residual grain being both large and slow-moving is not very appealing. There appears to be some other slight temporal issues that I can't tell are from the original capture, the denoising, or the median/average of the captures.
The audio looses sync slightly over the duration of the movie.
Any chance you could upload the best of the raw captures before combining them and post-processing? Might allow better feedback as to how to improve your final product.
-G