Good stuff, but I have a suggestion:
Instead of announcing which version, playing, then announcing the next version and playing, skip the pause and talk and use a graphic on the screen to show which version is being heard. The "stop and talk" breaks things in a way that makes it tougher to notice any audible change(if any happens).