It does seem like text intros/explanations are a big key, and not just at the start, but they could be worked in at different scenes. They can even have their own music, but I think it would be neat, if possible, to start a particular song during the text, but lower in volume. Then when the scene it goes into starts, the music could be ramped up a bit.
I'm just thinking out loud without trying to overthink things. :p