The thing is:
The audio is given in 5.1 or better. The dialogue is on the center channel. So you can just take the center channel and get rid a lot of the music, but sound effects as well. Unfortunately, the center channel isn't free from music and sound effects. So there still remains a lot of the unwanted stuff. But it's lot better than taking the full 5.1 or so.