I'll figure more out as I work with the audio. Right now I'm cutting the video because that's the really hard and time-consuming bit, and I'll leave audio alone until the video is done (the computer is totally bogged down with that job). To me, it sounds like the sound editor simply took the first few seconds of audio from the deleted scene, and put that onto the first few seconds of the following scene so that they just blended together. However I'm a little concerned that they may have also added an echo effect and a fade in to better match the existing dialogue.
According to the progress bar on my computer, I should be done cutting the video sometime next week ;)