It works like this:
you have a 2:12h PAL version, resize it with CCE to a 2:12h PAL version with 720x480 (NTSC size), then use a pulldown program to change the framerate from 25 to 29.97 and there it is: NTSC (2:3 pulldown). The time stays the exact same, so the audio stays in sync of course.