Hi, I would like to work in this project. I have the required skills to get it done. The video you've posted is made by using a simple trick of image/video processing. It uses the concept of chroma key. Every frame in the final video is made by overlaying three frames from different sources(different videos/images). Which means one frame is the man in the photographic studio, other is the gorilla and the third is the jungle. the gorilla is filmed by using the chroma key, which means the background will be transparent. The jungle seems to be a static frame(maybe a photo with a mask made in Photoshop) with transparent regions. So, if you take into account all the transparency information and draw the man first following the gorilla frame and then the jungle, you get that effect.
I can do this project in Delphi by using Directshow. Besides, I can give you all support you happen to need later on.
If you like my bid and want more information, PM me.