以多視角為基礎之運動視訊編輯系統

呂華綸
As digital videos become more and more popular nowadays, the application of videos in different fields has been wide spreading. The researches had turned from simply getting the shots to using techniques like abstraction, summarization to display the exciting performances in the clips; or even increasing audio techniques to let users listen to music while watching the films.

However, among these developments, the filming still relies on only one video camera. When filming, it makes it impossible to catch the performance and a close-up on the performers at the same time. Therefore, using multiple video cameras to film can achieve the expectations of letting the users capture bounteous contents and close-ups. The thesis takes multi-view based video as the foundation to build up the automatic video editing system. There are two important issues for video composition: video synchronization and video switching. Video synchronization is to match the time of the videos from different viewing directions to the global time axis. The system firstly uses abrupt video shot detection to segment the abrupt shots of the captured videos, and then uses the velocity curve similarity to search the synchro-point. The goal of video switching is to retrieve different contents of videos to appeal the users and allow them to watch the attention shots of videos. We designed three shots that based on the contents for the system, and we categorized these three shots in considering the parts that users will take notice of on videos, for example: camera motion shot, face shot, and fragment shot; we calculate the importance of each shot to determine whether those shots should be selected into the compositive film or not. The experiments uses ball games as the filming content, and they contain the conditions of the different viewing angles, different content based shot weighting, the environmental change of indoors and outdoors, and the filming of a lot of people. Also, we analyze the synchronization of the film and the importance of the shots in different circumstances.

在DV日漸盛行之下,以video作為應用的領域也隨之增長。以往研究從擷取影像shot,發展至abstraction、summarization等剪輯技術用於表現影片中精采的片段資訊,甚至增加audio技術讓使用者能隨著音樂的節奏觀賞精采鏡頭。然而在眾多的發展中,仍以單台攝影機作為基礎,因此在拍攝影片時會因而產生表演內容與人物特寫無法同時被擷取的狀況,因此以多台攝影機作為拍攝的工具可以達到預期的目的,讓使用者能拍攝更豐富的內容與特寫。本論文研究是以多視角影片為基礎,建立自動化的視訊編輯系統,予以解決多台攝影機拍攝之影像重合的問題。多攝影機視訊合成的問題有(1)多視角影片的同步化( video synchronization ),(2)對於影片的轉換( video switching )作為主要的討論主軸。多視角影片的同步化是把影片中本身的時間對應到共同的時間軸,在結合成影片的同時不會發生重疊的問題。本系統首先利用Abrupt video shot detection切割影片中發生瞬時變化的片段,並利用於我們所設計的video synchronization技術讓系統能搜尋到同步時間點。因為在不同角度下拍攝的人,所表現出來的速度會有相似的狀況,因此在此技術中利用velocity curve作為偵測上的特徵,並利用相似度的比較達到正確的偵測判斷。轉換影片的目的是以擷取不同影片內容讓使用者能提高興趣並能觀看到當時的重點鏡頭。本系統設計三種以內容為基礎的shot,考慮使用者對於影片所關注的部份而分成camera motion shot、face shot與fragment shot,並利用我們設計的video switching計算每段shot的重要性予以判斷是否會被結合於最終的影片。本研究實驗以球類比賽為拍攝內容。對於拍攝角度上的問題、content-based shot權重的不同、室內室外環境的改變與拍攝多人影片的問題做討論,並分析影片的同步化與shot在不同狀況下的重要性。本論文提出方法以解決多影像編輯所發生的同步化與影片轉換問題,並設計利用velocity curve、content-based shot與importance shot value來完成更順暢且活潑的影片。