This talk presents a tracking method for 3-D reconstruction of planar surfaces in the context of video editing. Interaction with end-user is allowed in order to avoid occlusion of the tracked objects and to correct possible errors. The method takes into account all perspective transformations by a template matching method. This one proceeds first by estimating translation and rotation of the object from large templates before to estimate the perspective transformation with more localized template matching.