German Word Level Lip Reading with Deep Learning

Abstract

Lip Reading, also known as Visual Speech Recognition is a challenging task in the intersection between Computer Vision and Natural Language Processing. Most works in this field focus on the English language. This thesis analyses, implements and evaluates different neural networks on the German language. DLIP is created and served as the dataset.