Lip Reading, also known as Visual Speech Recognition is a challenging task in the intersection between Computer Vision and Natural Language Processing. Most works in this field focus on the English language. This thesis analyses, implements and evaluates different neural networks on the German language. DLIP is created and served as the dataset.