Audio-visual synchronization detection using deep learning with modern Python architecture. This is a refactored and enhanced version of the original SyncNet ...