[washingtonpost] improve format extraction and add support for video pages extraction