Modulated Residual Network
Modern technology has brought about incredible advancements in many areas, including visual question answering. MODERN, short for Modulated Residual Network, is an architecture used in visual question answering that employs conditional batch normalization to allow for linguistic embedding. This linguistic embedding from an LSTM modulates the batch normalization parameters of a ResNet, enabling the manipulation of entire feature maps by scaling them up or down, negating them, or shutting them off