MATE
MATE is a type of Transformer architecture that has been specifically designed to help people model web tables. Its design is centered around sparse attention, which enables each head to attend to either the rows or the columns of a table in an efficient way. Additionally, MATE makes use of attention heads that can reorder the tokens found either at the rows or columns of the table, and then apply a windowed attention mechanism. Understanding Sparse Attention in MATE The sparse attention mech