May 17, 2022
But then again. The vanilla Transformer model -- it does not have a sense of hierarchical representations at least for vision modality.
But then again. The vanilla Transformer model -- it does not have a sense of hierarchical representations at least for vision modality.