@_akhaliq
MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features paper page: https://t.co/g2NlbmWher Self-supervised learning of visual representations has been focusing on learning content features, which do not capture object… https://t.co/SM4Hqwe51o https://t.co/ahZviHf3HQ