[TOC]

CNN

AlexNet

文章地址:ImageNet Classification with Deep Convolutional

AlexNet

ResNet

文章地址:Deep Residual Learning for Image Recognition

ResNet

NLP

Transformer

文章地址:Attention Is All You Need

Transformer

Bert

文章地址:Pre-training of Deep Bidirectional Transformers for Language Understanding

Bert

GPT

文章地址:Improving Language Understanding by Generative Pre-Training(GPT-1)Language Models are Unsupervised Multitask Learner(GPT-2)Language Models are Few-Shot Learners(GPT-3)

GPT

Transformer in CV

ViT

文章地址:Transformer for Image Recognition at Scale

ViT

MAE

文章地址:Masked Autoencoders Are Scalable Vision Learner

MAE

Swin Transformer

文章地址:Swin Transformer: Hierarchical Vision Transformer using Shifted Wind

Swin Transformer

Object Detection

DETR

文章地址:DETR(End-to-End Object Detection with Transformer)

DETR

Contrastive Learning

Contrastive Learning

InscDisc

文章地址:Unsupervised Feature Learning via Non-Parametric Instance Discrimin

InscDisc

InvaSpread

文章地址:Unsupervised Embedding Learning via Invariant and Spreading Instance Feature

InvaSpread

CPC

文章地址:Representation Learning with Contrastive Predictive Coding

CPC

CMC

文章地址:Contrastive Multiview Coding

CMC

MOCO

MOCOv1

文章地址:Momentum Contrast for Unsupervised Visual Representation Learning

MOCOv1

MOCOv2

文章地址:Improved Baselines with Momentum Contrastive Learning

MOCOv2

MOCOv3

文章地址:An Empirical Study of Training Self-Supervised Vision Transformer

MOCOv3

SimCLR

文章地址:A Simple Framework for Contrastive Learning of Visual RepresentationBig Self-Supervised Models are Strong Semi-Supervised Learner

SimCLR

SWaV

文章地址:Unsupervised Learning of Visual Features by Contrasting Cluster Assignment

SWaV

BYOL

文章地址:Bootstrap Your Own Latent A New Approach to Self-Supervised Learning

BYOL

SimSiam

文章地址:Exploring Simple Siamese Representation Learning

SimSiam

DINO

文章地址:Emerging Properties in Self-Supervised Vision Transformers

DINO

Generative Model

GAN

文章地址:Generative Adversarial Nets

GAN

VAE

文章地址:Variational AutoEncoder

VAE

VQVAE

文章地址:Vector Quantised-variational AutoEncoder

VQVAE

DALL.E

文章地址:Zero-Shot Text-to-Image Generation

DALL.E

DALL.E 2

文章地址:Hierarchical Text-Conditional Image Generation with CLIP Latent

DALL.E 2

Video Understanding

DeepVideo

文章地址:Large-scale Video Classification with Convolutional Neural Network

DeepVideo

Two-Stream

文章地址:Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream

Beyond Short Snippets

文章地址:Beyond Short Snippets: Deep Networks for Video Classification

Beyond Short Snippets

Convolutional fusion

文章地址:Convolutional Two-Stream Network Fusion for Video Action Recognition

Convolutional fusion

MultiModal Learning