论文记录
[TOC]
CNN
AlexNet
文章地址:ImageNet Classification with Deep Convolutional
ResNet
文章地址:Deep Residual Learning for Image Recognition
NLP
Transformer
文章地址:Attention Is All You Need
Bert
文章地址:Pre-training of Deep Bidirectional Transformers for Language Understanding
GPT
文章地址:Improving Language Understanding by Generative Pre-Training(GPT-1) 、Language Models are Unsupervised Multitask Learner(GPT-2)、Language Models are Few-Shot Learners(GPT-3)
Transformer in CV
ViT
文章地址:Transformer for Image Recognition at Scale
MAE
文章地址:Masked Autoencoders Are Scalable Vision Learner
Swin Transformer
文章地址:Swin Transformer: Hierarchical Vision Transformer using Shifted Wind
Object Detection
DETR
文章地址:DETR(End-to-End Object Detection with Transformer)
Contrastive Learning
InscDisc
文章地址:Unsupervised Feature Learning via Non-Parametric Instance Discrimin
InvaSpread
文章地址:Unsupervised Embedding Learning via Invariant and Spreading Instance Feature
CPC
文章地址:Representation Learning with Contrastive Predictive Coding
CMC
文章地址:Contrastive Multiview Coding
MOCO
MOCOv1
文章地址:Momentum Contrast for Unsupervised Visual Representation Learning
MOCOv2
文章地址:Improved Baselines with Momentum Contrastive Learning
MOCOv3
文章地址:An Empirical Study of Training Self-Supervised Vision Transformer
SimCLR
文章地址:A Simple Framework for Contrastive Learning of Visual Representation、Big Self-Supervised Models are Strong Semi-Supervised Learner
SWaV
文章地址:Unsupervised Learning of Visual Features by Contrasting Cluster Assignment
BYOL
文章地址:Bootstrap Your Own Latent A New Approach to Self-Supervised Learning
SimSiam
文章地址:Exploring Simple Siamese Representation Learning
DINO
文章地址:Emerging Properties in Self-Supervised Vision Transformers
Generative Model
GAN
文章地址:Generative Adversarial Nets
VAE
VQVAE
文章地址:Vector Quantised-variational AutoEncoder
DALL.E
文章地址:Zero-Shot Text-to-Image Generation
DALL.E 2
文章地址:Hierarchical Text-Conditional Image Generation with CLIP Latent
Video Understanding
DeepVideo
文章地址:Large-scale Video Classification with Convolutional Neural Network
Two-Stream
文章地址:Two-Stream Convolutional Networks for Action Recognition in Videos
Beyond Short Snippets
文章地址:Beyond Short Snippets: Deep Networks for Video Classification
Convolutional fusion
文章地址:Convolutional Two-Stream Network Fusion for Video Action Recognition