↖ click there to get TOC
Record papers I have read or reproduced since 2020 which were beneficial to my work.
My Xmind notes:
AI Art (TalkingHead/Text2Image/Text2Video etc.)
Recommend:
hmr-survey by tinatiansjz
Hand3DResearch by SeanChenxy
Human-Video-Generation by yule-li.
HelloFace by becauseofAI
awesome-NeRF by koolo233
awesome-ai-painting by hua1995116
Awesome-Face-Restoration by TaoWangzj
Year | Name | Paper | Codes |
---|---|---|---|
2019 | speech2gesture | Learning Individual Styles of Conversational Gesture | official |
2020 | Monoport | Monoport: Monocular Volumetric Human Teleportation | official |
2020 | PiFuHD | PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization | Meta |
2021 | iPERCore | Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis | official |
2021 | ContactHumanDynamics | Contact and Human Dynamics from Monocular Video | Stanford |
2021 | HuMoR | HuMoR: 3D Human Motion Model for Robust Pose Estimation | Stanford |
2021 | MeTRAbs | MeTRAbs: Metric-Scale Truncation-Robust Heatmaps for Absolute 3D Human Pose Estimation | official |
2022 | DeepMotion | official |
Year | Name | Paper | Codes |
---|---|---|---|
2021 | ParameterizedMotion | Learning a family of motor skills from a single motion clip | official |
2021 | 1165048017 Blog | official | |
2021 | TDPT | official | |
2021 | IK/FABRIK/CCDIK | UE4 doc |
Year | Name | Paper | Codes |
---|---|---|---|
2D Kp | |||
2018 | AlphaPose | RMPE: Regional Multi-Person Pose Estimation | official |
3D Kp | |||
2019 | mvpose | Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views | ZJU3DV |
2022 | PoseTriplet | Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision | official |
combine | |||
2021 | MediaPipe | official | |
2021 | mmpose | official |
Year | Name | Paper | Codes |
---|---|---|---|
2017 | MANO | Embodied Hands: Modeling and Capturing Hands and Bodies Together | official |
2020 | Mediapipe | MediaPipe Hands: On-device Real-time Hand Tracking | |
2021 | MocapNETv3 | Towards Holistic Real-time Human 3D Pose Estimation using MocapNETs | official |
2021 | S2HAND | S2HAND: Model-based 3D Hand Reconstruction via Self-Supervised Learning | Tencent |
Year | Name | Paper | Codes |
---|---|---|---|
2021 | MLP-Mixer | MLP-Mixer: An all-MLP Architecture for Vision | official |
2021 | Noisy Student | Self-training with Noisy Student improves ImageNet classification | official |
2021 | ImageNet-21K | ImageNet-21K Pretraining for the Masses | official |
2021 | MicroNet | MicroNet: Improving Image Recognition with Extremely Low FLOPs | official |
2021 | RepVGG | RepVGG: Making VGG-style ConvNets Great Again | official |
2022 | ConvNeXt | A ConvNet for the 2020s | official |
Year | Name | Paper | Codes |
---|---|---|---|
2016 | MTCNN | Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks | unofficial |
2020 | DSFD | DSFD: Dual Shot Face Detector | official |
2021 | SCRFD | Sample and Computation Redistribution for Efficient Face Detection | official |
Year | Name | Paper | Codes |
---|---|---|---|
2019 | FSGAN | FSGAN: Subject Agnostic Face Swapping and Reenactment | official |
2020 | Disney | High-Resolution Neural Face Swapping for Visual Effects | unofficial |
2020 | FaceShifter | FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping | unofficial |
2021 | SimSwap | SimSwap: An Efficient Framework For High Fidelity Face Swapping | official |
2021 | InfoSwap | Information Bottleneck Disentanglement for Identity Swapping | official |
2021 | ShapeEditer | ShapeEditer: a StyleGAN Encoder for Face Swapping | |
2021 | HifiFace | HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping | unofficial |
2022 | MobileFaceSwap | MobileFaceSwap: A Lightweight Framework for Video Face Swapping | baidu |
2022 | Stitch it in Time | Stitch it in Time: GAN-Based Facial Editing of Real Videos | official |
Year | Name | Paper | Codes |
---|---|---|---|
2019 | SPADE | Semantic Image Synthesis with Spatially-Adaptive Normalization | Nvidia |
2021 | OASIS | You Only Need Adversarial Supervision for Semantic Image Synthesis | official |
2017 | pix2pix | Image-to-Image Translation with Conditional Adversarial Networks | official |
2018 | pix2pixHD | High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs | nvidia |
2018 | vid2vid | Video-to-Video Synthesis | nvidia |
anime face | |||
2019 | TalkingHeadAnime | official | |
2021 | TalkingHeadAnime2 | official | |
2022 | EasyVtuber | official |
Year | Name | Paper | Codes |
---|---|---|---|
2019 | StyleGan | A Style-Based Generator Architecture for Generative Adversarial Networks | Nvidia |
2019 | StyleGan2 | Analyzing and Improving the Image Quality of StyleGAN | Nvidia |
2021 | stylegan2-ada | Training Generative Adversarial Networks with Limited Data | Nvidia |
2021 | StyleGan3 | Alias-Free Generative Adversarial Networks | Nvidia |
2021 | SemanticGAN | Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization | Nvidia |
Year | Name | Paper | Codes |
---|---|---|---|
2020 | FirstOrder | First Order Motion Model for Image Animation | official |
2021 | speech2gesture | NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video | ZJU3DV |
2021 | StyleGestures | Style-controllable speech-driven gesture synthesis using normalising flows | official |
2021 | face-vid2vid | One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing | Nvidia Project unofficial unofficial-2 |
2022 | DaGAN | Depth-Aware Generative Adversarial Network for Talking Head Video Generation | official |
Year | Name | Paper | Codes |
---|---|---|---|
2020 | 100 Days of Hands | Understanding Human Hands in Contact at Internet Scale | official |
2021 | YOLOX | YOLOX: Exceeding YOLO Series in 2021 | Megvii |
Year | Name | Paper | Codes |
---|---|---|---|
2017 | google Attention | Attention Is All You Need | official |
2020 | ViT | An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | |
2021 | Token Labeling | All Tokens Matter: Token Labeling for Training Better Vision Transformers | official |
2021 | Tokens-to-Token ViT | Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet | official |
2021 | MAE | Masked Autoencoders Are Scalable Vision Learners | Meta |
Year | Name | Paper | Codes |
---|---|---|---|
2017 | AdaIN | Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization | official |
2018 | lpips | The Unreasonable Effectiveness of Deep Features as a Perceptual Metric | OpenAI |
2020 | IBA | Restricting the Flow: Information Bottlenecks for Attribution | official |
2021 | Focal Frequency Loss | Focal Frequency Loss for Image Reconstruction and Synthesis | official |
2022 | ffcv | MIT |
Year | Name | Paper | Codes |
---|---|---|---|
2020 | 3d photo inpainting | 3D Photography using Context-aware Layered Depth Inpainting | official |
2021 | ParameterizedMotion | Learning a family of motor skills from a single motion clip | official |
2021 | AnimeInterp | Deep Animation Video Interpolation in the Wild | SenseTime |
2021 | DALLE | Zero-Shot Text-to-Image Generation | OpenAI |