Joined July 2019
Photos and videos
cclin retweeted
so cool to see PapersWithCode back!
Introducing a revival of PapersWithCode! As @ilyasut said, we're back to the "age of research". Hence, it's important to share research and build on each other's work. > find SOTA per domain, not just LLMs > leaderboards > methods > all parsed at scale using AI agents.
1
7
46
13,300
cclin retweeted
4 Jul 2023
Thx @_akhaliq! Check out our DisCo at disco-dance.github.io.🔥🔥🔥 🧙‍♂️High Generalizability. No need human-specific fine-tuning! 💃Extensive human-related applications with disentangled control! 👨‍💻Easy-to-follow framework and totally opensource code!

4 Jul 2023
DisCo: Disentangled Control for Referring Human Dance Generation in Real World paper page: huggingface.co/papers/2307.0… Generative AI has made significant strides in computer vision, particularly in image/video synthesis conditioned on text descriptions. Despite the advancements, it remains challenging especially in the generation of human-centric content such as dance synthesis. Existing dance synthesis methods struggle with the gap between synthesized content and real-world dance scenarios. In this paper, we define a new problem setting: Referring Human Dance Generation, which focuses on real-world dance scenarios with three important properties: (i) Faithfulness: the synthesis should retain the appearance of both human subject foreground and background from the reference image, and precisely follow the target pose; (ii) Generalizability: the model should generalize to unseen human subjects, backgrounds, and poses; (iii) Compositionality: it should allow for composition of seen/unseen subjects, backgrounds, and poses from different sources. To address these challenges, we introduce a novel approach, DISCO, which includes a novel model architecture with disentangled control to improve the faithfulness and compositionality of dance synthesis, and an effective human attribute pre-training for better generalizability to unseen humans. Extensive qualitative and quantitative results demonstrate that DISCO can generate high-quality human dance images and videos with diverse appearances and flexible motions.
2
6
8
3,851
cclin retweeted
Xuedong Huang, CTO, Azure AI, will present his keynote at CVPR 2022 @CVPR today at 5PM CT where he will share progress on the application of Integrative AI on computer vision and its promising results. Virtual conference registrants can tune in here: msft.it/6017bWAUz
1
7
25
cclin retweeted
Interested in Vision Language Pre-training (VLP) but do not know where to start? Hard to track the rapid progress in VLP? Come and join us at our CVPR2022 VLP tutorial on 19th Jun (9am-5pm CDT) in person in New Orleans or virtually. vlp-tutorial.github.io #CVPR2022
22
106
cclin retweeted
We are thrilled to announce Imagen, a text-to-image model with unprecedented photorealism and deep language understanding. Explore imagen.research.google and Imagen! A large rusted ship stuck in a frozen lake. Snowy mountains and beautiful sunset in the background. #imagen
55
292
1,611
cclin retweeted
20 Oct 2021
NeuralDiff: Segmenting 3D objects that move in egocentric videos abs: arxiv.org/abs/2110.09936 project page: robots.ox.ac.uk/~vadim/neura…
12
67
cclin retweeted
Here are the video recordings of the workshop: youtube.com/watch?v=VmKc_sEJ…
Announcing @ICCV_2021 workshop on "Unsupervised 3D Learning in the Wild" with an incredible line-up of speakers on this topic! #ICCV2021 🚩 Website: unsup3d.github.io 📅 Time: 7:00-18:00 EDT / 12:00-23:00 BST, 11 Oct 2021 Calendar: calendar.google.com/calendar… (mark it down!)
12
71
cclin retweeted
13 Aug 2021
We're sharing Unidentified Video Objects (UVO), a new benchmark to facilitate research in open-world segmentation, an important computer vision task that aims to detect, segment, and track all objects exhaustively in a video. Learn more: ow.ly/rH8650FQ7vp
6
83
361
cclin retweeted
2 Aug 2021
FAIR research scientist, Ishan Misra (@imisra_) sat down with @lexfridman to demystify self-supervised learning & its impact in #AI: youtube.com/watch?v=FUS6ceIv…. Read the blog post that inspired the conversation: ai.facebook.com/blog/self-su…

4
81
458
cclin retweeted
Writing Related Work I enjoy reading/writing the related work section of a paper. It helps organize prior research and put the contributions of the work in proper context. But HOW? Check the thread below👇
7
164
725
cclin retweeted
22 Jun 2021
Today, we are announcing the open source release of DeepLab2, a modern TensorFlow library for deep labeling that aims to facilitate future research on dense pixel labeling by providing a unified, state-of-the-art, and easy-to-use TensorFlow codebase → goo.gle/3d3SnVE
14
283
1,208
cclin retweeted
We dive deep into self-supervised learning with Dr. Ishan Misra @imisra_ from FAIR @facebookai and cover their recent cluster of vision papers; with @ykilcher @RisingSayak CC: @ylecun @skornblith @mcaron31 @HugoTouvron youtu.be/EXJmodhu4_4
2
55
230
cclin retweeted
5 May 2021
DINO’s attention maps can discover and segment objects in an image or a #Video with absolutely no supervision and without being given a segmentation-targeted objective. #computervision ai.facebook.com/blog/dino-pa…
30 Apr 2021
Here’s our new computer vision system achieving state of the art results in image segmentation, without needing any labeled training data. This new model was trained on random, unlabeled data, but quickly achieved state-of-the-art results. It’s awesome.
4
71
285
cclin retweeted
Want to learn about meta-learning? Lecture videos for CS330 are now online! youtube.com/playlist?list=PL… Topics incl. MTL, few-shot learning, Bayesian meta-learning, lifelong learning, meta-RL & more: cs330.stanford.edu 3 guest lectures from Kate Rakelly, @svlevine, @jeffclune
30
658
2,539
19 Jul 2019
Join us and consider submitting a paper to the Second workshop on Moving Camera at @ICCV19. More details on the website: sites.google.com/view/mcmvs2… Deadline: August 5 #ICCV2019 #ComputerVIsion
3
2