Sergio Paniego

Sergio Paniego

Photos and videos

Tweets

cclin retweeted

Sergio Paniego

@SergioPaniego

Jun 2

so cool to see PapersWithCode back!

Niels Rogge @NielsRogge

May 18

Introducing a revival of PapersWithCode! As @ilyasut said, we're back to the "age of research". Hence, it's important to share research and build on each other's work. > find SOTA per domain, not just LLMs > leaderboards > methods > all parsed at scale using AI agents.

3:01

13,300

Tan Wang

cclin retweeted

Tan Wang @Wangt97

4 Jul 2023

Thx @_akhaliq! Check out our DisCo at disco-dance.github.io.🔥🔥🔥 🧙‍♂️High Generalizability. No need human-specific fine-tuning! 💃Extensive human-related applications with disentangled control! 👨‍💻Easy-to-follow framework and totally opensource code!

@_akhaliq

4 Jul 2023

DisCo: Disentangled Control for Referring Human Dance Generation in Real World paper page: huggingface.co/papers/2307.0… Generative AI has made significant strides in computer vision, particularly in image/video synthesis conditioned on text descriptions. Despite the advancements, it remains challenging especially in the generation of human-centric content such as dance synthesis. Existing dance synthesis methods struggle with the gap between synthesized content and real-world dance scenarios. In this paper, we define a new problem setting: Referring Human Dance Generation, which focuses on real-world dance scenarios with three important properties: (i) Faithfulness: the synthesis should retain the appearance of both human subject foreground and background from the reference image, and precisely follow the target pose; (ii) Generalizability: the model should generalize to unseen human subjects, backgrounds, and poses; (iii) Compositionality: it should allow for composition of seen/unseen subjects, backgrounds, and poses from different sources. To address these challenges, we introduce a novel approach, DISCO, which includes a novel model architecture with disentangled control to improve the faithfulness and compositionality of dance synthesis, and an effective human attribute pre-training for better generalizability to unseen humans. Extensive qualitative and quantitative results demonstrate that DISCO can generate high-quality human dance images and videos with diverse appearances and flexible motions.

0:29

3,851

Microsoft Research

cclin retweeted

Microsoft Research

@MSFTResearch

22 Jun 2022

Xuedong Huang, CTO, Azure AI, will present his keynote at CVPR 2022 @CVPR today at 5PM CT where he will share progress on the application of Integrative AI on computer vision and its promising results. Virtual conference registrants can tune in here: msft.it/6017bWAUz

CVPR 2022 Plenary 2

youtube.com

Linjie (Lindsey) Li

cclin retweeted

Linjie (Lindsey) Li @LINJIEFUN

18 Jun 2022

Interested in Vision Language Pre-training (VLP) but do not know where to start? Hard to track the rapid progress in VLP? Come and join us at our CVPR2022 VLP tutorial on 19th Jun (9am-5pm CDT) in person in New Orleans or virtually. vlp-tutorial.github.io #CVPR2022

106

Chitwan Saharia

cclin retweeted

Chitwan Saharia @Chitwan_Saharia

24 May 2022

We are thrilled to announce Imagen, a text-to-image model with unprecedented photorealism and deep language understanding. Explore imagen.research.google and Imagen! A large rusted ship stuck in a frozen lake. Snowy mountains and beautiful sunset in the background. #imagen

292

1,611

AK

cclin retweeted

@_akhaliq

20 Oct 2021

NeuralDiff: Segmenting 3D objects that move in egocentric videos abs: arxiv.org/abs/2110.09936 project page: robots.ox.ac.uk/~vadim/neura…

1:21

Elliott / Shangzhe Wu

cclin retweeted

Elliott / Shangzhe Wu @elliottszwu

27 Oct 2021

Here are the video recordings of the workshop: youtube.com/watch?v=VmKc_sEJ…

Layered Neural Representations for Video - Tali Dekel

Tali DekelLayered Neural Representations for Videohttps://unsup3d...

youtube.com

Elliott / Shangzhe Wu @elliottszwu

6 Oct 2021

Announcing @ICCV_2021 workshop on "Unsupervised 3D Learning in the Wild" with an incredible line-up of speakers on this topic! #ICCV2021 🚩 Website: unsup3d.github.io 📅 Time: 7:00-18:00 EDT / 12:00-23:00 BST, 11 Oct 2021 Calendar: calendar.google.com/calendar… (mark it down!)

AI at Meta

cclin retweeted

AI at Meta

@AIatMeta

13 Aug 2021

We're sharing Unidentified Video Objects (UVO), a new benchmark to facilitate research in open-world segmentation, an important computer vision task that aims to detect, segment, and track all objects exhaustively in a video. Learn more: ow.ly/rH8650FQ7vp

361

AI at Meta

cclin retweeted

AI at Meta

@AIatMeta

2 Aug 2021

FAIR research scientist, Ishan Misra (@imisra_) sat down with @lexfridman to demystify self-supervised learning & its impact in #AI: youtube.com/watch?v=FUS6ceIv…. Read the blog post that inspired the conversation: ai.facebook.com/blog/self-su…

458

Jia-Bin Huang

cclin retweeted

Jia-Bin Huang

@jbhuang0604

19 Jul 2021

Writing Related Work I enjoy reading/writing the related work section of a paper. It helps organize prior research and put the contributions of the work in proper context. But HOW? Check the thread below👇

164

725

Google AI

cclin retweeted

Google AI

@GoogleAI

22 Jun 2021

Today, we are announcing the open source release of DeepLab2, a modern TensorFlow library for deep labeling that aims to facilitate future research on dense pixel labeling by providing a unified, state-of-the-art, and easy-to-use TensorFlow codebase → goo.gle/3d3SnVE

283

1,208

Machine Learning Street Talk

cclin retweeted

Machine Learning Street Talk

@MLStreetTalk

21 Jun 2021

We dive deep into self-supervised learning with Dr. Ishan Misra @imisra_ from FAIR @facebookai and cover their recent cluster of vision papers; with @ykilcher @RisingSayak CC: @ylecun @skornblith @mcaron31 @HugoTouvron youtu.be/EXJmodhu4_4

230

AI at Meta

cclin retweeted

AI at Meta

@AIatMeta

5 May 2021

DINO’s attention maps can discover and segment objects in an image or a #Video with absolutely no supervision and without being given a segmentation-targeted objective. #computervision ai.facebook.com/blog/dino-pa…

DINO and PAWS: Advancing the state of the art in computer vision

Working with Inria researchers, we’ve developed a self-supervised image representation method, DINO, which produces remarkable results when trained with Vision Transformers. We are also detailing...

ai.meta.com

Mike Schroepfer

@schrep

30 Apr 2021

Here’s our new computer vision system achieving state of the art results in image segmentation, without needing any labeled training data. This new model was trained on random, unlabeled data, but quickly achieved state-of-the-art results. It’s awesome.

0:13

285

Chelsea Finn

cclin retweeted

Chelsea Finn

@chelseabfinn

25 Feb 2020

Want to learn about meta-learning? Lecture videos for CS330 are now online! youtube.com/playlist?list=PL… Topics incl. MTL, few-shot learning, Bayesian meta-learning, lifelong learning, meta-RL & more: cs330.stanford.edu 3 guest lectures from Kate Rakelly, @svlevine, @jeffclune

658

2,539

cclin

cclin @cclin_

19 Jul 2019

Join us and consider submitting a paper to the Second workshop on Moving Camera at @ICCV19. More details on the website: sites.google.com/view/mcmvs2… Deadline: August 5 #ICCV2019 #ComputerVIsion

Moving Cameras 2019

Introduction

sites.google.com