Most ARM chips can't run decent AI models. Introducing EfficientSpeech, a 266k-param TTS model. Low cost ARM chips like in RPi4 can generate 104sec of speech mel spec in 1sec. Here's an AI-generated video w/ voice from EfficientSpeech. Info: github.com/roatienza/efficieβ¦#ICASSP2023
"The UP National Engineering Center Analytics and Data Science Certifications announced the development on Wednesday." gmanetwork.com/news/scitech/β¦ via @gmanews
Idea: If data augmentation improves model generalization, why not use it to generate 2 new inputs and force the representations to agree. Result: Additional model performance improvement. Comparison: Unlike Label Smoothing, the performance of our method, AgMax, is consistent.
Weβre introducing GSLM, the first language model that breaks free completely of the dependence on text for training. This βtextless NLPβ approach learns to generate expressive speech using only raw audio recordings as input. Learn more and get the code:
ai.facebook.com/blog/textlesβ¦
Yesterday, my former grad student Daryl gave a talk at Sony CSL Paris about his thesis on Next View Policy for 3D Reconstruction. Youtube: youtu.be/KdyDj3bjU0I
After the Christmas break, we come back with our #enticelle seminar series! We welcome Daryl Peralta from @upsystem with his talk about βNext-Best View Policy for 3D Reconstructionβ. Register here to attend on Wednesday 13 at 11am (CET) csl.sony.fr/seminars/
Our paper Next-Best View Policy for 3D Reconstruction
was presented (oral) Aug 28 at ECCV 2020 Workshops. Key contribs: 1) 3K 3D House Models dataset 2)Learning algo to efficiently scan 3D houses. Paper: arxiv.org/abs/2008.12664 Youtube: youtu.be/OXynAHTDTTA
Advanced Deep Learning with TensorFlow 2 and Keras 2nd Ed (@PacktPub) has been added to bookwatch. Author Rowel Atienza (@jacobe) introduces the practical side of deep learning i-programmer.info/book-watchβ¦