Our new preprint is available on biorXiv: doi.org/10.1101/2023.12.14.5… !🥳 We present scplainer, a principled computational approach to streamline and standardise #MassSpec#SingleCell#proteomics data analysis. Here's a 🧵 with our key messages.
As always, many thanks to my promoter Laurent Gatto for his supervision. I also warmly thank Manon Martin and Bernadette Govaerts for their guidance on the APCA framework.
Our last work on #SingleCellMultiModal data collection is finally out! 🎉
It has been a long journey 🚀, but it taught me a lot! 🥸
Thanks to all the co-authors (in the comments), I really enjoyed working with all of you! 🤩
doi.org/10.1371/journal.pcbi…
(1/5)
#Reproducible data analysis is so important! Our decision to emphasize on software details and providing code was sparked by an email I received from @harrisonspecht "Reanalysis of reanalysis".
Software details do matter! For instance, the same KNN algorithm implemented in two pieces of software leads to different results, because they either impute by samples (cells) or by variable (proteins). Here's the impact on cell correlations.
Harrison raised the points that the correlations he computed were not the same as the correlations we showed in our previous article, while we were both processing the exact same dataset with the exact same algorithms (or were we?) !
Because the @slavovLab does such a nice job of providing the code to analyze their data, I could quickly identify that we were KNN imputing by variable and Harrison was KNN imputing by sample. This would have taken me ages if I add to guess this from a blurry methods section!