Hi @willtownes,
My typical Seurat workflow for multiple samples and conditions is to run PCA followed by CCA-based integration (now IntegrateLayers in Seurat v5), then identify one joint set of clusters. If I want to try swapping in GLM-PCA, is that supposed to work as-is or do I need to adjust somehow?
I just ran a test on real data using the approximation method mentioned here, where I used nullResiduals on my raw counts then ran PCA on that using the top 3k deviant genes, followed by IntegrateLayers.
The resulting UMAP and clusters looked nothing like my PCA-based analysis, so either I did something wrong or it is not appropriate to do this in the first place.
Can you share your thoughts on whether this is possible, and if so, some best practices for doing this in Seurat when the dataset is large (>50k cells)? The RunGLMPCA() helper function was itself taking too long on these data.
Thanks!
Hi @willtownes,
My typical Seurat workflow for multiple samples and conditions is to run PCA followed by CCA-based integration (now
IntegrateLayersin Seurat v5), then identify one joint set of clusters. If I want to try swapping in GLM-PCA, is that supposed to work as-is or do I need to adjust somehow?I just ran a test on real data using the approximation method mentioned here, where I used
nullResidualson my raw counts then ran PCA on that using the top 3k deviant genes, followed byIntegrateLayers.The resulting UMAP and clusters looked nothing like my PCA-based analysis, so either I did something wrong or it is not appropriate to do this in the first place.
Can you share your thoughts on whether this is possible, and if so, some best practices for doing this in Seurat when the dataset is large (>50k cells)? The
RunGLMPCA()helper function was itself taking too long on these data.Thanks!