- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- BIRS Workshop Lecture Videos /
- CORTEX seq-FISH: integration with scRNA-seq data
Open Collections
BIRS Workshop Lecture Videos
BIRS Workshop Lecture Videos
CORTEX seq-FISH: integration with scRNA-seq data Singh, Amrit
Description
Dr. Amrit Singh (https://amritsingh.ca) is a post-doctoral research fellow at the PROOF Centre of Excellence. University of British Columbia, Pathology. He uses statistical methodologies to extract signals from big biological data. He have a background in biology, math, and programming (R/Python/Node) and has developed new methods to integrate multi-source biological data as part of the mixOmics data integration project . He is interested in using web and voice to develop interactive user interfaces for users to extract maximal information from their data. Dr. Amrit Singh work addressed the question if scRNA-seq data be overlaid onto seqFISH for resolution enhancement The published approach trained a multiclass SVM on the scRNAseq data and applied it to the seqFISH data to estimate the cell-types labels. My approach uses a penalized regression method (glmnet) with a semi-supervised approach in order to build a model using both the scRNAseq+seqFISH data. This strategy uses a recursive approach that involves multiple rounds of training glmnet models using labeled data (label and imputed) and predicting the cell-type labels of unlabeled data. At each iteration, cell-type labels with high confidence (probability > 0.5) are retained for the next iteration, where a new glmnet model is trained with the scRNAseq data and seqFISH data with imputed cell-type labels with high confidence. This process is repeated until all cell-types in the seqFISH data have been labeled or until 50 iterations have been reached (in order to reduce compute times). The advantage of this approach is that more data in used for model training such that the resulting model may generalize better to new data. The performance of this approach was estimated using cross-validation, using only the scRNAseq data as the test set. This work was done in collaboration with Prof Kim-Anh Le Cao (University of Melbourne) . Code is available at https://github.com/singha53/ssenet
Item Metadata
Title |
CORTEX seq-FISH: integration with scRNA-seq data
|
Creator | |
Publisher |
Banff International Research Station for Mathematical Innovation and Discovery
|
Date Issued |
2020-06-15T08:21
|
Description |
Dr. Amrit Singh (https://amritsingh.ca) is a post-doctoral research fellow at the PROOF Centre of Excellence. University of British Columbia, Pathology. He uses statistical methodologies to extract signals from big biological data. He have a background in biology, math, and programming (R/Python/Node) and has developed new methods to integrate multi-source biological data as part of the mixOmics data integration project . He is interested in using web and voice to develop interactive user interfaces for users to extract maximal information from their data.
Dr. Amrit Singh work addressed the question if scRNA-seq data be overlaid onto seqFISH for resolution
enhancement
The published approach trained a multiclass SVM on the scRNAseq data and applied it to the seqFISH data to estimate the cell-types labels. My approach uses a penalized regression method (glmnet) with a semi-supervised approach in order to build a model using both the scRNAseq+seqFISH data. This strategy uses a recursive approach that involves multiple rounds of training glmnet models using labeled data (label and imputed) and predicting the cell-type labels of unlabeled data. At each iteration, cell-type labels with high confidence (probability > 0.5) are retained for the next iteration, where a new glmnet model is trained with the scRNAseq data and seqFISH data with imputed cell-type labels with high confidence. This process is repeated until all cell-types in the seqFISH data have been labeled or until 50 iterations have been reached (in order to reduce compute times). The advantage of this approach is that more data in used for model training such that the resulting model may generalize better to new data. The performance of this approach was estimated using cross-validation, using only the scRNAseq data as the test set.
This work was done in collaboration with Prof Kim-Anh Le Cao (University of Melbourne) .
Code is available at https://github.com/singha53/ssenet
|
Extent |
19.0 minutes
|
Subject | |
Type | |
File Format |
video/mp4
|
Language |
eng
|
Notes |
Author affiliation: University of British Columbia
|
Series | |
Date Available |
2020-12-14
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0395268
|
URI | |
Affiliation | |
Peer Review Status |
Unreviewed
|
Scholarly Level |
Postdoctoral
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International