__STDOUT LOG__
Job runs on fgcz-h-162
at /scratch/o5495_o5444_ScSeurat_2024-12-05--10-03-00_Undifferentiated_lung_cells_temp1411859
Starting EzAppScSeurat ScSeurat o5495_o5444_ScSeurat_2024-12-05--10-03-00_Undifferentiated_lung_cells_temp1411859 2024-12-05 10:03:08
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
Uploading data to Enrichr... Done.
Querying Azimuth_Cell_Types_2021... Done.
Parsing results... Done.
INFO [2024-12-05 10:27:34] Skipping pathway and TF activity
__STDERR LOG__
Loading required package: data.table
Loading required package: Biostrings
Loading required package: BiocGenerics
Attaching package: ‘BiocGenerics’
The following objects are masked from ‘package:stats’:
IQR, mad, sd, var, xtabs
The following objects are masked from ‘package:base’:
anyDuplicated, aperm, append, as.data.frame, basename, cbind,
colnames, dirname, do.call, duplicated, eval, evalq, Filter, Find,
get, grep, grepl, intersect, is.unsorted, lapply, Map, mapply,
match, mget, order, paste, pmax, pmax.int, pmin, pmin.int,
Position, rank, rbind, Reduce, rownames, sapply, setdiff, table,
tapply, union, unique, unsplit, which.max, which.min
Loading required package: S4Vectors
Loading required package: stats4
Attaching package: ‘S4Vectors’
The following objects are masked from ‘package:data.table’:
first, second
The following object is masked from ‘package:utils’:
findMatches
The following objects are masked from ‘package:base’:
expand.grid, I, unname
Loading required package: IRanges
Attaching package: ‘IRanges’
The following object is masked from ‘package:data.table’:
shift
Loading required package: XVector
Loading required package: GenomeInfoDb
Attaching package: ‘Biostrings’
The following object is masked from ‘package:base’:
strsplit
Loading required package: GenomicRanges
Loading required package: tidyverse
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr 1.1.4 ✔ readr 2.1.5
✔ forcats 1.0.0 ✔ stringr 1.5.1
✔ ggplot2 3.5.1 ✔ tibble 3.2.1
✔ lubridate 1.9.3 ✔ tidyr 1.3.1
✔ purrr 1.0.2
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ lubridate::%within%() masks IRanges::%within%()
✖ dplyr::between() masks data.table::between()
✖ dplyr::collapse() masks Biostrings::collapse(), IRanges::collapse()
✖ dplyr::combine() masks BiocGenerics::combine()
✖ purrr::compact() masks XVector::compact()
✖ dplyr::desc() masks IRanges::desc()
✖ tidyr::expand() masks S4Vectors::expand()
✖ dplyr::filter() masks stats::filter()
✖ dplyr::first() masks S4Vectors::first(), data.table::first()
✖ lubridate::hour() masks data.table::hour()
✖ lubridate::isoweek() masks data.table::isoweek()
✖ dplyr::lag() masks stats::lag()
✖ dplyr::last() masks data.table::last()
✖ lubridate::mday() masks data.table::mday()
✖ lubridate::minute() masks data.table::minute()
✖ lubridate::month() masks data.table::month()
✖ ggplot2::Position() masks BiocGenerics::Position(), base::Position()
✖ lubridate::quarter() masks data.table::quarter()
✖ purrr::reduce() masks GenomicRanges::reduce(), IRanges::reduce()
✖ dplyr::rename() masks S4Vectors::rename()
✖ lubridate::second() masks S4Vectors::second(), data.table::second()
✖ lubridate::second<-() masks S4Vectors::second<-()
✖ dplyr::slice() masks XVector::slice(), IRanges::slice()
✖ purrr::transpose() masks data.table::transpose()
✖ lubridate::wday() masks data.table::wday()
✖ lubridate::week() masks data.table::week()
✖ lubridate::yday() masks data.table::yday()
✖ lubridate::year() masks data.table::year()
ℹ Use the conflicted package () to force all conflicts to become errors
unknown param: partition
unknown param: tissue
unknown param: Azimuth
unknown param: SingleR
unknown param: cellxgeneUrl
unknown param: cellxgeneLabel
unknown param: sushi_app
unknown param: isLastJob
Loading required package: DelayedArray
Loading required package: Matrix
Attaching package: ‘Matrix’
The following objects are masked from ‘package:tidyr’:
expand, pack, unpack
The following object is masked from ‘package:S4Vectors’:
expand
Loading required package: MatrixGenerics
Loading required package: matrixStats
Attaching package: ‘matrixStats’
The following object is masked from ‘package:dplyr’:
count
Attaching package: ‘MatrixGenerics’
The following objects are masked from ‘package:matrixStats’:
colAlls, colAnyNAs, colAnys, colAvgsPerRowSet, colCollapse,
colCounts, colCummaxs, colCummins, colCumprods, colCumsums,
colDiffs, colIQRDiffs, colIQRs, colLogSumExps, colMadDiffs,
colMads, colMaxs, colMeans2, colMedians, colMins, colOrderStats,
colProds, colQuantiles, colRanges, colRanks, colSdDiffs, colSds,
colSums2, colTabulates, colVarDiffs, colVars, colWeightedMads,
colWeightedMeans, colWeightedMedians, colWeightedSds,
colWeightedVars, rowAlls, rowAnyNAs, rowAnys, rowAvgsPerColSet,
rowCollapse, rowCounts, rowCummaxs, rowCummins, rowCumprods,
rowCumsums, rowDiffs, rowIQRDiffs, rowIQRs, rowLogSumExps,
rowMadDiffs, rowMads, rowMaxs, rowMeans2, rowMedians, rowMins,
rowOrderStats, rowProds, rowQuantiles, rowRanges, rowRanks,
rowSdDiffs, rowSds, rowSums2, rowTabulates, rowVarDiffs, rowVars,
rowWeightedMads, rowWeightedMeans, rowWeightedMedians,
rowWeightedSds, rowWeightedVars
Loading required package: S4Arrays
Loading required package: abind
Attaching package: ‘S4Arrays’
The following object is masked from ‘package:abind’:
abind
The following object is masked from ‘package:base’:
rowsum
Loading required package: SparseArray
Attaching package: ‘DelayedArray’
The following object is masked from ‘package:purrr’:
simplify
The following objects are masked from ‘package:base’:
apply, scale, sweep
Loading required package: rhdf5
Attaching package: ‘HDF5Array’
The following object is masked from ‘package:rhdf5’:
h5ls
Loading required package: Biobase
Welcome to Bioconductor
Vignettes contain introductory material; view with
'browseVignettes()'. To cite Bioconductor, see
'citation("Biobase")', and for packages 'citation("pkgname")'.
Attaching package: ‘Biobase’
The following object is masked from ‘package:SparseArray’:
rowMedians
The following object is masked from ‘package:MatrixGenerics’:
rowMedians
The following objects are masked from ‘package:matrixStats’:
anyMissing, rowMedians
Loading required package: annotate
Loading required package: AnnotationDbi
Attaching package: ‘AnnotationDbi’
The following object is masked from ‘package:dplyr’:
select
Loading required package: XML
Loading required package: graph
Attaching package: ‘graph’
The following object is masked from ‘package:XML’:
addNode
The following object is masked from ‘package:stringr’:
boundary
The following object is masked from ‘package:Biostrings’:
complement
Loading required package: SummarizedExperiment
Loading required package: SeuratObject
Loading required package: sp
Attaching package: ‘sp’
The following object is masked from ‘package:IRanges’:
%over%
Attaching package: ‘SeuratObject’
The following object is masked from ‘package:SummarizedExperiment’:
Assays
The following object is masked from ‘package:GSEABase’:
intersect
The following object is masked from ‘package:GenomicRanges’:
intersect
The following object is masked from ‘package:Biostrings’:
intersect
The following object is masked from ‘package:GenomeInfoDb’:
intersect
The following object is masked from ‘package:IRanges’:
intersect
The following object is masked from ‘package:S4Vectors’:
intersect
The following object is masked from ‘package:BiocGenerics’:
intersect
The following objects are masked from ‘package:base’:
intersect, t
Attaching package: ‘Seurat’
The following object is masked from ‘package:SummarizedExperiment’:
Assays
The following object is masked from ‘package:ezRun’:
SingleCorPlot
Loading required package: SingleCellExperiment
Welcome to enrichR
Checking connection ...
Enrichr ... Connection is Live!
FlyEnrichr ... Connection is Live!
WormEnrichr ... Connection is Live!
YeastEnrichr ... Connection is Live!
FishEnrichr ... Connection is Live!
OxEnrichr ... Connection is Live!
Attaching package: ‘decoupleR’
The following object is masked from ‘package:data.table’:
:=
Registered S3 method overwritten by 'SeuratDisk':
method from
as.sparse.H5Group Seurat
Attaching shinyBS
failed to open the port 11476, trying a new port...
Loading required package: future
Assuming the input to be a matrix of counts or expected counts.
Clustering cells...
9 clusters
Creating ~4532 artificial doublets...
Dimensional reduction
Evaluating kNN...
Training model...
iter=0, 220 cells excluded from training.
iter=1, 202 cells excluded from training.
iter=2, 208 cells excluded from training.
Threshold found:0.34
218 (3.8%) doublets called
Loading required package: scran
Warning: Cannot find a parent environment called Seurat
Warning: The default method for RunUMAP has changed from calling Python UMAP via reticulate to the R-native UWOT using the cosine metric
To use Python UMAP via reticulate, set umap.method to 'umap-learn' and metric to 'correlation'
This message will be shown once per session
10:23:02 UMAP embedding parameters a = 0.9922 b = 1.112
10:23:02 Read 5446 rows and found 20 numeric columns
10:23:02 Using Annoy for neighbor search, n_neighbors = 30
10:23:02 Building Annoy index with metric = cosine, n_trees = 50
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
10:23:03 Writing NN index file to temp file /tmp/RtmpBUuwg7/file158b3c2a32f505
10:23:03 Searching Annoy index using 4 threads, search_k = 3000
10:23:03 Annoy recall = 100%
10:23:05 Commencing smooth kNN distance calibration using 4 threads with target n_neighbors = 30
10:23:09 Initializing from normalized Laplacian + noise (using RSpectra)
10:23:09 Commencing optimization for 500 epochs, with 223366 positive edges
Using method 'umap'
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
10:23:16 Optimization finished
Attaching package: 'celda'
The following objects are masked from 'package:scater':
normalizeCounts, plotHeatmap
The following object is masked from 'package:scuttle':
normalizeCounts
The following object is masked from 'package:ezRun':
selectFeatures
The following object is masked from 'package:S4Vectors':
params
--------------------------------------------------
Starting DecontX
--------------------------------------------------
Thu Dec 5 10:23:19 2024 .. Analyzing all cells
Thu Dec 5 10:23:19 2024 .... Generating UMAP
Thu Dec 5 10:23:50 2024 .... Estimating contamination
Thu Dec 5 10:23:53 2024 ...... Completed iteration: 10 | converge: 0.02182
Thu Dec 5 10:23:56 2024 ...... Completed iteration: 20 | converge: 0.01495
Thu Dec 5 10:23:58 2024 ...... Completed iteration: 30 | converge: 0.01009
Thu Dec 5 10:24:01 2024 ...... Completed iteration: 40 | converge: 0.006617
Thu Dec 5 10:24:03 2024 ...... Completed iteration: 50 | converge: 0.005275
Thu Dec 5 10:24:06 2024 ...... Completed iteration: 60 | converge: 0.004179
Thu Dec 5 10:24:08 2024 ...... Completed iteration: 70 | converge: 0.00333
Thu Dec 5 10:24:11 2024 ...... Completed iteration: 80 | converge: 0.002683
Thu Dec 5 10:24:13 2024 ...... Completed iteration: 90 | converge: 0.002192
Thu Dec 5 10:24:16 2024 ...... Completed iteration: 100 | converge: 0.001816
Thu Dec 5 10:24:18 2024 ...... Completed iteration: 110 | converge: 0.001526
Thu Dec 5 10:24:21 2024 ...... Completed iteration: 120 | converge: 0.001327
Thu Dec 5 10:24:23 2024 ...... Completed iteration: 130 | converge: 0.001165
Thu Dec 5 10:24:26 2024 ...... Completed iteration: 140 | converge: 0.001051
Thu Dec 5 10:24:27 2024 ...... Completed iteration: 145 | converge: 0.0009979
Thu Dec 5 10:24:27 2024 .. Calculating final decontaminated matrix
--------------------------------------------------
Completed DecontX. Total time: 1.181975 mins
--------------------------------------------------
236 genes passed tf-idf cut-off and 0 soup quantile filter. Taking the top 0.
Error in autoEstCont(sc, tfidfMin = tfidfMin, forceAccept = T, doPlot = FALSE) :
No plausible marker genes found. Is the channel low complexity (see help)? If not, reduce tfidfMin or soupQuantile
594 genes passed tf-idf cut-off and 3 soup quantile filter. Taking the top 3.
Using 16 independent estimates of rho.
Estimated global rho of 0.05
Expanding counts from 13 clusters to 5446 cells.
Calculating cluster 0
Calculating cluster 1
Calculating cluster 2
Calculating cluster 3
Calculating cluster 4
Calculating cluster 5
Calculating cluster 6
Calculating cluster 7
Calculating cluster 8
Calculating cluster 9
Calculating cluster 10
Calculating cluster 11
Calculating cluster 12
Warning in .AUCell_calcAUC(geneSets = geneSets, rankings = rankings, nCores = nCores, :
The following gene sets will be excluded from the analysis(less than 20% of their genes are available):
Airway secretory cell, Alveolar pneumocyte Type II, Anti-tumor immune cell, Basophil, CD4 T cell, CD8 T cell, Clara cell, Conventional dendritic cell 2(cDC2), Conventional dendritic cell(cDC), Exhausted CD8+ T cell, Naive B cell, Red blood cell (erythrocyte)
Warning: Overwriting miscellanous data for model
Warning: Adding a dimensional reduction (refUMAP) without the associated assay being present
detected inputs from HUMAN with id type Gene.name
reference rownames detected HUMAN with id type Gene.name
Normalizing query using reference SCT model
Warning: 796 features of the features specified were not present in both the reference query assays.
Continuing with remaining 2204 features.
Projecting cell embeddings
Finding query neighbors
Finding neighborhoods
Finding anchors
Found 534 anchors
Finding integration vectors
Finding integration vector weights
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
Predicting cell labels
Predicting cell labels
Predicting cell labels
Predicting cell labels
Predicting cell labels
Predicting cell labels
Integrating dataset 2 with reference dataset
Finding integration vectors
Integrating data
Computing nearest neighbors
Running UMAP projection
10:33:17 Read 5446 rows
10:33:17 Processing block 1 of 1
10:33:17 Commencing smooth kNN distance calibration using 4 threads with target n_neighbors = 20
10:33:17 Initializing by weighted average of neighbor coordinates using 4 threads
10:33:17 Commencing optimization for 67 epochs, with 108920 positive edges
Using method 'umap'
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
10:33:18 Finished
Projecting reference PCA onto query
Finding integration vector weights
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
Projecting back the query cells into original PCA space
Finding integration vector weights
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
Computing scores:
Finding neighbors of original query cells
Finding neighbors of transformed query cells
Computing query SNN
Determining bandwidth and computing transition probabilities
0% 10 20 30 40 50 60 70 80 90 100%
[----|----|----|----|----|----|----|----|----|----|
**************************************************|
Total elapsed time: 3.68986988067627
/var/spool/slurmd/job11598/slurm_script: line 96: 1411900 Killed R --vanilla --slave <