The iMVP-utils

interactive epitranscriptomic Motif Visualization and Sub-type Partitioning (iMVP) is a strategy inspired by the commonly used single cell analysis strategy of dimensional reduction followed by clustering. Different from the digital counts in single cell analysis, we here use the RNA sequence (and/or structure) as an input. Here, we firstly transform RNA sequences into a one-hot format; then these transformed sequences were projected to a 2D plane with UMAP, which can not only gather the similar sequences together, but also maintain the relationships between each other (compared with t-SNE); the dimensional reduced data were further clustered by density, with the super-efficient algorithm HDBSCAN, to highlight the enriched sequences.

Background

Installation

For beginners

For advanced users

Algorithm and parameter selections

Dimension Reduction Algorithms

Unsupervised clustering algorithms

Towards huge dataset

Limitations

Notebooks

API Reference

Indices and tables