Dataset Version

These version notes are for the current version of CLUE. For past versions, see here.


This beta release is an expansion upon the previous 2017 data release and contains ~3M gene expression profiles and ~1M replicate-collapsed signatures.

Total number of signatures = ~1M replicate-collapsed signatures.

3.02M profiles

1.16M signatures

81,979 perturbagens

33,609 compounds

657 MoAs

9,288 unique genes targeted

240 cell contexts (12 primary)


Current Version

Data version 1.0

Touchstone-P is a library of proteomic signatures that serves as a reference database. Data version 1.0 contains {3,396} total samples, spanning {90} small-molecule perturbations and {6} cell lines in {2} mass spectrometry-based assays (P100 and GCP).

Tool version 1.7.1

Please see the Proteomics Signature Pipeline Github page to access the tools used to compute similarities and connectivities. This system of versioning aligns with the tag/release system on Github. Now using cmapPy 3.2.


Current Version

The Cell App is a library of cell lines that serves as a reference database. The current Version includes 2,705 cell lines and their annotations.