mosaicmpi.Dataset.get_category_overrepresentation

mosaicmpi.Dataset.get_category_overrepresentation#

Dataset.get_category_overrepresentation(layer: str, truncate_negative: bool = True, subset_categories: Collection[str] = None) DataFrame#

Calculate Pearson residual of chi-squared test, associating programs for each rank (k) to categories of samples/observations. By default, truncates negative values.

Parameters:
  • layer (str) – name of categorical data layer

  • truncate_negative (bool, optional) – Truncate negative residuals to 0, defaults to True

  • subset_categories (Collection[str]) – Provide a subset of categories for calculating overrepresentation

Returns:

category × program matrix of overrepresentation values

Return type:

pd.DataFrame