mosaicmpi.integration.Integration.get_usages

mosaicmpi.integration.Integration.get_usages#

Integration.get_usages(discretize=False, normalize=False) DataFrame#

Calculate usage of each program in each dataset and sample/observation.

Parameters:
  • discretize (bool, optional) – Discretizes the usage matrix such that for each value of k, each sample has usage of only 1 program (the one with the maximum usage). Defaults to False

  • normalize (bool, optional) – Normalize the program usage matrix such that for each value of k, usage of all programs sums to 1. Defaults to False

Returns:

category × programs matrix of overrepresentation values

Return type:

pd.DataFrame