Release notes
1.9.1
Bug fixes
get_pseudobulknow handlespandas>=2.0.0and properly handles ``NA``s.
Changes
All OmniPath functions now query directly the OmniPath server,
omnipathandpypath-omnipathpackages are no longuer needed. Now orthology conversion usesliana’s implementation instead.All OmniPath functions now include a
licenseparameter, allowing you to choose between"academic","commercial", or"nonprofit"licenses (by default, the academic license is used, as it contains the most information).
Additions
Added function
show_organisms.Added new vignette on how to use different licenses to access OmniPath databases.
1.8.0
Bug fixes
Fixed
scipy >= 1.14.0errors due to deprecated.Aattribute.get_pseudobulknow handles categorical columns.
Changes
prefunctions now are more efficient.- Refactored
run_gsva: -
kcdfdensity argument now accepts either'gaussian','poisson'orNone. -mx_diffandabs_rnknow can be set to different values. -Deprecatedtauparameter
- Refactored
Additions
Added method
zscore.Added
prefunctionbreak_ties.
1.7.0
Bug fixes
Fixed error in in
get_contrastby reverting use ofcopy.deepcopytocopy.Fixed verbose error regarding the number of unique sources being used in
benchmark.Added check for minimum version of
igraph>=0.10.0to properly renderplot_network.Fixed return error of methods triggered when an observation was empty and input was
AnnData.
Changes
Resource functions such as
get_resourceorget_collectrinow accept differentgenesymbol_resourcethan UniProt for gene translation to other organisms.Deprecated
sklearnand switched tosklearnforudt.
Additions
Added metric
recallto the benchmark pipeline.
1.6.0
Bug fixes
Fixed wrong corrected p-values of
get_metadata_associationswhenever more than 10 factors were present.Added static fallback for the most used
omnipathresources, enabling their retrieval when internet connection is unstable.Fixed ZeroDivisionError of
gseawhen many 0s were present.dense_runnow returns sorted features, which allows to run the benchmark pipeline ignoring zeros.
Changes
Methods now throw an error if there are repeated feature names in
mat.Functions can now accept
AnnDataobjects with any sparse format than justscipy.sparse.csr_matrix.get_pseudobulknow can return empty observations and features with the parameterremove_empty=False.Fixed
pandaswarnings for categorical data across several functions.plot_volcanoandplot_volcano_dfnow can use different colors for positive, negative and null genes.benchmarknow has increased verbosity.get_metadata_associationsnow has averboseargument.plot_associationsnow uses themarsileaplotting library instead ofPyComplexHeatmap.
Additions
Added
plot_networkto visualize the network structure of top sources and their targets.Added metrics
rankandnrankto the benchmark pipeline.Added
denseargument todecoupleto run methods ignoring zeroes per observation.
1.5.0
Bug fixes
run_mlmnow uses the correct number of degrees of freedom to compute the t-value (one less than before), scores might change slightly.rank_sources_groupsnow handlesnanp-value results and skips whenever the group is equal to the reference.get_pseudobulknow handles any sparse matrix format, can use individuallambdafunctions asmodeand handles negative values when filtering formin_countsandmin_cells.Plotting functions with a grid now show it below dots instead of above.
Changes
All functions that deal with
matnow do not enforcecsr_matrixformat automatically.The benchmark pipeline now handles methods that provide only positive enrichment scores. Also added support for
nanscores.Modified
get_ora_dfto make it easier to use results of differential expression analysis, now returns different statistics.gseanormalized score now is computed as in the original publication(ES+/mean(null+) and ES-/mean(null-))instead of(ES - mean(null)) / sd(null).run_ulmnow runs faster.get_collectrinow returns the PMIDs supporting each interaction when available.
Additions
Added
get_gsea_dffunction to rungseadirectly on results of differential expression analysis, returns different statistics and the leading edge.Added
plot_dotplotfunction to plot results ofget_ora_df.Added
plot_barplot_dffunction to plot results ofget_ora_df.Added
plot_running_scorefunction to plot results ofget_gsea_df.Added
plot_associationsfunction to check for significant covariates in the metadata.
1.4.0
Changes
get_pseudobulkchanges:Default values now do not filter features. For feature filtering check the new functions
filter_by_exprorfilter_by_prop.If feature filters are used, it may return more genes than before due to a change of
> min_propsto>= min_props.Now it returns quality control metrics such as
psbulk_n_cells,psbulk_countsandpsbulk_props.Now
groups_colaccepts take multiple keys.Now
modeaccepts a dictionary of callable functions. The resulting profiles will be stored in.layers.
swap_layernow has a new argumentX_layer_key, a.layerskey where to move and store the original.X.Pseudobulk and bulk vignettes have been updated to use the PyDESeq2 package
run_consensusnow accepts extra arguments with the new parametterargsthat are passed down todecouple.Omnipath functions now return resources with sorted indexes and throw a warning if the version is too old.
run_wsumandrun_wmeannow correctly accept empty null distributions.
Additions
Added
filter_by_exprfeature filtering function from edgeR.Added
filter_by_propfeature filtering function. In previous versions it was incorporated insideget_pseudobulk.Added
plot_psbulk_samplesto assess the quality of pseudobulk samples.Added
plot_filter_by_exprto assess which filtering thresholds to use infilter_by_expr.Added
plot_filter_by_propto assess which filtering thresholds to use infilter_by_prop.Added
plot_volcano_dfto plot volcano plots from long format dataframes.Added
plot_targetsto plot downstream target genes of a source by their change and weight.Added
get_collectrito retrieve the CollecTRI gene regulatory network.Added
get_ksn_omnipathto retrieve the Kinase-Substrate network from omnipath.Added
rank_sources_groupsto identify marker sources (TFs, pathways, etc.) per group of samples/cells.
1.3.4
Changes
get_pseudobulknow has new arguments:modeto change how to summarize profiles andskip_checksto bypass checks.OmniPath functions now accept more organism synonyms.
Bug fixes
Fixed empty text labels bug for
adjustText==0.8.
1.3.3
Bug fixes
read_gmtis now properly exported.
1.3.2
Bug fixes
plot_metrics_scatter_colsnow deals with missing sources when comparing nets.
Changes
get_pseudobulkandget_actsnow have adtypeargument due to futureAnnDatachanges.plot_metrics_scatterandplot_metrics_boxplotnow useGroupBy.mean(numeric_only=True).
Additions
Added
swap_layerfunction to easily moveadatalayers to.X.Added
read_gmtfunction to read GMT files containing gene sets.
1.3.1
Changes
Omnipath wrappers (
get_resource,get_dorotheaandget_progeny) now accept any organism name.
1.3.0
Bug fixes
Fixed change in api from
sklearn.tree.Forced gene names in
extractto be inunicodeformat.Changed integer format from
int32toint64to accommodate larger datasets across methods.
Additions
Added conversion utility function
translate_netto translate nets across organisms.
1.2.0
Bug fixes
Removed
python <3.10limitation.Forced
np.float32type to output ofget_contrast.Made
summarize_actscompatible with older versions of numpy.extract_psbulk_inputsnow checks if mat and obs have matching indexes.plot_volcanonow correctly can plot networks with different source names.
Changes
extractnow removes empty samples and features.run_consensusnow follows the same format as other methods, old function is now calledcons.get_pseudobulknow checks if input are raw integer counts.plot_volcanonow can plot without subsetting features by a network and can save plots to disk.plot_volcanonow usesadjustTextto better plot text labels.plot_volcanonow can set logFCs and p-value limits for outliers.get_top_targetsnow can also work without subsetting features by a network and returns significant adjusted p-values.get_contrastnow can also work without needing to group.udtandmdtnow check ifskrangerandsklearnare installed, respectively.get_toy_datanow contains more example TFs.get_top_targetsnow returnslogFCsandpvalsas column names instead oflogFCandpval.format_contrast_resultsnow returns also the adjusted p-value.
Additions
Added
dense_runutil function which runs methods ignoring zeros in the data.Added
plot_violinsandplot_barplotfunctions.Added
p_adjust_fdrutil function to correct p-values for FDR.Added
get_ora_dffunction to infer ora from lists of genes instead of an input mat.Added
shuffle_netfunction to randomize networks.Added benchmarking metrics
metric_auroc,metric_auprc,metric_mcaurocandmetric_mcauprc.Added
get_toy_benchmark_datafunction to generate a toy example for benchmarking.Added
show_metricsfunction to show available metrics.Added
benchmark,format_benchmark_inputsandget_performancesfunctions to benchmark methods and nets.Added
plot_metrics_scatterfunction to plot the results of running the benchmarking pipeline.Added
plot_metrics_scatter_colsfunction to plot the results of running the benchmarking pipeline grouped by two levels.Added
plot_metrics_scatterfunction to plot the results of running the benchmarking pipeline.Added
plot_metrics_boxplotfunction to plot the distributions of Monte-Carlo benchmarking metrics.
1.1.0
Bug fixes
Fixed
get_pseudobulkerrors.Fixed
get_progenyto correctly return non duplicate entries.Fixed
run_viperparallelization error.Fixed
run_orato correctly deal with random ties.
Changes
get_dorotheanow returns an ordered dataframe.get_contrastnow prints warnings instead of returning an empty dataframe.
Additions
Added
get_top_targetsutil function.Added
format_contrast_resultsutil function.