Pairwise compare for optimization #11

jaceybronte · 2024-12-16T19:11:00Z

This PR implements the pairwise compare tool to optimize conditions for ALSF

review-notebook-app · 2024-12-16T19:11:05Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

MattsonCam

I added some suggestions to consider. Additionally, if you wanted to perform a sanity check, you could perform these steps:

Calculate the number of comparisons you expect and make sure it equals the number of samples in the output dataframe. (this is implemented as a test in Pairwise Compare with the NF1 dataset)
Choose n pairs of samples from the input dataframes and compute the pearsons correlation of these pairs using an existing tool, such as a pandas correlation function, then compare these results to the output dataframe for the corresponding pairs. (this is not implemented as a test in Pairwise Compare)

MattsonCam · 2024-12-16T21:33:02Z

4.optimization/scripts/0.pairwise-compare.py

+
+results = []
+
+for plate in plate_names:


You could also combine all of the plates into the same dataframe and specify the plate column name in _same_columns (e.g. Metadata_Plate) instead of using a for loop. This works too though, and may be a better solution if memory is a concern

MattsonCam · 2024-12-16T21:36:08Z

4.optimization/scripts/0.pairwise-compare.py

+        _same_columns=["Metadata_cell_line", "Metadata_seeding_density", "Metadata_time_point"],
+        _different_columns=["Metadata_Well"],
+        _feat_cols=feat_cols,
+        _drop_cols=["Metadata_Concentration", "Metadata_Well"],


If you specify a column _drop_cols that is not specified in _same_columns or _different_columns then the PairwiseCompareManager will ignore it. Could consider removing Metadata_Concentration here to improve clarity

MattsonCam · 2024-12-16T21:43:27Z

4.optimization/scripts/0.pairwise-compare.py

+    comparer = PairwiseCompareManager(
+        _df=plate_df.copy(),
+        _comparator=pearsons_comparator,
+        _same_columns=["Metadata_cell_line", "Metadata_seeding_density", "Metadata_time_point"],
+        _different_columns=["Metadata_Well"],
+        _feat_cols=feat_cols,
+        _drop_cols=["Metadata_Concentration", "Metadata_Well"],


Consider describing what you are aiming to compare here for people less familiar with the tool. You could also consider providing a link to the tool for those who are curious. It also may be worth commenting here, at the top of the notebook, or in both places describing the purpose of performing these comparisons.

MattsonCam · 2024-12-16T21:46:01Z

4.optimization/scripts/0.pairwise-compare.py

+worst_results = (
+    combined_df.loc[
+        combined_df.groupby("Metadata_cell_line__antehoc_group0")["pearsons_correlation"].idxmin(),
+        [
+            "Metadata_cell_line__antehoc_group0", 
+            "Metadata_seeding_density__antehoc_group0", 
+            "Metadata_time_point__antehoc_group0", 
+            "pearsons_correlation"
+        ]
+    ]
+)


Changing the column name here could make sense, although, it may be worth changing in the PairwiseCompareManager. I kept it the same, because the other tool (PairwiseCompare.py) has no concept of same or different columns

jaceybronte added 2 commits December 16, 2024 12:06

Pairwise compare for optimization

af8ed63

Python file

818a731

jaceybronte requested a review from MattsonCam December 16, 2024 20:20

MattsonCam approved these changes Dec 16, 2024

View reviewed changes

Fixes

f49c77b

jaceybronte merged commit 1116d15 into WayScience:main Feb 18, 2025

jaceybronte deleted the pairwise-compare branch February 18, 2025 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pairwise compare for optimization #11

Pairwise compare for optimization #11

jaceybronte commented Dec 16, 2024

review-notebook-app bot commented Dec 16, 2024

MattsonCam left a comment •

edited

Loading

MattsonCam Dec 16, 2024

MattsonCam Dec 16, 2024 •

edited

Loading

jaceybronte Feb 18, 2025

MattsonCam Dec 16, 2024

jaceybronte Feb 18, 2025

MattsonCam Dec 16, 2024

jaceybronte Feb 18, 2025

Pairwise compare for optimization #11

Pairwise compare for optimization #11

Conversation

jaceybronte commented Dec 16, 2024

review-notebook-app bot commented Dec 16, 2024

MattsonCam left a comment • edited Loading

Choose a reason for hiding this comment

MattsonCam Dec 16, 2024

Choose a reason for hiding this comment

MattsonCam Dec 16, 2024 • edited Loading

Choose a reason for hiding this comment

jaceybronte Feb 18, 2025

Choose a reason for hiding this comment

MattsonCam Dec 16, 2024

Choose a reason for hiding this comment

jaceybronte Feb 18, 2025

Choose a reason for hiding this comment

MattsonCam Dec 16, 2024

Choose a reason for hiding this comment

jaceybronte Feb 18, 2025

Choose a reason for hiding this comment

MattsonCam left a comment •

edited

Loading

MattsonCam Dec 16, 2024 •

edited

Loading