Return LM Filter Probabilities by sidjha1 · Pull Request #125 · lotus-data/lotus

sidjha1 · 2025-02-20T04:45:31Z

Introduces return_probs. Output can look like

                   Course Name Department  Level  filter_label  probs_filter
0  Introduction to Programming         CS    100         False      0.001927
1         Advanced Programming         CS    200         False      0.000024
2               Cooking Basics   Culinary    100          True      0.880797
3       Advanced Culinary Arts   Culinary    200          True      0.851953
4              Data Structures         CS    300         False      0.000261
5                   Algorithms         CS    300         False      0.000553
6               French Cuisine   Culinary    200          True      0.985936
7              Italian Cooking   Culinary    200          True      0.997527

Or

              Course Name Department  Level  probs_filter
2          Cooking Basics   Culinary    100      0.851953
3  Advanced Culinary Arts   Culinary    200      0.851953
6          French Cuisine   Culinary    200      0.985936
7         Italian Cooking   Culinary    200      0.999665

Depending on if return_all is set.

This PR also fixes a bug in the way that logprobs were analyzed for the True/False prob calculations.

sidjha1 · 2025-02-20T04:46:34Z

-    return pd.DataFrame({
-        "Course Name": [
-            "Introduction to Programming",
-            "Advanced Programming",
-            "Cooking Basics",
-            "Advanced Culinary Arts",
-            "Data Structures",
-            "Algorithms",
-            "French Cuisine",
-            "Italian Cooking"
-        ],
-        "Department": [
-            "CS", "CS", "Culinary", "Culinary",
-            "CS", "CS", "Culinary", "Culinary"
-        ],
-        "Level": [
-            100, 200, 100, 200,
-            300, 300, 200, 200
-        ]
-    })


This file contains a bunch of ruff formatting changes. It's that weird thing where the CI does not work the same way as local. In any case this formatting is better.

sidjha1 · 2025-02-20T04:46:46Z

+class TestFilterWithProbs(BaseTest):
+    def test_filter_with_probs(self, sample_df):
+        """Test semantic filter with probabilities returned to the user"""
+        lm = LM(model="gpt-4o-mini")
+        lotus.settings.configure(lm=lm)
+        result = sample_df.sem_filter("{Course Name} will be fun", return_probs=True)
+        print(result)
+        assert "probs_filter" in result.columns
+
+    def test_filter_with_probs_and_return_all(self, sample_df):
+        """Test semantic filter with probabilities returned to the user"""
+        lm = LM(model="gpt-4o-mini")
+        lotus.settings.configure(lm=lm)
+        result = sample_df.sem_filter("{Course Name} will be fun", return_probs=True, return_all=True)
+        print(result)
+        assert "probs_filter" in result.columns
+
+        for idx, row in result.iterrows():
+            if row["filter_label"]:
+                assert row["probs_filter"] > 0.5
+            else:
+                assert row["probs_filter"] <= 0.5


Core tests are here

sidjha1 · 2025-02-20T04:47:12Z

+vs = FaissVS()

-lotus.settings.configure(lm=gpt_4o, helper_lm=gpt_4o_mini, rm=rm)
+lotus.settings.configure(lm=gpt_4o, helper_lm=gpt_4o_mini, rm=rm, vs=vs)


Not related to my logic changes. But vs needs to be set here for the example to run, given the recent merge.

sidjha1 · 2025-02-20T04:49:58Z

+                cleaned_token = logprob.token.lower().strip()
+                if cleaned_token not in ["true", "false"]:


Needed to add this .lower().strip() cleaning in a few places so we can process True, True true, etc.

liana313

For the most part looks good -- I left a couple of comments about naming. Once those are updated, we can also update the filter docs section

liana313 · 2025-02-28T01:50:39Z

            new_df["raw_output" + suffix] = filtered_raw_outputs

+        if return_scores:
+            new_df["score"] = filtered_scores


we should add an index to the end of the output col labels, since if a user filters twice, there will be a naming conflict. we should also add a test for filtering twice

sidjha1 added 4 commits February 19, 2025 17:40

Initial work

618b913

Merge branch 'main' into sid/filter-probs

bde1326

More work

9000328

Add another test

1619657

sidjha1 requested a review from liana313 February 20, 2025 04:47

sidjha1 commented Feb 20, 2025

View reviewed changes

sidjha1 added 2 commits February 19, 2025 20:59

Refactor

956db54

Switch to scores and add score_method

5587074

liana313 reviewed Feb 28, 2025

View reviewed changes

sidjha1 added 2 commits April 1, 2025 09:00

Merge branch 'main' into sid/filter-probs

610c76c

Address comments

8fee423

sidjha1 requested a review from liana313 April 1, 2025 16:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return LM Filter Probabilities#125

Return LM Filter Probabilities#125
sidjha1 wants to merge 8 commits intomainfrom
sid/filter-probs

sidjha1 commented Feb 20, 2025

Uh oh!

sidjha1 Feb 20, 2025

Uh oh!

sidjha1 Feb 20, 2025

Uh oh!

sidjha1 Feb 20, 2025

Uh oh!

sidjha1 Feb 20, 2025

Uh oh!

liana313 left a comment

Uh oh!

liana313 Feb 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		cleaned_token = logprob.token.lower().strip()
		if cleaned_token not in ["true", "false"]:

Conversation

sidjha1 commented Feb 20, 2025

Uh oh!

sidjha1 Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

sidjha1 Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

sidjha1 Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

sidjha1 Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

liana313 left a comment

Choose a reason for hiding this comment

Uh oh!

liana313 Feb 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants