Show the number of not classified instances for multi-label models (#3964) #4150

piri-p · 2024-04-28T21:23:03Z

Fix #3964 - Show the number of not classified instances for multi-label models @suhaibmujahid

Done by

Passing extra parameter y_test to calculate the total number of not-classified instances (y_test - classified)
Adding table header

Run log attached.

suhaibmujahid

Thank you, @piri-p! Please see my comments. Also, please fix the linter (you may want to consider installing pre-commit¹).

https://github.com/mozilla/bugbug#auto-formatting ↩

suhaibmujahid · 2024-04-29T21:52:16Z

bugbug/model.py

+                y_true_count = (test_classes[:, num].tolist()).count(i)
+                table[i].append(y_true_count - sum(table[i][1:]))


The number of "Not classified" was originally part of the table but dropped earlier:

# Don't show the Not classified row in the table output if "__NOT_CLASSIFIED__" in labels and not is_multilabel: confusion_matrix_table.pop(labels.index("__NOT_CLASSIFIED__"))

It may be cleaner to avoid dropping it in the first place.

suhaibmujahid · 2024-04-29T21:54:46Z

bugbug/model.py

@@ -499,8 +504,9 @@ def train(self, importance_cutoff=0.15, limit=None):

            tracking_metrics["report"] = report

+        # no confidence threshold - no need to handle  'Not classified' instances


I think this comment is helpful in the context of reviewing this PR. However, without this context, the comment feels out of place (after merging the PR).

suhaibmujahid · 2024-04-29T21:55:16Z

bugbug/model.py

@@ -567,8 +573,9 @@ def train(self, importance_cutoff=0.15, limit=None):
                        labels=confidence_class_names,
                    )
                )
+            # with confidence threshold - handle  'Not classified' instances by passing y_test


The same case as the comment above.

piri-p · 2024-04-29T23:16:00Z

Hi @suhaibmujahid, thanks you for feedback!

Also, please fix the linter (you may want to consider installing pre-commit

Done :)

The number of "Not classified" was originally part of the table but dropped earlier:
    # Don't show the Not classified row in the table output
    if "__NOT_CLASSIFIED__" in labels and not is_multilabel:
        confusion_matrix_table.pop(labels.index("__NOT_CLASSIFIED__"))
It may be cleaner to avoid dropping it in the first place.

I just run the command in issue-3964-run-log.txt again and checked the confusion matrix of "Not classified" class, it gives [[6, 0], [1, 1]] for Confidence threshold > 0.6.

Now I am not sure how to interpret this. This confusion matrix doesn't seem to correspond to a specific class; but we would like to see the number of 'Not classified' instances in each specific class' visualizer.

In my current implementation, I calculate the number of "Not classified" class-wise, i.e., any non-classified samples from the test set are counted. Here, the test set is of size 200, so any non-classified samples among these 200 are counted. Also, this way, the total sum of all elements in each matrix is precisely the test size, 200.

Says instead of this using current implementation

Current implementation - any non-classified samples from the test set are counted (here: 192, 190)
label: crash - 
     total sum = 1 + 0 + 192 + 0 + 7 + 0 = 200; 
     192 = instances whose ground truth label isn't 'crash', and the model classifies them to neither 'crash' nor 'not crash', i.e. the predicted probability is somewhat ambiguous, e.g. close to 0.5.
      0 = instances whose ground truth label is 'crash', and the model classifies them to neither 'crash' nor 'not crash'
╒════════════╤═════════════════╤═════════════════╤══════════════════╕
│            │   0 (Predicted) │   1 (Predicted) │   Not classified │
╞════════════╪═════════════════╪═════════════════╪══════════════════╡
│ 0 (Actual) │               1 │               0 │              192 │
├────────────┼─────────────────┼─────────────────┼──────────────────┤
│ 1 (Actual) │               0 │               7 │                0 │
╘════════════╧═════════════════╧═════════════════╧══════════════════╛

label: memory - total sum = 7 + 0 + 190 + 0 + 1 + 3 = 200
╒════════════╤═════════════════╤═════════════════╤══════════════════╕
│            │   0 (Predicted) │   1 (Predicted) │   Not classified │
╞════════════╪═════════════════╪═════════════════╪══════════════════╡
│ 0 (Actual) │               7 │               0 │              190 │
├────────────┼─────────────────┼─────────────────┼──────────────────┤
│ 1 (Actual) │               0 │               1 │                2 │
╘════════════╧═════════════════╧═════════════════╧══════════════════╛

We would have this using confusion matrix

Implementation using   __NOT_CLASSIFIED__ in confusion_matrix_table (here: A, B)
label: crash
╒════════════╤═════════════════╤═════════════════╤══════════════════╕
│            │   0 (Predicted) │   1 (Predicted) │   Not classified │
╞════════════╪═════════════════╪═════════════════╪══════════════════╡
│ 0 (Actual) │               1 │               0 │              A   │
├────────────┼─────────────────┼─────────────────┼──────────────────┤
│ 1 (Actual) │               0 │               7 │                B │
╘════════════╧═════════════════╧═════════════════╧══════════════════╛

label: memory
╒════════════╤═════════════════╤═════════════════╤══════════════════╕
│            │   0 (Predicted) │   1 (Predicted) │   Not classified │
╞════════════╪═════════════════╪═════════════════╪══════════════════╡
│ 0 (Actual) │               7 │               0 │              A   │
├────────────┼─────────────────┼─────────────────┼──────────────────┤
│ 1 (Actual) │               0 │               1 │                B │
╘════════════╧═════════════════╧═════════════════╧══════════════════╛

I think this comment is helpful in the context of reviewing this PR. However, without this context, the comment feels out of place (after merging the PR).

Should I remove these comments or how would you prefer it?

Show the number of not classified instances for multi-label models (m…

7b18ed3

…ozilla#3964)

suhaibmujahid reviewed Apr 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show the number of not classified instances for multi-label models (#3964) #4150

Show the number of not classified instances for multi-label models (#3964) #4150

piri-p commented Apr 28, 2024

suhaibmujahid left a comment

suhaibmujahid Apr 29, 2024

suhaibmujahid Apr 29, 2024

suhaibmujahid Apr 29, 2024

piri-p commented Apr 29, 2024 •

edited

Loading

		y_true_count = (test_classes[:, num].tolist()).count(i)
		table[i].append(y_true_count - sum(table[i][1:]))

		@@ -499,8 +504,9 @@ def train(self, importance_cutoff=0.15, limit=None):

		tracking_metrics["report"] = report

		# no confidence threshold - no need to handle 'Not classified' instances

Show the number of not classified instances for multi-label models (#3964) #4150

Are you sure you want to change the base?

Show the number of not classified instances for multi-label models (#3964) #4150

Conversation

piri-p commented Apr 28, 2024

suhaibmujahid left a comment

Choose a reason for hiding this comment

Footnotes

suhaibmujahid Apr 29, 2024

Choose a reason for hiding this comment

suhaibmujahid Apr 29, 2024

Choose a reason for hiding this comment

suhaibmujahid Apr 29, 2024

Choose a reason for hiding this comment

piri-p commented Apr 29, 2024 • edited Loading

piri-p commented Apr 29, 2024 •

edited

Loading