Guidance for adding the 1PL-IRT model #147

giacomoran · 2025-01-02T21:35:22Z

Hi,

I want to start contributing to this repo by adding the 1PL-IRT model (also known as Rasch model), which I think is a great baseline for SRS.

The 1PL-IRT model estimates:

Person ability parameters ($\theta$) (one parameter for each user)
Item difficulty parameters ($\beta$) (one parameter for each card)

The model assumes the probability of a correct response is:
$$P(correct) = \sigma(\theta - \beta)$$
where $\sigma$ is the logistic function.

I have a few questions:

1PL-IRT was originally developed outside the context of SRS, items (aka cards) are typically shared between users. In the 10k Anki collection dataset, given two cards of different users but with the same ID, should they be considered the same card? For example, coming from the same shared deck? Otherwise the model simplifies and can be trained independently for each user.
I've been looking at other.py, in particular to the DASH family of models which pretty much extend 1PL-IRT with review history data. The per-user and per-card parameters seem to be missing, was that intentional? Why?

The text was updated successfully, but these errors were encountered:

Expertium · 2025-01-02T21:42:56Z

Given two cards of different users but with the same ID, should they be considered the same card?

That is very unlikely to occur. IDs are generate from UNIX timestamps with millisecond resolution, so the only way for two cards to have the same ID so for them to be created at exactly the same time, down to 1/1000 of a second. That being said, if they are from the same shared deck, yes, it's possible. But I also want to confirm this with @L-M-Sherlock, I'm not 100% sure.
But anyway, all algorithms in our benchmark assume independence of cards. We have tested using information from sibling cards in FSRS, but the results were not promising.

I've been looking at other.py, in particular to the DASH family of models which pretty much extend 1PL-IRT with review history data. The per-user and per-card parameters seem to be missing, was that intentional? Why?

~~Idk about that, you'd have to wait for LMSherlock to respond. Those models are the ones I am least familiar with.~~
On second thought, I'm not sure what you mean. Every parameter is per-user (well, per-collection, technically), they are optimized for each collection independently.
Also, weren't you helping LMSherlock with implementing DASH? #51

giacomoran · 2025-01-03T09:13:29Z

if they are from the same shared deck, yes, it's possible

Every parameter is per-user (well, per-collection, technically), they are optimized for each collection independently

I see. Given these two points, I think the benchmark deviates from the literature introducing models like like 1PL-IRT and the DASH family. Those models are usually trained across user collections. But I see the point of further optimizing for each individual user.

The benchmark includes "FSRS-5 default param.", which is trained on the entire 10k collections. Would it be possible to do the same for other models?

Expertium · 2025-01-03T10:42:07Z

It's not exactly "trained on all 10k collections": we don't combine them into one giant collection. Instead, we optimize FSRS on every collection individually, and then take the median of each parameter, and then use those median parameters.

giacomoran · 2025-01-03T10:58:22Z

I'd be curious to see the benchmark results of models trained on the "one giant collection".

Is there anything in the training/testing setup blocking this?

Expertium · 2025-01-03T11:02:45Z

@L-M-Sherlock there are some questions here that you should be able to answer better than me

L-M-Sherlock · 2025-01-03T11:32:17Z

Is there anything in the training/testing setup blocking this?

My device's RAM will cry.

L-M-Sherlock · 2025-01-05T06:57:15Z

Do you have any other suggestion or further question?

giacomoran · 2025-01-06T08:55:25Z

Not right now

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guidance for adding the 1PL-IRT model #147

Guidance for adding the 1PL-IRT model #147

giacomoran commented Jan 2, 2025

Expertium commented Jan 2, 2025 •

edited

Loading

giacomoran commented Jan 3, 2025

Expertium commented Jan 3, 2025

giacomoran commented Jan 3, 2025

Expertium commented Jan 3, 2025

L-M-Sherlock commented Jan 3, 2025

L-M-Sherlock commented Jan 5, 2025 •

edited

Loading

giacomoran commented Jan 6, 2025

Guidance for adding the 1PL-IRT model #147

Guidance for adding the 1PL-IRT model #147

Comments

giacomoran commented Jan 2, 2025

Expertium commented Jan 2, 2025 • edited Loading

giacomoran commented Jan 3, 2025

Expertium commented Jan 3, 2025

giacomoran commented Jan 3, 2025

Expertium commented Jan 3, 2025

L-M-Sherlock commented Jan 3, 2025

L-M-Sherlock commented Jan 5, 2025 • edited Loading

giacomoran commented Jan 6, 2025

Expertium commented Jan 2, 2025 •

edited

Loading

L-M-Sherlock commented Jan 5, 2025 •

edited

Loading