Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release v3 proper #1298

Merged
merged 270 commits into from
Jan 9, 2025
Merged

Release v3 proper #1298

merged 270 commits into from
Jan 9, 2025

Conversation

kba
Copy link
Member

@kba kba commented Jan 8, 2025

It's time to flip the switch to v3.0.0.

This PR

bertsky and others added 30 commits July 6, 2024 19:00
- `Processor.__init__`: remove non-processing kwargs
- `Processor.__init__`: deprecate processing kwargs (passing file groups etc)
- `Processor`: define members for all non-processing calls
- `ocrd_cli_wrap_processor`: non-processing calls instead of init kwargs
- `run_processor` and `get_processor` and `cli.bashlib` and `cli.ocrd_tool`:
  always set processing attributes _after_ init
- `Processor.process_workspace`: delegate fileGrp checking to `verify`
  (still empty)
- `DummyProcessor.setup`: no more fileGrp assertions here (too early!)

(This is meant to ensure that existing processor implementations, i.e.
subclasses of `Processor` do not call `setup` in the constructor anymore.
That way, v3.0 will stay backwards compatible in more respects and thus
adopting it along the way will become easier.)
- `ocrd_tool.schema.yml`:
  - deprecate `input_file_grp`
  - deprecate `output_file_grp`
  - introduce+require `input_file_grp_cardinality`: number of min:max
  - introduce+require `output_file_grp_cardinality`: number of min:max
- `ocrd_utils.assert_file_grp_cardinality`: deprecate
- `Processor.verify`: check that
  - fileGrp attributes exist,
  - input fileGrp(s) exist in METS
  - input/output fileGrp(s) match the cardinality constraints,
    if specified in ocrd-tool.json: exact number, or minimum+maximum
    number (skipping negative or zero)

(Processor implementors must now specify `input_file_grp_cardinality`
and `output_file_grp_cardinality` in order to have a valid `ocrd-tool.json`
again.)
bertsky and others added 27 commits October 23, 2024 22:12
Processor parallel pages: switch from multithreading to multiprocessing
@kba kba requested a review from bertsky January 8, 2025 19:41
@kba kba merged commit 68786a6 into master Jan 9, 2025
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants