Skip to content

Commit 2075797

Browse files
committed
bump: v0.3.0 → v0.4.0
1 parent 1064cbb commit 2075797

File tree

2 files changed

+32
-1
lines changed

2 files changed

+32
-1
lines changed

CHANGELOG.md

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,34 @@
1+
## v0.4.0 (2026-03-04)
2+
3+
### BREAKING CHANGE
4+
5+
- MetricsResult.language replaced by input_languages/output_languages
6+
7+
### Feat
8+
9+
- add lazy execution filtering and ExecutionMode enum
10+
- add cross-lingual aggregation modes for per-language metrics
11+
- freelancer project ranking
12+
- add unicode normalization to lexical baseline preprocessing
13+
- add lexical baselines for ranking
14+
15+
### Fix
16+
17+
- remove from example the dataset that uses ESCO 1.0.5 but defines UK as supported language
18+
- add language field to MetricsResult for proper per-language aggregation
19+
- solve issues in example files
20+
- include lowercase setting in lexical baseline model names
21+
- import SkillSkape
22+
23+
### Refactor
24+
25+
- use language-grouped averaging in per-task aggregation
26+
- migrate freelancer task to dataset_id-based language mapping
27+
- make language_aggregation_mode a non-optional parameter in evaluate()
28+
- migrate freelancer project matching tasks to load_dataset API
29+
- rename language_results to datasetid_results for consistency with dataset_id abstraction
30+
- generalize dataset indexing from language-based to dataset_id-based
31+
132
## v0.3.0 (2026-01-09)
233

334
### Feat

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ packages = ["src/workrb"]
77

88
[project]
99
name = "workrb"
10-
version = "0.3.0"
10+
version = "0.4.0"
1111
description = "WorkRB: Work Research Benchmark. Easy benchmarking of AI progress in the work domain."
1212
readme = "README.md"
1313
requires-python = ">=3.10,<4.0"

0 commit comments

Comments
 (0)