Add `ahisto_named_entity_search.recognition.evaluator`

Adds the `AggregateMeanFScore` class, which can be used to evaluate NER
models on different categories and to aggregate the scores for clusters
of categories such as B-PER and I-PER.
2 jobs for main
in 8 minutes and 42 seconds, using 0.0 compute credits, and was queued for 9 seconds