Rename `AggregateMeanFScore` to `AggregateMeanFScoreEvaluator`

2 jobs for main
in 9 minutes, using 0.0 compute credits, and was queued for 9 seconds