Issue #604 - Add support for nominal forecasts #837

nikosbosse · 2024-06-02T15:34:53Z

Description

This PR closes #604.

Nominal forecasts are forecasts for outcomes that can fall in one of several unordered categories. This PR implements support for nominal forecasts (see #604, #607, and #608).

Specifically, the PR

creates a new nominal_forecast class with
- an assert_input_nominal function that checks the inputs passed to a scoring function
- a check_input_nominal, doing the same thing without producing an error - UPDATE: I think I deleted that as I didn't use it for checks. See Clean up input checks #840 for some discussion on when to check what.
- an assert_forecast.forecast_nominal function, checking that a data.table is complying with the required input format
- a default list of metrics, provided via metrics_nominal
- a new method score.forecast_nominal
adds new example data
updates as_forecast() to accept a new predicted_label argument.
updates get_forecast_type() and adds a check function to make sure that the forecast type is nominal
implements the log score for nominal forecasts
adds tests

Note:
Throughout the process, I noticed that sadly, scoringutils is currently not "easily extensible"... To make this go smoothly, there are quite a few hoops. Some of this will be simplified in the future when we implement a separate as_forecast_nominal() function instead of a single as_forecast() function that has to do all the guesswork.

Still missing (likely for a future PR)

Updating the manuscript to include nominal forecasts
other kinds of docs
- Creating a vignette that walks through a hubVerse example
A helper function that completes the forecast such that users don't have to specify every single option (see Define input format for categorical forecasts #608)

One current code example:

# remotes::install_github("epiforecasts/scoringutils@multiclass")
library(dplyr)
library(hubExamples)
library(scoringutils)

pred <- hubExamples::forecast_outputs |> filter(output_type == "pmf")
obs <- hubExamples::forecast_target_observations |> 
  dplyr::filter(output_type == "pmf")
hubex <- dplyr::full_join(pred, obs)

hubex |> 
  dplyr::group_by(model_id, location, reference_date, horizon, target_end_date, target, output_type) |>
  dplyr::mutate(
    observation = output_type_id[observation == 1], 
    observation = factor(observation, levels = c("low", "moderate", "high", "very high")), 
    output_type_id = factor(output_type_id, levels =  c("low", "moderate", "high", "very high"))) |>
  as_forecast(
    model = "model_id", observed = "observation", 
    predicted = "value", predicted_label = "output_type_id"
  ) |> 
  score()

Checklist

My PR is based on a package issue and I have explicitly linked it.
I have included the target issue or issues in the PR title as follows: issue-number: PR title
I have tested my changes locally.
I have added or updated unit tests where necessary.
I have updated the documentation if required.
I have built the package locally and run rebuilt docs using roxygen2.
My code follows the established coding standards and I have run lintr::lint_package() to check for style issues introduced by my changes.
I have added a news item linked to this PR.
I have reviewed CI checks for this PR and addressed them as far as I am able.

Merge remote-tracking branch 'origin/main' into multiclass # Conflicts: # R/default-scoring-rules.R # R/validate.R

R/forecast.R

tests/testthat/test-forecast.R

nickreich

I didn't review closely a lot of the code related to formal S3 class setup because i'm not that familiar with the structure/functions used there. but I reviewed the tests and the general set-up with the nominal forecast type and things look good to me +/- a few very small optional suggested changes.

seabbs

This looks really good I think and also appears correct to me. I don't have substantive comments about this PR aside from one instance of missing docs.

I did however use it to review the current changes needed to add a new class. This has improved by splitting out as_forecast but there are still a few pain points. It looks like nearly all of the me can deal with using a bit more s3 which is great.

I think we have discussed this before but I think this would be much easier to review/parse and easier for someone new to do if all the bits that defined as specific as_forecast_type where in the same file vs being split by generic method.

R/check-inputs-scoring-functions.R

R/get_-functions.R

nikosbosse · 2024-08-04T14:08:37Z

@seabbs some excellent points in your review here. I think moving towards as_forecast_<type>() really was the right call and should allow us to simplify things here quite a bit.

I suggest addressing your points before implementing ordinal forecasts (pinging @nickreich and @elray1 for awareness) as that will make it easier to create the new ordinal class. Since Nick and Evan care about the ordinal forecasts more than the nominal ones I also suggest addressing your points before merging this.

seabbs · 2024-08-06T08:43:56Z

I also suggest addressing your points before merging this.

I don't mind either way here but I agree it would be a good idea to use the ordinal forecasts as a test case. If it were me I think I would look to merge this, make a new issue with the pain points identified, address in a PR, and then implement ordinal?

nikosbosse added 10 commits January 17, 2024 11:42

Add skeleton for a score method for categorical forecasts

d930c6a

skeleton for validate_forecast method for categorical forecasts

be05422

add skeleton for default scoring rules for categorical forecasts

b1ac284

empty skeleton for check functions for categorical forecasts

d0f57fb

fix merge conflict

b0ee467

Merge remote-tracking branch 'origin/main' into multiclass # Conflicts: # R/default-scoring-rules.R # R/validate.R

implement nominal forecast class

3c5c23e

fix issues

34be31a

make code work

bf71982

add example data

5eadf04

fix warnings

5e80e29

nikosbosse changed the title ~~Multiclass DON'T MERGE~~ Draft for supporting nominal forecasts Jun 7, 2024

nikosbosse marked this pull request as draft June 7, 2024 14:47

nikosbosse and others added 16 commits June 7, 2024 22:04

add tests

907511d

Refine tests and docs

4681f92

improve tests

b5ef322

fix linting issues

d666e64

fix issues

9aa1e0a

try fix for failing test

0f15226

try fixing test again...

fec609f

update test to work with old R version

14a5fb7

round and round and round it goes

dfc6c50

Merge branch 'main' into multiclass

13cd0f9

Require R4.0

1cad15a

remove R3.6 from CI checks

de851e3

update NEWS file

be1eab0

add CI check for 4.0 back in

efa1ec7

fix typo in news file

3bd0db6

update manual figure

dd6cc28

nikosbosse changed the title ~~Draft for supporting nominal forecasts~~ Issue #604 - Add support for nominal forecasts Jun 14, 2024

nikosbosse requested a review from nickreich June 14, 2024 05:25

nikosbosse and others added 8 commits July 21, 2024 22:59

update docs

7b65d58

fix linter issue

bac88d9

fix tests

7ff58b8

update tests

708ad74

update tests

9082a0a

use magrittr pipe

746b3c8

update docs

340e9d5

Merge branch 'main' into multiclass

4adee5a

nikosbosse requested a review from seabbs July 23, 2024 08:13

nickreich reviewed Jul 26, 2024

View reviewed changes

R/forecast.R Outdated Show resolved Hide resolved

nickreich reviewed Jul 26, 2024

View reviewed changes

tests/testthat/test-forecast.R Show resolved Hide resolved

nickreich approved these changes Jul 26, 2024

View reviewed changes

nikosbosse and others added 4 commits July 27, 2024 11:00

address comments from Nick

92c8d80

Merge branch 'main' into multiclass

b800007

fix test

fd36025

Merge branch 'main' into multiclass

dedbc97

seabbs approved these changes Jul 30, 2024

View reviewed changes

R/check-inputs-scoring-functions.R Outdated Show resolved Hide resolved

R/get_-functions.R Show resolved Hide resolved

R/get_-functions.R Show resolved Hide resolved

R/get_-functions.R Show resolved Hide resolved

seabbs mentioned this pull request Jul 30, 2024

evaluation for ordinal predictions #846

Open

Merge branch 'main' into multiclass

7ac15f1

This was referenced Aug 10, 2024

Get rid of get_forecast_type() #887

Closed

Rework get_duplicate_forecasts() to S3 to avoid hard-coding columns and types #888

Open

Make get_protected_columns() S3 to avoid hard-coding things #889

Open

update docs

013816a

nikosbosse mentioned this pull request Aug 10, 2024

Issue #481: Update pkgdown structure #886

Merged

9 tasks

nikosbosse and others added 2 commits August 10, 2024 13:33

Merge branch 'main' into multiclass

8d78327

Update docs

f453895

nikosbosse merged commit 867a2ff into main Aug 10, 2024
9 checks passed

nikosbosse deleted the multiclass branch August 10, 2024 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #604 - Add support for nominal forecasts #837

Issue #604 - Add support for nominal forecasts #837

nikosbosse commented Jun 2, 2024 •

edited

Loading

nickreich left a comment

seabbs left a comment •

edited

Loading

nikosbosse commented Aug 4, 2024

seabbs commented Aug 6, 2024 •

edited

Loading

Issue #604 - Add support for nominal forecasts #837

Issue #604 - Add support for nominal forecasts #837

Conversation

nikosbosse commented Jun 2, 2024 • edited Loading

Description

Checklist

nickreich left a comment

Choose a reason for hiding this comment

seabbs left a comment • edited Loading

Choose a reason for hiding this comment

nikosbosse commented Aug 4, 2024

seabbs commented Aug 6, 2024 • edited Loading

nikosbosse commented Jun 2, 2024 •

edited

Loading

seabbs left a comment •

edited

Loading

seabbs commented Aug 6, 2024 •

edited

Loading