Allow workflows with formulas #59

topepo · 2024-01-29T19:37:15Z

Closes #56

The main API change is that a fitted workflow would now be required. That made the changes a little more invasive than I hoped they would be since it affects documentation and testing.

I could make the testing code a little more simple with some changes that would avoid a bunch of expect_type() and expect_s3_class() calls. Now sure how much more you would like me to make more changes.

On the bright side:

library(tidymodels)
library(workboots)

car_subset <- mtcars[, c("mpg", "disp", "wt")]
lm_wflow <- workflow(mpg ~ ., parsnip::linear_reg())
lm_fit <- fit(lm_wflow, car_subset)
new_car <- data.frame(disp = 150.0, wt = 2.5)

set.seed(1)
new_car_pred <-
  predict_boots(
    workflow = lm_fit,
    n = 2000,
    training_data = car_subset,
    new_data = new_car
  ) %>% 
  summarise_predictions()

new_car_pred
#> # A tibble: 1 × 5
#>   rowid .preds               .pred .pred_lower .pred_upper
#>   <int> <list>               <dbl>       <dbl>       <dbl>
#> 1     1 <tibble [2,000 × 2]>  23.9        17.7        29.8

^{Created on 2024-01-29 with reprex v2.0.2}

topepo · 2024-01-29T19:39:05Z

R/standalone-input-names.R

@@ -0,0 +1,83 @@
+# ---
+# repo: tidymodels/workflows


This "standalone" file is in the next version of workflows and is designed to be added to other packages. It is tested there and the nocov directives prevent it from decreasing code coverage here.

topepo · 2024-01-29T19:39:55Z

tests/testthat/test-predict-boots.R

@@ -1,19 +1,17 @@
-# read in data to use in tests


I moved these to a "helper.R" file. Since it starts with "helper" it gets executed prior to each test file.

topepo · 2024-01-29T19:41:10Z

tests/testthat/test-predict-boots.R

  )

  # predictors missing from new_data
-  expect_error(
+  expect_snapshot(


I moved most of the expect_error() code to expect_snapshot() since it tends to be more robust to unrelated changes to the package.

topepo · 2024-01-29T19:42:22Z

tests/testthat/test-predict-boots.R

  )

+  # tests
+  expect_s3_class(x, c("tbl_df", "tbl", "data.frame"))


We could reduce a lot of these by making a "ptype" list which is just a zero-row slice of the data (e.g. x[0,]). That contains the classes of the overall object as well as the columns.

topepo · 2024-01-29T19:43:08Z

vignettes/Estimating-Linear-Intervals.Rmd

@@ -235,7 +237,7 @@ ames_boot_conf_int %>%
  geom_point(aes(y = Sale_Price),
             alpha = 0.25) +
  scale_x_log10(labels = scales::comma_format()) +
-  scale_y_log10(labels = scales::label_number(scale_cut = scales::cut_short_scale())) +


These were causing an error. I, for the life of me, could not find a way to resolve it. I might be able to ask Thomas about it.

topepo added 6 commits January 29, 2024 13:46

use column name API from workflows

db1dada

remove cut_short_scale() to resolve errors

1433ccb

update docs and add cli import

69da438

use fitted workflow in vignettes

fcae8d4

more to snapshot testing and help file

b248f8e

add tests for formulas

388e278

topepo commented Jan 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow workflows with formulas #59

Allow workflows with formulas #59

topepo commented Jan 29, 2024

topepo Jan 29, 2024

topepo Jan 29, 2024 •

edited

Loading

topepo Jan 29, 2024

topepo Jan 29, 2024

topepo Jan 29, 2024

Allow workflows with formulas #59

Are you sure you want to change the base?

Allow workflows with formulas #59

Conversation

topepo commented Jan 29, 2024

topepo Jan 29, 2024

Choose a reason for hiding this comment

topepo Jan 29, 2024 • edited Loading

Choose a reason for hiding this comment

topepo Jan 29, 2024

Choose a reason for hiding this comment

topepo Jan 29, 2024

Choose a reason for hiding this comment

topepo Jan 29, 2024

Choose a reason for hiding this comment

topepo Jan 29, 2024 •

edited

Loading