Optimize Shinytest2: Speed Up Your Shiny App Tests
Learn how to speed up Shiny app tests using shinytest2 by reusing application instances and refreshing the browser.
A good rule of thumb when testing is to make sure that each test case is independent of the others.
Having setup, execution, assertion, and teardown in each test case ensures that:
- Tests are easier to understand – all required code is contained inside a test case.
- Tests are easier to maintain – we can refactor tests without affecting others.
- Tests won’t fail due to side effects – each test case is an independent unit.
When testing Shiny applications or Shiny modules with shinytest2
, a test case is independent is when we:
- start the application in each test case,
- stop the application at the end of each test case.
Starting up the application can be time-consuming, especially if there is an expensive setup, like loading data or connecting to a database.
To reduce the time it takes to run tests, we could reuse the same application instance between test cases. But how do we ensure that the application is in a clean state before each test case? How do we ensure that each test case is independent?
In some cases, it should be enough to refresh the browser with the server still running.
This is how Cypress works, you just visit the page where it’s served and do your tests. You can refresh the page to restore the initial state and move on to the next test case.
There are other optimization techniques, like reducing the scope of the test or parallelization, but let’s focus on the effect of reusing the same application instance between test cases of what is the potential impact.
Let’s see how we can introduce this approach to test a simple counter app.
Let’s describe this app with tests.
library(testthat)
describe("CounterPage", {
it("should increment the counter once", {
# Arrange
page <- open_page()
# Act
page$increment()
# Assert
page$expect_value("1")
})
it("should increment the counter many times", {
# Arrange
page <- open_page()
# Act
page$increment()
page$increment()
# Assert
page$expect_value("2")
})
})
There are three distinct actions in each test case:
- Arrange – open the page,
- Act – increment the counter,
- Assert – check if the counter has the expected value.
We don’t know the implementation of those methods yet, we don’t even have the code of the app yet, but we’ve described the behavior of the page.
Having at least one test, we can move to the implementation of the page object, and then the app.
To abstract common operations made on any page, we can create a class that represents a page, it’s a page that can only be reloaded or closed. We will be able to reuse this object for all module tests as our project grows. It also bundles shinytest2
driver and selenider
session in one structure, so that they’re easier to access.
The reloading of the page can be implemented with reload
method of the browser or via a dedicated function from {selenider}
.
Page <- R6::R6Class(
public = list(
driver = NULL,
selenider = NULL,
initialize = function(app) {
self$driver <- shinytest2::AppDriver$new(app)
self$selenider <- selenider::selenider_session(
driver = self$driver,
local = FALSE
)
},
# Refresh the page
reload = function() {
self$driver$run_js("window.location.reload()")
# Or we can use:
# selenider::reload(self$selenider)
},
# Close the connection, free resources
stop = function() {
self$driver$stop()
}
)
)
Let’s add page-specific methods.
Extending the base page object, we add methods to interact with the specific app/module we’re developing. We introduce methods that we described the page with using tests. The two things we can do on that page is increment the counter and expect that it bumps the number.
We assume there will be an element with a data-test
attribute that increments the counter and an element with a data-test
attribute that displays the counter value.
We can use functions from {selenider}
to interact with the page. It provides functions that allow more control over interactions with the browser than shinytest2
.
CounterPage <- R6::R6Class(
inherit = Page,
public = list(
increment = function() {
self$selenider |>
selenider::find_element("[data-test=increment]") |>
selenider::elem_click()
},
expect_value = function(target) {
self$selenider |>
selenider::find_element("[data-test=increment-value]") |>
selenider::elem_expect(\(x) selenider::has_exact_text(x, target))
}
)
)
💡 Notice how this approach is separated from components IDs and Shiny namespaces.
It doesn’t matter if it’s a nested module or a standalone app. Targeting elements in tests is separated from IDs in production code. This approach allows us to grow the production code, without having to go back and refactor selectors in tests.
See how to do it for any component in Shiny.
The last function we need in tests is open_page
function. It only initializes the drivers and stops the app just before the test case finishes running.
open_page <- function(app = test_app(), page_object = CounterPage) {
page <- page_object$new(app)
withr::defer(page$stop(), parent.frame(2))
page
}
Having code that drives the app in place, we can implement the app itself.
In this case, it’s a simple Shiny app, but it could also be a Shiny module.
test_app <- function() {
shiny::shinyApp(
ui = shiny::fluidPage(
shiny::actionButton(
"btn",
label = "Calculate",
`data-test` = "increment"
),
shiny::textOutput(
"text",
container = function(...) {
shiny::div(
`data-test` = "increment-value",
...
)
}
)
),
server = function(input, output, session) {
output$text <- shiny::renderText(input$btn)
}
)
}
Putting this all together satisfies tests. 🎉
- Each test case initializes its own app.
- Each test cleans up after itself.
- It works as expected, tests are green 🟢.
Now let’s see how we can optimize it.
By modifying the open_page
function we can make it initialize the page only once.
We can create a closure that will initialize the driver only on the first run and refresh the page on subsequent runs before returning the page object.
make_page <- function(app = test_app(), page_object = CounterPage) {
page <- NULL
function() {
if (is.null(page)) {
page <<- page_object$new(app)
} else {
page$reload()
}
page
}
}
With this approach, we can initialize the page once for a series of test cases. Notice how the code inside test cases doesn’t change, we only added 2 lines in the common part of the setup.
describe("CounterPage", {
open_page <- make_page()
withr::defer(open_page()$stop())
it("should increment the counter once", {
# Arrange
page <- open_page()
# Act
page$increment()
# Assert
page$expect_value("1")
})
it("should increment the counter many times", {
# Arrange
page <- open_page()
# Act
page$increment()
page$increment()
# Assert
page$expect_value("2")
})
})
After this change, all test cases reuse the same application instance, but each test case is still independent. The page is refreshed in the setup part of each test case. There is only one cleanup, just before we exit the describe
block.
Let’s compare the performance of the two approaches.
Benchmark code
result <- bench::mark(
iterations = 10,
unoptimized = describe("CounterPage", {
it("should increment the counter once", {
# Arrange
page <- open_page(test_app())
# Act
page$increment()
# Assert
page$expect_value("1")
})
it("should increment the counter many times", {
# Arrange
page <- open_page(test_app())
# Act
page$increment()
page$increment()
# Assert
page$expect_value("2")
})
}),
optimized = describe("CounterPage", {
open_page <- make_page(test_app())
withr::defer(open_page()$stop())
it("should increment the counter once", {
# Arrange
page <- open_page()
# Act
page$increment()
# Assert
page$expect_value("1")
})
it("should increment the counter many times", {
# Arrange
page <- open_page()
# Act
page$increment()
page$increment()
# Assert
page$expect_value("2")
})
})
)
result |>
dplyr::select(expression, min:total_time) |>
knitr::kable()
expression | min | median | itr/sec | mem_alloc | gc/sec | n_itr | n_gc | total_time |
---|---|---|---|---|---|---|---|---|
unoptimized | 2.89s | 3.39s | 0.2984827 | 207MB | 1.999834 | 10 | 67 | 33.5s |
optimized | 1.63s | 2.09s | 0.4800020 | 114MB | 2.160009 | 10 | 45 | 20.8s |
We can see that the initialization takes an additional second of running, even for this simple, small example. With more complex applications/modules and more test cases added, the startup of the app can significantly increase the time it takes to finish running the tests.
The quicker the feedback from tests, the more useful they are.
This technique can be an easy starting point for tests optimization.
- Writing tests first made us abstract each action. Tests don’t know how the page is provided, it can be started up for each case, or reloaded. It doesn’t impact the test code, it only knows that the page is opened.
- It takes a moment to start a Shiny server, a page refresh can be a quicker option if it ensures clean state of the app.
- Refreshing the page instead of starting up a new application is a simple change if the test setup code is abstracted well. It can be a good starting point for tests optimization.
- After refreshing the page, some methods of
shinytest2::AppDriver
no longer work (or at least I couldn’t get them to work), but functions from {selenider} work. This might have something to do with howAppDriver
methods communicates with the browser:
Error in
app_get_values()
: ! Shiny server returned 404 for values URL: http://127.0.0.1:6228/session/2bc7bb3e48e6a94ec7f3b3a61f914eb2/dataobj/shinytest?w=&nonce=ad001b462&input=1&output=1&export=1&format=rds&sortC=1 ℹ Isshiny::runApp(test.mode = TRUE)
enabled?
- When running
testthat
tests in parallel, test files are run in parallel. Sharing an app instance between test files introduces a risk of tests interfering with each other, leading to flaky tests. - If there is logic that restores the state of the app after a refresh implemented, when the app stores some persistent data that will be displayed during the next session, you won’t obtain a clean state by only refreshing the page.