add tests that verify behavior of generated code + generator errors/warnings #1156

eli-bl · 2024-10-31T22:30:11Z

This creates two new categories of end-to-end tests that I'm calling "functional" tests:

Tests that run the generator against a valid spec (defined inline in the test code, specific to each test class, rather than in large spec files) and then import and execute pieces of the generated code. Unlike the "golden record"-based tests, these don't make assertions about the exact implementation of the generated code; they treat it as a black box and verify its behavior. These are in functional_tests/generated_code_execution.
Tests that run the generator against an invalid spec (again defined inline) and make assertions about its error/warning output. These are in functional_tests/generator_failure_cases. The reason I've put these in a separate directory is that they do not involve executing generated code; they're basically black-box tests of the generator.

I've added a bunch of tests in both categories, to cover generated model classes. There aren't currently any tests of endpoint behavior, but I think those could be pretty easily implemented using a mock HTTP client.

The intended benefits are:

These tests prove that the generated code behaves as intended from the point of view of an application using the client. That's unlike the existing end-to-end tests (where we are verifying what the code looks like and only inferring what we think it will do), and the unit tests of the generator (where we're verifying the logic for the internal data structures that will be used to generate the code).
They're more convenient for rapid development of a feature, as tests are lighter-weight and self-contained.
They can replace many (most?) of the current low-level unit tests, which are too tightly coupled to internal implementation details and difficult to maintain.
In case of a regression in the generator, failures in these tests can be more specific and easier to interpret. The "golden record"-based tests would also fail, but the output from those is often quite verbose and it can be hard to see which part of the diff is relevant, especially if the error is duplicated in many model/endpoint files.

This PR removes quite a few low-level unit tests that are now redundant; the coverage check in CI is proof that the relevant code paths are still covered.

I believe we could also greatly reduce the size of the "golden record"-based tests, and reduce those current very large specs to cover a smaller range of things, like the overall code structure of the generated client. I think it is safe to say that tests for things like "does a property with such-and-such schema get correctly translated into a model class attribute, with the right serialization/deserialization logic" would be better handled in the new functional test style. But I haven't removed any of those in this PR, since I think that's a bigger task.

eli-bl · 2024-10-31T22:33:12Z

end_to_end_tests/test_end_to_end.py

@@ -168,18 +145,17 @@ def test_literal_enums_end_to_end():
    )
 )
 def test_meta(meta: str, generated_file: Optional[str], expected_file: Optional[str]):


I've rewritten this and other functions in this file to use the same generate_client context manager, even though the logic in the tests is fairly simple, for two reasons:

It enforces consistent state cleanup. Previously, failures in some of these tests would leave generated files behind in the project directory.

I think it makes the tests a bit easier to read by reducing boilerplate, so what's left is the logic specific to that test.

eli-bl · 2024-10-31T22:49:08Z

By the way, I think this mechanism would be especially useful for testing something like my discriminator PR, #1156 - since that is ultimately about validation behavior.

eli-bl · 2024-11-12T04:21:38Z

.github/workflows/checks.yml

@@ -11,7 +11,7 @@ jobs:
  test:
    strategy:
      matrix:
-        python: [ "3.8", "3.9", "3.10", "3.11", "3.12", "3.13" ]
+        python: [ "3.9", "3.10", "3.11", "3.12", "3.13" ]


I know there's another PR extant for dropping 3.8, but I had to remove 3.8 from the testing matrix here because 3.8 has some differences in the typing module that would make some of the new test assertions a pain to implement.

add tests that verify actual behavior of generated code

0928032

eli-bl commented Oct 31, 2024

View reviewed changes

eli-bl requested a review from dbanty October 31, 2024 22:44

eli-bl marked this pull request as ready for review November 4, 2024 01:36

eli-bl mentioned this pull request Nov 5, 2024

add option to use dataclasses instead of attrs in generated code #1158

Draft

documentation

f324f26

eli-bl mentioned this pull request Nov 7, 2024

add tests that verify actual behavior of generated code benchling/openapi-python-client#216

Closed

make assertion error messages work correctly

6b15783

eli-bl force-pushed the live-generated-code-tests branch 2 times, most recently from 58b3950 to 27478a5 Compare November 8, 2024 21:18

misc improvements, test error conditions, remove redundant unit tests

b7d34a7

eli-bl force-pushed the live-generated-code-tests branch 2 times, most recently from 2370ba2 to e57aa39 Compare November 8, 2024 22:43

misc improvements + remove redundant unit tests

53fca35

eli-bl force-pushed the live-generated-code-tests branch 5 times, most recently from e87dca8 to b21d2e6 Compare November 11, 2024 23:13

eli-bl changed the title ~~add tests that verify actual behavior of generated code~~ add tests that verify behavior of generated code + generator errors/warnings Nov 11, 2024

eli-bl force-pushed the live-generated-code-tests branch 2 times, most recently from 52a235c to e369b2e Compare November 12, 2024 02:29

restore some missing test coverage

cd2ccb0

eli-bl force-pushed the live-generated-code-tests branch from e369b2e to cd2ccb0 Compare November 12, 2024 02:36

eli-bl mentioned this pull request Nov 12, 2024

add tests against generated code benchling/openapi-python-client#217

Merged

eli-bl added 3 commits November 11, 2024 20:08

don't run tests in 3.8 because type hints don't work the same

79c322d

make sure all tests get run

d915267

cover another error case

87673c8

eli-bl commented Nov 12, 2024

View reviewed changes

eli-bl added 3 commits November 11, 2024 20:44

reorganize

3a0c36c

rm test file

eabbf2b

reorganize

80c8333

eli-bl force-pushed the live-generated-code-tests branch from 8346301 to 80c8333 Compare November 12, 2024 18:34

eli-bl added 3 commits November 12, 2024 10:49

coverage

1c59c6c

docs

aa63390

slight refactor, better failure output

15eafe7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add tests that verify behavior of generated code + generator errors/warnings #1156

add tests that verify behavior of generated code + generator errors/warnings #1156

eli-bl commented Oct 31, 2024 •

edited

Loading

eli-bl Oct 31, 2024

eli-bl commented Oct 31, 2024

eli-bl Nov 12, 2024

add tests that verify behavior of generated code + generator errors/warnings #1156

Are you sure you want to change the base?

add tests that verify behavior of generated code + generator errors/warnings #1156

Conversation

eli-bl commented Oct 31, 2024 • edited Loading

eli-bl Oct 31, 2024

Choose a reason for hiding this comment

eli-bl commented Oct 31, 2024

eli-bl Nov 12, 2024

Choose a reason for hiding this comment

eli-bl commented Oct 31, 2024 •

edited

Loading