Contributing to dbt-bouncer
#
dbt-bouncer
is open source software. Whether you are a seasoned open source contributor or a first-time committer, we welcome and encourage you to contribute code, documentation, ideas, or problem statements to this project.
About this document#
There are many ways to contribute to the ongoing development of dbt-bouncer
, such as by participating in discussions and issues.
The rest of this document serves as a more granular guide for contributing code changes to dbt-bouncer
(this repository). It is not intended as a guide for using dbt-bouncer
, and some pieces assume a level of familiarity with Python development (virtualenvs, Poetry
, etc). Specific code snippets in this guide assume you are using macOS or Linux and are comfortable with the command line.
If you get stuck, we're happy to help! Just open an issue or draft PR and we'll do our best to help out.
Note#
- Branches: All pull requests from community contributors should target the
main
branch (default).
Getting the code#
Installing git#
You will need git
in order to download and modify the dbt-bouncer
source code. On macOS, the best way to download git is to just install Xcode.
Contributors#
You can contribute to dbt-bouncer
by forking the dbt-bouncer
repository. For a detailed overview on forking, check out the GitHub docs on forking. In short, you will need to:
- Fork the
dbt-bouncer
repository. - Clone your fork locally.
- Check out a new branch for your proposed changes.
- Push changes to your fork.
- Open a pull request against
godatadriven/dbt-bouncer
from your forked repository.
Setting up an environment#
There are some tools that will be helpful to you in developing locally. While this is the list relevant for dbt-bouncer
development, many of these tools are used commonly across open-source python projects.
Tools#
These are the tools used in dbt-bouncer
development and testing:
click
to create our CLI interface.- GitHub Actions for automating tests and checks, once a PR is pushed to the
dbt-bouncer
repository. make
to run multiple setup or test steps in combination.mypy
for static type checking.Poetry
to manage our python virtual environment.pre-commit
to easily run those checks.Pydantic
to validate our configuration file.pytest
to define, discover, and run tests.Ruff
to lint and format python code.
A deep understanding of these tools in not required to effectively contribute to dbt-bouncer
, but we recommend checking out the attached documentation if you're interested in learning more about each one.
Virtual environments#
We strongly recommend using virtual environments when developing code in dbt-bouncer
. We recommend creating this virtualenv in the root of the dbt-bouncer
repository. To create a new virtualenv, run:
This will create a new Python virtual environment.
Setting environment variables#
Set required environment variables by copying .env.example
to .env
and updating the values.
Running dbt-bouncer
in development#
Installation#
First make sure that you set up your virtualenv
as described in Setting up an environment. Next, install dbt-bouncer
, its dependencies and pre-commit
:
When installed in this way, any changes you make to your local copy of the source code will be reflected immediately in your next dbt-bouncer
run.
Running dbt-bouncer
#
With your virtualenv activated, the dbt-bouncer
script should point back to the source code you've cloned on your machine. You can verify this by running which dbt-bouncer
. This command should show you a path to an executable in your virtualenv. You can run dbt-bouncer
using the provided example configuration file via:
Testing#
Once you're able to manually test that your code change is working as expected, it's important to run existing automated tests, as well as adding some new ones. These tests will ensure that: - Your code changes do not unexpectedly break other established functionality - Your code changes can handle all known edge cases - The functionality you're adding will keep working in the future
Note#
- Generating dbt artifacts: If you change the configuration of the dbt project located in
dbt_project
then you will need to re-generate the dbt artifacts used in testing. To do so, run:
Test commands#
There are a few methods for running tests locally.
makefile
#
There are multiple targets in the makefile
to run common test suites, most notably:
# Runs unit tests
make test-unit
# Runs integration tests
make test-integration
# Runs all tests
make test
pre-commit
#
pre-commit
takes care of running all code-checks for formatting and linting. Run poetry run pre-commit install
to install pre-commit
in your local environment. Once this is done you can use the git pre-commit hooks to ensure proper formatting and linting.
pytest
#
Finally, you can also run a specific test or group of tests using pytest
directly. With a virtualenv active and dev dependencies installed you can do things like:
# run all unit tests in a file
poetry run pytest ./tests/unit/checks/catalog/test_columns.py
# run a specific unit test
poetry run pytest ./tests/unit/checks/catalog/test_columns.py::test_check_columns_are_documented_in_public_models
See pytest usage docs for an overview of useful command-line options.
Assorted development tips#
- Append
# type: ignore
to the end of a line if you need to disablemypy
on that line, preferably with the specific rule to ignore such as# type: ignore[union-attr]
.
Adding a new check#
To add a new check follow the below steps:
- In
./src/dbt_bouncer/checks
choose the appropriate directory for your check. For example, if your check only requires themanifest.json
then use themanifest
directory, if your check requires thecatalog.json
then use thecatalog
directory. - Within the chosen directory assess if a suitable file already exists. For example, if your check applies to a model then
manifest/check_models.py
is a suitable location. -
Within the chosen file, add a Pydantic model, this object must meet the following criteria:
- Start with "Check".
- Inherit from
dbt_bouncer.check_base.BaseCheck
. - Have a
name
attribute that is a string whose value is the snake case equivalent of the class name. - A
default
value provided for optional input arguments and arguments that are received at execution time. - Have a doc string that includes a description of the check, a list of possible input parameters and at least one example.
- A clear message in the event of a failure.
-
After the check is added, add the check to
dbt-bouncer-example.yml
and rundbt-bouncer --config-file dbt-bouncer-example.yml
to ensure the check succeeds. - (Optional) If the dbt project located in
./dbt_project
needs to be updated then do so and also runmake build-artifacts
to generate the new test artifacts. - Add at least one happy path and one unhappy path test to
./tests
. The appropriate test file will be the one matching the directory of the check. For example, if the check is in./src/dbt_bouncer/checks/catalog/check_columns.py
then the test file will be./tests/unit/checks/catalog/test_columns.py
. - Run
make test
to ensure the tests pass. - Open a PR 🎉!
Submitting a Pull Request#
Code can be merged into the current development branch main
by opening a pull request. If the proposal looks like it's on the right track, then a dbt-bouncer
maintainer will review the PR. They may suggest code revision for style or clarity, or request that you add unit or integration test(s). These are good things! We believe that, with a little bit of help, anyone can contribute high-quality code. Once merged, your contribution will be available for the next release of dbt-bouncer
.
Automated tests run via GitHub Actions. If you're a first-time contributor, all tests will require a maintainer to approve.
Once all tests are passing and your PR has been approved, a dbt-bouncer
maintainer will merge your changes into the active development branch. And that's it! Happy developing :tada: