T-SQL coverage monitoring

Content

TLDR

Add a child sqlproj with tSQLt tests.
Assembled PR We build both projects, get dakpacks taking into account changes from the current branch.
We create a database clone on the fly and deploy the project dakpack with tests into this clone.
We run tests and collect coverage through Extended Events using the SqlCover utility.
We convert the received data into formats for TeamCity and SonarQube.

Building CI/CD for databases is not a trivial task in any case. If the database project contains nothing but tables, then, in principle, we could finish with the implementation of sqlpackage. But if code for stores, triggers, and functions is written in the database, then I would like to test this code before publishing it for production. In this article I would like to share some details of implementing unit tests in the pipeline for databases with T-SQL code. We really run tests when assembling each pull request in the sqlproj repositories, but not everything is so simple.

Project preparation

In the world of T-SQL there are no options, there is only one living framework – tSQLt. Even the guys from Microsoft are already open to it refer in conversations about a framework made under SSDT. That's why we focus on tSQLt.

A successful organization within a solution is very similar to how it is done in .NET and other languages and frameworks: there is a main project and an additional, child one, which with tests looks at the main project.

Thus, we create two solutions sqlproj. To prevent the code with tests from turning into a meaningless independent database, we establish the relationship with the main project as Same Database. This solution achieves two goals:

the main project is not littered with extraneous objects, there are no difficulties in publishing a database without tests;
when tests are needed, they will end up in the same database with the main code, which is what testing requires.

Window for adding a new connection between databases in the SSDT project. Pay attention to the choice of value in the Database location field.

You need to connect the tSQLt framework to the same project with tests. The framework will be needed in all projects with tests. And, most likely, you will make improvements to the framework code. Therefore, I recommend forking the source code, formatting it as the same local SSDT project as all other databases, building it in dacpac and sharing it among all your TSQL projects. The framework code, obviously, should be in the same database as the tests. We connect it similarly to the main project, with the option Same Databasebut not directly, as a project with sources, but in the form of a dacpac file.

This is what dependencies in a project with tests should look like. On the right is a fragment of the AdventureWorsTest.sqlproj file

After assembling the project with tests, you will get dakpakcontaining everything:

This rich dakpak will be deployed to the circuits where these tests are supposed to be performed.

To understand whether the main project is in a state capable of deployment, it is better to first deploy its own dakpack. And only then, on top of the first deployment, deploy the dakpack with tests. Otherwise, it will turn out that a dakpack will arrive at the product, which no one has ever tried to deploy – no one needs such surprises.

Isolation Testing

TSQL code is noticeably different from code in JS, .NET, Python and many other languages in that it can only be executed on the DBMS side. That is, first the code needs to be deployed somewhere. And testing, obviously, is only possible in a working database instance. It would be possible to speculate about raising an instance on the fly in a conditional Docker container, if the traditions of TSQL coding were not for cutting the system into many databases, and then working with all these databases in a row, in the opposite direction and diagonally from one storage. A database with such noodles in isolation cannot be deployed to an empty instance. She needs all the familiar surroundings. All connections must be resolved.

In such circumstances, running tests is possible on a pre-prepared server with a bunch of databases that make up a full-fledged system under test. Here, those who want to do something are faced with the fact that their actions have an impact on other users and developers. And they themselves regularly observe the effect of the presence and activity of other people in the database. If we return to the original desire to test pull requests in parallel, it becomes clear that different programmers can influence not only the data, but also the code, the structure of the database. And these changes cannot coexist simultaneously in the same database.

In search of a solution to the described problem, we came to database cloning. It's about team DBCC CLONEDATABASE. Cloning is very fast even for large databases. This mechanism has a number of nuances, but overall it performed well. Creating a database on the fly from scratch was also considered, but sqlproj projects do not have many properties and settings characteristic of a decent database blank created under the control of a DBA, and from scratch from a dakpack it is difficult to roll out something quite similar to the same database from the market. But who knows, maybe we’ll return to discussing this option.

Regarding the data: writing tests based on what is stored here and now in a database taken from sale or even in a sandbox is naive and futile. We found a suitable client (albeit an impersonal one), with the required tariff, balance, orders, and adjusted a hardcoded test for him client_id. Tomorrow the client is no longer a client or has switched to a different tariff, or the product from the order has changed category – and the test has fallen apart. A reliable test does not depend on such circumstances. A decent tSQLt test first fakes dependencies, inserts the minimum required set of data into fake tables, turns “extra” stores and functions into fake dummies, and checks only what it is intended for. Here clones again turn out to be a good solution: the tables in the clone are pristinely empty and there is simply nothing for the naive version of the test to work on.

However, preparing an isolated blank, in which it will be possible to run tests for many pull requests in parallel, does not end with cloning. The scenario is broadly like this:

- Создать клон
    - Задеплоить проект с тестами
        - Разрешить доступ сборке tSQLt в клон
            - Стартовать сбор покрытия
                - Обнаружить список тест-классов
                    - Выполнить отдельный тест или тест-класс
                        - Повторно выполнить упавшие тесты

Pitfalls

There are a lot of nuances associated with database clones, including bugs like this one. Bug DacFxwhich depends on the SqlSever bug.
Before creating crowds of clones, you need to reach an agreement with the DBA. And program a job that will clean up abandoned scraps from interrupted sessions.
Programmers need the ability to run tests not only on the CI side, but also during development. This may require a separate server. And a period of getting used to the fact that sometimes developers overlap with each other in storage and tables.
Coverage through Extended Events will not be able to provide the same details as in “normal” programming languages. There will be no line-by-line detail, only the level of individual expressions.
A lot of time will have to be spent on finalizing SqlCover and tSQLt.
First you need to build a CI/CD for the database, at least without tests.
Everything around the SqlCover, tSQLt, SqlPackage calls are custom solutions. This will need to be written by hand. Kilometers of scripts.

In any case, the game is worth the candle. The TSQL development process with pipeline tests and sonar is much more mature and decent than without them. This will clearly become clear even to skeptics when the test fails for the first time.