Skip to content

Latest commit

 

History

History
127 lines (76 loc) · 5.38 KB

README.md

File metadata and controls

127 lines (76 loc) · 5.38 KB

DBI-compliance reports for different database systems

Scope

The programming language R offers a standardized interface to access many different database systems via an interface called DBI.

This github repository is intended as

  • a collection of reports
  • that visualize how good different databases support DBI-based database access
  • by measuring the DBI compliance with the unit tests of the package DBItest.

The source code to generate the reports is also included in this repository (not separated).

Example reports

For an example of a result report see this first real-life report and the correspondig raw data (based on SQLite).

This is still pre-mature alpha status. Don't draw any conclusions from this example!

Example: DBI-compliance summary for database xy

This diagram shows the comliance maturity of a database xy (total and per test case group):

DBI is...

"... a common interface between [the programming languages] R/S and RDBMS that would allow users to access data stored on database servers in a uniform and predictable manner irrespective of the database engine. The interface defines a small set of classes and methods similar in spirit to Python’s DB-API, Java’s JDBC, Microsoft’s ODBC, Perl’s DBI, etc."

Source: https://cran.r-project.org/web/packages/DBI/vignettes/DBI-proposal.html

You can find the full DBI specification at CRAN: https://cran.r-project.org/web/packages/DBI/vignettes/spec.html

Just remember: DBI is a common database interface for R to access different databases via the same API.

Use cases of the reports

  1. Decide if a DBI driver covers the requirements of a project for a specific database
  2. Estimate the impact of different configurations for the same database (e. g. using the odbc vs. RPostgreSQL packages) to identify the best configuration for your project
  3. Help developers of DBI drivers to find gaps and improve their drivers toward full DBI-compliance
  4. Help database vendors (their developers) to close gaps in the database engines required to be DBI-compliant (e. g. for new big-data or noSQL databases)
  5. Find DBItest unit tests that are unrealistic to work with any database and improve them

Current status

Last status update: Feb 10, 2018

The work is still in progress based on the embedded database SQLite on Linux for easier development without a DB server (see TODO list)...

For a list of currently supported test configurations (database/driver combinations) see the test configuration file:

https://github.com/aryoda/R_DBI_compliance_reports/raw/master/test_configs.xlsx

My (personal) goal is the check the Microsoft SQL Server 2017 using the DBI-ODBC "bridge" provided by the odbc package to identify more open issues to be solved in the odbc package (or the SQL server).

How to run the tests

  1. Clone or download the repository
  2. Install the database drivers (native, ODBC...) on the client computer
  3. Install docker
  4. Prepare the required docker container by using the install* scripts in the sub folder docker
  5. Adjust the test configuration file test_configs.xlsx using LibreOffice Calc or Excel
  6. Run the tests by executing (sourcing) main.R
  7. See the results in the sub folder results
    • Comparative_report.html
    • one report as hmtl file per test configuration
    • one Excel file containing the raw test results (for manual filtering and examination)
    • one CSV file containing the raw test results (for automatic processing)

I have a dream...

... to provide a central DBI-compliance test data collection that

  1. is showing the DBI-compliance status of all database/DBI-driver/client-OS combinations
  2. is updated whenever a new version of a database is released
  3. is updated whenever a new version of DBI-driver is published (e. g. via continuous integration with docker)
  4. is finally showing 100 % DBI-compliance for all database/DBI-driver/client-OS combinations ;-)

OK, to be realistic: I can't do this alone - so please contribute if you want the dream to become true!

References

  1. Programming language R
  2. DBI package for R: Declares the formal DBI interface
  3. DBItest package for R: Contains unit tests to check the DBI compliance
  4. odbc package for R: DBI-based driver to use the ODBC interface for database access
  5. Docker containerization platform
  6. Connection string syntax and examples for many database vendors
  7. R driver packages that implement the DBI interface (see Reverse depends)
  8. DBI documentation from RStudio (containing a short history too)