DB Tester Specification - Database Operations

Operation Enum

Location: io.github.seijikohara.dbtester.api.operation.Operation

Available Operations

Operation	Description	Use Case
`NONE`	No database operation	Read-only verification
`UPDATE`	Update existing rows by primary key	Modify existing data
`INSERT`	Insert new rows	Empty tables or append
`UPSERT`	Upsert (insert or update)	Mixed insert and update
`DELETE`	Delete specific rows by primary key	Selective removal
`DELETE_ALL`	Delete all rows from tables	Clear without sequence reset
`TRUNCATE_TABLE`	Truncate tables	Clear with sequence reset
`CLEAN_INSERT`	Delete all then insert (default)	Fresh test data
`TRUNCATE_INSERT`	Truncate then insert	Fresh data with sequence reset

Operation Descriptions

NONE

Performs no database operation.

Behavior:

Skips all dataset tables
Does not modify database state

Use Case:

ExpectedDataSet-only tests where data set data exists from previous tests
Manual setup scenarios

UPDATE

Updates existing rows identified by primary key.

SQL Generated: UPDATE table SET columns = ? WHERE pk_columns = ?

Behavior:

Matches rows by primary key
Updates non-key columns
Ignores rows not found in database

Use Case:

Modifying existing test data
State transitions within tests

Requirements:

Tables must have primary keys defined
Dataset must include primary key columns

INSERT

Inserts new rows without modifying existing data.

SQL Generated: INSERT INTO table (columns) VALUES (?)

Behavior:

Inserts each row from the dataset
Fails on duplicate primary key violations
Does not affect existing rows

Use Case:

Appending to tables with existing data
Tests requiring specific row additions

Constraints:

Requires empty target rows or unique keys

UPSERT

Performs upsert operations (insert or update).

Behavior:

Checks if row exists by primary key
Updates existing rows
Inserts new rows

Use Case:

Mixed scenarios with partial existing data
Incremental data setup

Requirements:

Tables must have primary keys defined

DELETE

Deletes specific rows identified by primary key.

SQL Generated: DELETE FROM table WHERE pk_columns = ?

Behavior:

Matches rows by primary key from dataset
Deletes only matching rows
Other rows remain unchanged

Use Case:

Removing specific test records
Cleanup of individual rows

Requirements:

Dataset must include primary key columns

DELETE_ALL

Deletes all rows from referenced tables.

SQL Generated: DELETE FROM table

Behavior:

Deletes all rows from each table in dataset
Does not reset identity or sequence columns
Respects foreign key constraints (may fail)

Use Case:

Clearing tables while preserving sequences
Setup for insert-only operations

Table Processing Order:

Tables processed in reverse foreign key order

TRUNCATE_TABLE

Truncates tables, resetting identity columns where supported.

SQL Generated: TRUNCATE TABLE table

Behavior:

Removes all rows from tables
Resets identity and auto-increment columns
Database-dependent behavior for foreign keys

Use Case:

Complete table reset including sequences
Performance-critical cleanup

Database Support:

Database	Identity Reset	FK Handling
H2	Yes	Requires CASCADE
MySQL	Yes	Requires disable FK checks
PostgreSQL	Yes	Requires CASCADE
SQL Server	Yes	Requires no FK references
Oracle	Yes	Requires CASCADE

CLEAN_INSERT

Deletes all rows then inserts dataset rows.

Equivalent To: DELETE_ALL followed by INSERT

Behavior:

Delete all rows from each table (reverse FK order)
Insert all rows from dataset (FK order)

Use Case:

Standard test data set setup (default operation)
Deterministic starting state

Execution Order:

DELETE phase: Child tables first
INSERT phase: Parent tables first

TRUNCATE_INSERT

Truncates tables then inserts dataset rows.

Equivalent To: TRUNCATE_TABLE followed by INSERT

Behavior:

Truncate each table (may require CASCADE)
Insert all rows from dataset

Use Case:

Test data set setup with sequence reset
Performance-optimized setup

Execution Flow

DataSet Phase

Load dataset files from configured location
Filter rows by scenario marker
Resolve table ordering based on configured strategy
Execute configured operation
Commit transaction

ExpectedDataSet Phase

Load expected dataset files
Filter rows by scenario marker
Read actual data from database
Compare expected vs actual data
Report mismatches as test failures

Operation Execution

Table Ordering Strategy

TableOrderingStrategy Enum

Location: io.github.seijikohara.dbtester.api.operation.TableOrderingStrategy

The TableOrderingStrategy enum controls how the framework determines the order in which tables are processed during database operations.

Available Strategies

Strategy	Description
`AUTO`	Automatically determine best ordering (default)
`LOAD_ORDER_FILE`	Use `load-order.txt` file (error if not found)
`FOREIGN_KEY`	Use JDBC metadata for FK-based ordering
`ALPHABETICAL`	Sort tables alphabetically by name

Strategy Details

AUTO (Default)

The framework attempts strategies in the following order:

LOAD_ORDER_FILE - If load-order.txt exists in the dataset directory, use it
FOREIGN_KEY - Query JDBC metadata to resolve foreign key dependencies
ALPHABETICAL - Fall back to case-insensitive alphabetical ordering

This strategy provides the most flexible behavior and is suitable for most use cases.

LOAD_ORDER_FILE

Requires a load-order.txt file in the dataset directory. If the file does not exist, the framework throws a DataSetLoadException.

Use Case: When you need explicit control over table ordering and want to guarantee the order is always specified.

java

@DataSet(tableOrdering = TableOrderingStrategy.LOAD_ORDER_FILE)
void testWithExplicitOrder() { }

FOREIGN_KEY

Uses JDBC database metadata (DatabaseMetaData.getExportedKeys()) to analyze foreign key dependencies and performs a topological sort.

Behavior:

Parent tables (those referenced by foreign keys) are processed before child tables
If foreign key metadata cannot be retrieved, falls back to original table order
If circular dependencies are detected, logs a warning and uses declaration order

Use Case: Databases with well-defined foreign key constraints where automatic ordering is desired.

java

@DataSet(tableOrdering = TableOrderingStrategy.FOREIGN_KEY)
void testWithFkOrdering() { }

ALPHABETICAL

Tables are sorted in ascending alphabetical order (case-insensitive).

Use Case: When table ordering does not matter (no FK constraints) or for deterministic ordering in scenarios without dependencies.

java

@DataSet(tableOrdering = TableOrderingStrategy.ALPHABETICAL)
void testWithAlphabeticalOrder() { }

Usage in Annotations

java

// Default AUTO strategy
@DataSet
void testDefault() { }

// Explicit strategy on @DataSet
@DataSet(tableOrdering = TableOrderingStrategy.FOREIGN_KEY)
void testWithFkOrder() { }

// Strategy on @ExpectedDataSet (affects verification order)
@ExpectedDataSet(tableOrdering = TableOrderingStrategy.ALPHABETICAL)
void testExpectedDataSetOrder() { }

// Combined usage
@DataSet(operation = Operation.CLEAN_INSERT, tableOrdering = TableOrderingStrategy.LOAD_ORDER_FILE)
@ExpectedDataSet(tableOrdering = TableOrderingStrategy.ALPHABETICAL)
void testBothPhases() { }

Table Ordering

Manual Ordering with load-order.txt

The load-order.txt file provides the preferred method for controlling table processing order. This file specifies the exact order in which the framework processes tables.

For detailed information about the file format and usage, see Data Formats - Load Order.

Foreign Key Awareness

When no load-order.txt file exists, the framework resolves table dependencies using database metadata:

Query DatabaseMetaData.getExportedKeys() for each table
Build dependency graph
Topologically sort tables

Ordering Rules

Operation	Order
INSERT, UPSERT	Parent tables first (FK order)
DELETE, DELETE_ALL	Child tables first (reverse FK order)
TRUNCATE_TABLE	Child tables first
CLEAN_INSERT	Delete reverse, Insert forward
TRUNCATE_INSERT	Truncate reverse, Insert forward

Order Resolution Priority

The TableOrderingStrategy determines table order. When using AUTO (the default), the priority is:

Manual ordering: load-order.txt file in dataset directory
FK-based ordering: Automatic resolution using database metadata
Alphabetical ordering: Fallback when no other ordering is available

Note: Unlike other frameworks, DB Tester does not automatically generate the load-order.txt file. See Table Ordering Strategy for details.

Circular Dependencies

For tables with circular foreign key references:

Detect cycles in dependency graph
Log warning
Process in dataset declaration order

Transaction Handling

Default Behavior

AutoCommit disabled during operation
Single transaction per dataset
Commit on success, rollback on failure

Transaction Boundaries

Phase	Transaction Scope
DataSet	All tables in single transaction
ExpectedDataSet	Read-only (no transaction)

Connection Management

Obtain connection from DataSource
Save current AutoCommit setting
Set AutoCommit = false
Execute operations
Commit or rollback
Restore AutoCommit setting
Close connection

Error Recovery

On exception:

Rollback transaction
Close connection
Wrap exception in DatabaseOperationException
Propagate to test framework

SQL Identifier Validation

Overview

The framework validates all SQL identifiers (table names and column names) before interpolating them into SQL statements. This validation prevents malformed inputs from causing SQL syntax errors or security issues.

Validation Rules

Identifiers must match the following pattern:

regex

^[a-zA-Z_][a-zA-Z0-9_]*(\.[a-zA-Z_][a-zA-Z0-9_]*)?$

Valid identifiers:

Example	Description
`USERS`	Simple uppercase name
`user_accounts`	Lowercase with underscores
`_temp_table`	Starting with underscore
`public.users`	Schema-qualified name
`Table123`	Mixed case with digits

Invalid identifiers:

Example	Reason
`123table`	Starts with digit
`user-accounts`	Contains hyphen
`user name`	Contains space
`table;DROP`	Contains semicolon

Error Handling

When the framework detects an invalid identifier, it throws an IllegalArgumentException with a descriptive message:

Invalid SQL identifier: 'user-accounts'. Identifiers must start with a letter or underscore and contain only letters, digits, and underscores.

The framework wraps this exception in a DatabaseOperationException during database operations.

Identifier Sources

The framework derives identifiers from:

Source	Example
CSV/TSV filename	`USERS.csv` → table `USERS`
CSV/TSV header row	`ID,NAME,EMAIL` → columns
`@DataSetSource` attributes	Custom paths and names

Ensure all data files follow the naming conventions to avoid validation errors.

Type Conversion

String to SQL Type

The framework converts string values from CSV or TSV to appropriate SQL types:

Target SQL Type	Conversion Method
`VARCHAR`, `CHAR`, `TEXT`	Direct string
`INTEGER`, `SMALLINT`	`Integer.parseInt()`
`BIGINT`	`Long.parseLong()`
`DECIMAL`, `NUMERIC`	`new BigDecimal()`
`REAL`, `FLOAT`	`Float.parseFloat()`
`DOUBLE`	`Double.parseDouble()`
`BOOLEAN`, `BIT`	`Boolean.parseBoolean()`
`DATE`	`LocalDate.parse()`
`TIME`	`LocalTime.parse()`
`TIMESTAMP`	`LocalDateTime.parse()`
`BLOB`, `BINARY`	Base64 decode
`CLOB`	Direct string

NULL Handling

Empty CSV field = SQL NULL
PreparedStatement.setNull() with appropriate SQL type

Date and Time Formats

Type	Accepted Formats
DATE	`yyyy-MM-dd`
TIME	`HH:mm:ss`, `HH:mm:ss.SSS`
TIMESTAMP	`yyyy-MM-dd HH:mm:ss`, `yyyy-MM-dd HH:mm:ss.SSS`

LOB Handling

Type	Conversion
BLOB	Base64 string → byte[]
CLOB	String → Reader

Overview - Framework purpose and key concepts
Public API - Operation enum reference
Data Formats - Source file structure
Configuration - OperationDefaults
Error Handling - Database operation errors

DB Tester Specification - Database Operations ​

Operation Enum ​

Available Operations ​

Operation Descriptions ​

NONE ​

UPDATE ​

INSERT ​

UPSERT ​

DELETE ​

DELETE_ALL ​

TRUNCATE_TABLE ​

CLEAN_INSERT ​

TRUNCATE_INSERT ​

Execution Flow ​

DataSet Phase ​

ExpectedDataSet Phase ​

Operation Execution ​

Table Ordering Strategy ​

TableOrderingStrategy Enum ​

Available Strategies ​

Strategy Details ​

AUTO (Default) ​

LOAD_ORDER_FILE ​

FOREIGN_KEY ​

ALPHABETICAL ​

Usage in Annotations ​

Table Ordering ​

Manual Ordering with load-order.txt ​

Foreign Key Awareness ​

Ordering Rules ​

Order Resolution Priority ​

Circular Dependencies ​

Transaction Handling ​

Default Behavior ​

Transaction Boundaries ​

Connection Management ​

Error Recovery ​

SQL Identifier Validation ​

Overview ​

Validation Rules ​

Error Handling ​

Identifier Sources ​

Type Conversion ​

String to SQL Type ​

NULL Handling ​

Date and Time Formats ​

LOB Handling ​

Related Specifications ​

DB Tester Specification - Database Operations

Operation Enum

Available Operations

Operation Descriptions

NONE

UPDATE

INSERT

UPSERT

DELETE

DELETE_ALL

TRUNCATE_TABLE

CLEAN_INSERT

TRUNCATE_INSERT

Execution Flow

DataSet Phase

ExpectedDataSet Phase

Operation Execution

Table Ordering Strategy

TableOrderingStrategy Enum

Available Strategies

Strategy Details

AUTO (Default)

LOAD_ORDER_FILE

FOREIGN_KEY

ALPHABETICAL

Usage in Annotations

Table Ordering

Manual Ordering with load-order.txt

Foreign Key Awareness

Ordering Rules

Order Resolution Priority

Circular Dependencies

Transaction Handling

Default Behavior

Transaction Boundaries

Connection Management

Error Recovery

SQL Identifier Validation

Overview

Validation Rules

Error Handling

Identifier Sources

Type Conversion

String to SQL Type

NULL Handling

Date and Time Formats

LOB Handling

Related Specifications