View a utility transaction database file with the transaction database Viewer (SPMF documentation)

Utility transaction databases are a type of data taken as input by data mining algorithms offered in SPMF such as FHM, EFIM, UP-Growth and Two-Phase .

SPMF offers a tool to view the content of a utility transaction database. This tool is called the SPMF utility transaction database viewer.

This page explains how to use this tool with an example.

How to run this example?

If you want to run this example from the graphical interface of SPMF, (1) choose the algorithm "Open_utility_transaction_database_file_with_viewer", (2) choose the DB_Utility.txt file as input, and then (3) click "run algorithm" .

graph viewer open

What is displayed?

After running the example, the content of the file will be displayed by the tool. The picture below shows the user interface of this viewer.

The window A) show in the picture below is the main window. It displays the utility transaction database using a table. The table has four rows in this example. Each row (except the last one) is a transaction from the utility transaction database.

Imagine that each transaction represents the items purchased by a customer. .

Take the first row as example.
The cell in the first column indicates that the ID of this transaction is 0.
The cell in the second column indicates that this transaction 0 contains the item 1 and the utility was 5 $.
The cell in the third column indicates that this transaction 0 contains the item 2 and the utility was 10 $.
The cell in the fourth column indicates that this transaction 0 contains the item 3 and the utility was 1 $.
The cell in the fifth column indicates that this transaction 0 contains the item 4 and the utility was 6 $.
The cell in the sixth column indicates that this transaction 0 contains the item 5 and the utility was 3 $.
The cell in the fifth column indicates that this transaction 0 contains the item 6 and the utility was 5 $.
The cell in the sixth column indicates that this transaction 0 does not contain the item 7.
The cell in the seventh column indicates that the total amount of money (utility) spent in this transaction is 1 + 3 + 5 + 10 + 6 + 5 = 30 $

The other transactions follow the same format.

Then the last line of the table provides the sum of each column. For example, the cell in the last row and second column indicates that the total amount of utility for item 1 in this database is 5 + 5 + 10 = 20 $.

This view as a table can be useful to understand the content of a utility transaction database file.

Besides, there are buttons that provides additional features:

graph viewer database graph

What is the input?

The algorithm takes as input a utility transaction database in SPMF format, as used by algorithm such FHM, EFIM and UP-Growth .

The database used in this example is provided in the text file "DB_Utility.txt" in the package ca.pfv.spmf.tests of the SPMF distribution.

The input file format is defined as follows. It is a text file. Each lines represents a transaction. Each line is composed of three sections, as follows.

For example, this is the content of the example file "DB_Utility.txt":

3 5 1 2 4 6:30:1 3 5 10 6 5
3 5 2 4:20:3 3 8 6
3 1 4:8:1 5 2
3 5 1 7:27:6 6 10 5
3 5 2 7:11:2 3 4 2

Consider the first line. It means that the transaction {3, 5, 1, 2, 4, 6} has a total utility of 30 and that items 3, 5, 1, 2, 4 and 6 respectively have a utility of 1, 3, 5, 10, 6 and 5 in this transaction. The following lines follow the same format.