Skip to content

Advanced settings

In table extraction view, you will find the menu item Settings in the upper action bar (make sure that "Training Mode" is activated). If you click on the gear icon, a window will open in which you will find Advanced Settings.

These functionalities are available in "Settings":#

Header row count#

Here you can define the number of lines of a table header. For example, the table header line can be two lines:

Accordingly, the value in "Header row count" is set to two:

Why is this needed? It might be that DOC² does not recognize the second line in the table header as header line. In this case, it incorrectly inserts it into the table as an extracted value. This can easily be prevented with this function.

Example before:

Example after:

Move Extra Rows to#

In this example, the item description in the table spans several rows, but you only need the first one. To only extract this and include it in the "DESCRIPTION" column, select Move Extra Rows to Trash.

After naming the columns and mapping them to a position, you get the following result:

Below functionalities are available in the advanced settings:#

Minimum grouped rows#

Enter the minimum number of rows in your grouped column here.

In this table you see six rows of which only three would be relevant to you. In the first two columns there are two criteria that have to be extracted separately. These will be your mapped columns, all the other ones have to be trained as custom columns.
It works as follows:

Select the two header rows as well as two minimum grouped rows as these should be grouped to one row.

Also select the Move extra rows to Trash option to be able to train all the other columns as custom columns.

Name the first column "Position" and group on that one.

After naming all the columns and training the values, this should be your result:

Reverse grouping#

If you want to combine all the rows above the grouped attribute, check the box here.

In this example, the table starts with a row that is above all other information but also needs to be extracted along with the information below it. DOC² might extract this row as an additional row and then the grouping of the information, e.g. by position, will not work properly.

After grouping on net amount, checking the box, selecting the Move extra rows to Trash option

and naming all columns this is your result: