1. C-3 Plus Componentsadd chapter
C-3 Plus consists of four essential components:
- Rule (C-3 Plus rule)
- Items (C-3 Plus items)
- Start, end and global conditions (C-3 Plus conditions)
- Predicates (C-3 Plus predicates)
This order also represents the hierarchy of the rule-elements: A rule consists of several search items, which each of them contains three options as start, end or global condition. Each condition can include any number of predicates.
Each component of C-3 Plus has a set of parameters, which can influence their individual functionalities. Parameter changes of a C-3 Plus component will only have an effect on the respective C-3 component.
1.1 Basic Functionadd section
C-3 Plus runs through all components of a rule in the order of the items. Each rule will be used on the before selected areas of the table of contents. So C-3 Plus uses the OCR results of those areas.
An item normally searches for a text part in an area. The item can be specifically defined by setting predicates in start, end and global conditions.
Therefore, start and end conditions are defining the beginning or the end of an item while the global conditions define the whole condition itself.
Basically, C-3 Plus runs through each area word by word. The selected area shows words in an useful order which perhaps could not be delivered by the OCR for the whole document. C-3 Plus tests each predicate on each word. If all results are correct, the program continues with the next check of start, end and global condition. If they are correct the next item will be tested and so on.
While testing between the conditions the word order will be maintained but not while testing between the items. So it is possible that one word can be used as start or as end condition. But a single word belongs only to one item.