4. Itemsadd chapter
An item is an element of a table of content entries. For example, the categories author, title and page number are items.
Each item has three conditions (start, end, global). Those can be defined by selecting predicates. Each item has its own parameters.
Items with all parameters and predicates can be saved easily as modules. These modules are used to simplify the creation of a rule by reuse well-known patterns.
The current C-3 Plus version contains the following items:
4.1 Authoradd section
If the item author needs to be extracted from a found text, stop-word and separator lists can be defined in the parameter field. Furthermore, separator and stop-word lists can be defined in BCS-2 settings. So, new items have those lists as default.
PARAMETER:
- Separators – List of separating signs or word between authors
- Stop-word – List of words which are not a part of the author
- Stop-word Case Sensitivity – Big and small letters are distinguished (on/off)
- One Author per Line – The authors are listed separately line for line
- If this function activated, already defined separators won’t be noticed
- Turn Author Names around – Swith first and last name of an author separated by a comma (on/off).
- For example: “Frank Mueller” is “Mueller, Frank”
- Resolve Hyphens – Delete separating hyphens between words and merge it together (on/off). As a consequence, hyphens in names will not be considered. Exception: If an author is written in big letters the rule accepts it as nonbreaking hyphen. It will not be deleted.
4.2 Titleadd section
In order to find the item title, parameters can be defined as for example the language of the title as well as stopp-words. Apart from that it can be defined that this title is a subtitle. So it is possible to list two titles in one rule. They are ranked correctly and considererd as independant items one of them marked as subtitle.
Parameter
- Language – Language of the title
- Resolve Hyphens – Delete hyphens between words and merge them (on/off)
- Stop-word – Stop-word lists
- Stop-word Case Sensitivity – Big and small letters are distinguished (on/off)
- Item is Subtitle – The title is considered as a subtitle
4.3 Page Numberadd section
If the item page is found in the text a stop-word list can be added to the parameter field.
Parameter
- Stop-word – Stop-word lists
- Stop-word Case Sensitivity – Big and small letters are distinguished (on/off)
- Normalize Numbers – OCR mistakes will be corrected automatically
- Space bars will be deleted
- Small letter l and i and big letter i will become figure 1
- Small and big letter o will become figure 0
- Respect Roman Numbers: OCR mistakes will be corrected automatically
- Number 1, small i and small l will become I
4.4 Abstractadd section
If the item abstract was found in the text the abstract language can be set.
Parameter:
- Language – Language of the title
4.5 Add Items to a Ruleadd section
To add an item you have to mark the first point in the rule editor. Click right and select Add item … . After that a tab opens up and you can select the item:
Optionally, use the item buttons for adding items to the editor field.