Adding new Content-Based view rules

Instructions and examples of adding rules to define the site's logical structure.

Before you begin

In Content-Based view of the Scan Configuration dialog box (Explore > Content-Based), you can define content-based structure using two types of definition:

Logical Content Path (such as breadcrumbs)
Custom (use regexps to define your own custom nodes)

Doing this enables AppScan® to display a logical application tree, instead of listing large parts of the site under a single node. When more than one rule is defined, AppScan® attempts to match each URL to the rules in the order they are listed. As soon as it finds a match it includes the URL in the Content-Based tree following that rule and proceeds to the next URL.

Procedure

Add new definitions by clicking

(or edit existing definitions by selecting a definition and clicking

). Refer to the table below for field descriptions.


Setting	Description
Name	Name for this rule.
Description	(Optional) Description for this rule.
Rule Type	Select Logical Content Path or Custom. The remaining fields vary depending on your selection.
Logical Content Path
Content Path	Regexp that matches the content path in the HTML.
Separator	Regexp that matches the hierarchy separators in the content path in the HTML.
Node Display Name	Regexp that matches the node names (that appear to the application user and that will be used in the Application Tree).
Condition	(Optional) Regexp that defines the responses that are to be included in this node location.
Custom
Node Location	Use standard URL format to describe where nodes that fulfil the condition for this rule should be placed in the application tree (for example: /Home/Buy/Books). If the path does not exist it will be created.
Condition	Regexp that defines the responses that are to be included in this node location. If the page content matches this condition it will be included in the Content-Based tree.

Note: Fields that require or accept a regular expression have the Expression Test button:

, which opens the Expression Test PowerTool, to help you verify the syntax of the regular expression.

If you need additional help you may find the following link useful: http://www.regular-expressions.info/quickstart.html

Use the up/down arrows to arrange the definitions in the order they are to be applied.
Verify that the check box next to each rule that is to be applied, is selected.
Click OK to save changes.

Example

The following tables shows sample content for the two rule types.

Table 1. Content-Based View rule
Setting	Sample	Meaning
Content Path	<title>(.*?)</title>	All text between the title tags will be used as the basis for the node names.
Separator	[:\->\|]	These five characters will be treated as separators. Therefore: <title>Home:Accounts</title> and <title>Home:Plans</title> will appear in the application tree as a parent node Home with two child nodes: Accounts and Plans
Node Display Name	^\s(.?)\s*$	Spaces before and after the text will be removed when naming the node.
Condition	Log out	Only pages containing the words "Log out" will be included in the application tree under this rule.

Table 2. Custom rule
Setting	Sample	Meaning
Node Location	/root/child/grandchild	All pages that fulfil the Condition are added to the application tree under /root/child/grandchild Note: If the parent nodes in the specified branch do not exist, they will be created.
Condition		As this field is empty, all content to which the rule is applied will be added at this node location. Therefore, if this rule appears after the rule above, all pages that do not contain the words "Log out" will be added under the node /root/child/grandchild