site crawler

1.2.1. Page TSconfig Reference (tx_crawler.crawlerCfg)

Property:

Data type:

Description:

Default:

paramSets.[key]

string

Get Parameter configuration. The values of GET variables are according to a special syntax. From the code documentation (class.tx_crawler_lib.php):

 

  • Basically: If the value is wrapped in [...] it will be expanded according to the following syntax, otherwise the value is taken literally

  • Configuration is splitted by "|" and the parts are processed individually and finally added together

  • For each configuration part:

    • "[int]-[int]" = Integer range, will be expanded to all values in between, values included, starting from low to high (max. 1000). Example "1-34" or "-40--30"

    • "_TABLE:" in the beginning of string indicates a look up in a table. Syntax is a string where [keyword]:[value] pairs are separated by semi-colon. Example "_TABLE:tt_content; _PID:123"

      • Keyword "_TABLE" (mandatory, starting string): Value is table name from TCA to look up into.

      • Keyword "_PID": Value is optional page id to look in (default is current page).

      • Keyword "_FIELD": Value is field name to use for the value (default is uid).

    • - Default: Literal value

 

Examples:

  &L=[|1|2|3]

  &L=[0-3]

  &L=[0-3]&contentId=[_TABLE:tt_content]

 

paramSets.[key].procInstrFilter

string

List of processing instructions, eg. "tx_indexedsearch_reindex" from indexed_search

 

paramSets.[key].pidsOnly

list of integers (pages uid)

List of Page Ids to limit this configuration to

 

paramSets.[key].userGroups

list of integers (fe_groups uid)

User groups to set for the request.

 

paramSets.[key].cHash

boolean

If set, a cHash value is calculated and added to the URLs.

 

paramSets.[key].baseUrl

string

If not set, t3lib_div::getIndpEnv('TYPO3_SITE_URL') is used to request the page.

MUST BE SET if run from CLI (since TYPO3_SITE_URL does not exist in that context!)

 

[Page TSconfig: tx_crawler.crawlerCfg]

Example

 

  tx_crawler.crawlerCfg.paramSets.test = &L=[0-3]&contentId=[_TABLE:tt_content]
  tx_crawler.crawlerCfg.paramSets.test {
      procInstrFilter = tx_indexedsearch_reindex
  }

To top


Valid XHTML 1.0!