Theme list

On the Theme List work board, there are the following methods to list themes:

  • Via clicking the right-button pop-up menu item List, all themes on the MetaCamp server can be listed and paginated.
  • After having input theme's name into the text box below the theme list and hit RETURN, all themes matching the query condition can be listed and paginated. Wild card character "*" can be involved. For example, "Com", "*Com*", "*" are all permitted.

The column Host of the theme list reveals if the user has generated the Data and Clue Extraction Instruction Files for a data schema belonging to this theme and stored them on the currently connected DataStore server. If he does, an image and character Y are presented. If he doesn't, an image and character N are presented. If not sure, an image and character U are presented.

The column status can take the following three values:

  • ready: means at least one data schema has been defined for this theme.
  • torecognize: means no data schema has been defined for this theme despite DataScrapers have already extracted at least one clue belonging to this theme. The operation Recognize can be performed against a theme of this type to define data schemas, which is an alternative and a shortcut to define a data schema via loading a sample page manually as stated in chapter Load a sample page.
  • reserved: means the name has been used by a data schema stored on the MetaCamp server despite no clues belonging to this theme have been extracted. For example, the user named a new target theme with this name when defining a data schema. Once the DataScraper has extracted one clue for this theme, the status is changed to torecognize.

Besides List there are two more right-button pop-up menu items shown as follows:

  • Info: If clicked, detailed information on the selected theme will be presented in a pop up window, for example, information on every data schemas, the owner, the DataStore server hosting the instruction files, contributor to modify the instruction files, modification date etc, which are described further in MetaStudio Senior User's Handbook#Theme List.
  • Recognize: When crawling the Web and extracting data, DataScraper extracts new clues according to SCE files. If clues belonging to a non-existing theme are extracted, DataScraper will insert a new theme record into the MetaCamp's database, the status field of the record is torecognize, which means the new theme has no data schemas defined. If the Recognize menu item is clicked, MetaStudio will select and load one page as a sample page. Thereafter the operator can define a data schema for this theme. After all, described in the previous chapter, have been done, the status of this theme is changed to ready. In summary, this menu item is an convenient way to define the first data schema for a new theme.