result files

Why are there control codes in results

Known to all, the control codes LF and CR, i.e. 0x0a, 0x0d, are not permitted in source codes of a HTML page. The permitted characters are specified in HTML standard. But these control codes are found in the result files generated by MetaSeeker, which happens mainly in case that the data snippets are extracted for a property with attribute block in type of text.

Is MetaSeeker a screen scrapper or a web scrapper

There is not an authority to define what are screen scrapper and web scrapper respectively. The following description is based on my understanding on their difference. I think a screen scrapper is integrated with a Web browser engine and a web scrapper is not. So MetaSeeker is in type of the former.

Syndicate content