Extracting the semantic content of web pages via repeated structures | IEEE Conference Publication | IEEE Xplore