Information Elicitation Using Web Feeds
Barkha Bhadana, Jay Shankar
KEYWORDS: Web News Article, Information extraction, RSS FEED, Triggers.
ABSTRACT: This paper discusses the new algorithm to extract only significant information from dynamic news websites. Earlier wrapper were used for extraction of news but their use is full of complexities because of two reasons-first one is wrapper generation and wrapper maintenance. Our approach uses triggers such as AND and OR to extract only meaningful information from web pages. This approach is applicable to the general types of news RSS feeds and independent of news page layout.
. NASCIO Research Brief:Think Before You Dig:Privacy Implications of Data Mining & Aggregation.
. Schema-Guided Wrapper Maintenance for web-data Extraction.
. Detecting and Partitioning of data objects in complex web pages.
. Google news.http://news.google.com.
. American Newspapers and the Internet.:Threat and opportunity?Technical report,The Bivings Group,july 2007.
. Full Text RSS.Http://echodittolabs.org/fulltextrss.
. Ashish N,Knoblock C A.Wrapper generation for semi-structured Internet sources.SIGMOD,1997,26(4):8-15.
. Gupta A.,Harinarayan V.,Quass D.,and Rajaram A.Method and apparatus for structuring the querying and interpretation of semistructued information .United States Patent number 5,826,258,1998.
. yahoo News:http://news.yahoo.com
. yahoo Shopping:http://shopping.yahoo.com.