Discovering relevant scientific literature on the Web

Online Available
Authors: Bollacker, K. D.; Lawrence, Steve; Lawrence, S.; Giles, C. L.;
Publishing Info: Intelligent Systems and Their Applications , 15(2), 42--47.
Year: 2000
Everyone's Keywords: automatic generator;   Data mining;   disorganized database;   knowledge discovery;   incoming records;   Information retrieval;   information filtering system;   World Wide Web;   presentation methods;   digital libraries;   
 
Abstract: Scientific literature on the Web makes up a massive, noisy, disorganized database. Unlike large, single-source databases such as a corporate customer database, the Web database draws from many sources, each with its own organization. Also, owing to its diversity, most records in this database are irrelevant to an individual researcher. Furthermore, the database is constantly growing in content and changing in organization. All these characteristics make the Web a difficult domain for knowledge discovery. To quickly and easily gather useful knowledge from such a database, users need the help of an information filtering system that automatically extracts only relevant records as they appear in a stream of incoming records. To this end, we have developed the CiteSeer. CiteSeer is an automatic generator of digital libraries of scientific literature. It uses sophisticated acquisition, parsing, and presentation methods to eliminate most of the manual effort of finding useful publications on the Web
 
 
Close
Discuss This Article
 


 
Close
You may also find these articles interesting
 
 
Share:Yahoo! My Web Google Bookmarks StumbleUpon Digg del.icio.us Facebook Technorati Diigo
Users having this article in their library
 
 
 
 

Feedback | About Memento