Web->KB dataset Web pages partitioned into classes, with hyperlink data. The dataset has been used for text categorization and learning to extract symbolic knowledge from the World Wide Web. http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/ Top/Computers/Artificial_Intelligence/Machine_Learning/Datasets