Inventors:
Paul Harrison - Keller TX, US
James Oliphant - Pleasant Grove UT, US
Hal Fulton - Austin TX, US
Armin Roehrl - Koetzting, DE
Brenden Grace - Charlottesville VA, US
Assignee:
Collective Media, Inc. - New York NY
International Classification:
G06F 17/30
US Classification:
707741, 707748, 707E17083, 707E17061, 707E17015
Abstract:
A content classification system, method and computer product is presented. In connection with the invention, a data structure is created by identifying a plurality of words and mapping each word to one or more categories. The data structure is indexed. An item of content is identified and classified based on the data structure. The classification includes identifying all one—or more—word combinations in the item of content; for each word of at least a pre-determined number of characters in length in each of the word combinations, identifying each of the categories to which it is mapped; and determining a weight for each of the words based on an inverse proportion to the number of categories to which it is mapped.