Inventors:
Thomas E. Raffill - Sunnyvale CA, US
Shunhui Zhu - San Jose CA, US
Roman Yanovsky - Los Altos CA, US
Boris Yanovsky - Saratoga CA, US
John Gmuender - San Jose CA, US
Assignee:
SonicWall, Inc. - San Jose CA
International Classification:
G06F 17/28
US Classification:
704 7, 704 10, 707258, 707754
Abstract:
Some embodiments of an efficient string search have been presented. In one embodiment, a string of bytes representing content written in a non-delimited language is received, wherein the content has been classified into a predetermined category. In a single pass through the string of bytes, a set of N-grams is searched for simultaneously. Statistical information on occurrences of the N-grams, if any, in the string of bytes is collected. In some embodiments, a model is generated based on the statistical information, where the model is usable by a content filter to classify content.