Inventors:
David G Harris - Laurel MD, US
N. Oksana Lassowsky - Riva MD, US
Assignee:
The United States of America as represented by the National Security Agency - Washington DC
International Classification:
G06F017/24
Abstract:
A method of summarizing a text by the following steps. Identifying the textual units in the text. Selecting a first set of textual units and identifying its textual units. Selecting a second set of textual units and identifying its textual units. Determining how many textual units are shared between the first and second sets of textual units. Selecting a third set of textual units between the first and second set of textual units and identifying its unique textual units. Determining the frequency of occurrence of the textual unit in the third set of textual units. Determining the frequency of occurrence of the textual unit in the text. Determining the proximity of the results of the last two steps. Calculating a score for the first set of textual units. Assigning the highest score to the first set of textual units.