4406 entries. 94 themes. Last updated December 26, 2016.

Hans Peter Lund of IBM Develops an Automatic Document Indexing Program (1958)


In 1958 Hans Peter Luhn of IBM developed an automatic document indexing program for the production of literature abstracts.

"The complete text of an article in machine-readable form is scanned by an IBM 704 data-processing machine and analyzed in accordance with a standard program. Statistical information derived from word frequency and distribution is used by the machine to compute a relative measure of significance, first for individual words and then for sentences. Sentences scoring highest in significance are extracted and printed out to become the "auto-abstract."