Computational Linguistics
About

Frederick Jelinek

Frederick Jelinek (1932–2010) led the IBM speech recognition group that transformed natural language processing from a rule-based discipline into a statistical science, famously declaring that every time he fired a linguist, the performance of the speech recognizer went up.

P(W) = ∏ P(wᵢ | wᵢ₋₁, ..., wᵢ₋ₙ₊₁)

Frederick Jelinek was a Czech-American researcher whose leadership of the IBM Continuous Speech Recognition group from the 1970s to the 1990s catalysed the statistical revolution in natural language processing. Under his direction, the IBM group developed n-gram language models, the noisy channel framework for speech recognition, and statistical approaches to machine translation that displaced earlier rule-based paradigms.

Early Life and Education

Born in Kladno, Czechoslovakia, in 1932, Jelinek fled the communist regime and eventually settled in the United States. He studied electrical engineering at MIT, where he earned his PhD in 1962 working on information theory under Robert Fano. He joined IBM's Thomas J. Watson Research Center, where he built the speech recognition group that would become legendary in computational linguistics.

1932

Born in Kladno, Czechoslovakia

1962

Completed PhD at MIT under Robert Fano

1972

Began leading IBM's speech recognition research

1976

IBM group demonstrated first large-vocabulary continuous speech recognition system

1993

Moved to Johns Hopkins University as director of the Center for Language and Speech Processing

2010

Died in Baltimore, Maryland

Key Contributions

Jelinek and his IBM colleagues formalised speech recognition as a statistical decoding problem using Bayes' theorem: find the word sequence W that maximises P(W|A) = P(A|W)P(W)/P(A), where A is the acoustic signal. The language model P(W) provides prior probabilities over word sequences, the acoustic model P(A|W) models the mapping from words to sounds, and decoding searches for the best W. This framework required estimating n-gram language models from large text corpora, developing smoothing techniques, and building hidden Markov models for acoustic modelling.

At Johns Hopkins, Jelinek established the annual Summer Workshop on Language Engineering (later the Frederick Jelinek Memorial Workshop), which became the premier venue for intensive collaborative research in speech and language processing. He also authored the influential textbook Statistical Methods for Speech Recognition.

"Every time I fire a linguist, the performance of the speech recognizer goes up." — Frederick Jelinek (attributed, circa 1988)

Legacy

Jelinek's group at IBM trained an entire generation of researchers who went on to lead NLP at major technology companies and universities. The statistical paradigm they established — using probability theory and large datasets rather than hand-crafted rules — became the dominant approach to NLP and remains the foundation on which modern deep learning methods are built. The JHU summer workshops continue to produce influential research.

Interactive Calculator

Enter a CSV of publications: year,title,citations_count. The calculator computes total citations, h-index, peak year, and a per-decade breakdown of scholarly output.

Click Calculate to see results, or Animate to watch the statistics update one record at a time.

Related Topics

References

  1. Jelinek, F. (1997). Statistical Methods for Speech Recognition. MIT Press.
  2. Jelinek, F., Mercer, R. L., Bahl, L. R., & Baker, J. K. (1977). Perplexity — A measure of the difficulty of speech recognition tasks. Journal of the Acoustical Society of America, 62(S1), S63.
  3. Jelinek, F. (2009). The dawn of statistical ASR and MT. Computational Linguistics, 35(4), 483–494. doi:10.1162/coli.2009.35.4.35404
  4. Church, K. (2011). A tribute to Frederick Jelinek. Computational Linguistics, 37(1), 1–13.

External Links