Ira Pohl
Artificial Intelligence
The identification and extraction of technical terms is one of the better understood and most robust Natural Language Processing (NLP) technologies within the current state of the art of language engineering. In generic information management contexts, terms have been used primarily for procedures seeking to identify a set of phrases that is useful for tasks such as text indexing, computational lexicology, and machine-assisted translation: such tasks make important use of the assumption that terminology is representative of a given domain. This paper discusses an extension of basic terminology identification technology for the application to two higher level semantic tasks: domain description, the specification of the technical domain of a document, and content characterisation, the construction of a compact, coherent and useful representation of the topical content of a text. With these extensions, terminology identification becomes the foundation of an operational environment for document processing and content abstraction. © 1999, Cambridge University Press. All rights reserved.
Ira Pohl
Artificial Intelligence
Arthur Nádas
IEEE Transactions on Neural Networks
Saeel Sandeep Nachane, Ojas Gramopadhye, et al.
EMNLP 2024
P.C. Yue, C.K. Wong
Journal of the ACM