Semantic Topology

Jussi Karlgren Gabriel Isheden Martin Bohman Ariel Ekgren Emelie Kullmann David Nilsson
Gavagai and KTH

A reasonable requirement (among many others) for a lexical or semantic component in an information system is that it should be able to learn incrementally from the linguistic data it is exposed to, that it can distinguish between the topical impact of various terms, and that it knows if it knows stuff or not.

We work with a specific representation framework – semantic spaces – which well accommodates the first requirement; in this short paper, we investigate the global qualities of semantic spaces by a topological procedure – mapper – which gives an indication of topical density of the space; we examine the local context of terms of interest in the semantic space using another topologically inspired approach which gives an indication of the neighbourhood of the terms of interest. Our aim is to be able to establish the qualities of the semantic space under consideration without resorting to inspection of the data used to build it.

In: Proceedings of the 23d ACM international conference on Conference on information & knowledge management (CIKM '14) in Shanghai, Nov 3-7. ACM, New York, NY, USA, 2014.