Named-Entity Chunking for Norwegian Text using Support Vector Machines

  • Bjarte Johansen Department of Information Science and Media Studies, University of Bergen, Norway
Keywords: Named Entity Chunking, NEC, Chunking, Named Entity Recognition, NER, Natural Language Processing, NLP


Named-Entity Chunking is part of the Named-Entity Recognition (NER) process and is the task of identifying which parts of a text are names. This task is usually done as an implicit part of the recognizer, but because previous attempts at NER for Norwegian text focus only on the recognition, this research represents an attempt to develop an explicit chunker. The research shows that if we only focus on demarcating names and not on discovering their type as well, we are able to accurately (>95% F1 -score) find the names in Norwegian text using Support Vector Machines.