Welsh Parts-of-Speech Tagger API

The Welsh Parts-of-Speech tagger API allows users to tag Welsh words in a text with their parts of speech (e.g. verbs and mutations) automatically without needing to install any special software on their computer. It’s easy to use, and it’s free.

Go to our API Services in order to register and receive your personal API key in order to use this service. Here is the guide on how to get an API key.

You can enter any word, phrase or text segment into the API and receive a detailed analysis of each part of speech. For example, if you input “mae hen wlad fy nhadau” into the API, the following analysis will be returned to you:

mae/VBF/- hen/ADJP/- wlad/NF/TM fy/PRONOUN/- nhadau/NPL/TT

In full:

mae (verb form) hen (adjective preceeding the noun) wlad (feminine noun/soft mutation) fy (pronoun) nhadau (plural noun/nasal mutation)

This is the only service of its kind which currently exists through the medium of Welsh on the web, and tags specifically Welsh features, such as mutations.

The service has possible applications in several domains:

  • Marketing: identifying personal and place names, and providing further valuable marketing data to businesses that conduct research into customer attitudes on Twitter
  • Education: educational applications for schools and coding clubs
  • Research: use in academic projects needing to analyse large corpora
  • Development: a resource for Welsh medium software developers, including for games and apps

For more specific information on how to use the service, and example code, follow the link below to the Part-of-Speech Tagger API page on GitHub :

PorthTechnolegauIaith/postagger

 

Acknowledgements

Articles or software based on the use of this API should cite:

Jones, D. B., Robertson, P., Prys, G. (2015) Welsh language Parts-of-Speech Tagger API Service [http://techiaith.cymru/api/parts-of-speech-tagger-api/?lang=en]