Skip to main content
  • Cymraeg (Welsh)
Welsh National Language Technologies Portal

Welsh National Language Technologies Portal

Menu

Skip to content
  • The Resources
    • ELG
    • GitHub
    • Docker
    • Books
  • Packages
    • Macsen – Welsh Language Voice Assistant
    • Trawsgrifiwr (Welsh Transcriber)
    • Lleisiwr (Voicer)
  • Translation
    • Aligning
    • Localisation
    • Machine Translation
      • Advice Note
      • Demo
    • Sharing TMs
  • Speech
    • Text-to-Speech
    • Speech Recognition
    • Forced Aligner
  • Cloud
    • API Services
      • How to register for an API key
      • Machine Translation API
      • Text-to-Speech API
      • Cysill Ar-lein (spelling and grammar checker) API
      • Parts-of-Speech Tagger API
      • Lemmatizer API
    • Widgets
      • Vocab
      • Terminology Dictionaries
      • Cysill Ar-lein Widget
      • Welshify
      • Treigl
  • Data
    • Place-Names
    • Lexicons
    • Corpora
  • News
    • Coding
    • Machine Translation
    • Speech
    • API Services
    • Corpora
  • Events
    • Conference 2023
      • Conference Programme 2023
      • Software Roadshow
      • Presentations 2023
    • Show & Tell 25 March 2022
    • Stronger Together: Sharing translation resources in Wales
      • Agenda
      • Presentations
    • Conference 2021
      • Conference Programme 2021
      • Presentations 2021
    • Hacio’r Iaith presents Hacio Bach
    • National Language Technologies Portal Workshop 18 June 2019
    • Conference 2019
      • Presentations 2019
      • Conference Programme
      • Software demos
      • Scientific committee
      • Conference Accommodation
      • Management Centre Location and Parking
    • Conference 2017
      • Recordings
      • Call for papers
      • Conference Programme
      • Software demos
      • Scientific committee
      • Conference Accommodation
      • Pontio Location and Parking arrangement
    • Conference 2015
      • Background
      • Guest Speakers
      • Conference Papers
      • Hacio’r Iaith 2015
  • Sitemap

New Speech Resources

December 20, 2017
techiaith
Resources Speech Speech Recognition No Comments

New speech resources have just been published by us under the Macsen project, funded by the Welsh Government. See details below. Enjoy!

HTK Acoustic Model

http://techiaith.cymru/htk/paldaruo-16kHz-2017-12-08.tar.gz

Lexicon

http://techiaith.cymru/htk/lexicon-2017-12-08.tar.gz

Prosodylab Aligner

There are also new HTK acoustic models included in the Welsh Prosodylab Aligner:

https://github.com/techiaith/Prosodylab-Aligner/tree/v2.0_paldaruo_4

Kaldi Acoustic Model

http://techiaith.cymru/kaldi/decoders/paldaruo_macsen/tri3-2017-12-18.tar.gz

Training code in GitHub

https://github.com/techiaith/kaldi-cy

Post navigation

RoboLlywydd
Introducing Lleisiwr – Welsh Open Source Voice Banking and Text to Speech

National Portals

  • Welsh National Terminology Portal
  • Welsh National Corpora Portal
  • Welsh National Language Technologies Portal

Search

Latest from the Blog

  • New Deepspeech Training Scripts and Models for Welsh
  • KESS MRes Scholarship – Welsh Ontology of Family Law
  • Mozilla CommonVoice, Paldaruo ac Adnabod Lleferydd Cymraeg
  • Introducing Lleisiwr – Welsh Open Source Voice Banking and Text to Speech
  • New Speech Resources
  • RoboLlywydd
  • Darlith Cymdeithas Wyddonol Gwynedd

Categories

  • API Services
  • Coding
  • Conference 2015
  • Corpora
  • Cysill
  • Grammar Checking
  • Machine Translation
  • Raspberry Pi
  • Resources
  • Speech
  • Speech Recognition
  • Spell Checking
  • Text to Speech
  • Translation
  • Uncategorized @en

Useful Links

Language Technologies Network
Language Technologies Unit
Cysill Ar-lein
DECHE : Digitising, E-Publishing and Electronig Corpora
Canolfan Bedwyr

Information Security Policy

Connect

techiaith@bangor.ac.uk

 @techiaith
   techiaith
Canolfan Bedwyr
Bangor University,
Bangor, Gwynedd, LL57 2DG

Bangor University is a Registered Charity: No. 1141565
Copyright © 2001–2021