Macsen

Macsen app on a phone

Mae Macsen yn feddalwedd cynorthwyydd personol Cymraeg cod agored tebyg i Alexa neu’r Google Assistant.

Ystyr cod agored yw y gall unrhyw un weld, addasu a dosbarthu’r cod fel yr hoffen nhw. Mae’n gweithio fel ap ar ffôn neu dabled, ac mae ar gael ar gyfer dyfeisiau iOS ac Android. Mae fersiwn ar-lein o Macsen ar gael hefyd. Mae’n bosib siarad gyda Macsen mewn Cymraeg naturiol er mwyn gofyn iddo gwblhau tasgau neu ofyn am wybodaeth.

Polisi preifatrwydd

Tair sgrin ffôn symudol, gyda phob un yn arddangos un o sgriniau macsen.

Macsen’s skills

By now, Macsen has a number of skills, including the ability to play Welsh music on Spotify, play programs on S4C Clic, turn the light on or off, report the latest news from Golwg360 headlines and provide weather forecasts from the OpenWeatherMap website. Macsen also uses ChatGPT, translating English speech into Welsh text and transcribing Welsh speech into text. So it is possible to type as well as use the voice to ask questions and give orders to Macsen.

Assistant

The latest developments in the field of artificial intelligence have transformed the field, and caused us to reconsider what is possible for a computer to achieve. Macsen now uses the ChatGPT-4 Language Model to answer questions, reason and converse through the medium of Welsh.

Translation

Mae sgil cyfieithu Macsen yn galluogi defnyddwyr i gyfieithu lleferydd Saesneg i destun Cymraeg. Erbyn hyn, mae’r dechnoleg hefyd wedi’i gysylltu â geiriadur Y Termiadur Addysg, sy’n darparu nodwedd defnyddiol er mwyn rhestru termau cysylltiedig i’r hyn a lefarwyd. Mae adnoddau eraill a grëwyd i hwyluso cyfieithu ar gael, megis aliniwr a’n hadnodd ar gyfer rhannu atgofion cyfieithu.

Transcription

Mae sgil trawsgrifio Macsen yn galluogi defnyddwyr i drawsgrifio unrhyw leferydd Cymraeg yn destun. Ar ôl llefaru eich neges, gallwch gopio’r testun mewn i unrhyw ap, boed yn neges destun, e-bost neu’n restr siopa! Yn ogystal â bod yn sgil yn Macsen, mae Trawsgrifiwr ar gael fel fersiwn ar-lein. Mae fersiwn Windows o Trawsgrifiwr ar gael hefyd.

Download Macsen

Be part of the Welsh language’s digital revolution and download the Macsen app for iOS or Android devices today!

Language technologies within Macsen

Macsen uses a number of different technologies to operate. It uses Mozilla DeepSpeech speech recognition to translate what you say into text. Then it uses intent parsing to recognise whether you asked for the news, weather, music or one of the other options. When Macsen needs to reply orally, it uses text-to-speech technology to speak the the appropriate response.

We are still improving the speech features, and if you would like, you can help us improve it in the future by contributing recordings of your voice. You can do this by clicking on Training within the app. This will guide you to read specific sentences aloud in the app. We will use these recordings to create development sets and test sets for training the speech recognition. If you want to contribute more than this, visit Mozilla’s CommonVoice website to record sentences for the large collection of recordings. More information about these technologies and the Welsh language is available in the Language Technologies Handbook published by the Coleg Cymraeg Cenedlaethol.

Macsen is funded by the Welsh Government, and we thank them and the volunteers who have been contributing their voices to improve speech technology.

Thanks also to Golwg360 and OpenWeatherMap for permission to use their online services.

Open source resources for developers

We are using this project to show what we can create when developing Welsh language speech technology and artificial intelligence and all relevant components and resources are shared below under an open source license:

techiaith/macsen-flutter

techiaith/macsen-sgwrsfot

Documentation showing how Macsen can be used to expand a digital service is available within the app code: https://github.com/techiaith/macsen-flutter/blob/master/docs/README.md

Macsen’s research publications

Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language, Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), pages 194-201 Language Resources and Evaluation Conference (LREC 2020), Marseille, France Paper

Building Intelligent Assistants for Speakers of a Lesser-Resourced Language,CCURL 2016 2nd Workshop on Collaboration and Computing for Under-Resourced Languages ‘Towards an Alliance for Digital Language Diversity’ (LREC 2016), Portoroz, Slovenia. Paper

Tuag at Gynorthwyydd Personol Deallus Cymraeg, Astudiaeth Fer o APIs ar gyfer Gorchmynion Llafar, Systemau Cwestiwn ac Ateb a Thestun a Lleferydd ar gyfer Llywodraeth Cymru. Report