The difficulties of transcribing tone. Or, what's the goal of transcribing IPA with Machine Learning?

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

this-word-does-not-exist

33 1,009 0.0 Python

This Word Does Not Exist

I'm a software engineer by profession and occasionally have reason to play with so-called Machine Learning (ML). I think the best show case of what's possible nowadays is the This X Does Not Exist fashion for generating permutations of arbitrary categories of say, human faces or even English words. Imagine a word that seemingly possesses all the natural characteristics of a word, but is not a word that actually exists, for example: trichurid. ML can produce infinite numbers of these.

fairseq

89 29,262 6.0 Python

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

It would seem ML is well placed to handle the world of phonemes (fundamental categories) and phones (permutations on those categories). Indeed, Facebook has a mature project and set of pre-trained ML models for something similar, if not identical: wav2vec (v2.0). If it's not identical then I think it'd be trivial to achieve. Wav2vec is trained to map the spoken word of a language to that language's particular writing system, see here for a specific example. However, we already have plenty of software that can convert writing systems to IPA. Whilst all that does connect a lot of dots, it's not exactly what I think the goal should be.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
stylegan2-pytorch

1,989 3,613 0.0 Python

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

I'm a software engineer by profession and occasionally have reason to play with so-called Machine Learning (ML). I think the best show case of what's possible nowadays is the This X Does Not Exist fashion for generating permutations of arbitrary categories of say, human faces or even English words. Imagine a word that seemingly possesses all the natural characteristics of a word, but is not a word that actually exists, for example: trichurid. ML can produce infinite numbers of these.

epitran

2 579 7.5 Python

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

It would seem ML is well placed to handle the world of phonemes (fundamental categories) and phones (permutations on those categories). Indeed, Facebook has a mature project and set of pre-trained ML models for something similar, if not identical: wav2vec (v2.0). If it's not identical then I think it'd be trivial to achieve. Wav2vec is trained to map the spoken word of a language to that language's particular writing system, see here for a specific example. However, we already have plenty of software that can convert writing systems to IPA. Whilst all that does connect a lot of dots, it's not exactly what I think the goal should be.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

This A.I.-generated artwork, Théâtre D'opéra Spatial, won first place at an art competition, and the art community isn't happy about it
2 projects | /r/Damnthatsinteresting | 4 Sep 2022
Wikipedia No Longer Considers CNET "Generally Reliable" Source After AI Scandal
1 project | news.ycombinator.com | 29 Feb 2024
Realism Engine SDXL v2.0 just released
1 project | /r/StableDiffusion | 11 Dec 2023
Spongebob!!!
1 project | /r/comedyhomicide | 10 Dec 2023
🤣🤣🤣
1 project | /r/WhitePeopleTwitter | 10 Dec 2023

The difficulties of transcribing tone. Or, what's the goal of transcribing IPA with Machine Learning?

This page summarizes the projects mentioned and recommended in the original post on /r/linguistics
Machine Learning Pytorch Artificial intelligence Python generative-adversarial-network
Post date: 14 Apr 2022

this-word-does-not-exist

fairseq

InfluxDB

stylegan2-pytorch

epitran

Related posts

The difficulties of transcribing tone. Or, what's the goal of transcribing IPA with Machine Learning?

This page summarizes the projects mentioned and recommended in the original post on /r/linguistics Machine Learning Pytorch Artificial intelligence Python generative-adversarial-network Post date: 14 Apr 2022

this-word-does-not-exist

fairseq

InfluxDB

stylegan2-pytorch

epitran

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/linguistics
Machine Learning Pytorch Artificial intelligence Python generative-adversarial-network
Post date: 14 Apr 2022