Advanced Tools


Semantics Scoring Application

Employs Word Embeddings to score semantic distance between two words. Requires a CSV file with "target" and "response" columns. Provides two semantic measures:
a. Semantic Measure 1: Based on pre-trained glove vectors from 2B tweets, 27B tokens, 1.2M vocab, uncased.
b. Semantic Measure 2: Based on trained vectors from the English Wikipedia trained by us.

Spelling Application and Phonology Application
What is the Spelling and Phonology Score?

The Spelling and Phonology Score provides composite measurements that quantify phonological variations between a target word and a response word. These scores are calculated after converting both words into their International Phonetic Alphabet (IPA) representations, which encapsulate the phonological attributes of the words.

Spelling Score vs. Phonology Score

While the Phonology Score is calculated using IPA conversions for both words, the Spelling Score diverges in its approach. Specifically, it converts only non-words into their IPA forms, not actual words. This is predicated on the notion that writers are likely familiar only with the phonological representation of non-words, not their written forms. However, users have the flexibility to modify this behavior by choosing to treat non-words as words, or vice versa, for IPA conversion.

Algorithmic Methodology

The algorithm employs a customized version of the Levenshtein distance metric that accounts for transpositions. In essence, the Levenshtein distance measures the minimal number of operations (additions, deletions, substitutions, and transpositions) required to transform one word into another.

2. **Phonetic Comparisons**:

- For non-words, the tool employs their phonetic transcriptions to gauge their similarity to real words.

3. **Spelling Distance for Words and Non-Words**:

- The tool accommodates both real and fabricated words, providing comprehensive metrics for each word pair in a given list.

Updated Features

1. **Levenshtein Score with Operations**:

- This tool now offers a nuanced breakdown, detailing not just the Levenshtein Score but also the types and counts of operations, Transpositions, Deletions, Insertions, and Substitutions, needed to equate two words.

Why Transpositions, Deletions, Insertions, and Substitutions Matter

These operations are noteworthy for various reasons:

- **Typing or Phonological Errors**: often occur when people mistype or mispronounce words.

- **Language Learning**: Learners frequently make mistakes in the sequence of adjacent sounds or letters.

- **Clinical Assessments**: Errors like transpositions can signal specific language impairments or cognitive conditions.

Expanded Explanation of Operations

1. **Insertions**:

- Adding a new letter to a word, e.g., transforming "cat" into "cart" by inserting an 'r'.

2. **Deletions**:

- Removing a letter from a word, e.g., reverting "cart" back to "cat" by deleting the 'r'.

3. **Substitutions**:

- Replacing one letter with another, e.g., changing "cat" to "bat" by substituting 'c' with 'b'.

4. **Transpositions**:

- Swapping adjacent letters, e.g., converting "flow" to "fowl" by transposing 'l' and 'o'.

Target Audience

This tool is geared towards researchers, linguists, and educators focusing on language and spelling. It has applications in:

- **Spelling Research**: To analyze frequent types of misspellings.

- **Language Learning**: To assess the proximity of a learner's pronunciation to the target word.

- **Healthcare**: To monitor language capabilities in fields like speech therapy.

Scoring Phonological Errors

Scores phonological distance. Requires a CSV file with "target" and "response" columns. Application will create a new column with phonemic scores. Provides a score of phonological errors on a scale of 0.0 to 1.0, where 1.0 indicates highest degree of error. Supported languages are listed below..

Scoring Spelling Responses

Scores spelling errors. Requires a CSV file with "target", "response", and "type" columns. Application will create a new column with spelling scores. Provides a score of spelling errors on a scale of 0.0 to 1.0, where 1.0 indicates highest degree of error. Supported languages are listed below. See the preparing the CSV section on how to prepare the CSV file for this application.

International Phonetic Alphabet (IPA) Transcription Tool

This tool allows the transcription of a text written in standard alphabet into the International Phonetic Alphabet (IPA). In addition, the tool provides measures about the phoneme distribution found in the provided text.

Using the Phonological, Spelling Error Scoring, and IPA Transcription Tools

The applications cover the following languages.

Supported Languages
  • Afrikaans
  • Albanian
  • Aragonese
  • Armenian (West)
  • Armenian
  • Bosnian
  • Bulgarian
  • Catalan
  • Chinese (Cantonese)
  • Chinese (Mandarin)
  • Croatian
  • Czech
  • Danish
  • Dutch
  • English (North UK)
  • English (Received Pronunciation)
  • English (Scottish)
  • English (UK)
  • English (US)
  • English (West Indies)
  • English (West Midlands)
  • Esperanto
  • Estonian
  • Finnish
  • French (Belgium)
  • French
  • Georgian
  • German
  • Greek (Ancient)
  • Greek
  • Hindi
  • Hungarian
  • Icelandic
  • Indonesian
  • Irish
  • Italian
  • Kannada
  • Kurdish
  • Latin
  • Latvian
  • Lingua Franca Nova
  • Lithuanian
  • Macedonian
  • Malay
  • Malayalam
  • Nepali
  • Norwegian
  • Persian (Pinglish)
  • Persian
  • Polish
  • Portuguese (Brazil)
  • Portuguese (Portugal)
  • Punjabi
  • Romanian
  • Russian
  • Serbian
  • Slovak
  • Spanish (Latin America)
  • Spanish
  • Swahili
  • Swedish
  • Tamil
  • Turkish
  • Vietnamese (Hue)
  • Vietnamese (Sgn)
  • Vietnamese
  • Welsh
Preparing CSV Files

For the Semantics Scoring Application, Scoring Phonological Errors, and Scoring Spelling Responses, you need to provide a CSV file with specific columns. The IPA transcription works with plain text.

For the Semantics Scoring Application: CSV file should have "target" and "response" columns.

For the Phonological Errors Scoring Application: CSV file should have "target" and "response" columns.

For the Spelling Responses Scoring Application: CSV file should have "target", "response", and "type" columns. "type" column should have values "nonword" or "word".

Make sure to spell words in lowercase and without spaces. If not, you will not get an output. You can keep additional data columns, but the required columns are necessary to generate a score.

Also, make sure you have selected the appropriate target language from the dropdown menu

Processing Time

Note that long files may take longer to process.


After uploading your CSV file, the application will create a new column with the respective scores at the end of your file.

Application Usage

The tools provided can be employed in clinics and classrooms to assess patients' and students' linguistic functioning across various parameters such as semantics, phonology, and spelling.