Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support metadata with vowel and consonant or pronunciation properties #18

Open
grhoten opened this issue Mar 14, 2024 · 0 comments
Open
Labels
discuss Discussion item

Comments

@grhoten
Copy link
Member

grhoten commented Mar 14, 2024

There are several languages that change their choice of articles or prepositions depending on whether it starts or ends in a vowel or consonant. As an example, the word "apple" has IPA information for pronouncing words in Wikidata. The vowel and consonant properties can be derived from that information. Properly supporting the English indefinite article requires this information to handle all of the edge cases. For example, in English you say "an apple" and not "a apple". You can make default guesses with a UnicodeSet to check the base character being in "[aeiou]" for the front of the word, but you have to handle such edge cases with exceptions, such as "an LED light", or "a unicorn".

@grhoten grhoten added the discuss Discussion item label Mar 14, 2024
grhoten pushed a commit that referenced this issue Oct 30, 2024
…CENSE.txt for copyright and permission details.

This contribution should resolve the following issues: #5, #6, #7, #11, #12, #13, #15, #17, #18, #19
This contribution is also related to the following issues without fully resolving the issues: 3, 4, 8, 10, 21, 23, 24, 25
This contribution also has an implementation that addresses these CLDR issues: 13025, 13563
grhoten added a commit that referenced this issue Oct 30, 2024
…CENSE.txt for copyright and permission details.

This contribution should resolve the following issues: #5, #6, #7, #11, #12, #13, #15, #17, #18, #19
This contribution is also related to the following issues without fully resolving the issues: 3, 4, 8, 10, 21, 23, 24, 25
This contribution also has an implementation that addresses these CLDR issues: 13025, 13563
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss Discussion item
Projects
Status: Todo
Development

No branches or pull requests

1 participant