Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stress on multi-word expressions #2

Open
reynoldsnlp opened this issue Oct 19, 2020 · 1 comment
Open

Stress on multi-word expressions #2

reynoldsnlp opened this issue Oct 19, 2020 · 1 comment
Assignees

Comments

@reynoldsnlp
Copy link
Collaborator

Taken from reynoldsnlp/udar#19

The lexical underlying form needs to have a persistent stress mark that survives the two-level rule that reduces stresses to the right-most one. For example,...

красно-жёлтых
так как
так что
то есть

Search through an fst2strings version of a stressed transducer for any words with stresses on both sides of spaces and hyphens. Something like this: egrep ":.*[ё́̀].*(% |-).*[ё́̀]"

@reynoldsnlp
Copy link
Collaborator Author

See giellatekno subversion revision 181941. Most of these have been fixed by changing preceding stress marks that should persist into secondary stress marks ('\u0300').

Those like красно-жёлтых came from the RNC addendum to Zalizniak, so they don't have stress marked at all. They already had a TODO comment to work with a native to place stress. I added a comment that this should be done using secondary stress marks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant