-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Overuse of INTJ #429
Comments
Agreed about good, great, fine; I think "well" is established as a discoursy interjection in the sense that doesn't mean 'good' in any way (sentence initial well, xpos UH), I would keep that INTJ (not the same lexical item). For "Christ" we get into the general area of profanities, which often do not behave like their morphological POS - I'm not sure we want them not to be INTJ just because of etymology. They are definitely UH in xpos. |
Agreed, I wasn't thinking that one though
https://universaldependencies.org/u/pos/INTJ.html specifically rules out "God". Not to sound religious but I don't think we can distinguish "God" from "Christ". |
Hm, it's hard with that being in the guidelines the way it is, but do we think that's true for swearwords as well? I mean, we could argue about what POS some of them are but it would lead to a colorful GH issue ;) If we stick to PTB UH for them, which I think is standard, we would need a conversion table to know what their 'etymological POS' is, and I'm not sure how much sense that makes. Morphosyntactically, profanities and other oathes do fit the definition of emotional, syntactically unintegrated language... |
There may be some borderline cases but here's an interpretation that makes sense to me: If a word is mainly used for swearing, and it's syntactically extrinsic to the semantics-bearing part of the sentence (not a predicate or argument etc.), then it's INTJ. Same for discourse particles from which verbs have been derived (INTJ for the main use of "OK" even though it can also be a VERB), and discourse particles whose meaning is quite distinct from the non-discourse one ("well", "like" as INTJ). If it's a word that is mainly an ADV or NOUN etc. and also has a secondary use as a discourse particle, then it's not INTJ. We can tell that it is being used as a discourse particle because it attaches as "Please", "sorry", "right" feel kinda borderline. I guess "Sorry/discourse" just expresses that you're sorry/ADJ so we can call it ADJ. "Right?" is like saying "Is that right/ADJ?", so ADJ as well. "Please/discourse" is farther from evoking an act of pleasing, so I'd call it INTJ. |
I have to say it's not that I necessarily think the above is a bad way to slice up the space, but it seems like 'just another arbitrary system', where we already have one (PTB UH). I would probably be happier just deciding "if it's UH, it's INTJ" since that's an established practice in English and be done with it. Having/maintaining another lexicalized list of items just for upos seems rather unappealing... We could just decide that all of those swear and discourse items have a second sense/lexical entry which deserves tagging as INTJ - if it's good enough for "well" and "ok" then why not also for the gander? |
For "[Oo]h [Gg]od", ON has 50 UH : 5 NNP, and three of the latter are actually referring to God ("Oh God, we ask you for...") I get the issue with crosslinguistic comparison, but I feel like language internal consistency should also not be overlooked, and I worry about chaos/arbitrary decisions and incompatibility between TBs. I think ultimately people will decide on a language-specific basis whether there is a separate lexical item for an intj version of something. For example, some UD Russian TBs tag the archaic "боже" (vocative of "God") as an INTJ and lemmatize it to itself (UD_Russian-Taiga). But others still consider it to be a form of lemma God, and annotate it as a noun with vocative case (even though modern Russian has no productive vocative), for example in UD_Russian-SynTagRus, with the nominative lemma бог "God". So at the end of the day, if Russian can choose for there to be a 'special' interjection use of God, I think it's likely other languages will vary too, and I'm not sure that's wrong (though I am sure it shouldn't oscillate within the same UD language...) |
I agree that flexibility in the universal guidelines is sometimes necessary. If we want to say that it should be up to the language, then the guidelines shouldn't articulate a hard-and-fast rule. @dan-zeman do you think this calls for a more flexible guideline? (Come to think of it, why does https://universaldependencies.org/u/pos/INTJ.html say "God" is a NOUN and not a PROPN?) |
In PTB non-INTJ usage, it's determined just by captialization it seems... I suppose both tags are possible, certainly for common noun uses ("a/some god"). |
I believe we need |
OK, then the Russians should really make up their minds whether the frozen vocative боже can still be analyzed as a noun; I cannot judge its frequency (when I speak Russian at all, it's not to a god) and I'm probably biased, being a speaker of a language where the vocative is still productive and bože sounds absolutely normal. In English, I understand that gosh is probably an interjection. But I don't think its an argument for god/God to not be a noun (common or proper). |
Well, I'm saying it could be used as an argument for there being two lexical items "god", one of which is a noun (and is not in a paradigm with gosh), and one of which has the same POS as "gosh", with which it is completely interchangeable paradigmatically. |
Per today's Core Group discussion, the wording on the INTJ page was probably a bit too specific; the intention was to emphasize the general guidelines about prototypical vs. productively extended usages. Revised to make this clearer: https://universaldependencies.org/u/pos/INTJ.html |
Perfect. Thanks, Nathan.
Joakim
Skickat från Outlook för iOS<https://aka.ms/o0ukef>
…________________________________
Från: Nathan Schneider ***@***.***>
Skickat: Wednesday, December 13, 2023 4:52:21 AM
Till: UniversalDependencies/UD_English-EWT ***@***.***>
Kopia: Subscribed ***@***.***>
Ämne: Re: [UniversalDependencies/UD_English-EWT] Overuse of INTJ (Issue #429)
Per today's Core Group discussion, the wording on the INTJ page was probably a bit too specific; the intention was to emphasize the general guidelines about prototypical vs. productively extended usages. Revised to make this clearer: https://universaldependencies.org/u/pos/INTJ.html
—
Reply to this email directly, view it on GitHub<#429 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABZ7ZVRPEZII5M4IRRZDJPLYJEQ7LAVCNFSM6AAAAAA5ERWSQWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNJTGIZDINJVG4>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
VARNING: Klicka inte på länkar och öppna inte bilagor om du inte känner igen avsändaren och vet att innehållet är säkert.
CAUTION: Do not click on links or open attachments unless you recognise the sender and know the content is safe.
När du har kontakt med oss på Uppsala universitet med e-post så innebär det att vi behandlar dina personuppgifter. För att läsa mer om hur vi gör det kan du läsa här: http://www.uu.se/om-uu/dataskydd-personuppgifter/
E-mailing Uppsala University means that we will process your personal data. For more information on how this is performed, please read here: http://www.uu.se/en/about-uu/data-protection-policy
|
LGTM, thanks!
�
Amir
�
From: Joakim Nivre ***@***.***>
Sent: Wednesday, December 13, 2023 2:38 AM
To: UniversalDependencies/UD_English-EWT ***@***.***>
Cc: Amir Zeldes ***@***.***>; Comment ***@***.***>
Subject: Re: [UniversalDependencies/UD_English-EWT] Overuse of INTJ (Issue #429)
�
Perfect. Thanks, Nathan.
Joakim
Skickat från Outlook för iOS<https://aka.ms/o0ukef>
…________________________________
Från: Nathan Schneider ***@***.***>
Skickat: Wednesday, December 13, 2023 4:52:21 AM
Till: UniversalDependencies/UD_English-EWT ***@***.***>
Kopia: Subscribed ***@***.***>
Ämne: Re: [UniversalDependencies/UD_English-EWT] Overuse of INTJ (Issue #429)
Per today's Core Group discussion, the wording on the INTJ page was probably a bit too specific; the intention was to emphasize the general guidelines about prototypical vs. productively extended usages. Revised to make this clearer: https://universaldependencies.org/u/pos/INTJ.html
—
Reply to this email directly, view it on GitHub<#429 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABZ7ZVRPEZII5M4IRRZDJPLYJEQ7LAVCNFSM6AAAAAA5ERWSQWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNJTGIZDINJVG4>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
VARNING: Klicka inte på länkar och öppna inte bilagor om du inte känner igen avsändaren och vet att innehållet är säkert.
CAUTION: Do not click on links or open attachments unless you recognise the sender and know the content is safe.
När du har kontakt med oss på Uppsala universitet med e-post så innebär det att vi behandlar dina personuppgifter. För att läsa mer om hur vi gör det kan du läsa här: http://www.uu.se/om-uu/dataskydd-personuppgifter/
E-mailing Uppsala University means that we will process your personal data. For more information on how this is performed, please read here: http://www.uu.se/en/about-uu/data-protection-policy
—
Reply to this email directly, view it on GitHub <#429 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAQWKW2UBIGTZTMV7A3N7RTYJFLL7AVCNFSM6AAAAAA5ERWSQWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNJTGQYDMMRQHA> .
You are receiving this because you commented.Message ID: ***@***.***>
|
Per https://universaldependencies.org/u/pos/INTJ.html, INTJ should not be used for words that come from another category, like adjectives/adverbs.
https://universal.grew.fr/?custom=650f87d83dd5e - includes "good", "great", "fine", "well", "Christ", ...
The text was updated successfully, but these errors were encountered: