-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Head of TIME + AM/PM #107
Comments
Agreed, am/pm (as well as time zones) would be better treated as dependents of the hour. In the domain of value expressions I don't think there are any great options for the deprel. Not sure whether Another option would be |
Forgot to answer this - this is actually kind of a big deal, since if we concede this case has the number as the head, then it's a chip in the widespread EWT tendency to treat numbers as
Now let's try this for February 2nd, 2 February, etc. First we check the distribution of months vs. days:
Now the controversial cases:
If |
I still think the number as head is a stronger case for am/pm because of optionality: you can simply omit am/pm without adding an article, unlike the ordinal interpreted as a monthless date. "on" selects semantically for dates, not months—so while "2nd" is arguably the semantic head of "February 2nd", it still seems plausible that MONTH+DATE is a syntactically headless construction (which we are forced to assign a formal head in UD, hence That said, I don't particularly like such a broad use of |
That sounds interesting, I'd also be interested in something like subtypes to deal with NP syntax too, but I'd want to ensure we don't diverge from TBs that won't do such expansions, and keep target labels for parsers manageable too. Maybe feat or misc expansions could do the trick? I'm happy to brain storm on proposals. For the dates, if you go with |
My point is that there are many syntactic formulations of dates, and while they are semantically equivalent that's not sufficient to assume they are syntactically equivalent. in/on select for the semantics of the NP, so it follows that if "4" is the semantic head, "September 4" requires "in" rather than "on"; I don't think this tells us anything about syntactic headedness, and of course there are other constructions where the syntactic head is not the semantic head ("a lot of cookies" comes to mind). The relations we have are best equipped for "normal" NP syntax, where if the NP has multiple words the head is a noun, and if it is a singular count common noun it must have a determiner and so forth. With "the 4th of September" it's easy enough to say this evolved from eliding "day", so we promote "4th" to be the head, and "of September" is a normal PP modifier. With "4 September" and "September 4(th)" and "September the 4th" our usual understanding of NPs breaks down, it seems to me, so we need to do something special one way or another. (And likewise for minor or archaic constructions like "Richard the Lionheart".) Let me try to formulate a syntactic argument that "4th" in "September 4th" is not a head, or at least not a "normal" head: If "September" was a modifier, shouldn't it be expected to license its own modifiers in between "September" and "4th"? I don't think those are natural ways to express a date:
It's a bit tricky because semantically a temporal modifier that could applies to the month could usually apply to the date as well, since dates are contained within months. We can try adjectives modifying the month specifically e.g.
Maybe this tells us only that the month-date construction doesn't allow internal modification of the month. In any case, it sure doesn't feel to me that "September" is modifying "4th" syntactically. I suppose others feel the same way which is why the guidelines currently prescribe |
That's interesting - it actually does feel like modification to me (I read "September 5" as something like an elliptical possessive "September('s) 5(th day)" and "5 September" as "5(th of) September". That must be the source of our disagreement... In fact, "5 September" said out loud is ungrammatical for me, I'd prob. read it "September fifth", like I read "$500" as "five hundred dollars". I think the possessive reading of Sep. 5 and problems omitting Sep. are similar to normal possessives: the NP in "John's dog barked" is headed by "dog", despite the fact we can't say "*dog barked". I also think the "on" government argument is a morphosyntactic one and not a semantic one, since comparable semantics does not guarantee comparable government ("at sea" but "in the ocean", "wait for someone" but "await someone"). As for the modifiers for September, I think they are possible, but due to heavy phonological weight of the phrase, they're postponed (in transformational terms, "*September of last year 4th" ->"September t 4th of last year"), and in any case as you point out, hard to distinguish from the entire "September 4th" getting that modifier. In fact, we can use phrase syntax for another argument for days as heads: if months are the head, then they project both a month and a day phrase, which seems odd to me. Compare: (September (fourth)) The day analysis seems easier to reconcile with compositional phrases (even if compositionality is a semantic fact, I think syntax should map onto semantics if possible). In the second analysis, we can explain compositionality by "fourth" projecting a phrase corresponding to a day, and leave "September" to project a subordinate "month" phrase, which is semantically consistent. Finally I don't know about the guidelines, but EWT in practice doesn't use flat, it has |
Re: possessives—I think you're saying that September in September 4 has a determinative function (acts as a specifier in lieu of an determiner). Currently determinatives are covered by OTOH positing a new determinative construction absent clearer syntactic evidence of headedness may be forcing it if we could simply say the month-date construction is special and lacks a normal head.
You can also say "in the sea". "At sea/*at ocean" is a multiword expression phenomenon (determinerless PP); so is "wait for" (prepositional verb). Yes there are many many idiosyncrasies in prepositions within multiword expressions, but "in"-month vs. "on"-day is quite productive based on semantics (we say not only "on September 4", but also "on Tuesday", "on Yom Kippur", and "on the third day of the week"). If we were abbreviating the date we would write "on 9/4" but "in 9/2020". This can all be explained by saying that the distribution of prepositions is sensitive to the semantic type of the temporal expression; whether the semantic head bearing that type lexically is also the syntactic head is another question. Overall, at this point I think we should hear what other people think. |
Continuing on the determinative idea: FWIW, looking at the potential for "the", demonstratives, and quantifiers, it appears months pattern more like quantity modifiers than possessives, and match the distribution of month entities:
I don't think this makes a convincing case one way or another about headedness, but it calls into question whether "September" should be considered determinative. |
What to do with dates is a really difficult issues with seemingly conflicting facts in different languages. There was thoughts of having a multilingual UD study of this issue. I won't weigh in here. But what you suggest for "2 am" does seem reasonable, and so we could change EWT to do things like you do in GUM! |
@manning - thanks, this is now consistently |
Related: UniversalDependencies/docs#893 |
EWT has e.g. "2 pm" as:
nummod(pm,2)
GUM has the time as head and AM/PM as
nmod:tmod
:nmod:tmod(2,pm)
Syntactically I think you can drop "pm", but not "2", and semantically it makes more sense to me that pm is an expansion on what kind of "2" this is. Is there a good argument for making "pm" the head? If not, should EWT be changed?
The text was updated successfully, but these errors were encountered: