-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add UP/dependencies white paper #51
Comments
Agreed. We need a document that does this ASAP.
…On Wed, Oct 24, 2018 at 12:46 PM Andy Halterman ***@***.***> wrote:
It would be really useful as we're debugging UP on Arabic (and for users
more generally) to have a white paper or algorithm describing how UP uses
the dependency parse, similar to the Petrarch2 white paper. Some questions
that have come up on Monday and before include:
- Does UP only look at the root verb of the sentence, or at all the
verbs in the sentence? e.g.
<https://github.com/openeventdata/UniversalPetrarch/blob/1fe09dd35d36ce1f850925aa9f1dfbad8960e8b1/UniversalPetrarch/PETRgraph.py#L632>
- Does UP determine the event and actor phrases first, and then check
the dictionaries, or is there an iterative process of using the
dictionaries to determine the spans?
- How does UP decide which direct objects are part of the event and
which are the target actor? (Consolidating the final work addressing some
previous issues)
- How does UP handle prepositional phrases as part of the event or
target?
- To what extent does UP rely on part-of-speech tags in addition to
dependency parses? e.g.
<https://github.com/openeventdata/UniversalPetrarch/blob/1fe09dd35d36ce1f850925aa9f1dfbad8960e8b1/UniversalPetrarch/PETRgraph.py#L632>
- Does UP match the longest/most specific found verb phrase or does it
stop after it finds the first?
- How does UP handle passive constructions?
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#51>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AJrP1r5-Sgd8Y3VYYmqj9GPteh-8CqlUks5uoKd-gaJpZM4X4dds>
.
--
Patrick T. Brandt
Professor
Political Science
School of Economic, Political and Policy Sciences
University of Texas at Dallas
Personal site: http://www.utdallas.edu/~pbrandt
MSBVAR site: http://yule.utdallas.edu
|
Just wanted to update this issue with the second round of documentation, along with my comments on it. Some questions to address:
|
Updated documentation. I've marked the issues above that were resolved, but most are still outstanding. The documentation addressed two issues (the al-Shabaab and "Gondor opposition" examples), describing very different behavior from the previous version of the documentation. I don't see any changes to the code, though. Has the code been updated to reflect the new behavior described in the documentation? |
Agreed. This needs more documentation.
…On Thu, Dec 6, 2018, 10:28 Andy Halterman ***@***.*** wrote:
Updated documentation. I've marked the issues above that were resolved,
but most are still outstanding.
The documentation addressed two issues (the al-Shabaab and "Gondor
opposition" examples), describing very different behavior from the previous
version of the documentation. I don't see any changes to the code, though.
Has the code been updated to reflect the new behavior described in the
documentation?
ud_petrarch_documentation_v3.pdf
<https://github.com/openeventdata/UniversalPetrarch/files/2653914/ud_petrarch_documentation_v3.pdf>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#51 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AJrP1nkunkZNS1HLP3GhZ4eSncwXq7wxks5u2UXKgaJpZM4X4dds>
.
|
Or more importantly, more code! |
I'll also emphasize that the “{Ukraine, ratified, agreement}” problem still requires comments (and major work on the coder). This is a fundamental problem to overcome. |
Concur with Andy's comment: situations like that are why event coding is a problem distinct from the standard "event-triple" NLP issue (as numerous projects vastly larger and better funded than ours have learned over the past three decades to their dismay when they try to adapt generic NLP software to do event coding): depending on context, the target of an action can be in a variety of different places in the sentence/parse structure (the source is usually the subject, though depending on the clause structure of the sentence, sometimes not even that is true. But usually it is), and that's why verb dictionaries have 10K or so distinct patterns, and why outfits like Raytheon/BBN developed separate software just for event coding even though long ago they'd developed NLP software for just extracting triples. |
It would be really useful as we're debugging UP on Arabic (and for users more generally) to have a white paper or algorithm describing how UP uses the dependency parse, similar to the Petrarch2 white paper. Some questions that have come up on Monday and before include:
The text was updated successfully, but these errors were encountered: