Assignment I: PCFGs Cognitive Models of Language and Beyond 0440949 Andreas van Cranenburgh 1. S | VP / \ / \ / NP / / \ | / \ | NP \ | / \ \ | | \ \____ | NP PP PP | / \ / \ / \ | | | | NP | NP | | | | | | | V DET N P N P N List the sales of products in 2008 List the sales-of-products in 2008 S | VP / \ / \ / NP / / \_____ | | \ | NP PP | / \ / \ | | | / NP__ | | | / | \ | | | | NP PP | | | | | | \ | | | | | | NP | | | | | | | V DET N P N P N List the sales of products in 2008 List the sales of products-in-2008 S | VP _____/ | \______________ / | \ / NP PP / / \ / \ | | \___ | | | NP PP | | | / \ / \ | NP | | | / NP | | | | | | | | | V DET N P N P N List the sales of products in 2008 List the-sales-of-products and do it in 2008 2a. The first tree is the preferred interpretation, where "products" and "2008" both modifiy "sales". b. S -> VP 1 VP -> V NP 1 NP -> N 2/5 NP -> NP PP 2/5 NP -> DET N 1/5 PP -> P NP 1 V -> List 1 DET -> the 1 N -> sales 1/3 N -> products 1/3 N -> 2008 1/3 P -> of 1/2 P -> in 1/2 3. First tree: S -> VP VP -> V NP NP -> NP PP NP -> DET N PP -> P NP NP -> N 1 1 2/5 1/5 1 2/5 PP -> P NP NP -> N 1 2/5 joint probability: 0.0128 Second tree: S -> VP VP -> V NP NP -> NP PP NP -> DET N PP -> P NP 1 1 2/5 1/5 1 NP -> NP PP NP -> N PP -> P NP NP -> N 2/5 2/5 1 2/5 joint probability: 0.00512 Third tree: S -> VP VP -> V NP PP NP -> NP PP NP -> DET N PP -> P NP 1 0 2/5 1/5 1 NP -> N PP -> P NP NP -> N 2/5 1 2/5 joint probability: 0 (ungrammatical, due to PP attaching to VP) 4. Recently activated rules might temporarily receive higher probabilities, taking probability mass from the rules that have not been recently activated. Additionally, recently parsed chunks might be memorized temporarily, so that if they re-occur, they will receive consistent parses. Eg., perhaps "sales of products" has been mentioned previously, if it is mentioned again there is a high probability it should be parsed in the same way, which could resolve potential ambiguities. Instead of probabilities for single rules we could use probabilities for rules given their parent constituent.