Quantitative affixes: related to the measure, calculation or standard of measurement performed by the noun. Links are established between adjectives and nouns indicating the unit of measure: ApNt.
For example, the link in the phrase “two thousand meters deep” is shown in figure 2.12 below Comparative affix: This affix is taken up by a comparative adjective. For example “more beautiful than fairies”. The link between “beautiful” and “more” is the ApAr bond.
Description Addendum. Then the two adjectives go together: “beautiful”, “extensive”… The former indicates properties, the latter belongs to the relational subtype (comparative subcategory).
This suffix does not appear with the degree suffix on the left or the quantity suffix on the right, for example it cannot say “very wide”, “two meters high”
Hình 2.12. Liên kết trong cụm từ “ sâu hai ngàn mét”
So the association formula in this case for the prime is:
(RlAp- xor ApRl+) or (RlAp- xor ApAr+) or (ApAr+ xor ApNt+)
Maybe you are interested!
- Vietnamese linking grammar model - 10
- Vietnamese linking grammar model - 11
- Link Building Based On Adjective Structure
- Application Algorithm To Expand Vietnamese Dictionary
- Test Results For Analyzing Simple Sentences And Simple Compound Sentences
- Vietnamese linking grammar model - 16
Similarly, degree affixes can be used after adjectives such as “extremely”, “extremely”.
2.1.5. Link clauses in simple compound sentences
With compound sentences consisting of many clauses, it is necessary to separate to analyze each clause separately and then combine. However, with two-clause compound sentences, it is possible to use conjunctions for the conjunctions and components in the sentence. The core of compound sentences in Vietnamese is divided into two types: main and minor and parallel [28]. The link building for core core grafting is simpler. Conjunctions such as: “because”, “but”, “however” … all have a CL+ connection to connect to the clause that follows it. Their linking formula is as follows:
however, but, should, should, because, because: CL+
The CL connection is connected to the predicate element in the sentence. The reason for connecting conjunctions to a predicate and not a subject is because the following clauses sometimes do not have a subject. Therefore verbs and adjectives are added CL-.
Most conjunctions can come between two clauses. To connect these conjunctions with the preceding clause, the thesis adds the EV- connection for these conjunctions. Like the CL+ conjunction, the EV- conjunction connects to the predicate of the preceding clause.
however, but, should, should, because, because: CL+ & EV
Figure 2.13 below shows a two-clause compound sentence that only promises a “but” conjunction. This conjunction joins the verb “like” of the main clause to the left with the EV- conjunction and with the adjective “expensive” on the right with the CL+ conjunction.
Figure 2.13. Connect a sentence that combines two clauses with a conjunction in the middle
Some conjunctions (“because”, “because”, “after”, etc.) can even go first in the first clause. At this time, we use the CO+ link instead of the EV- link to connect the conjunction with the second clause (we still keep the CL+ connection to connect to the clause immediately after the conjunction). There can also be a comma between two clauses. To connect to this comma the conjunctions also have a PH+ connection and this connection is optional.
because, because, though: CL+ & {PH+} & (EV- or CO+)
In Figure 2.14 is an example of a sentence with a conjunction at the beginning of the first clause, a comma separating the two clauses. Analysis shows that the sentence is syntactically correct. In addition to the links with nouns, verbs, and adjectives, there are also links with the conjunction “because” and the comma. These are the CO, CL, PH linkages, in which CO, CL connect the conjunction “because” with the predicate of the two clauses, and the PH link connects the conjunction “because” with a comma.
Figure 2.14. Linking of two-clause compound sentences with a conjunction at the beginning and a comma
Similarly, in the case of a compound sentence, there are conjunctions at the beginning of the two clauses (because … should…, although … however…), which are pairs of conjunctions that have a relationship with each other. This case is handled as follows: join the two conjunctions with a QHT link. The conjunction at the beginning of the sentence has a QHT+ connection, the conjunction at the beginning of the second clause has a QHT- connection. Conjunctions are still connected to the following clause by a CL+ connection.
Combined with the connections built above, we have:
however, but, should, should, because, because: CL+ & (EV- or QHT-) because, because, though: CL+ & {PH+} & (EV- or (CO+ or QHT+))
Figure 2.15. Links in compound sentences with conjunctions present in both clauses
In Figure 2.15, the sentence contains two related conjunctions “because” and “should”. The link that connects them is QHT.
When analyzing compound sentences according to the algorithm mentioned in Chapter 3, the analyzer can choose whether or not to analyze each proposition separately with this type of sentence.
2.2. Expanded linked grammar dictionary
Initially, a dictionary with more than 150 large formulas and more than 1000 entries was built manually by the thesis. The words that appear in the dictionary are divided into groups, each containing the same word type, with the same association requirements as shown in Figure 2.16 below:
Figure 2.16. A passage from a linked grammar dictionary
Because the linking grammar model was first applied to Vietnamese parsing, all linking formulas were proposed by the thesis, the first dictionary was limited to experimental texts. In order for the analyzer to work effectively on any Vietnamese text, the thesis has researched to expand the associated grammar dictionary for Vietnamese.
Out of all the documents published on the link grammar website, only two refer to dictionary building. These are the articles of Protassov’s Russian linking grammar building [132] and Szolovits’ article [113]. [132]’s method allows searching in a large Russian corpus to filter out the necessary words for a certain formula. For example, find verbs that require patterning (way 5) by finding all verbs that are followed by a noun in way 5. This is much easier than Vietnamese because verbs are as well as nouns. from the 5th way in Russian can be distinguished by the suffix. Therefore, the thesis has followed the method of Szolovits used to gradually add words from a medical lexical system to the linked grammar dictionary. That vocabulary system is the UMLS Specialist Lexicon which contains 235,197 entries (of which 75,121 short phrases – generally phrases are also treated as words).
Each Specialist entry includes the following information:
- Word type (part of speech).
- Conformity / inflection code (agreement / inflection code). 1st, 2nd and 3rd person personal pronouns, singular and plural, tense and negative (for verbs, modal verbs and auxiliary verbs); countable and uncountable nouns and determiners.
- Complements. A complex system for describing the types of modifiers for verbs, nouns and adjectives, including different types of modifiers, prepositions, etc.
- Position and modification types for adjectives and adverbs.
- Other features.
The thesis used a dictionary of over 40,000 Vietnamese words of the topic KC 01.01/06-10 with the design including the following descriptions:
1. Morphology (one or more morphemes)
2. Syntax
– Word type
– Child type
– Form of the verb
3. Semantics
– Logical binding
+ Meaning of type
+ Synonyms
+ Antonyms
– Semantic binding
+ Sub-element
+ Direct complements
4. Definition (describe the meaning of the word)
5. Examples (one or several example sentences using the word described)