Afrikaans (AF) Language Data

Inflectional Morphology Data

The Lexical Resource for Afrikaans contains all the standard inflectional forms for nouns, verbs, adjectives, prepositions, conjunctions, etc.

Derivational Morphology Data

Contains all the standard derivational forms including superlatives and common compound words.

 

Extended Morphology Data

Contains the result of extending the inflectional and derivational forms lists as a result of considering additional morphological phenomena such as common contractions.

Frequency Indication

Contains the data regarding the relative frequency of appearance for the words in the above lists in the given language.

Each word has been assigned a frequency group, where the frequency group corresponds to a normalized logarithmic scale from 0 to 255. The most frequent word in the corpus has been assigned frequency group 255, and words not appearing in the corpus have been assigned frequency group 0. 

 

 

 

Complementary Semantic Annotations

 

Named Entities Morphology Data

Contains the data regarding named entities comprising person names, places, companies and organizations.

Offensive Language Flag

Contains information per word indicating if the word might be considered offensive in certain contexts. 

Volume of Language Data

lexical-forms-afrikaans

Total number of forms

38,000 forms 

    • Verbs: 7,000 forms (18%) 
    • Nouns: 23,000 forms (61%) 
    • Adjectives: 7,000 forms (18%) 
    • Other: 1,000 forms (3%) 
    number-of-lemmas-arabic-lexical

    Total number of lemmas

    20,000 lemmas 

    Features

    Each form will be annotated with the lemma (root form), POS, and morphological attributes (tense, mood, person, number, gender and entity type). 

    h

    Lemma

    The canonical form for the inflected word.

    {

    POS

    Part of Speech such as noun, verb, adjective, etc.

    v

    Voice

    Not applicable.

    +

    Tense

    Specifies when the action takes place such as past, present, future, etc.

    Aspect

    Indicates whether the action is complete, ongoing, habitual, etc.

    Mood

    Not applicable.

    Person

    Verb or pronoun refers to the first, second or third person.

    Number

    State of being singular, dual or plural.

    Gender

    Noun, verb or adjective forms are provided, masculine, feminine, neuter, etc.

    Case

    The function that the noun or adjective plays within a sentence.

    R

    Degree

    An adjective is specified as in its positive, comparative or superlative form.

    l

    Definiteness State

    Not applicable.

    O

    Negative

    Not applicable.

    |

    Contractions

    Shortened form of a word or group of words.

    Pronominal Clitics

    Not applicable.

    w

    Formality

    Not applicable.

    Frequency

    Relative frequency of the form based on a large general-purpose corpus.

    Named Entities

    Pre-defined entities are tagged as person names, places, organization, etc.

    r

    Offensive

    Indicates whether the form might be considered offensive in certain contexts.

    MADRID, SPAIN

    Camino de las Huertas, 20, 28223 Pozuelo
    Madrid, Spain

    SAN FRANCISCO, USA

    541 Jefferson Ave Ste 100, Redwood City
    CA 94063, USA