Information about Lemma (linguistics)

The word lemma has a different meaning in logic and in mathematics.


In linguistics a lemma is the canonical form of a lexeme.

Specifically, in lexicography, "lemma" is a synonym for headword, q.v.

In morphology, a lemma is the canonical form of a lexeme. Lexeme, in this context, refers to the set of all the forms that have the same meaning, and lemma refers to the particular form that is chosen by convention to represent the lexeme. Lemmas have special significance in highly inflected languages such as Czech. In this sense, a lemma can also be called a citation form. The process of determining the lemma for a given word is called lemmatisation.

In psycholinguistics, the term lemma has a more restricted use: it is an abstract form of a word that is used in speech production. In the best accepted psycholinguistic models, speech production has several stages, and the lemma occurs after the word has been selected mentally, but before any information has been accessed about the sounds in it (and thus before the word can be pronounced). It therefore contains information concerning only meaning and the relation of this word to others in the sentence.

Morphology

In a dictionary, the lemma "go" represents the inflected forms "go", "goes", "going", "went", and "gone". The relationship between an inflected form and its lemma is usually denoted by an angle bracket, e.g. "went" < "go". The disadvantage of such simplifications is, of course, the inability to look up a declined or conjugated form of the word, although some dictionaries, like Webster's, will list "went". Multilingual dictionaries vary in how they deal with this issue: the Langenscheidt dictionary of German does not list ging (< gehen); the Cassell does.

The form that is chosen to be the lemma is usually the least marked form. There are significant exceptions; e.g. in Finnish, the dictionaries lists verbs not under the verb root, but under the first infinitive marked with -(t)a, -(t)ä.

Lemmas are used often in corpus linguistics for determining word frequency. In such usage the specific definition of "lemma" is flexible depending on the task it is being used for.

Lemmas in different languages

In English, the citation form of a noun is the singular: e.g. mouse rather than mice. For multi-word lexemes which contain possessive adjectives or reflexive pronouns, the citation form uses a form of the indefinite pronoun one: e.g. do one's best, perjure oneself. In languages with grammatical gender, the citation form of regular adjectives and nouns is usually the masculine singular. If the language additionally has cases, the citation form is often the masculine singular nominative.

In many languages, the citation form of a verb is the infinitive: French aller, German gehen. In English we can use either the bare infinitive go or the full infinitive to go. In Latin and Greek, however, the first person singular present tense is normally used, though occasionally the infinitive may also be seen. (For contracted verbs in Greek, an uncontracted first person singular present tense is used to reveal the contract vowel, e.g. φιλέω philéō for φιλῶ philō "I love" [implying affection]; αγαπάω agapáō for αγαπῶ agapō "I love" [implying regard]).

In Arabic, which has no infinitives, the third person singular of the past tense is the least-marked form, and is used for entries in modern dictionaries. In older dictionaries, which are still commonly used today, the triliteral of the word, either a verb or a noun, is used. Hebrew often uses the 3rd person masculine qal perfect, e.g. ברא bara' create, כפר kaphar cover. For Korean, -da is attached to the stem.

Some phrases are cited in a sort of lemma, e.g. Carthago delenda est (literally, "Carthage is destroyed") is a common way of citing Cato, although what he said was more like, Ceterum censeo Carthaginem esse delendam ("As to the rest, I hold that Carthage must be destroyed").

Psycholinguistics

When we produce a word, we are essentially turning our thoughts into sounds (a process known as lexicalisation). In many psycholinguistic models this is considered to be at least a two-stage process. The lemma is thus intermediate between the semantic level (where meaning is specified) and the phonological level (where the sounds of the word are specified). It is an abstract form containing syntactic information (about how the word can be used in a sentence), but no information about the pronunciation of the word. In this context, the lexeme is the phonologically specified form that is selected after the lemma.

This two-staged model is the most widely supported theory of speech production in psycholinguistics[1], although it has been recently challenged.[2] For example, there is some evidence to indicate that the grammatical gender of a noun is retrieved from the word's phonological form (the lexeme) rather than from the lemma.[3] This is easily explained by Caramazza's Independent Network model, which does not assume a distinct level between the semantic and the phonological stages (so there is no lemma representation); in this model, syntactic information about the word in this model is activated in the semantic or phonological level (so gender would be activated in the latter).[4]

See also

References

1. ^ Harley, T. (2005) The Psychology of Language. Hove; New York: Psychology Press: 359
2. ^ e.g. Caramazza, A. (1997) How many levels of processing are there in lexical access? Cognitive Neuropsychology, 14, 177-208.
3. ^ e.g. Starreveld, P. A. and La Heij, W. (2004) Phonological facilitation of grammatical gender retrieval. Language and Cognitive Processes, 19 (6), 677-711.
4. ^ Caramazza (1997)

External links

In informal logic and argument mapping, a lemma is simultaneously a contention for premises below it and a premise for a contention above it.

See also

  • Co-premise
  • Objection
  • Inference objection
  • Lemma (mathematics)
  • Lemma (linguistics)

..... Click the link for more information.
lemma is a proven proposition which is used as a stepping stone to a larger result rather than an independent statement, in and of itself. A good stepping stone leads to many others, so some of the most powerful results in mathematics are known as lemmata, such as Zorn's lemma,
..... Click the link for more information.
Linguistics is the scientific study of language, which can be theoretical or applied. Someone who engages in this study is called a linguist.
..... Click the link for more information.
A lexeme is an abstract unit of morphological analysis in linguistics, that roughly corresponds to a set of words that are different forms of the same word. For example, in the English language, run, runs, ran and running
..... Click the link for more information.
''For the term in mathematics, see Lexicographical order
The pursuit of lexicography is divided into two related disciplines:
  • Practical lexicography is the art or craft of compiling, writing and editing dictionaries.

..... Click the link for more information.
A headword, head word, lemma, or sometimes catchword is the word under which a set of related dictionary or encyclopaedia entries appears. The headword is used to locate the entry, and dictates its alphabetical position.
..... Click the link for more information.
Morphology is the field within linguistics that studies the internal structure of words. (Words as units in the lexicon are the subject matter of lexicology.
..... Click the link for more information.
A lexeme is an abstract unit of morphological analysis in linguistics, that roughly corresponds to a set of words that are different forms of the same word. For example, in the English language, run, runs, ran and running
..... Click the link for more information.
inflection or inflexion is the modification or marking of a word (or more precisely lexeme) to reflect grammatical (that is, relational) information, such as gender, tense, number or person.
..... Click the link for more information.
Czech}}} 
Official status
Official language of:  Czech Republic
 European Union
Regulated by: Czech Language Institute
Language codes
ISO 639-1: cs
ISO 639-2: cze (B)  ces (T)
ISO 639-3: ces
..... Click the link for more information.
In computing, lemmatisation is the process of determining the lemma for a given word. Since the process involves determining the part of speech of a word in a sentence, it requires knowledge of the grammar of a language, and it can therefore be a great deal of work to implement a
..... Click the link for more information.
Psycholinguistics or psychology of language is the study of the psychological and neurobiological factors that enable humans to acquire, use, and understand language.
..... Click the link for more information.
In linguistics, meaning is the content carried by the words or signs exchanged by people when communicating through language. Restated, the communication of meaning is the purpose and function of language.
..... Click the link for more information.
Webster's Dictionary is the common title given to English language dictionaries in the United States, derived from American lexicographer Noah Webster. In the United States, the phrase Webster's has become a genericized trademark for dictionaries.
..... Click the link for more information.
Markedness is a linguistic concept that developed out of the Prague School (also known as the Prague linguistic circle).

A marked form is a non-basic or less natural form. An unmarked form is a basic, default form.
..... Click the link for more information.
Finnish ( suomi  , or suomen kieli) is the language spoken by the majority of the population in Finland (91.
..... Click the link for more information.
Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language.
..... Click the link for more information.
Examples
A proper or common noun can co-occur with an article or an attributive adjective. Verbs and adjectives can't. As usual, a `*' in front of an example means that this example is ungrammatical.
..... Click the link for more information.
grammatical number is grammatical category of nouns, pronouns, and verb agreement that expresses count distinctions (such as "one" or "more than one").[1]
..... Click the link for more information.
possessive adjectives — in linguistic analyses possessive pronouns, possessive determiners or genitive pronouns — are a part of speech that prototypically modifies a noun by attributing possession to someone or something (but see below).
..... Click the link for more information.
reflexive pronoun is a pronoun that is preceded by the noun or pronoun to which it refers (its antecedent) within the same clause. In generative grammar, a reflexive pronoun is an anaphor that must be bound by its antecedent (see binding).
..... Click the link for more information.
An indefinite pronoun is a pronoun that does not refer to a specific person, place or thing. If an indefinite pronoun is used as the subject when forming a sentence, the verb of the sentence must agree with the pronoun.
..... Click the link for more information.
In linguistics, grammatical genders, sometimes also called noun classes, are classes of nouns reflected in the behavior of associated words; every noun must belong to one of the classes and there should be very few which belong to several classes at once.
..... Click the link for more information.
This article or section may be confusing or unclear for some readers.
Please [improve the article] or discuss this issue on the talk page. This article has been tagged since August 2007.
..... Click the link for more information.
verb is a word belonging to the part of speech that usually denotes an action (bring, read), an occurrence (decompose, glitter), or a state of being (exist, stand).
..... Click the link for more information.
In grammar, infinitive is the name for certain verb forms that exist in many languages. In the usual (traditional) description of English, the infinitive of a verb is its basic form with or without the particle to: therefore, do and to do, be
..... Click the link for more information.
French (français, pronounced [fʁɑ̃ˈsɛ]) is a Romance language originally spoken in France, Belgium, Luxembourg, and Switzerland, and today by about 300 million people around the world as either
..... Click the link for more information.
German language (Deutsch, ] ) is a West Germanic language and one of the world's major languages.
..... Click the link for more information.
Latin}}} 
Official status
Official language of: Vatican City
Used for official purposes, but not spoken in everyday speech
Regulated by: Opus Fundatum Latinitas
Roman Catholic Church
Language codes
ISO 639-1: la
ISO 639-2: lat
..... Click the link for more information.
Greek}}} 
Writing system: Greek alphabet 
Official status
Official language of:  Greece
 Cyprus
 European Union
recognised as minority language in parts of:
 European Union
 Italy
 Turkey
Regulated by:
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter