Difference between revisions of "Tagger"

From Glottopedia
Jump to navigation Jump to search
(Marked as {{ref}})
 
(5 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
==Definition==
 
==Definition==
A tagger is a device which assigns symbolic labels (''tags'') to linguistics units. The labels are taken from a predefined set of symbols (''tag-set'').
+
A '''tagger''' is a device which assigns symbolic labels (''tags'') to linguistics units. The labels are taken from a predefined set of symbols (the so-called ''tag-set'').
  
 
==Comments==
 
==Comments==
In most cases, a tagger assigns tags representing morpho-syntactic information to single word-forms or token. But there are tagger which have been designed to identify semantic role of noun phrases or prepositional phrases (''sense tagging'') and sometimes identiying the discourse structure of a text is considered as a king of tagging.
+
In most cases, a tagger assigns tags representing morpho-syntactic information to single word-forms or tokens. But there are also taggers which have been designed to identify semantic roles of noun phrases or prepositional phrases ([[sense tagging]]), and sometimes identifying the structure of a text is considered as a kind of tagging ([[discourse structure tagging]]).
  
Conceptually, tagging can be considered as a three step process: (i). identification of the relevant units (ii). assigning all possible labels to the units (e.g. by lexical look-up, applying heuristics, etc.) (iii). disambiguation.
+
Conceptually, tagging can be considered as a three step process: (i). identification of the relevant units (ii). assigning all possible labels (e.g. by lexical look-up, applying heuristics, etc.) (iii). disambiguation.
  
It is common practice to distinguish between rule-based and stochastic tagger, though some tagger combine rules and stochastic information.  
+
It is common practice to distinguish between rule-based and stochastic taggers, though some taggers combine rules and stochastic information.  
 
   
 
   
State-of-the-art tagger achieve a precision of at least 95% for morpho-syntactic tagging.
+
In general, state-of-the-art taggers achieve a precision of at least 95% for morpho-syntactic tagging.
  
 
==Subtypes==
 
==Subtypes==
Line 21: Line 21:
 
* German [[Tagger (de)]]
 
* German [[Tagger (de)]]
  
{{dc}}
+
{{dc}}{{ref}}
 
[[Category:Computational Linguistics]]
 
[[Category:Computational Linguistics]]

Latest revision as of 16:30, 27 July 2014

Definition

A tagger is a device which assigns symbolic labels (tags) to linguistics units. The labels are taken from a predefined set of symbols (the so-called tag-set).

Comments

In most cases, a tagger assigns tags representing morpho-syntactic information to single word-forms or tokens. But there are also taggers which have been designed to identify semantic roles of noun phrases or prepositional phrases (sense tagging), and sometimes identifying the structure of a text is considered as a kind of tagging (discourse structure tagging).

Conceptually, tagging can be considered as a three step process: (i). identification of the relevant units (ii). assigning all possible labels (e.g. by lexical look-up, applying heuristics, etc.) (iii). disambiguation.

It is common practice to distinguish between rule-based and stochastic taggers, though some taggers combine rules and stochastic information.

In general, state-of-the-art taggers achieve a precision of at least 95% for morpho-syntactic tagging.

Subtypes

Other Languages

REF This article has no reference(s) or source(s).
Please remove this block only when the problem is solved.