All Topics  
Stochastic grammar

 

   Email Print
   Bookmark   Link






 

Stochastic grammar



 
 
A stochastic grammar (statistical grammar) is a grammar framework
Grammar framework

In theoretical linguistics, the following fundamental approaches towards constructing grammar frameworks for natural languages are distinguished:...
 with a probabilistic notion of grammaticality
Grammaticality

In theoretical linguistics, grammaticality is the quality of a linguistic utterance of being grammar Gradient well-formedness.Lyons 1968 defines the concept as "that part of the acceptability of utterances which can be accounted for in terms of the rules," the complement criterion for acceptability being semantic soundness....
:

Statistical natural language processing
Natural language processing

Natural language processing is a field of computer science concerned with the interactions between computers and human languages. Natural language generation systems convert information from computer databases into readable human language....
 uses stochastic
Stochastic

Stochastic means random.A stochastic process is one whose behavior is non-Deterministic system in that a system's subsequent state is determined both by the process's predictable actions and by a random element....
, probabilistic and statistical methods, especially to resolve difficulties which arise because longer sentences are highly ambiguous when processed with realistic grammars, yielding thousands or millions of possible analyses. Methods for disambiguation often involve the use of corpora
Corpus linguistics

Corpus linguistics is the study of language as expressed in samples or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language....
 and Markov models.






Discussion
Ask a question about 'Stochastic grammar'
Start a new discussion about 'Stochastic grammar'
Answer questions from other users
Full Discussion Forum



Encyclopedia


A stochastic grammar (statistical grammar) is a grammar framework
Grammar framework

In theoretical linguistics, the following fundamental approaches towards constructing grammar frameworks for natural languages are distinguished:...
 with a probabilistic notion of grammaticality
Grammaticality

In theoretical linguistics, grammaticality is the quality of a linguistic utterance of being grammar Gradient well-formedness.Lyons 1968 defines the concept as "that part of the acceptability of utterances which can be accounted for in terms of the rules," the complement criterion for acceptability being semantic soundness....
:
  • Stochastic context-free grammar
    Stochastic context-free grammar

    A stochastic context-free grammar is a context-free grammar in which each production is augmented with a probability. The probability of a derivation is then the product of the probabilities of the productions used in that derivation; thus some derivations are more consistent with the stochastic grammar than others....
  • Statistical parsing
    Statistical parsing

    Statistical parsing is a group of parsing methods within natural language processing. The methods have in common that they associate grammar rules with a probability....
  • Data-oriented parsing
    Data-oriented parsing

    Data-oriented parsing is a probabilistic parsing grammar formalism in computational linguistics. DOP was conceived by Remko Scha in 1990 with the aim of developing a Linguistic_performance-oriented grammar framework....
  • Hidden Markov model
    Hidden Markov model

    A hidden Markov model is a statistical model in which the system being modeled is assumed to be a Markov process with unknown parameters; the challenge is to determine the hidden parameters from the observable data....
  • Estimation theory
    Estimation theory

    Estimation theory is a branch of statistics and signal processing that deals with estimating the values of parameters based on measured/empirical data....


Statistical natural language processing
Natural language processing

Natural language processing is a field of computer science concerned with the interactions between computers and human languages. Natural language generation systems convert information from computer databases into readable human language....
 uses stochastic
Stochastic

Stochastic means random.A stochastic process is one whose behavior is non-Deterministic system in that a system's subsequent state is determined both by the process's predictable actions and by a random element....
, probabilistic and statistical methods, especially to resolve difficulties which arise because longer sentences are highly ambiguous when processed with realistic grammars, yielding thousands or millions of possible analyses. Methods for disambiguation often involve the use of corpora
Corpus linguistics

Corpus linguistics is the study of language as expressed in samples or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language....
 and Markov models. The technology for statistical NLP comes mainly from machine learning
Machine learning

Machine learning is the subfield of artificial intelligence that is concerned with the design and development of algorithms that allow computers to improve their performance over time based on data, such as from sensor data or databases....
 and data mining
Data mining

Data mining is the process of extracting hidden patterns from data. As more data is gathered, with the amount of data doubling every three years, data mining is becoming an increasingly important tool to transform this data into information....
, both of which are fields of artificial intelligence
Artificial intelligence

Artificial intelligence is the intelligence of machines and the branch of computer science which aims to create it. Major AI textbooks define the field as "the study and design of intelligent agents,"...
 that involve learning from data.

Literature

  • Christopher D. Manning, Hinrich Schutze Foundations of Statistical Natural Language Processing, MIT Press (1999), ISBN 978-0262133609.
  • Stefan Wermter, Ellen Riloff, Gabriele Scheler (eds.) Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing, Springer (1996), ISBN 978-3540609254.


See also

  • Colorless green ideas sleep furiously
    Colorless green ideas sleep furiously

    "Colorless green ideas sleep furiously" is a sentence composed by Noam Chomsky in 1957 as an example of a sentence whose grammar is correct but whose meaning is Nonsense....
  • Computational linguistics
    Computational linguistics

    Computational linguistics is an interdisciplinary field dealing with the Statistics and/or rule-based modeling of natural language from a computational perspective....