Towards the Design of a Syntactico-Semantic Lexicon for Polish

Adam Przepiórkowski

In the proceedings of Intelligent Information Systems 2004 (New Trends in Intelligent Information Processing and Web Mining). This report is an introduction to this article.


This article presents the design of a syntactico-semantic dictionary for Polish, i.e., a valence dictionary enriched with certain semantic informations. Valence dictionaries, specifying the number and morphosyntactic form of arguments of verbs, are useful in many Natural Language Processing applications, including deep parsing, e.g., for the purpose of machine translation, shallow parsing, e.g., for the purpose of information extraction, and rule-based morphosyntactic disambiguation, e.g., for the purpose of corpus annotation. An approach based on recent results in formal and computational linguistics is proposed, which takes into consideration the morphosyntactic and syntactic structure of Polish and which avoids various known problems of previous valence dictionaries, some of them stemming from their impoverished theoretical framework, unable to take proper care of the syntax-semantics interface, case variations and raising predicates. An implementation of a grammar of Polish deploying the ideas presented here is currently under development.

Electronically available formats:

BibTeX entry:

@string{sv =     "Springer-Verlag"}
  author =       "Adam Przepiórkowski",
  title =        "Towards the Design of a Syntactico-Semantic Lexicon
                  for {P}olish",
  crossref =     "klo:etal:04:ed",
  pages =        "237--246"}
  editor =       "Mieczysław A. Kłopotek and Sławomir T. Wierzchoń and
                  Krzysztof Trojanowski",
  title =        "Intelligent Information Processing and Web Mining",
  booktitle =    "Intelligent {I}nformation {P}rocessing and {W}eb {M}ining",
  publisher =    sv,
  year =         2004,
  series =       "Advances in Soft Computing",
  address =      "Berlin"}

Valid XHTML 1.0! Valid CSS!

Creation Date: Tuesday, December 23, 2003
Last Modified: Tue Jun 7 22:23:54 CEST 2005