|
Data
- SemCor. 23,346 lemmas, 234,113 instances. Inventory: WordNet 1.6, 1.7, 1.7.1, 2.0.
Text sources: 80% Brown corpus, 20% a novel, The Red Badge of Courage.
http://www.cs.unt.edu/~rada/downloads.html.
- SemiSusanne, a semantically tagged and structurally annotated corpus formed from the union of the SUSANNE and SemCor corpora. 33 documents annotated with WordNet 1.6 senses.
http://www.grsampson.net/Resources.html
- line, hard and serve corpora.
3 instances, 12,000+ instances. Inventory: WordNet 1.5. Text sources: Wall Street Journal,
American Printing House for the Blind, San Jose Mercury. Leacock, Towell, and Voorhees 1998.
http://www.d.umn.edu/~tpederse/data.html.
- interest corpus.
1 lemma, 2,369 instances. Inventory: LDCOE
Text sources: Wall Street Journal.
Rebecca Bruce and Jan Wiebe.
http://crl.nmsu.edu/cgi-bin/Tools/CLR/clrcat#I9
- Hector is an Oxford University Press and DEC dictionary research project.
ca 300 lemmas, 200,000 instances. Inventory: Hector. (Atkins 1993)
Text source: A 20M-word pilot for the British National Corpus.
- DSO Corpus
191 lemmas, 192,800 instances. Inventory: WordNet 1.5.
Text sources: Brown Corpus, Wall Street Journal. (Ng and Lee 1996)
http://www.ldc.upenn.edu/Catalog/LDC97T12.html
- Open Mind Word Expert
230 lemmas, 70,000 instances. Includes duplicates but the number is growing
daily; this is an on-line resource that web-users can add to at any time.
Inventory: WordNet 1.7.
Text source: Penn treebank, LA Times, others.(Chklovski and Mihalcea 2002)
http://www.teach-computers.org/word-expert.html
- HKUST-Chinese
38,725 sentences. Inventory: Hownet.
Text source: Sinica corpus. (Gan Kwok-Wee and Wong Ping-Wai)
http://godel.iis.sinica.edu.tw/CKIP/hk/index.html
http://www.keenage.com
- Swedish corpus
179,151 instances tagged. Inventory: Gothenburg lexical database.
Text source: The SUC Corpus.(Jaerborg, Kokkinakis, and Toporowska).
http://svenska.gu.se/~svedk/SENSEVAL/senseval.html
- Image captions
2,304 lemmas, 8,816 instances. Inventory: WordNet 1.5.
Text source: Image captions of an image collection.
(Smeaton and Quigley 1996)
http://www.computing.dcu.ie/~asmeaton/SIGIR96-captions/
- Senseval-1
Senseval-1 resources
- Senseval-2
Senseval-2 resources
|
|
|