r110667 MediaWiki - Code Review archive

Repository:MediaWiki
Revision:r110666‎ | r110667 | r110668 >
Date:17:24, 3 February 2012
Author:oren
Status:deferred
Tags:
Comment:
Documentation apt error
Modified paths:
  • /trunk/lucene-search-3/src/site/apt/development.apt (modified) (history)

Diff [purge]

Index: trunk/lucene-search-3/src/site/apt/development.apt
@@ -2,15 +2,13 @@
33 Development
44 -------------
55
6 -* Requirements
 6+Requirements
77
88 Java Jdk version 1.6
99 Maven 2
1010
11 -
1211 To build you need to download the source from the SVN repository.
1312
14 -
1513 NLP Data Build
1614
1715
@@ -20,42 +18,43 @@
2119
2220 * Preliminary Processing
2321
24 - TODO: describe how convert a Wikipedia dump to a corpus
 22+ TODO: describe how convert a Wikipedia dump to a corpus
2523 setting chunker
26 - TODO: describe how convert a Wikitionary to compatible lexicon format
 24+ TODO: describe how convert a Wikitionary to compatible lexicon format
2725
2826 * Phonology
2927
30 - TODO: describe how to extend text to IPA conversion for "sound like" searching
 28+ TODO: describe how to extend text to IPA conversion for "sound like" searching
3129
3230 * Morphology
3331
34 - TODO: describe how to generate a Monolingual word list
35 - TODO: describe how to generate a N-gram model from a Monolingual lexicon for language detection
36 - TODO: describe how to bootstrap a morphology
37 - TODO: describe how to induct a morphology from a Monolingual lexicon
 32+ TODO: describe how to generate a Monolingual word list
 33+ TODO: describe how to generate a N-gram model from a Monolingual lexicon for language detection
 34+ TODO: describe how to bootstrap a morphology
 35+ TODO: describe how to induct a morphology from a Monolingual lexicon
3836
39 - TODO: describe use the morphology for tagging a corpus
40 - TODO: describe use the morphology for tagging a corpus
 37+ TODO: describe use the morphology for tagging a corpus
 38+ TODO: describe use the morphology for tagging a corpus
4139
4240
4341 * Semantics
4442
45 - TODO: describe how to build a lexical thesaurus.
46 - TODO: describe how to bootstrap a cross language thesaurus.
47 - TODO: describe how to induct a cross language thesaurus.
 43+ TODO: describe how to build a lexical thesaurus.
 44+ TODO: describe how to bootstrap a cross language thesaurus.
 45+ TODO: describe how to induct a cross language thesaurus.
4846
49 - TODO: describe how to import/localize an ontology/merology indexing data.
 47+ TODO: describe how to import/localize an ontology/merology indexing data.
 48+
5049 * Cross Language
5150
52 - TODO: describe how to get and update bi-lingual dictionaries from Apertium.
53 - TODO: describe how to specify (Personal) names
54 - TODO: describe how to override entries import
55 - TODO: describe how to build
 51+ TODO: describe how to get and update bi-lingual dictionaries from Apertium.
 52+ TODO: describe how to specify (Personal) names
 53+ TODO: describe how to override entries import
 54+ TODO: describe how to build
5655
5756 * Named Entities
5857
59 - TODO: describe how train Named Entity detection model.
 58+ TODO: describe how train Named Entity detection model.
6059
6160 SOLR Build
6261

Status & tagging log