Index: branches/ariel/xmldumps-backup/README.config |
— | — | @@ -87,10 +87,13 @@ |
88 | 88 | (wikiproject) being dumped, in subdirectories by date |
89 | 89 | Default value: /dumps/public |
90 | 90 | private -- full path to directory under which all dumps of private wikis |
91 | | - and all private tables will be created, in subdirs by project |
92 | | - name and underneath that in subdirs by date, similar to the |
93 | | - public dumps |
| 91 | + and all private tables will be created, in subdirs by project |
| 92 | + name and underneath that in subdirs by date, similar to the |
| 93 | + public dumps |
94 | 94 | Default value: /dumps/private |
| 95 | +temp -- full path to directory under which temporary files will be created; |
| 96 | + this should not be the same as the public or private directory. |
| 97 | + Default value: /dumps/temp |
95 | 98 | index -- name of the top-level index file for all projects that is |
96 | 99 | automatically created by the monitoring process |
97 | 100 | Default value: index.html |
— | — | @@ -103,7 +106,7 @@ |
104 | 107 | are found |
105 | 108 | Default value: home |
106 | 109 | perdumpindex -- name of the index file created for a dump for a given project |
107 | | - on a given date |
| 110 | + on a given date |
108 | 111 | Default value: index.html |
109 | 112 | |
110 | 113 | The above options do not have to be specified in the config file, |
— | — | @@ -158,6 +161,11 @@ |
159 | 162 | Default value: /bin/cat |
160 | 163 | grep -- Location of the grep binary |
161 | 164 | Default value:/bin/grep |
| 165 | +checkforbz2footer -- Location of the checkforbz2footer binary |
| 166 | + This is part of the mwbzutils package. If |
| 167 | + the package is not installed, this value will |
| 168 | + be ignored. |
| 169 | + Default value: /usr/local/bin/checkforbz2footer |
162 | 170 | |
163 | 171 | The above options do not have to be specified in the config file, |
164 | 172 | since default values are provided. |
— | — | @@ -197,6 +205,18 @@ |
198 | 206 | pagesPerChunkAbstract -- as pagesPerChunkHistory but for the abstract |
199 | 207 | generation phase |
200 | 208 | Default value: False |
| 209 | +checkpointTime -- save checkpoints of files containing revision text |
| 210 | + (articles, metacurrent, metahistory dumps) |
| 211 | + every checkpointTime minutes. This involves closing |
| 212 | + the current output file, renaming it to a filename |
| 213 | + which includes in the filename the first and last page |
| 214 | + written, and opening a new file for the next portion |
| 215 | + of the XML output. This can be useful if you want |
| 216 | + to produce a large number of smaller files as input |
| 217 | + to XML-crunching scripts, or if you are dumping |
| 218 | + a very large wiki which has a tendency to fail |
| 219 | + somewhere in the middle (*cough*en wikipedia*cough*). |
| 220 | + Default value: 0 (no checkpoints produced) |
201 | 221 | |
202 | 222 | The above options do not have to be specified in the config file, |
203 | 223 | since default values are provided. |