r104140 MediaWiki - Code Review archive

Repository:MediaWiki
Revision:r104139‎ | r104140 | r104141 >
Date:08:46, 24 November 2011
Author:ariel
Status:deferred
Tags:
Comment:
more disclaimers, bolder disclaimers, info on the other files in each dir
Modified paths:
  • /branches/ariel/xmldumps-backup/incrementals/incrs-index.html (modified) (history)

Diff [purge]

Index: branches/ariel/xmldumps-backup/incrementals/incrs-index.html
@@ -65,18 +65,20 @@
6666 for documentation on the provided data formats.
6767 </p>
6868 <p>
69 - Here's the big fat disclaimer.
 69+ <strong>Here's the big fat disclaimer.</strong>
7070 </p>
7171 <p>
72 - This service is experimental. At any time it may not be working, for a day, a week or a month.
 72+ <em>This service is experimental.</em> At any time it may not be working, for a day, a week or a month.
7373 It is not intended to replace the full XML dumps. We don't expect users to be able to construct
74 - full dumps of a given date from the incrementals and an older dump.
 74+ full dumps of a given date from the incrementals and an older dump. We don't guarantee that the data
 75+ included in these dumps is complete, or correct, or won't break your Xbox. In short: don't blame us (but
 76+ do get on the email list and send mail: see <a href="https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l">xmldatadumps-l</a>).
7577 </p>
7678 <p>
7779 The data provided in these files is ''partial data''. To be precise:
7880 <ul>
7981 <li>* Revisions included in these dumps are not up to the minute. We write out those that were
80 - created up to 18 hours ago; this gives local editing communities time to delete revisions
 82+ created up to 12 hours ago; this gives local editing communities time to delete revisions
8183 with sensitive information, vulgarities and other vandalism, etc.</li>
8284 <li>* New pages entered for the first time during the time interval are included</li>
8385 <li>* Revisions of undeleted pages will be included only if new revision IDs need to be assigned to
@@ -85,7 +87,8 @@
8688 <li>* Imported revisions will be included if they were imported during the time interval, since they
8789 will have new revisions IDs.</li>
8890 <li>* As with all dumps, hidden revisions or more generally revisions not readable by the general public
89 - are not provided.</li>
 91+ are not provided.</li>
 92+ <li>* When a wiki is closed, it no longer shows up in this list.</li>
9093 </ul>
9194 </p>
9295 <p>
@@ -104,7 +107,19 @@
105108 you get articles, user pages, discussion pages, etc. If you want articles only, you will need to write a
106109 filter to grab just those entries.
107110 </p>
108 - <h2>Adds/changes dump listing</h2>
 111+ <p>
 112+ The md5sums.txt file contains the md5 hash of the stubs file and the revs file, so that downloaders can verify
 113+ the integrity of the files after download.
 114+ </p>
 115+ <p>
 116+ The file maxrevid.txt contains the largest revision ID on the project at the time we checked, which should be the
 117+ same as the timestamp of that file.
 118+ </p>
 119+ <p>
 120+ The file status.txt, if it exists, will contain the value "done" in cases where the run is complete and was
 121+ successful.
 122+ </p>
 123+ <h2>Adds/changes dump listing (links to latest complete run)</h2>
109124 <ul>
110125 %(items)s
111126 </ul>

Status & tagging log