Index: branches/ariel/xmldumps-backup/incrementals/incrs-index.html |
— | — | @@ -65,18 +65,20 @@ |
66 | 66 | for documentation on the provided data formats. |
67 | 67 | </p> |
68 | 68 | <p> |
69 | | - Here's the big fat disclaimer. |
| 69 | + <strong>Here's the big fat disclaimer.</strong> |
70 | 70 | </p> |
71 | 71 | <p> |
72 | | - This service is experimental. At any time it may not be working, for a day, a week or a month. |
| 72 | + <em>This service is experimental.</em> At any time it may not be working, for a day, a week or a month. |
73 | 73 | It is not intended to replace the full XML dumps. We don't expect users to be able to construct |
74 | | - full dumps of a given date from the incrementals and an older dump. |
| 74 | + full dumps of a given date from the incrementals and an older dump. We don't guarantee that the data |
| 75 | + included in these dumps is complete, or correct, or won't break your Xbox. In short: don't blame us (but |
| 76 | + do get on the email list and send mail: see <a href="https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l">xmldatadumps-l</a>). |
75 | 77 | </p> |
76 | 78 | <p> |
77 | 79 | The data provided in these files is ''partial data''. To be precise: |
78 | 80 | <ul> |
79 | 81 | <li>* Revisions included in these dumps are not up to the minute. We write out those that were |
80 | | - created up to 18 hours ago; this gives local editing communities time to delete revisions |
| 82 | + created up to 12 hours ago; this gives local editing communities time to delete revisions |
81 | 83 | with sensitive information, vulgarities and other vandalism, etc.</li> |
82 | 84 | <li>* New pages entered for the first time during the time interval are included</li> |
83 | 85 | <li>* Revisions of undeleted pages will be included only if new revision IDs need to be assigned to |
— | — | @@ -85,7 +87,8 @@ |
86 | 88 | <li>* Imported revisions will be included if they were imported during the time interval, since they |
87 | 89 | will have new revisions IDs.</li> |
88 | 90 | <li>* As with all dumps, hidden revisions or more generally revisions not readable by the general public |
89 | | - are not provided.</li> |
| 91 | + are not provided.</li> |
| 92 | + <li>* When a wiki is closed, it no longer shows up in this list.</li> |
90 | 93 | </ul> |
91 | 94 | </p> |
92 | 95 | <p> |
— | — | @@ -104,7 +107,19 @@ |
105 | 108 | you get articles, user pages, discussion pages, etc. If you want articles only, you will need to write a |
106 | 109 | filter to grab just those entries. |
107 | 110 | </p> |
108 | | - <h2>Adds/changes dump listing</h2> |
| 111 | + <p> |
| 112 | + The md5sums.txt file contains the md5 hash of the stubs file and the revs file, so that downloaders can verify |
| 113 | + the integrity of the files after download. |
| 114 | + </p> |
| 115 | + <p> |
| 116 | + The file maxrevid.txt contains the largest revision ID on the project at the time we checked, which should be the |
| 117 | + same as the timestamp of that file. |
| 118 | + </p> |
| 119 | + <p> |
| 120 | + The file status.txt, if it exists, will contain the value "done" in cases where the run is complete and was |
| 121 | + successful. |
| 122 | + </p> |
| 123 | + <h2>Adds/changes dump listing (links to latest complete run)</h2> |
109 | 124 | <ul> |
110 | 125 | %(items)s |
111 | 126 | </ul> |