Wikipedia:WikiProject Disambiguation/Database dump analysis
A database dump is a backup of all Wikipedia pages, which can then be downloaded. Once downloaded, extensive analysis can performed on the dump (this can't be done by scraping live from the servers because it creates excessive load).
Database dump analysis can help WikiProject Disambiguation achieve its goals by providing editors with extra information.
Currently run dump analyses
- The Disambiguation pages maintenance report is refreshed by a dump occasionally. Generated by RussBlau.
- Generated by Bo Lindbergh (details):
- The Disambiguation pages with links report. Status - updated every couple of months, as needed.
- From portals, a variation on link repair
- From templates, a variation on link repair
- Malplaced disambiguation pages report.
- Statistics, example:
- The Disambiguation pages with links report. Status - updated every couple of months, as needed.
| ∑ | articles | categories | portals | templates | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| pages | links | pages | links | pages | links | pages | links | pages | links | |
| 2005-11-13 | 33102 | 412194 | 32166 | 410987 | 936 | 1207 | ||||
| 2005-12-13 | 34475 | 425520 | 34126 | 425120 | 349 | 400 | ||||
| 2006-03-03 | 39928 | 465726 | 38238 | 463507 | 578 | 836 | 429 | 495 | 683 | 888 |
Proposal: tracking down dab pages with suspect style
At WP:DAB wangi expressed interest in using the dumps to aid dab page style (by tracking down suspect dab pages). One could argue that Category:Disambiguation pages in need of cleanup is always plentifully stocked and that a dump analysis to find more troublesome dabs is unnecessary. But then again, who could have perceived the activity around From templates that resulted in completion of that report.
Ideas
- Image and template checks...
Dab pages are checked for:
- Images
- Templates (other than dab templates naturally, including stubs templates etc)
- Images and templates indicate that a dab page is verging on article status. An expert can examine the dab and perform merging, start discussion etc.
- Talk page is a redirect?
- If a page has a dab template then it should have its own talk page. Due to page moves, often a dab's talk page redirects elsewhere (no redirect should be present). A listing of dab pages without their own talk pages would be helpful.
- Link checking...
- Check the ratio of wikilinks to number of lines for page. The idea being that the higher the value the more in need of cleanup a page is (generally).
- Check for piping of links. Generally piping should not be present on dab pages. Perhaps check the gross number of piped links
Content Disclaimer
Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.
- The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
- There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
- It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
- Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
- Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.