Discussion:
[Xmldatadumps-l] [Wikitech-ambassadors] Queries regarding Wikimedia's Data Dumps.
Nick Wilson (Quiddity)
2018-06-07 16:51:49 UTC
Permalink
(Fwding to Xmldatadumps-l@ list -
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l , CCing
Hi there,
This is Nikhil, an undergraduate student from India. And I'm trying to
understand the Wikipedia's data dumps provided by Wikimedia.
I'm working on 20180520 dumps. It contains many sections, each having
different data. And I would like to know what each section's data
represent. Although it's written in a brief, I don't get it clearly.
Like in section "All pages, current versions only." Does each and every
article's current version is present in this data? Because I just
downloaded "enwiki-20180520-pages-meta-current1.xml-p10p30303.bz2", the
first-page information is of "AccessibleComputing" but it does not have
complete article's information in it?
Hoping to get a quick reply.
Thanks.
Nikhil
_______________________________________________
Wikitech-ambassadors mailing list
https://lists.wikimedia.org/mailman/listinfo/wikitech-ambassadors
--
Nick Wilson (Quiddity)
Community Liaison, Wikimedia Foundation
Loading...