Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Define and document the output format #4

Open
Klortho opened this issue Jun 2, 2012 · 8 comments
Open

Define and document the output format #4

Klortho opened this issue Jun 2, 2012 · 8 comments

Comments

@Klortho
Copy link
Collaborator

Klortho commented Jun 2, 2012

I've just been really looking at the "MediaWiki XML" for the first time. I created a wiki page where we can define and document it.

I guess I knew this before, but forgot, but it seems that the XML is really just incidental -- just a wrapper around the wiki text.

@Klortho
Copy link
Collaborator Author

Klortho commented Jun 2, 2012

Interestingly, I just found that a page that I exported using Special:Export is not valid! It had two validation errors:

  1. The value of /mediawiki/page/ns is "0", but it should be a positive integer
  2. comes after , but it should come before.

Of course, what we produce should work with whatever importers are used, regardless of what the XSD says. Presumably, this invalid export file format is correct, and the XSD is wrong.

@jgmorse
Copy link
Collaborator

jgmorse commented Jun 2, 2012

Yeah, the XSLT to generate the wrapper XML was fairly trivial, although it
did require a bit of reverse-engineering since that XML schema is not well
documented. I've got a few pertinent comments in the XSLT that might be
useful.

On Fri, Jun 1, 2012 at 11:58 PM, Chris Maloney <
[email protected]

wrote:

I've just been really looking at the "MediaWiki XML" for the first time.
I created a wiki page
where we can define and document it.

I guess I knew this before, but forgot, but it seems that the XML is
really just incidental -- just a wrapper around the wiki text.


Reply to this email directly or view it on GitHub:
#4

@Klortho
Copy link
Collaborator Author

Klortho commented Jun 2, 2012

I guess the output format should not be limited to the format of the main MediaWiki document, but also include a spec for how images and other media files are handled. I saw you mention some file/directory naming convention in an couple of recent emails -- was that a conversation you had offline? Do you have any pointers to those specs?

@jgmorse
Copy link
Collaborator

jgmorse commented Jun 2, 2012

That was in a conversation with Daniel. To make the XSLT generally useful,
we decided that renaming the image files, and accordingly modifying the
links in the wikicode, would happen outside the XSLT, meaning the WIkiMedia
XML will be post-processed to modify the links to match the new filenames.
You should talk to Daniel about how it would be implemented.

On Sat, Jun 2, 2012 at 12:59 AM, Chris Maloney <
[email protected]

wrote:

I guess the output format should not be limited to the format of the main
MediaWiki document, but also include a spec for how images and other media
files are handled. I saw you mention some file/directory naming convention
in an couple of recent emails -- was that a conversation you had offline?
Do you have any pointers to those specs?


Reply to this email directly or view it on GitHub:
#4 (comment)

@Daniel-Mietchen
Copy link
Collaborator

There are a few bugs reported around the xsd already:
https://bugzilla.wikimedia.org/buglist.cgi?quicksearch=XSD&list_id=119661
dunno whether any of these refer to what you were talking about,
though.

Daniel

On Sat, Jun 2, 2012 at 5:58 AM, Chris Maloney
[email protected]
wrote:

I've just been really looking at the "MediaWiki XML" for the first time.  I created a wiki page where we can define and document it.

I guess I knew this before, but forgot, but it seems that the XML is really just incidental -- just a wrapper around the wiki text.


Reply to this email directly or view it on GitHub:
#4

@Klortho
Copy link
Collaborator Author

Klortho commented Jun 2, 2012

There are a few bugs reported around the xsd already ... dunno whether any of these refer to what you were talking about, though.

One of them does -- 35264. For the other issue, I created 37305

@Daniel-Mietchen
Copy link
Collaborator

Thanks!

On Sat, Jun 2, 2012 at 4:41 PM, Chris Maloney
[email protected]
wrote:

There are a few bugs reported around the xsd already ... dunno whether any of these refer to what you were talking about, though.

One of them does -- 35264.  For the other issue, I created 37305


Reply to this email directly or view it on GitHub:
#4 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants