< User:Richajain1912
Revision as of 12:57, 19 March 2014 by Richajain1912 (talk | contribs) (Tools available for conversion between Wiki to Docbook and Vice-Versa:)

Personal Information

Name: Richa Jain
IRC Nick: richa (#brlcad on freenode)
Github: richajain

Background Information

Hi, I am Richa Jain, a third year Undergraduate student from Indian Institute of Technology Roorkee. I am currently a developer at Software Development Section of my institute. I have mostly worked on projects based on PHP and JavaScript although I also use C, C++ regularly. I was also an intern at Wikimedia Foundation last year and now I am looking forward to work with BRL-CAD.

Project Information

Project Title

Synchronize wiki with Docbook

Brief project summary

BRL-CAD’s documentation is available in a variety of formats but the best way of maintaining documentation is docbook XML. However, as we know that a general user finds it easier to edit a wiki page rather than a docbook. Conversion between XML markup (docbook) and wiki syntax is not that easy. Using tools like pandoc for this, some markups are lost and ultimately the data is ruined. So, the main aim of this project is to Synchronize wiki and docbook so that any changes made to the wiki appears in the docbook and vice-versa without any loss of data. The project will be implemented as a separate mediawiki extension which shows the docbook XML in the edit window and after they have edited it, the XML is processed to produce the HTML web output. If a user prefers to edit wiki markup, then after the editing, the docbook XML can be downloaded.

Detailed project description

Why is Docbook format important?

Docbook is an XML language. Docbook provides you a large number of semantic element tags. These tags give some structure to the document and this structured document can be published in a variety of formats including HTMl, XHTML, PDF, etc. without any change to the source code. Most of the organisations maintain their documentation in the docbook format due to its easy maintainance. BRL-CAD also focuses on maintaining its documentation in docbook format so that it can be more easily managed by revision control system.

Tools available for conversion between Wiki to Docbook and Vice-Versa:

There are some tools available which convert docbook to wiki or vice-versa which can be used for this project:

1. Pandoc converts from one markup to another. However, while conversion between wiki and docbook, some of the tags are lost and hence the data is not preserved.

2. Mediawiki’s Collection Extension allows user to convert from Wiki markup to Docbook XML. It is configured by default to use the server . For high traffic, we need to set up mwlib and setup our own render server which will generate the docbook XML output.

3. Fedora’s Beacon project which was a done as GSoC 2009 is an XML editor which allows user to upload, create or edit docbook and parse the XML and return the HTML weboutput.

Basic Layout of the project:

1. User will be given two edit options - wiki markup or Docbook XML.
2. If a user chooses editing Docbook, the docbook XML will be displayed in the editing window rather than wiki syntax. When the user clicks on “save”, the entire XML is processed and converted to wikitext and the HTML output is displayed.
3. If a user prefers to edit wiki markup, then after clicking “save” button the wiki markup is converted to docbook and the user is given an option to download the XML file.

Possible approaches for the project

There are basically two approaches for this project:
1. Since mediawiki's Collection extension[1] converts wiki markup to Docbook XML and for the vice-versa, projects like Beacon[2] converts Docbook to weboutput. So, in order to have a synchronization between the two edits we must use both the technologies in a combined way.
2. Instead of using both of these technologies, we can define our own mediawiki extension which intelligently maps wiki markup to docbook markup and serves both the purposes simultaneously. This can be done with mediawiki parser functions.

This needs some research regarding which approach will be more beneficial, so I will utilize my time during the community bonding period in finding out a reliable method.


  • A mediawiki extension having two edit options - Wiki and Docbook.
  • Providing XML markup instead of wiki when a user chooses "Edit via Docbook".
  • Mapping Docbook and wiki by using any of the approaches described above such that there exists a synchronization between the two edits.
  • Adding an option to download the XML file once a user has finished editing wiki markup.
  • Creating documentation which describes the technologies used for future developers ( Documentation for the code will be written as I write the code )

Development Schedule

I will start working on my project as my exams will get over (on 30th April). The basic plan and estimated timeline of my project are: (timeline may change a little during the project)

  • May 1 to May 18 (Community bonding period): Investigating the two approaches described above, making few experiments with Beacon and Collection Extension and deciding which of the two will be better.
  • May 19 to May 31: Making a skeleton for the extension and giving two edit options - wiki and docbook.
  • June 1 to July 15: Implementing the backend of the extension. This involves fetching of XML to the edit window and mapping XML to wiki and vice-versa using any of the two approaches.
  • July 16 to July 31: Testing and bug fixing
  • August 1 to August 10: Writing relevant Documentation
  • August 11 to August 18: Final Evaluations

Time availability

  • Time Zone: UTC+5:30 (IST-India)
  • Typical working hours: 12 noon to 1 am before July, 6pm to 3am after July

I am quite flexible with the timings and will try to dedicate most of my time to this project.

Past Experiences

  • I was a Google Summer of Code 2013 student under Wikimedia Foundation. I worked on making a mediawiki extension (Annotator) which allows user to make inline comments on a wiki article. This project was based on PHP, JavaScript. The source code can be seen here
  • My open source projects can be seen on github.

Why me

I will work hard and bring this project to completion. I am always logged in to IRC (#brlcad) whenever I work and also do respond to the emails quickly whenever I am awake. I will keep a repository of this project on github and make frequent commits. I will always be in touch with my mentor and the community and taking help from people around whenever I am stuck.