Skip to content

MatrixMrsCatb

FrancisBond edited this page Aug 5, 2008 · 17 revisions

Cathedral and the Bazaar

[#candb This] is an early essay on Open Source. It is a little difficult (a lot of parentheticals and run-on sentences), but quite fun to read. It is about 800 sentences, which is small, but there are more essays if we want more data. There are several good translations (not all linked to the main page: e.g. a Spanish translation at <http://es.tldp.org/Otros/catedral-bazar/cathedral-es-paper-00.html%3E. It is freely available, but I (FCB) checked with the author anyway as a matter of courtesy and he was enthusiastic about us using it. There will be some clean up work involved in getting the translations aligned (there are several versions of the essay).

Translations

Wikipedia has a number of links to different translations: http://en.wikipedia.org/wiki/The_Cathedral_and_the_Bazaar (see on the left) (AF)

Thai version: http://linux.thai.net/~thep/catb/cathedral-bazaar/index.html

Language Participant Group URL Version
Catalan (ca) Barcelona http://www.danielclemente.com/apuntes/asai/recensio/catb.html ?
Chinese (zh) traditional Saarbrücken http://www.linux.org.tw/CLDP/OLD/doc/Cathedral-Bazaar.html (big5) 1.42
Chinese (zh) simplified Saarbrücken http://www.angeloliu.org/read-37.html ?
English (en) Stanford/Oslo http://www.catb.org/~esr/writings/cathedral-bazaar/cathedral-bazaar/ 1.57
French (fr) Toulouse http://www.linux-france.org/article/these/cathedrale-bazar/cathedrale-bazar.html 1.4
German (de) Saarbrücken http://gnuwin.epfl.ch/articles/de/Kathedrale/ 1.45
Greek, Modern (el) Saarbrücken/Athens http://howto.hellug.gr/howto/pub/html/cathedral-bazaar.html ?
Japanese (ja) Kyoto http://cruel.org/freeware/cathedral.html 1.40
Korean (ko) Seoul http://wiki.kldp.org/wiki.php/DocbookSgml/Cathedral-Bazaar-TRANS 1.32
Norwegian (no) Trondheim NTNU TBA
Portuguese (pt) Lisbon http://www.geocities.com/CollegePark/Union/3590/pt-cathedral-bazaar.html 1.42
Spanish (es) Barcelona http://es.tldp.org/Otros/catedral-bazar/cathedral-es-paper-13.html 1.28
Swedish (sv) Linköping http://home.swipnet.se/swi/KatB-se.html 1.51

At NiCT we also have a 201 sentence aligned subset of en,ko,zh,de,pt,it,fr which we use for MT testing.

Treebanking this text leads to several interesting issues with text cleansing: italics, embedded quotations, list numbers and so forth, that it would be goodto discuss more generally:

  1. Treating your users as co-developers is your least-hassle route to rapid code improvement and effective debugging.

  2. When I expressed this opinion in his presence once, he smiled and quietly repeated something he has often said: "I'm basically a very lazy person who likes to get credit for things other people actually do."

  3. And loosely-coupled collaborations enabled by the Internet, a la Linux, were frequent.

Clone this wiki locally