Note: An updated version of this review has been published in eLanguage on July 15th, 2008.
A common dashboard sticker in Ghanaian taxi’s has it that “If it must be done, it must be done well”, where ‘done well’ cleverly doubles as a brand name. This is largely irrelevant except by way of introducing WALS Online, the web version of the World Atlas of Language Structures, which really has been done well.
The massive 2005 volume and the somewhat bumpy interface of the interactive maps on the accompanying CDROM have been transformed into a slick web interface with all sorts of clever stuff going on behind the scenes. In a time where an increasing number of print sources is thrown online simply in the form of scans or huge PDF files, it is refreshing to see what true adaptation to the medium of hypertext can bring us. One consequence of this is that WALS Online, rather than a reprint or a second edition of WALS 2005, has become a separate publication, edited by the same authors but published by the Max Planck Digital Library.
Features and languages
WALS Online is a website consisting of five main parts. The first part, Features, functions as an index to the 142 maps and chapters of the original edition. The opening page of each feature is merely a configuration screen from where one can navigate to the chapter text or map, change the indicators used on the map, or select another feature for combined display. The chapter text is beautifully laid out, with an eye for good web typography. A minor issue is that after using the atlas for some time, the configuration screen starts to feel as an unnecessary barrier between the index and the texts and maps. It might have been better to make the content more directly accessible from the main index of features.
The second part, Languages, provides multiple interfaces to the languages that comprise the WALS dataset. Languages are indexed by name, by language family, and by country. Moreover, under ‘Choose by region’, a nice implementation of Google Maps enables the user to display the languages from any rectangular piece of the earth’s surface simply by dragging sliders on a world map (or by manually inputting latitudes and longitudes). For unclear reasons the display is limited to the first 100 results of the query; this limitation seems a bit out of place since several other types of requests (listing all Niger-Congo languages, for example) easily result in more than 100 languages being displayed on the map. Things missing here are (1) numbers of languages — it would be trivial to display the number of languages included in WALS from a given country, family, or region on the relevant pages — and (2) a big overview map of all the languages included.
References and authors
The third part is a database of all 5584 references perused in extracting the feature values for the individual languages. This part of the website is extensively cross-referenced from both the Feature and the Language pages. My only gripe with it is that apart from the cross-referencing, the sole interface offered to explore this part of the website is a search screen. Even if a list of all the references would be rather too long, it would have been nice if references could at least be browsed by language family and by language.1 The search screen, additionally, isn’t very generous in the hints it provides, blurting ‘You have not provided enough search criteria!’ at various creative search attempts. But that’s just me wanting too much, probably; for general purposes, the search works excellent and the display of search results and individual citations is also perfectly fine. (More on that below.)
One consequence of putting all the references in a central database is that users are dependent on the quality of the cross-referencing job. On the whole, this job seems to have been done very well. Nonetheless, it was not too difficult to find an unlinked reference included in the database (Darwin 1878, mentioned but not linked in chapter 142) and worse, an unlinked reference which is wholly absent from the database (Zeschan 2004 in chapter 140). Hopefully these glitches can be corrected in due course.
The fourth part is simply an index of all the authors that coded features and wrote the chapter texts, with links to the features. No biographical information is provided. The fifth part is called Newsblog. The link leads to messages in the category ‘News’ on a weblog that at the same time functions as a place where comments pertaining to individual Features/Chapters can be left. To that end, every feature page includes a link ‘discuss’ which leads to a post on the blog. This is an innovative way of soliciting comments. There are two more links in the main navigation bar: one leads to a contact page, and one leads to an online Help feature.
So much for the interface. What about stuff behind the scenes?
Under the hood
Behind the scenes of any web application is interesting stuff that average users need not worry about, but that is the foundation of usability and extensibility. First a small gold nugget: a downloadable KML file is provided for each page that includes a map. This is a nice touch which really characterizes the great attention to detail that makes using WALS Online such a pleasant experience.
The reference database is an example of how an online bibliography should be done.2 Not only is it fully searchable, but every single citation can also be exported to various formats. A very neat feature, invisible to most users, is the embedding of bibliographic data (in the COinS format) on individual reference pages. This allows OpenURL resolvers to look up the citation online or in specified databases. It also allows users of clever research tools like Zotero (review here) to directly save the citation to their library. A logical extension of this feature would have been to provide a COinS span for each individual feature/chapter, to make those as easily citable as the references from the database.
Another nice feature which may go unnoticed (though not unused) by many visitors is the URL layout, i.e. the shape of the web addresses of the individual pages. Most of the pages have nice, clean URLs without meaningless clutter like
index.php?&page=bla&id#=etc. Even individual citations in the reference database have their own URLs (e.g. http://wals.info/refdb/record/Ameka-1991). This is a big plus, since web-savy users tend to think of the URL as another interface to the website (see URL as UI).
HTML and CSS
(Most readers will want to skip this section, which nags about unimportant details.) Amidst all the greatness, I could identify one missed opportunity. (X)HTML provides the possibility to ‘tag’ language data with the corresponding ISO code of the language. This is a nice feature which, if more widely and rigorously used, would bring us closer to usable web corpora. Regrettably, WALS Online does not code its language data like that. Instead, non-semantic class attributes are used in the CSS to specify widespread Unicode fonts. That part of the CSS, for that matter, is hugely redundant and could have been done a lot more economical and transparent.
Talking about transparency, most pages are coded in reasonably clean HTML 4.01 Transitional markup. There are several markup problems however, such as empty elements resulting from sloppy template coding (e.g. an empty
ul ‘altnames’ if an alternate name is actually not available, as in the row ‘Routledge’ on the page for Ossetic); invalid attributes either deriving from the underlying Turbogears application or from AJAX-related stuff (like
div ex:role="viewPanel"); and simple coding oversights such as the fact that
id attributes always must start with a letter rather than a digit. The ‘Alternate names’ markup by the way is very messy: it uses a
table for unclear reasons, and within that table a very redundant unordered list is used. And while we’re at it, it is also a bit unsatisfactory that
h3 elements are used with a class attribute
heading-2. But I probably shouldn’t be nitpicking like that; overall, the HTML is perfectly functional and good for what it needs to do: deliver a compelling interface to the users of WALS Online.
Part of this compelling interface is also the use of AJAX in many places to minimize whole page reloads and to streamline the UI experience. AJAX is used for example in providing realtime search suggestions; in retrieving the languages for a given feature value; and in pulling together the feature list on the main Features page.3
In conclusion, then, I simply want to reiterate what I started with: WALS Online is a formidable linguistic resource done well. It bears all the hallmarks of a well-executed web application that is here to stay for years to come. The blurb on the book version read ‘I suspect that many linguists will not be able to resist curling up with this massive volume on rainy days just for the fun facts.’ I suspect the same holds for this online version. Why not make yourself a nice cup of tea and enjoy the World Atlas of Language Structures Online?
(Hat tip: Lev Michael at Greater Blogazonia)
- Haspelmath, Martin, Matthew S. Dryer, David Gil, and Bernard Comrie, eds. 2005. The World Atlas of Language Structures. Oxford/New York: Oxford University Press.
- Haspelmath, Martin, Matthew S. Dryer, David Gil, and Bernard Comrie, eds. 2008. The World Atlas of Language Structures Online. Max Planck Digital Library.
- Links to the references are provided on individual language, but this is not the same as browsing a list of references for a given language. ↩
- It is not fully free from errors, though; for example, the editor of the 2004 volume Coordinating Constructions is cited in one reference as “Haspelmath, M.” and in another as “Haspelmath, Martin”. You’ll see what I mean when you search for ‘Haspelmath’. ↩
- That one is a bit over the top BTW, as there is now a visible delay before the list of features is displayed on the main Features page. It would have been better to just pull the list together with the HTML as the list is relatively static anyway. If it were simply coded in a
table, the browser would be displaying the first rows before the user could say AJAX. ↩