Making and breaking iconicity

Making and breaking iconicity was the theme of a plenary lecture I gave at the 6th conference of the Scandinavian Association for Language and Cognition (SALC VI) in Lund. Here’s the opening slide:

Research on iconicity and sound symbolism has long focused on how iconic associations are made — finding universal crossmodal associations using pseudowords like bouba and kiki, and trying to understand what makes ideophones (and aspects of sign language) iconic and how they may help learning. This is great, but to understand underlying processes, we also need to understand what happens when we break iconicity. When you break a system (carefully, in a controlled way), you can get a new view of its internals. We need more work on breaking iconicity. What makes it dissipate from the lexicon? Which cues do we need to remove to make ideophones harder to guess? What happens to learning if we mess with the link between form and meaning? Under which circumstances is iconic sensitivity disrupted? Is iconicity more ‘broken’ in some people than in others, and what can we learn from individual differences? If we want to understand the linguistic mechanisms and cognitive processes involved in lexical iconicity, breaking iconicity is as important methodologically as making it.

When preparing this lecture, I noticed my own destructive streak: a lot of recent work by me and collaborators can be construed as focusing not just on making, but also breaking iconicity to understand underlying processes and mechanisms. A selection (bibliographic details and more papers here):

  1. In a corpus study of Japanese, we found that ideophones lose their prosodic foregrounding (and thereby a significant part of their iconic cues) when they become more deeply integrated in the utterance.
    (Akita & Dingemanse 2016 Journal of Linguistics PDF)
  2. In a corpus study of Siwu, I found that frequently used ideophones are more likely to lose their expressive prosody, syntactic independence, and iconic features, essentially turning into ordinary words over time
    (Dingemanse 2017 STUF — Language Typology and Universals PDF)
  3.  In a forced choice task with 203 ideophones from 5 languages, we found that iconicity in ideophones becomes nearly impossible to detect when you remove either segmental or prosodic cues, showing it relies on both. We basically tried to see how much was left of iconicity when you strip away various features of the signal. (Dingemanse, Schuerman, Reinisch, Tufvesson, Mitterer 2016 Language PDF)
  4. In a bouba/kiki task comparing people with and without dyslexia, we found that dyslexia disrupts sound-symbolic sensitivity, implicating cross-modal abstraction processes. (Drijvers, Zaadnoordijk & Dingemanse 2015 CogSci Proceedings PDF)
  5. In a learning study, we found that Japanese ideophones become harder to learn when you present them with their opposite meanings, breaking iconic links. This was not the case for adjectives, supporting their arbitrariness and by implication the iconicity of ideophones (Lockwood, Dingemanse & Hagoort 2016 JEP:LMC PDF)
  6. In a replication of the learning study with added EEG measures, we found consistent differences in the time course of neural signatures that fit with what others have found for multisensory integration, and that co-varied with independently assessed sound-symbolic sensitivity. I personally think that tapping into individual differences to find out what makes and breaks iconicity is one of the most promising ways forward for studies of iconicity. (Lockwood, Hagoort & Dingemanse 2016 Collabra PDF)

Here’s to more work trying to break iconicity!

Facts and and fiction about iconicity: the story of ideophones

Here’s the abstract for the keynote lecture I’ll be giving at the 11th Symposium on Iconicity in Language and Literature in Brighton, April 6-8, 2017 (site).

The notion of iconicity has seen a remarkable increase in prominence in recent years. No longer the marginal phenomenon it once was, it has become a canvas upon which we paint our wildest dreams about lexical structure, language learning, and the evolution of communication. Amid the flurry of exciting work it is sometimes hard to see what exactly iconicity is. Some divide it into subtypes, treating it as a semiotic relationship that comes in kinds. Others measure it by graded judgements, treating it as a substance that comes in degrees. Yet others use it as a predictor variable in experimental work, treating it as a property that can be present or absent. These diverse operationalizations point to a need for clarity about the empirical foundations of iconicity. Here I approach this goal from the perspective of research on ideophones, vivid sensory words found in many of the world’s spoken languages. Studies of ideophones have it all: daring claims of pervasive iconicity down to the level of speech sounds, counterarguments positing the utter irrelevance of anything iconic, and a variety of approaches trying to chart a middle way between these extremes. I report on a series of linguistic and experimental investigations of iconicity in ideophones. Starting from the use of ideophones in conversation —the primordial ecology of language and verbal art— I show that they are best understood as multimodal depictions: communicative acts that invite us to imagine what it is like to perceive the scene depicted. This basic fact helps explain a range of cross-linguistic observations about ideophones: from their marked forms to their special morphosyntax, and from their sensory semantics to their uses as direct appeals to experience. Careful psycholinguistic experimentation allows us to see how phonemes and prosody can come to function as iconic cues, why iconic ideophones are easier to learn than arbitrary adjectives, and how the cross-modal associations they thrive on may be related to synaesthesia. Ideophones challenge us to take a fresh look at language and consider how it is that our communication system combines multiple modes of representation.

Some relevant readings (a larger selection of papers is here):

Ideophones, expressiveness and grammatical integration

Ideophones —vivid sensory words found in many of the world’s languages— are often described as having little or no morphosyntax. That simple statement conceals an interesting puzzle. It is not often that we can define a word class across languages in terms of its syntax (or lack thereof). After all, most major types of word classes show intriguing patterns of cross-linguistic variation. There is no particular reason to expect that the morphosyntactic position or degree of embedding of, say, noun-like or verb-like words will be similar across unrelated languages. Indeed that is why typologists define comparative concepts primarily by reference to semantic rather than grammatical or morphosyntactic properties (Croft 2003; Haspelmath 2007).  Continue reading

Slides for a hands-on Zotero workshop

One of the key tasks scientists need to master is how to manage bibliographic information: collecting relevant literature, building a digital library, and handling citations and bibliographies during writing.

This tutorial introduces Zotero (, an easy to use reference management tool made by scholars for scholars. The tutorial covers the basics of using Zotero for collecting, organizing, citing and sharing research. Zotero automates the tasks of managing bibliographic data, storing and renaming PDFs, and formatting references. It also integrates with widely used text processors, and can synchronize your library across devices. There is no more need to search through disorganized file folders full of inscrutably named PDF files, to copy and paste references across documents, or to manually deal with pointless differences in citation styles. Ultimately, the point of using a reference manager is to free more time for real research.

Note: these are slides made for a hands-on workshop. They may not work well outside the context of a live Zotero demonstration. I share them because they may still contain some useful information.

What do you really need on this earth?

Natural conversations are a great source of data for all sorts of linguistic research. Linguists and conversation analysts usually study them primarily for their structure, not their content. This is not out of disinterest, but out of empirical prudence. Talk tends to support a wide range of interpretations. It is empirically safest to stick to observable structures and practices, or at most to interpretations furnished by the interlocutors themselves.

The excerpt below is translated from a corpus of natural conversations in Siwu, a language spoken in Ghana. Two elderly men are sitting in front of their house and chatting. They’ve just been talking about a fellow villager whose children are “giving him problems”. The long silence before Adom’s “So now.” signifies, among other things, that what comes now is likely a new topic. The exchange that follows is beautifully poetic both in terms of structure and topic.

Adom So now.
You have a keyboard.
Ben Mm.
A You have an uh. (1.5) this thing
B Mm.
A You have uh (0.8) radio.
B Mm.
A You have electricity.
B Mm.
A You have water.
B Mm.
A So then what really- what do you really need on this earth?
B What I need?
As for me, I don’t need anything except-
Except my bodily health.
A Just your bodily health.
B Mm.

One is tempted to talk about Maslow’s pyramid, material culture, and a whole lot of other things — but it is probably best to let the exchange speak for itself. (Translated from Siwu.)

Morning clouds in Akpafu-Mempeasem, 2009


How promotes poor metadata and plays to our vanity

giphyA while back some low quality citations started showing up on Google Scholar. They had titles like “CHAPTER 2 draft — email” and it was hard find actual bibliographic metadata. Google Scholar seemed to have scraped random PDFs uploaded on and decided it was worth counting the citations in them even in the absence of proper metadata. I shared this on Twitter and promptly forgot about it.

Then I got an email from someone asking me to say a bit more about my concerns with poor metadata. I decided to write it up in a blog post. I’m afraid it turned into a a bit of rant about how seems built not so much for sharing scientific information as for playing to our vanity. Sorry about that. Let’s start with the poor metadata issue, which turns out to be rather pervasive. Continue reading

Sound symbolism boosts novel word learning: comparing ideophones and adjectives

We have a new paper out. It’s actually been available since February in an online-first version, but for those of us who love page numbers and dead trees, the journal has now printed it in its August issue on pages 1274-1281. Citation:

Lockwood, G., Dingemanse, M., & Hagoort, P. (2016). Sound-symbolism boosts novel word learning (PDF). Journal of Experimental Psychology: Learning, Memory, and Cognition, 42(8), 1274-1281. doi:10.1037/xlm0000235

This is another one for which we’ve made available the stimuli —word lists and sound files— through OSF, contributing to our mission to make research from our lab replicable. Also, we have since replicated the results in a follow-up study where we also took EEG and individual difference measures.

I provide a quick summary in 2×3 points below. For a write-up that’s much more fun and has great illustrations, check out Gwilym Lockwood’s Sound symbolism boosts novel word learning: the MS Paint version. Continue reading

Why PLOS ONE needs page proofs

Note: I prepared this posting in August 2015, when PLOS ONE was due to publish a paper by us and I wanted to make sure they’d avoid the stupid typesetting errors they made in our 2013 paper. I used the numbers to convince them to show us proofs beforehand. To my surprise, they did, and I never got around to finishing the draft piece I had in the making. This week the issue flared up again following a comment by Dorothy Bishop, so I’ve decided to unearth my draft blog post and put it online.

Update: thanks Retraction Watch for giving some attention to this issue: PLOS ONE’s correction rate is higher than average. Why? Continue reading

What sound symbolism can and cannot do: new paper in Language

What sound symbolism can and cannot doWe have a new paper out in Language:

Dingemanse, Mark, Will Schuerman, Eva Reinisch, Sylvia Tufvesson, and Holger Mitterer. 2016. “What Sound Symbolism Can and Cannot Do: Testing the Iconicity of Ideophones from Five Languages.” Language 92 (2): e117–33. doi:10.1353/lan.2016.0034

The basic finding is this: people are sensitive to the meaning of ideophones they’ve never heard, even when they are produced out of context by a computer voice in a difficult forced choice task. Yet there is also reason for caution: the effect is not nearly as strong as what people have found for pseudowords like bouba and kiki.

As we note in the introduction, “there appears to be a tendency to either underplay or exaggerate the significance of iconicity in the study of language and mind”. In this paper we chart a middle way between these extremes. Here’s a quick summary in 3×3 points:

What we did:

  1. Sound symbolism (iconicity in spoken language) is usually studied using hand-crafted pseudowords in binary forced choice experiments (think bouba and kiki, as reviewed here), but there are three problems with such experimental designs: (i) they run the risk of inflating effect sizes, (ii) it is unclear how they relate to natural languages, and (iii) they usually don’t control for prosody.
  2. We designed a study to tackle these problems by (i) adjusting the binary choice task to be more realistic and harder, (ii) using real words and meanings from natural languages, and (iii) teasing apart prosody and segmental features. Essentially, we bring linguistic insights to bear on the psychological study of sound symbolism.
  3. We take 203 ideophones —lexical sound-symbolic words— from 5 languages and 5 semantic domains and present them to 80 participants in 4 versions: (i) full original recording, (ii) full speech synthesized version, (iii) prosody-only condition and (iv) phonemes-only condition. The versions help us control for variation due to different speakers and help us examine the contributions of prosody and segmental features.

What we found:

  1. People can choose the correct translation of ideophones at a level significantly above chance. So ideophones in Japanese, Korean, Semai, Ewe and Siwu are not fully arbitrary, as is normally assumed of words; they contain iconic cues that even people who don’t speak these languages can pick up.
  2. Sound ideophones are easiest to guess, but the other semantic domains (movement, texture, color/visual appearance, and shape) come out significantly above chance as well. However, the effect is much more modest than most bouba/kiki studies: in the best versions, people score 57.2% on average (where 50% would be chance level) — quite different from the 95% that has sometimes been claimed for pseudoword studies.
  3. Performance for the original and resynthesised stimuli is indistinguishable, so our speech synthesis method works. Performance is significantly better for the full versions (i-ii) than for the reduced versions (iii-iv), so both prosody and phonemes contribute to the effect (and neither alone is sufficient).

What we conclude:

  1. Findings based on pseudowords like bouba/kiki cannot be automatically translated into claims about natural languages. Ideophones combine iconicity and arbitrariness, and lexical iconicity in ideophones is best characterised as a weak bias, which is supported by multimodal performances in actual use and which may be amplified in cultural evolution (cf our TiCS paper).
  2. Prosody is just as important as segmental information in supporting iconic interpretations (as predicted here). Prior work, which has rarely controlled for prosody, likely overestimates the role of speech sounds and underestimates the role of intonation, duration and other prosodic cues.
  3. Speech synthesis offers a viable way to achieve experimental control in the study of sound symbolism. To stimulate its wider use we’re making available all stimulus materials, including the diphone synthesis source files we used to create them. Get them at MUSE or OSF.
Here’s the abstract:

Sound symbolism is a phenomenon with broad relevance to the study of language and mind, but there has been a disconnect between its investigations in linguistics and psychology. This study tests the sound-symbolic potential of ideophones—words described as iconic—in an experimental task that improves over prior work in terms of ecological validity and experimental control. We presented 203 ideophones from five languages to eighty-two Dutch listeners in a binary-choice task, in four versions: original recording, full diphone resynthesis, segments-only resynthesis, and prosody-only resynthesis. Listeners guessed the meaning of all four versions above chance, confirming the iconicity of ideophones and showing the viability of speech synthesis as a way of controlling for segmental and suprasegmental properties in experimental studies of sound symbolism. The success rate was more modest than prior studies using pseudowords like bouba/kiki, implying that assumptions based on such words cannot simply be transferred to natural languages. Prosody and segments together drive the effect: neither alone is sufficient, showing that segments and prosody work together as cues supporting iconic interpretations. The findings cast doubt on attempts to ascribe iconic meanings to segments alone and support a view of ideophones as words that combine arbitrariness and iconicity. We discuss the implications for theory and methods in the empirical study of sound symbolism and iconicity.

Some things you need to know about Google Scholar

Summary: Google Scholar is great, but its inclusiveness and mix of automatically updated and hand-curated profiles means you should never take any of its numbers at face value. Case in point: the power couple Prof. Et Al and Dr. A. Author, whose profiles I created following Scholar’s recommended settings (and a bit of manual embellishment). If you have a Scholar profile, make sure you don’t let Scholar update the publication list automatically without checking and cleaning up regularly. If you’re looking at somebody else’s profile, take it with a big pinch of salt, especially when they have a reasonably common name or when duplicate entries or weird citation distributions indicate that it is being automatically updated.  Continue reading