探花直播sounds of languages that died thousands of years ago have been brought to life again through technology that uses statistics in a revolutionary new way.

As a word is uttered it vibrates air, and the shape of this soundwave can be measured and turned into a series of numbers

John Aston

No matter whether you speak English or Urdu, Waloon or Waziri, Portuguese or Persian, the roots of your language are the same. Proto-Indo-European (PIE) is the mother tongue 鈥 shared by several hundred contemporary languages, as well as many now extinct, and spoken by people who lived from about 6,000 to 3,500 BC on the steppes to the north of the Caspian Sea.

They left no written texts and although historical linguists have, since the 19th century, painstakingly reconstructed the language from daughter languages, the question of how it actually sounded was assumed to be permanently out of reach.

Now, researchers at the Universities of Cambridge and Oxford have developed a sound-based method to move back through the family tree of languages that stem from PIE. They can simulate how certain words would have sounded when they were spoken 8,000 years ago.

Remarkably, at the heart of the technology is the statistics of shape.

鈥淪ounds have shape,鈥 explains Professor John Aston, from Cambridge鈥檚 Statistical Laboratory. 鈥淎s a word is uttered it vibrates air, and the shape of this soundwave can be measured and turned into a series of numbers. Once we have these stats, and the stats of another spoken word, we can start asking how similar they are and what it would take to shift from one to another.鈥澨

A word said in a certain language will have a different shape to the same word in another language, or an earlier language. 探花直播researchers can shift from one shape to another through a series of small changes in the statistics. 鈥淚t鈥檚 more than an averaging process, it鈥檚 a continuum from one sound to the other,鈥 adds Aston, who is funded by the Engineering and Physical Sciences Research Council (EPSRC). 鈥淎t each stage, we can turn the shape back into sound to hear how the word has changed.鈥

Rather than reconstructing written forms of ancient words, the researchers triangulate backwards from contemporary and archival audio recordings to regenerate audible spoken forms from earlier points in the evolutionary tree. Using a relatively new field of shape-based mathematics, the researchers take the soundwave and visualise it as a spectrogram 鈥 basically an undulating three-dimensional surface that represents the shape of that sound 鈥 and then reshape the spectrogram along a trajectory 鈥榮ignposted鈥 by known sounds.

While Aston leads the team of statistician 鈥榮hape-shifters鈥 in Cambridge, the acoustic-phonetic and linguistic expertise is provided by Professor John Coleman鈥檚 group in Oxford.

探花直播researchers are working on the words for numbers as these have the same meaning in any language. 探花直播longest path of development simulated so far goes backwards 8,000 years from , and likewise for other numerals. They have also 鈥榞one forwards鈥 from the PIE penkwe to the modern Greek pente, modern Welsh pimp and modern Englishfive, as well as simulating change from Modern English to Anglo-Saxon (or vice versa), and from Modern Romance languages back to Latin.

(Other audio demonstrations are available )

鈥淲e鈥檝e explicitly focused on reproducing sound changes and etymologies that the established analyses already suggest, rather than seeking to overturn them,鈥 says Coleman, whose research was funded by the Arts and Humanities Research Council.

They have discovered words that appear to correctly 鈥榝all out鈥 of the continuum. 鈥淚t鈥檚 pleasing, not because it overturns the received wisdom, but because it encourages us that we are getting something right, some of the time at least. And along the way there have also been a few surprises!鈥 探花直播method sometimes follows paths that do not seem to be etymologically correct, demonstrating that the method is scientifically testable and pointing to areas in which refinements are needed.

Remarkably, because the statistics describe the sound of an individual saying the word, the researchers are able to keep the characteristics of pitch and delivery the same. They can effectively turn the word spoken by someone in one language into what it would sound like if they were speaking fluently in another.

They can also extrapolate into the future, although with caveats, as Coleman describes: 鈥淚f you just extrapolate linearly, you鈥檒l reach a point at which the sound change hits the limit of what is a humanly reasonable sound. This has happened in some languages in the past with certain vowel sounds. But if you asked me what English will sound like in 300 years, my educated guess is that it will be hardly any different from today!鈥

For the team, the excitement of the research includes unearthing some gems of archival recordings of various languages that had been given up for dead, including an Old Prussian word last spoken by people in the early 1700s but 鈥榖orrowed鈥 into Low Prussian and discovered in a German audio archive.

Their work has applications in automatic translation and film dubbing, as well as medical imaging (see panel), but the principal aim is for the technology to be used alongside traditional methods used by historical linguists to understand the process of language change over thousands of years.

鈥淔rom my point of view, it鈥檚 amazing that we can turn exciting yet highly abstract statistical theory into something that really helps explain the roots of modern language,鈥 says Aston.

鈥淣ow that we鈥檝e developed many of the necessary technical methods for realising the extraordinary ambition of hearing ancient sounds once more,鈥 adds Coleman, 鈥渢hese early successes are opening up a wide range of new questions, one of the central being how far back in time can we really go?鈥

Audio demonstrations are available here:

Inset image: Spectrograms showing how the shape of the sound of a word in one language can be morphed into the sound of the same word in another language; credit: John Aston.


Medical imaging reshaped

探花直播statistics of shape are not just being used to show how different languages relate to each 鈥 they are also being used to improve the analysis of medical images.

Just as soundwaves have a shape that can be analysed using statistics, so do the patterns of neurons interacting with each other or the dimensions of the surface of a tumour. Now a new research Centre will develop tools that use the mathematics of the shapes found in medical images to improve diagnosis, prognosis and treatment planning for patients.

探花直播, one of five 鈥榤aths鈥 centres recently funded by 拢10 million from EPSRC, is co-led by Aston and Dr Carola-Bibiane Sch枚nlieb from the Department of Applied Mathematics and Theoretical Physics in Cambridge.

鈥 探花直播new methodologies will allow clinical medicine to move beyond one person reading single scans, to automated systems capable of analysing populations of images,鈥 explains Sch枚nlieb. 鈥淎s a result, clinicians will have far greater scope to ask complex questions of the medical image.鈥

It鈥檚 already possible to extract statistical information from an image of a patient鈥檚 thigh bone, turn the data into a template for comparison with those from other people in the population, and then ask whether a particular shape of bone is more prone to being broken than others in the elderly.

Most organ scans split the image into many elements, which are then analysed voxel by voxel. 鈥淏ut complex structures like the heart and the brain should be analysed holistically,鈥 explains Dr James Rudd, from the Department of Medicine, who leads the clinical interaction with the Centre. 鈥 探花直播tools we are developing will enable the analysis of organs like the brain as single objects with millions of connections.鈥

探花直播Centre brings together researchers and clinicians from applied and pure maths, engineering, physics, biology, oncology, clinical neuroscience and cardiology, and involves industrial partners Siemens, AstraZeneca, Microsoft, GSK and Cambridge Computed Imaging.



探花直播text in this work is licensed under a . For image use please see separate credits above.