Dan's Fabulous and Implausible Blog

This is worth considering when trying to transfer genomic findings into a clinical test.

March 26, 2015

This is worth considering when trying to transfer genomic findings into a clinical test. Unfortunately no solution is provided.
http://simplystatistics.org/2015/03/19/a-surprisingly-tricky-issue-when-using-genomic-signatures-for-personalized-medicine/

Some unusual science for the day: Using modern computer science to understand just where Rock n' Roll came from.

March 10, 2015

Originally shared by Yonatan Zunger

Some unusual science for the day: Using modern computer science to understand just where Rock n' Roll came from. Normally, people who talk about the history of music break it down into genres which have a lot to do with marketing, country of origin, and so on, and talk about individual bands as historical influences on each other. But these boundaries can be quite arbitrary: for example, "gospel" and "rock" are considered very far apart, but if you go back to the 1950's, rock was so influenced by gospel that it was hard to tell them apart at times.

So these researchers tried something else. They analyzed the songs which topped the US charts from 1960 to 2010, about 17,000 in all. For each song, they examined features not of the marketing around the music, but of the music itself: instrumentation, chord changes, timbre, types of harmony. They then used a technique called "k-means" to find the natural clusters into which the songs fell by these measures, and found thirteen natural groupings. To understand these groupings better, they adapted a technique from molecular genetics which is used to understand the functions of genes: they took song tags from last.fm, and did a mathematical analysis to see which song tags were most strongly associated with each cluster. (For example, if one cluster had songs tagged "R&B" far more often than the other clusters did, it's a good sign that this tag describes the cluster)

They came up with 13 clusters -- what you might call the "purely musical" genres of the music, since they're based entirely on the songs' musical qualities, not on the politics or marketing around them. These ranged from cluster #2 (hip hop / rap / gangsta rap / old school) to #9 (classic rock / country / rock / singer-songwriter) to #8 (dance / new wave / pop / electronic).

The image you see a bit of below is the history of the popularity of these genres over time, with 1960 at the bottom of the graph and 2010 at the top. You can see the sudden rise of rap (leftmost column), the gradual vanishing of jazz and the blues from the charts (the dwindling figure center-right), and the coming and going of hard rock (the dark blue bubbly thing at the center).

Interestingly, they have answered one important historical question, about the significance of the British Invasion: apparently no, this was not the key catalyst of the revolution in American music; the revolution was already well underway before the Beatles arrived in 1964. (Which shouldn't really surprise people too much, given that this is where rock came from)

If you look at the bottom of the image, you'll notice a tree structure which the summary on the arXiv blog doesn't talk about; you'll have to read the article itself (http://arxiv.org/pdf/1502.05417v1.pdf) for that. It's basically a genetic tree of these genres of music. This is constructed using the same techniques of "genetic relatedness" which are used to create modern evolutionary trees of species, only instead of being based on DNA snippets, they're based on those underlying musical features like chord changes which were the basis of the clustering. So you can see (for example) that hip-hop comes from a completely different ancestry than all the other observed genres, while pairs like country and classic rock are close relatives.

Why is this interesting? Apart from the obvious fun of studying music history using the methods of molecular biology, it shows the ways in which these techniques can be used to describe a whole host of things. To make this work, what you need is a large sample of items to classify (here, songs); for each item, a large collection of features to measure (a few hundred at least; in this case, things like chord changes and instrumentation); and if you want to be able to describe the function of these features, have functional labels (here, song tags) for at least a good collection of the items you want to classify. Then you can do a "genetic analysis," grouping them into families, observing family trees, and (if you have additional data, like the year of release in this case) understand things like the evolution of these groups over time or space.

What's marvelous is that you can do this sort of analysis with all sorts of things. Do it on news articles, with the features being words, and you'll discover that they cluster into stories, which in turn cluster into subjects. (Why? Because you'll see, say, a bunch of stories with the word "Brezhnev" which also include references to the USSR, and these come and go over time, and at later times start to also include stories about "Andropov," "Chernenko," and "Gorbachev." Depending on how finely you slice these, you can either see the life of a politician, or the history of the Soviet Union.) Do it on a city's road network, with features involving the number of cars on each chunk of the road at a given time, and you'll discover... well, I'm not sure what you'll discover. I don't know if anyone's ever done that analysis. But you could do it and find out.

This is the real magic of data analysis: it gives you new ways to stare at what seem like hopelessly complex piles of data, and see meaningful patterns.
https://medium.com/the-physics-arxiv-blog/genetic-data-tools-reveal-how-pop-music-evolved-in-the-us-48ad60bf495b

Around 7 min mark Prof. Cooper talks about our recently published work.

March 05, 2015

Around 7 min mark Prof. Cooper talks about our recently published work.
http://www.bbc.co.uk/iplayer/episode/b05398fn/look-east-east-04032015

Professor Ros Eeles briefly talks about our research.
https://vimeo.com/121053321

Prostate Cancer Phylogeny paper press releases

March 02, 2015

Prostate Cancer Phylogeny paper press releases

* UEA: https://www.uea.ac.uk/mac/comm/media/press/2015/mar/colin-cooper-prostate-cells
* Cancer Research UK: http://www.cancerresearchuk.org/about-us/cancer-news/press-release/2015-03-02-healthy-looking-prostate-cells-mask-cancer-causing-mutations
* http://www.icr.ac.uk/news-archive/healthy-looking-prostate-cells-mask-cancer-causing-mutations

Here's my quote from the UEA one:
Co-author Daniel Brewer, from Norwich Medical School and The Genome Analysis Centre (TGAC) at Norwich Research Park, said: “This study has sequenced the whole genetic sequences of multiple samples from the prostate for the first time - both from tumours and apparently normal tissue."

“Surprisingly there were a large number of abnormal genetic changes found in the normal prostate tissue, suggesting that the prostate as a whole is a hot bed of genetic instability and is primed and ready for tumours to develop. This gives us important clues to how prostate cancer develops and has potential consequences to how it is treated.”
https://www.uea.ac.uk/mac/comm/media/press/2015/mar/colin-cooper-prostate-cells

Our work on the front page of Nature Genetics website. Not the best figure to choose, but who am I to complain.

March 02, 2015

Our work on the front page of Nature Genetics website. Not the best figure to choose, but who am I to complain.

Analysis of the genetic phylogeny of multifocal prostate cancer identifies multiple independent clonal expansions in...

March 02, 2015

Analysis of the genetic phylogeny of multifocal prostate cancer identifies multiple independent clonal expansions in neoplastic and morphologically normal prostate tissue

This bit of work has been my main focus for a long time and it is really great that it has finally been published in Nature Genetics. This study, for the first time, has sequenced the whole genetic sequences of multiple samples from the prostate, both from tumours and apparently normal tissue. Surprisingly there were a large number of abnormal genetic changes found in the normal prostate tissue, suggesting that the prostate as a whole is a hot bed of genetic instability and is primed and ready for tumours to develop. This gives us important clues to how prostate cancer develops and has potential consequences to how it is treated.
http://www.nature.com/ng/journal/vaop/ncurrent/full/ng.3221.html

Search This Blog

Dan's Fabulous and Implausible Blog

Posts

This is worth considering when trying to transfer genomic findings into a clinical test.

Some unusual science for the day: Using modern computer science to understand just where Rock n' Roll came from.

Around 7 min mark Prof. Cooper talks about our recently published work.

Our work also appeared in the local Norfolk press.

An article on our research in The Times.

Independent article on our Phylogeny prostate cancer paper.

Professor Ros Eeles briefly talks about our research.

Prostate Cancer Phylogeny paper press releases

Our work on the front page of Nature Genetics website. Not the best figure to choose, but who am I to complain.

Analysis of the genetic phylogeny of multifocal prostate cancer identifies multiple independent clonal expansions in...