Submitted by millionsong on Mon, 07/25/2011 - 10:57
Following a few questions we received (most recently from Sam Ferguson, thanks!) here is a somewhat detailed account on how the loudness is computed in the Million Song Dataset. What follows is a (slightly modified) answer from Tristan Jehan:
Submitted by millionsong on Fri, 07/15/2011 - 14:45
Today happens to be the acceptance date for both ISMIR (nope, delayed!) and WASPAA, and we are very excited to see publications using the MSD finally being released.
Submitted by millionsong on Sat, 06/25/2011 - 06:39
3 hours before hack/reduce, I've decided to write down a few ideas for the participants who would want to play with the MSD. Half of this is a list of resources, half of this is a crash course course on the MSD. I reserve the right to update this info during the day.
Submitted by millionsong on Mon, 04/11/2011 - 17:32
Quick reminder: 237,662 bag-of-words with the top 5,000 words given out of ~779K MSD tracks matched with the musiXmatch API. http://millionsongdataset.com/musixmatch
Submitted by millionsong on Sun, 03/27/2011 - 12:42
Following advice from our Quality Assessment office (i.e. Dan), we included the information from the SecondHandSongs dataset into the track_metadata.db SQLite database. You can download the new version from this site (not from infochimps!).