Submitted by millionsong on Fri, 03/25/2011 - 19:59
This is not really a MSD post, but I just want to remind everyone of what we've heard for some time now: Google is coming soon into the music business!
Submitted by millionsong on Wed, 03/16/2011 - 16:29
As it has been recently announced, we were very fortunate to partner with the website www.secondhandsongs.com to identify covers in the Million Song Dataset.
Submitted by millionsong on Tue, 03/15/2011 - 00:06
We took a deeper look at the problem of duplicate songs in the MSD. Take a task like cover song recognition. If your algorithm performs incredibly well, but keep finding some unknown song A as the closest cover before the known cover B, it might get frustrating. Especially if it turns out that A is a duplicate of B!
Submitted by millionsong on Wed, 03/09/2011 - 13:18
Since the beginning of the MSD project, Brian questioned the choice of HDF5 because it's... weird and unknown, I guess?
It makes me wonder, what else could I have chosen? And now that it's done, what converter should I build?
Submitted by millionsong on Tue, 02/15/2011 - 16:42
Matt Hoffman recently pulled my attention to this fact: there are many songs with the same title and artist name in the dataset, why did we allow that?
The idea for the Million Song Dataset came to us a couple of years ago when we were discussing with the Echo Nest possible ideas for a NSF GOALI (Grant Opportunities for Academic Liaison with Industry) grant. We were looking for an idea that wouldn't be possible without an academic-industrial collaboration, and that would appeal to the NSF as contributing to scientific progress.