<< Back
Last.fm Dataset - 1K users
DOWNLOAD lastfm-dataset-1K.tar.gz (~642Mb)
======
README
======
Version 1.0, May 2010
. What is this?
This dataset contains <user, timestamp, artist, song> tuples collected from Last.fm API,
using the user.getRecentTracks() method.
This dataset represents the whole listening habits (till May, 5th 2009) for nearly 1,000 users.
. Files:
userid-timestamp-artid-artname-traid-traname.tsv (MD5: 64747b21563e3d2aa95751e0ddc46b68)
userid-profile.tsv (MD5: c53608b6b445db201098c1489ea497df)
. Data Statistics:
File userid-timestamp-artid-artname-traid-traname.tsv
Total Lines: 19,150,868
Unique Users: 992
Artists with MBID: 107,528
Artists without MBDID: 69,420
. Data Format:
The data is formatted one entry per line as follows (tab separated, "\t"):
userid-timestamp-artid-artname-traid-traname.tsv
userid \t timestamp \t musicbrainz-artist-id \t artist-name \t musicbrainz-track-id \t track-name
userid-profile.tsv:
userid \t gender ('m'|'f'|empty) \t age (int|empty) \t country (str|empty) \t signup (date|empty)
. Example:
userid-timestamp-artid-artname-traid-traname.tsv:
user_000639 \t 2009-04-08T01:57:47Z \t MBID \t The Dogs D'Amour \t MBID \t Fall in Love Again?
user_000639 \t 2009-04-08T01:53:56Z \t MBID \t The Dogs D'Amour \t MBID \t Wait Until I'm Dead
...
userid-profile.tsv:
user_000639 \t m \t Mexico \t Apr 27, 2005
...
. License:
The data contained in lastfm-dataset-1K.tar.gz is distributed with permission of Last.fm.
The data is made available for non-commercial use.
Those interested in using the data or web services in a commercial context should contact:
partners [at] last [dot] fm
For more information see Last.fm terms of service
. Acknowledgements:
Thanks to Last.fm for providing the access to this data via their web services.
Special thanks to Norman Casagrande.
. References:
When using this dataset you must reference the Last.fm webpage.
Optionally (not mandatory at all!), you can cite Chapter 3 of this book
@book{Celma:Springer2010,
author = {Celma, O.},
title = {{Music Recommendation and Discovery in the Long Tail}},
publisher = {Springer},
year = {2010}
}
. Contact:
This data was collected by Òscar Celma @ MTG/UPF