World’s largest shadow library brags it scraped 300TB of Spotify music, metadata

HiroTheProtagonist

Ars Tribunus Angusticlavius
6,542
Subscriptor++
[at the risk of being off topic]

How do you get paid when people listen to music on YouTube Music?

I pay for YT Premium so the YT Music is my default app, and I've always wondered how the artists get paid since so much on YouTube seems randomly uploaded -- and I'd assume it's often not by the creators.
IIRC YTM usually tries to find official uploads first, but in the absence of official channels they'll often pull audio from random channels' uploads. It's usually most noticeable by the bitrate drops.

As for payment, it's probably less than Spotify.
 
Upvote
9 (10 / -1)
So can anyone tell me why Anna´s Archive even exist? You can download copies of the three books I have published, multiple formats, for free. Absolutely free. No barriers whatsoever. How the hell this is online or remotely legal?
The servers including DNS are probably in some country that ignores IP laws, because (other people's) "information wants to be free"
 
Upvote
24 (24 / 0)

lwdj905

Wise, Aged Ars Veteran
125
It kinda wild looking through the genres represented.
https://annas-archive.org/blog/backing-up-spotify.html

I recognize that Spotify pays nothing per stream and make a reasonable effort to purchase my rando Dream Theater vinyl off their website so that they receive the most margin possible. (I'm also one of those nuts who buys the Blu-ray for the Atmos mixes, thanks Steven Wilson!!)

Instrumental Convergence comes to mind with this move by Anna's Archive. They posit some benign, selfless goal but in actuality create a greater harm. I'm not really sure what their end goal is, and they might not either. That's the danger, justifying this move creating the hazard of others repeating it.
 

Attachments

  • Screenshot 2025-12-22 172740.png
    Screenshot 2025-12-22 172740.png
    139.3 KB · Views: 120
Upvote
12 (12 / 0)
What kind of NAS setup do you have where you have 300TB of free space?

I mean, I just got a new NAS setup a few months ago... and I'm a tiny fraction of that space.
My Plex server is approaching filling up my total of 100tb and I’m starting to sweat what free space I have. I’ve upgraded the drives over the last few years and if I can find an economical way to get over the 20tb per drive cost I’m definitely going to. My limit right now is that I can only have 8 drives at a time in my synology rack and I’m not really that interested in buying their expansion rack for it. I’m also too lazy to try and figure a way to move it all to a larger rack with a different OS. Having that amount of space is not something I’d want I don’t think, but I’m at 100tb total space without a lot of work and not a terrible amount of money having waited for sales and finding drives to shuck to get really good deals.
 
Upvote
12 (12 / 0)
Anna's Archive's ethics are dodgy, but are they worse than funding the river of money that goes to Josh Rogan? Spotify subscribers might now have a choice between two bad options.

Is anything that could de-fund Rogan all bad?

Also, I looked in vain for what a certain group would perceive as the lede. Sorry Ashley, but are the torrents mp3 or lossless flac? For that group, this may not be an opportunity at all.
You could just subscribe to Apple Music or YouTube music? You do realize that the large majority of these songs will be on non-Spotify platforms?
 
Upvote
22 (22 / 0)
This is infringement regardless of what Anna's Archive calls it.

A backup is permitted only for legally obtained information. None of this was legally obtained. It's pure piracy regardless of Spotify and the music publishing industry's dubious practices. No one is getting hurt here except the artists and perhaps the users of the Archive when they (the Archive) get sued out of existence. 300 TB of willfully pirated music at scale? Yeah, we're talking what's likely to be the second largest copyright infringement lawsuit to date. (The largest is likely to be Author's Guild v. OpenAI where Altman's plagerism machine initially scraped millions of books from shadow libraries instead of obtaining materials legally.)
 
Upvote
23 (26 / -3)

Griking

Ars Scholae Palatinae
721
Spotify? The Archive?

Both?

Artists should be paid appropriately for their work.


Artists weren't complaining though when the average album or CD was $18 and if you were only interested in a song or two on it then oh well....

Damaged the CD? Oh well, buy another.

I'm curious what people think the solution is. Raise Spotify monthly fees to $50 giving the increase to musicians? Ok, which musicians? If distributed by plays you'd just be making Weekend and Taylor Swift richer. Nevermind that most subscribers would cancel at that rate anyway. Or do you think CDs are ready to make a comeback?
 
Upvote
-15 (21 / -36)
I was a bit curious about the 37% of songs representing 99% of streams...

From this site: https://www.gearnews.com/spotify-streaming-report-2024-tech/

It seems like the cutoff is probably somewhere in the 50-100 total streams.

It's crazy to me how many songs have zero streams (25% or so?) and how much things are concentrated in a few 'billionaires' at the top.

Like with a lot of things, it seems like there should be a better distribution of wealth...
That's the findability/too much information problem. How do you find music in genres and styles you like when there's probably petabytes worth of chaff to search through? I'm no fan of GenAI, but this happens to be a field where useful AI tooling would be helpful. I just don't think we'll ever get AI tools that do anything but reinforce and expand already existing divides. We see the same problem with "viral" and shallow, astroturfed influencer highlighted points of interests and eateries showing up and crowding out in navigation apps and search engines instead of equally deserving mom and pop places.
 
Upvote
9 (9 / 0)
Post content hidden for low score. Show…
Post content hidden for low score. Show…

Marcus Andreus

Ars Scholae Palatinae
888
Subscriptor
Lossy. No point. The aural equivalent - when you have the audio equivalent of a 4K TV setup - of watching DVD instead of 4K on that system.
Apple Music has a ton of lossless stuff.

And when you consider how much an audio file weighs, a ton is quite a lot. :D
 
Upvote
34 (36 / -2)

KChat

Ars Scholae Palatinae
810
Subscriptor
[at the risk of being off topic]

How do you get paid when people listen to music on YouTube Music?

I pay for YT Premium so the YT Music is my default app, and I've always wondered how the artists get paid since so much on YouTube seems randomly uploaded -- and I'd assume it's often not by the creators.
Pretty sure that's where YT's Content ID system comes in - if it flags a song as belonging to an artist, it gives the monetization to that artist, regardless of who uploaded it. Part of the deal they made with the RIAA to not copyright strike everything into oblivion...
 
Upvote
32 (32 / 0)
So, I actually really liked Anna’s Archive, you could find books on many things that were useful in the communities I ran with, textbooks on more obscure topics that allowed us to learn how to keep us safe and gave us new ways of approaching projects. When you are extremely cash poor, it was a great resource to share an AA link and give someone who wouldn’t otherwise have the access the ability to really dive into the topic past what you could find on Wikipedia.

This is not that. It’s sad, but what felt like a genuinely countercultural resource has gone the way of many others aspects of the Internet that, when given an opportunity, will trade mission for profit.
 
Upvote
32 (35 / -3)

Marlor_AU

Ars Tribunus Angusticlavius
7,670
Subscriptor
That's the findability/too much information problem. How do you find music in genres and styles you like when there's probably petabytes worth of chaff to search through?
There used to be an elegant solution to this: regional markets.

A band would start out and develop a local following in their home town, playing live music venues like pubs, clubs and university campuses. They would hone their craft, work out what audiences like, and develop their style. If they gathered enough attention, they might start to get airplay on local radio or a larger indie station. They might land a record deal on a smaller label, then from there strive for national recognition. If they were really lucky, and had sufficient talent, they might "break out" and achieve international success.

This may not have been an entirely fair system, with some bands with clear talent being stuck in a local rut, but it at least offered a viable pathway to success. Or, at least, to a solid side-income as a "locally famous band". We had one band in my home town that spent decades on the verge of breaking out, occasionally making a showing in the national charts, but never achieving widespread fame. But they still managed to make a living from local gigs, events, support tours and album sales.

That's all much harder now. The live music scene for smaller acts is barely hanging on, and local radio and TV are being supplanted by global platforms, so the traditional "apprenticeship" of becoming locally-notable before achieving wider success is far more difficult than it was in the past.

It seems the more practical path now is to attempt to find fame on social media. But that's more of an all-or-nothing deal. With feeds curated by international platforms, there's a much smaller chance of getting local attention, and a greater chance you're joining in with the many other performers who are just playing into the void.
 
Upvote
28 (29 / -1)

idontevenexercise

Smack-Fu Master, in training
12
Subscriptor
Straight up theft. Nobody should be excusing this, or wondering if the motives were perhaps benevolent.

Yes, Spotify is a shitty platform. Streaming royalties are exploitative. However, two wrongs don't make a right. All that Anna's Archive have done here is pile on to fuck the artists some more and further diminish the monetary value of the music that we all enjoy.

This is why it's so important to support artists. Go see them in concert if they are still performing. Actually buy albums. Vinyl records are fun. Buy a turntable and enjoy yourself. If you like music, it shouldn't be about getting as much as you can, as conveniently as possible, for as little money as possible. Otherwise, the future is AI generated slop tunes with product placement from AI generated influencers.

I hope Anna's Archive gets sued into oblivion and someone actually goes to jail for this. I hope artists start to move away from companies like Spotify, Apple Music, Pandora, Tidal, and all the rest.

It's a sad day for music.
 
Upvote
11 (21 / -10)

vlam

Ars Tribunus Angusticlavius
7,131
Oh please, I know of zero people who think they're entitled to free music.

Have they pirated? Swapped cassettes, recorded songs off their boombox onto cassettes? Sure.

Not because they're entitled, but because dumb, desparate artists gave away their stuff for free.

You're like the guy who ran up to me on the sidewalk, shoved his CD on my face, and then my hands, yelling about being the next Kanye, and then started demanding money for it. I just shoved the CD back into his hands.

And, if you don't like people listening to your music for free, don't broadcast it for free. It's that simple. It's a model that worked for decades, if not centuries.

The problem is entitled artists who want to be the next STAR, do everything they can to get their music "out there" and prostitute themselves to the labels and money men. And then once they shoved their music in everyone's faces, they want to start demanding money for it.

Just play your music, and make people pay to listen. Don't shill to Spotify and the ad gods/demons, who want to sell ads not music (you have that backwards, btw)Don't shill to the radio (same model). Don't put your stuff on the Internet for free.

Musicians are not entitled to make a living. They can ask people to pay for their product instead of giving it away, and that's fine. But copyright is a very new idea, and some of the best music ever made was made before it existed. So it's not some divine right. It's a nice by-product of the legal structures in most modern societies, and if you leverage successfully, bully for you. But it is not a basic human right, just a nice-to-have in privileged societies.
Musicians cannot prevent the radio from playing their music. Which is honestly how it should be. The point of art is public enrichment. Sure, artists should receive compensation, but creating art to lock it away is fucking silly. If you've made it up share with the world, share it.

The problem is that art cannot sustain the livelihood of everyone who wishes to be an artist. At least not how we've gone about it for decades.
 
Upvote
6 (15 / -9)

Fatesrider

Ars Legatus Legionis
24,977
Subscriptor
Sounds like Anna's Archive's engineers were so preoccupied with whether or not they could that they didn't stop to think if they should.
Which is what will probably adorn humanity's gravestone.

It could be shortened to "They didn't stop to think" and still preserve the messaging.
 
Upvote
9 (11 / -2)
The same way they did before recorded music, the same way most still do..live
Sure if you're commercial enough in your own town. If you have to travel to tour, it's more expensive and complicated than ever (as are the tickets). Let's say you're in a well known metal band with a few million fans scattered around the world, as opposed to a pop star in a small country with far less followers, guess which one can make a living easier by playing live? Guess which one of them quite possibly still has a "real" job?

Apart from a few percent of artists, the only money in playing live comes from playing cover songs, weddings, and such. Getting paid to play someone else's music. Personally I think those people are even worse than pirates, but that's just me.
 
Upvote
-1 (6 / -7)

sir_trackmenot

Ars Scholae Palatinae
750
What kind of NAS setup do you have where you have 300TB of free space?

I mean, I just got a new NAS setup a few months ago... and I'm a tiny fraction of that space.
I've got 20 usable over 4 drives. Density has improved since so I could get 40 over 4 now.

It's not a stretch to imagine a 40 drive NAS in someone's home. Not everyone's home, but someone with a bit of cash and a tech interest.
 
Upvote
7 (7 / 0)

dark.jade

Smack-Fu Master, in training
28
Subscriptor
So can anyone tell me why Anna´s Archive even exist? You can download copies of the three books I have published, multiple formats, for free. Absolutely free. No barriers whatsoever. How the hell this is online or remotely legal?
I buy books digitally when they are available DRM free - so, mostly through libro.fm

When books are either:
  • only available as a physical copy
  • only available with DRM
  • only available on Amazon

I cannot access them

For scenarios like this, where it is not possible to buy a digital copy without DRM... either one downloads from a shadow library like this, or simply never reads the work in question
 
Upvote
7 (13 / -6)
Yes, they got a lot of files, but really, most people are not going to need or want to torrent a 300TB file. It's ridiculous. More interestingly and more important is the metadata. This article doesn't even mention the very interesting graphs and data they did from the analysis of the metadata. Like, do you even know what the most popular song of all time is? What are the genres with the most artists, etc. It's very interesting! It looks like currently you have to download the torrent to get to the metadata for now, but I assume they are working on making a search engine like they have for books.
  • This is by far the largest music metadata database that is publicly available. ..... Our data is well-annotated: MusicBrainz has 5 million unique ISRCs, while our database has 186 million.
 
Upvote
3 (9 / -6)

ngoncalves

Wise, Aged Ars Veteran
165
Subscriptor
Copyright is an outdated concept, like trying to hold back the tide forever. The cost of distribution is literally zero, the only reason people pay more is that laws (sometimes) enforce it.

If artists want to be paid fairly then they need to be paid before they release their art, not after. Like in centuries past.

Edit: Made a mistake there, distribution doesn't cost nothing, it costs negative dollars, in the sense that enforcing restrictions on distribution is costing more money than just doing nothing.
In the old days, only the rich could afford art. Commoners might have given a few cents to street performers, otherwise went to church for music and paintings. Furthermore, without steady financial support over the decades, art was mostly an amateur hobby and could not really transcend.

Anyway, piracy at this scale is obviously morally wrong. And the distribution cost is not zero: the consumer needs to pay for the infrastructure and the energy for reproducing the music/movie/whatever. And at this level of abundance, spending time sorting through a gazilion titles is also a cost.
 
Upvote
16 (19 / -3)

GaidinBDJ

Ars Scholae Palatinae
1,369
Subscriptor
Musicians cannot prevent the radio from playing their music. Which is honestly how it should be. The point of art is public enrichment. Sure, artists should receive compensation, but creating art to lock it away is fucking silly. If you've made it up share with the world, share it.

The problem is that art cannot sustain the livelihood of everyone who wishes to be an artist. At least not how we've gone about it for decades.

Uh, what?

Radio play is absolutely licensed by the artists. Where they hell are you getting this idea that radio stations aren't licensing the music they play?

Art, like all labor, belongs to the creator. They get to define the terms of consumption, not you. If I want to create the most beautiful painting ever created and then burn it, I can. It's mine. I'm not required to show it to you, and you're not entitled to see it. The same applies if I wanted to give a copy to everybody in the world except you. My labor, my terms.

Anything else, especially redefining the terms of your consumption unilaterally, is exploitation.
 
Last edited:
Upvote
17 (25 / -8)

The_Lamb

Wise, Aged Ars Veteran
199
One interesting thing from the AA blog:
sel_06_albums_timeline.png


Based on the accompanying text, it suggests that most music currently being added to Spotify is AI generated. I found their blog post to be a bit unclear on their statistics, but if it's true, the implications are interesting. Does it mean we're heading into a future where basically all music, art, and literature are just AI slop? Will it be impossible for future generations to find good art and music in a vast, eternal forest of neural network noise?
 
Upvote
23 (24 / -1)
[..] if I like something enough to keep listening to it I go to Qobuz and buy the album in lossless FLAC format. [..]
I'd do it more too, if the music industry respected online user libraries.
Removal after purchase should never happen (except for extreme cases, but with refunding). Delisted titles should always remain for previous owners.
Yes I can download the files, but I want my purchases to not silently disappear too. Otherwise what's the point.
 
Last edited:
Upvote
16 (17 / -1)
One interesting thing from the AA blog:
sel_06_albums_timeline.png


Based on the accompanying text, it suggests that most music currently being added to Spotify is AI generated. I found their blog post to be a bit unclear on their statistics, but if it's true, the implications are interesting. Does it mean we're heading into a future where basically all music, art, and literature are just AI slop? Will it be impossible for future generations to find good art and music in a vast, eternal forest of neural network noise?
I don't agree with the scraping (for tech-bros), but at least I wish they did it before 2022..
Btw yes, without some kind of regulation or limitation (if it's even possible at this point), any future real creation will be drown in an ocean of AI slop. There won't even be any point to create something new, as it will get cloned immediately.
Many people warned about these problems beforehand, but as usual, nobody listened.
 
Last edited:
Upvote
11 (12 / -1)