616 pointsby hgghMay 9, 2026

26 Comments

damnitbuildsMay 8, 2026
"Its efforts will initially focus on [...] and collecting the generative AI wave that is currently upon us all."

Why would they want to collect the AI wave ?!

But about time the Internet Archive had a US-independent backup.

kinowMay 8, 2026
> But about time the Internet Archive had a US-independent backup.

Agreed!

> The Internet Archive Switzerland, online at https://internetarchive.ch/, is a newly-formed Swiss non-profit foundation that will operate independently within its national context.

I think the Wikipedia Editors will have to decide whether they will add it to the existing page. The Operations section is still listing only U.S. data centers: https://en.wikipedia.org/wiki/Internet_Archive#Operations

rbanffyMay 9, 2026
I wonder how long does it take to back it up.
userbinatorMay 10, 2026
There are some real gems in the sea of slop; and as archivists and historians, they shouldn't moderate.
VasbarlogMay 9, 2026
Hugged to death? I can’t access the page.
alessandrobernaMay 9, 2026
Seems likely, same for me.
HendriktoMay 9, 2026
Same for me. I cannot access it either.
sixie6eMay 9, 2026
I am able to.
embedding-shapeMay 9, 2026
Have you tried just letting it load? Took maybe more than 30 seconds for the page to load for me, but it did load eventually.
KomoDMay 9, 2026
Yep, just loading forever.
AndroTuxMay 9, 2026
They just want everyone coming from archive.org to feel right at home
pedroneto3May 9, 2026
I am able too
insomMay 9, 2026
That website is really struggling. Very tempting to go to a mirror on archive.org to view it :)

This seems very distinct from Internet Archive in the US, I wonder how separate it is.

Internet Archive Canada (I worked there in 2024) operated like it was a subsidiary, even though I think it was technically an independent organization with some shared directors. Same Slack, same archive.org email domain, etc.

IA.ch has Brewster and Caslon on the board.

I suspect that for the political threats of the current decade the different Internet Archive organisations need to start operating more independently, especially when it comes to funding?

crossroadsguyMay 9, 2026
They use Slack? I am kind of surprised. But I am sure on the plus side, that would also mean having to worry about one less uptime.
insomMay 9, 2026
Slack, Zoom and Google Apps (but not for email) - otherwise basically everything was internally ran.

The Slack has (had?) hundreds of guest accounts due to volunteers and allied organizations. It’s an interesting (and cool) institution!

IntralexicalMay 9, 2026
Can you share more about your time at the Canadian one? I feel like there was a big hullabaloo about it years ago, but it's not really clear what they do.
insomMay 9, 2026
Not sure what hullabaloo -- they do provide a bunch of services to Canadian institutions (including Libraries and Archives Canada) and they perform physical services like book scanning and in the last few years I believe they are the parent organization for the physical Canadian datacentre _somewhere in BC_.

For my work, I worked in their Archiving & Data Services department, on https://archive-it.org/ -- I didn't know this before I joined, but Internet Archive offers various for-pay services to other cultural institutions, mostly around archiving their stuff or white-labelling playback of archives.

For example https://webarchiveweb.bac-lac.canada.ca/ (the Government of Canada's own Internet Archive) is actually outsourced to ADS within Internet Archive.

On one hand this is neat, as IA have expertise around this, but on the other hand (as a Canadian) I don't like that it's not actually sovereign and that it looks like it's run by our government but that it's not. Tradeoffs, I guess.

springtimesunMay 9, 2026
Ah, good, they are also mirroring the page load speed of the internet archive
trvzMay 9, 2026
Typical for something made in St. Gallen. A sensible web developer from Zurich interested in the topic would have created this website in just a single HTML and an optional CSS file.
4ggr0May 9, 2026
a dev from ZH would've added a blockchain, mobile app and hosted it on an over-allocated kubernetes cluster. 97% uptime and you need a macbook pro so the website doesn't stutter.
shermantanktopMay 9, 2026
A south-of-the-Limmat Migros shopper would use React and Vercel, but still use raw JS Date.
dangMay 10, 2026
Normally we'd reply with "please don't do regional flamewar on HN" but this sounds so good-humored to me that I've canceled the (no doubt well-intentioned) downvotes instead.

Edit: now someone is going to tell me how mean internecine Swiss conflict actually is...

slaterMay 10, 2026
There's an entire ditch between the French-speaking and German-speaking parts of Switzerland, filled with Röschti to keep the two apart. True story!
springtimesunMay 10, 2026
As someone who lives in Switzerland, but is not Swiss, I love this kind of thing. It’s an insight into an internal cultural understanding I didn’t get growing up and doesn’t really come up in the conversations I have day to day.
input_shMay 9, 2026
Relevant blog post: https://blog.archive.org/2026/05/06/internet-archive-switzer...

> Internet Archive Switzerland joins a growing group of mission-aligned organizations, alongside Internet Archive, Internet Archive Canada, and Internet Archive Europe. Together, these independent libraries strengthen a shared vision: building a distributed, resilient digital library for the world.

card_zeroMay 9, 2026
I was interested in the others, but https://www.internetarchive.eu is a horrible corporate-looking site with a hero image, a boast about AI, a carousel of news that won't scroll with doing its slow scroll animation, a huge "meet the team" section with mugshots and boring profiles, social media links, a newsletter signup form, and nothing to say where the actual archive is.
carlosjobimMay 9, 2026
Reading what little information they have there, they aren't a public facing or public serving organization. They seem to provide their services to institutions only:

"working with dozens of European libraries and government agencies to build web collections, Internet Archive Europe prioritized collaboration with cultural heritage organizations to safeguard our collective history."

badlibrarianMay 9, 2026
Internet Archive runs a completely separate version of their site for paying institutional clients. https://archive-it.org/

In a best case scenario, this eventually becomes the replacement for the (lets be honest) absurdly awful archive.org front and backend.

So: an expansion into the EU market. And yes, a honeypot for grant funds, because why not? Good for them.

ferongrMay 9, 2026
Looks like an "organization" tailor made to be awarded EU funds for their "mission".
CPLXMay 9, 2026
Mysteries abound.
vagesMay 9, 2026
The .eu branch that card zero criticized seems to be based in Amsterdam, the capital of the Netherlands (an EU member). Or am I missing something?
wongarsuMay 9, 2026
I think people are questioning the "Archive" part, not the "Europe" part of the name
justusthaneMay 9, 2026
I was excited to see there's a Canadian one, but it's just a Wordpress blog?
chorizoMay 9, 2026
They do exist and involved in archiving. Someone reached out to our amateur radio club and offered to archive any documents we might have. They even asked to archive the video recording of one of our monthly meetings.
ConceptJunkieMay 9, 2026
Somewhere there's a "create a random, soulless, corporate website generator", and these folks used it.
rbanffyMay 9, 2026
Also https://news.ycombinator.com/item?id=48068333, but got little traction.
dangMay 9, 2026
Thanks! Since the submitted URL https://internetarchive.ch/ seems to be down, I've put your link at the top and moved the other to the toptext.
red_admiralMay 9, 2026
Sankt Gallen's more physical archive is worth a visit too: https://www.stiftsbezirk.ch/de/stiftsbibliothek/
woodsonMay 9, 2026
Indeed. And the one in Admont, Austria: https://stiftadmont.at/en/about-the-abbey-library/
DeadEye2111May 9, 2026
Very proud of my alma mater town to be a place for this. It’s much needed infrastructure for Europe.
zkmonMay 9, 2026
Anything that is being built today, based on the assumptions about the future that extend into multiple years, is bound to fade away. Because the "future no longer what it used be". What's the envisaged future context and purpose where this would save the world?
consumer451May 9, 2026
Stop complaining about availability. Instead, create a solution.

If tpb dot org can still exist ...

At least these people tried. We need a p2p archive solution ASAP. Before our history is entirely re-written.

arjieMay 9, 2026
I don’t think the problem lends itself well to decentralization. People have tried to use IPFS et al for this. There were even IA attempts https://github.com/internetarchive/dweb-gateway

No one has cracked this one yet.

tylerchildsMay 9, 2026
It has been cracked.

The internet itself is the thing we want.

We’re just constantly in denial that the internet actually does the thing we want it to do.

The internet archive is an excellent demonstration of how to do it.

It’s primarily getting a ragtag group to pool resources and manage them and then gossip with other groups that are doing the same thing.

I’ve spent so much time around the archive that I plainly see a divide between internet people online that can’t connect the dots and internet people in real life that are confused as to why the dots aren’t connecting.

The easiest way to see the dots is to:

1. Stop trying to make money

2. Tally the things that cost money

3. Amortize the upkeep over time

E.g. where do we source resources from, where do we store resources and how do we secure them.

Like HTTP, but for physical materials, not digital.

zbentleyMay 10, 2026
That's not what is meant by "decentralization".

None of those things help with the problem of centralization. Centralization isn't limited to moneymaking enterprises, or the modern internet. A centralized server operated by donations for free can just as easily go down, be seized by law enforcement, have its domain or internet service taken offline by government action, and so on.

The internet is not the thing we want (or not sufficient alone), because the internet's resources, and the communication systems between them, are largely centralized.

tylerchildsMay 10, 2026
Yeah, I hear you.

Yeah, them as a single instance is centralized, but if you actually go (show up at 300 Funston on a Friday at 1pm) you can hear about the research into how to replicate and become the resiliency in the network to make it decentralized.

A lot of it is ancient Unix philosophy like “this massive text file is a seekable index” and “rsync does basically most of the heavy lifting” and you’ll quickly realize decentralization is a social problem and not a technical one.

They’re shifting more and better data than the centralized services we’re complaining about— we need better education, not innovation at this current juncture.

The technology exists, the will of the people is lacking in spirit.

tylerchildsMay 10, 2026
Also tacking on that ssh is a social network.

That’s the crucial social layer that powers all of the everything else on the decentralized internet.

Take git as a social platform.

SSH is the social protocol.

GitHub centralized most of the git+ssh net, but that was a choice and we use all these other git+ssh services to not give them a monopoly.

IntralexicalMay 9, 2026
They've been constantly trying to set up P2P solutions. Torrents, DWEB, IPFS, Filecoin, WebTorrent, YJS, whole bunch of tech acronyms. I'm not sure much of it has really caught on?

https://blog.archive.org/tag/decentralized-web/

https://github.com/internetarchive/dweb-transports

Third-party attempt:

https://wiki.archiveteam.org/index.php/INTERNETARCHIVE.BAK

Turns out it's hard! Or maybe just too niche. But you can also help them today, by seeding some of collections that are available as torrents.

arian_May 9, 2026
Finally a Swiss account I can afford to open.
idovmamaneMay 9, 2026
St Gallen has been archiving knowledge for over a thousand years. Now they are archiving AI models before they get retrained out of existence. The location is not a coincidence…
ok123456May 9, 2026
Where's the search bar at the top to search the archive?
ukanhaupaMay 9, 2026
cool!
imtomtMay 9, 2026
Huh. I can’t find the actual... archive. It mentions an AI archive less than 10 sentences in, and has a couple of links, but seems void of any actually archived content.
colinmegillMay 9, 2026
If you are running that thing, and reading this post: just do the right thing and get your own name.
moontearMay 9, 2026
colinmegillMay 10, 2026
I guess I stand corrected, but I maintain it was word salad :)
teewMay 9, 2026
The About Us section states:

> We are a team of change-makers who believe that every helping hand can raise a child and create a better future for them.

Which I found weird. And searching for this phrase yields many site-hits verbatim, which is even weirder. Anyone know what is up with that? Is it some kind of filler text?

Edit: I guess it's from a template, the Contact section is also mumbo-jumbo (address: 123 Fifth Avenue, NY and so on).

malickaMay 9, 2026
That doesn’t exactly instill confidence, honestly…
miki123211May 9, 2026
IA needs to do what Usenet has done. Have a bunch of mission-aligned but unrelated orgs (under different ownership and distributed around the world) that peer with each other, distribute all the content obtained by any of the orgs to each other, but that have no technical channel nor capability to distribute DMCA complaints and takedown requests.

This is (AFAIK) basically how Usenet piracy works. You send your warez to one provider, and that provider instantly replicates them to all the providers they peer with, recursively, until they eventually reach the entire network. When any of those providers get a DMCA complaint, they remove the offending files (as they're required to do by law), but they don't inform other providers that they've received a DMCA notice, so those providers keep serving those files. This makes it much harder to remove data from the network than it is to add it.

y3ahd0gMay 9, 2026
So they should use bit torrent.

IMO personal security would only be improved if we diversified away from "the open web".

"Flood the field" with protocols and pre-shared key networks where we have to generate keys together in meat space, make it too expensive to operate the panopticon.

Everyone putting their eggs in the open web basket, gathering in that public commons means all it takes is one bomb on us all, so to speak.

LocalHMay 9, 2026
BitTorrent allows untrusted users (read: industry plants) to connect and slurp down direct IP addresses to swarm participants. It's an unanswered legal question whether low-level uploading (such as the percentages one would get as a "leech", connecting to the torrent and then disconnecting immediately after completion) might fall under "fair use" or "fair dealing" statutes in various jurisdictions.

US-centric here: I feel that uploading a small percentage of a file as a condition of downloading the whole thing may very well fall under fair use - most BT traffic is noncommercial, the portion of the covered work uploaded by "leeches" is very small and probably would be covered by the "30-second" rule often quoted in fair use discussions. The only really arguable point is the "effect on the work's value", but then again an average leech is not uploading enough of the work to have that much of a material effect on the work's value.

fsfloverMay 9, 2026
Torrents in I2P allow fully anonymous data exchange.
y3ahd0gMay 9, 2026
Ok private 1:1 wireguard and syncthing or rsync all the way down then

Softlink data to the appropriate mount

The options are endless and tech nerds can 1:1 help friends and family

Locking the knowledge into corporate silos is a huge security risk. The masses should be just as competent and informed so they don't panic

Minority say over the economy and government is just fascism. These people are not deities. They're normal meat and bone

We have processes to replace politicians and workers; we need processes to replace the rich.

Free speech is a circular right and there is no freedom from consequences of speech. They can face consequences too

mafuyMay 10, 2026
In Germany at least, uploading even a single byte of content is illegal. We don't really have Fair Use here; there are only few, very narrow exceptions.

It is also not even required to show that that single byte was uploaded, your IP getting logged as part of the swarm suffices. The burden of proof is on you now. It was much, much worse than in the US.

While all this is technically still true today, a new law a few years ago luckily mostly blocked the path. It was badly needed, because the situation was horribly abused by law firms.

simondotauMay 10, 2026
> even a single byte of content is illegal

  10010110
Watch out die Deutschen, that’s the first byte of Super Mario Bros.
abc123abc123May 10, 2026
Woop, woop, it's the sound of da police!

I heard a rumour that this byte also exists in the Legend of Zelda! No go get em Mr Policeman!

defrostMay 10, 2026
In Australia it was determined that an ISP bears no responsibly to respond to allegations of copyright infringement by ISP users.

https://en.wikipedia.org/wiki/Roadshow_Films_Pty_Ltd_v_iiNet...

Of course Telco's can choose to be involved, perhaps accept payment to lookup and snitch, etc. but for the most part a number of ISPs in Au just wash their hands of devoting resources to play connect the dots for others.

cortesoftMay 10, 2026
You comment shares bytes with copyrighted content, does that mean you broke the law?
lxgrMay 10, 2026
Context matters.

“Here’s byte 0x67, which is at offset 0x729B1A38 of Copyrighted_Blockbuster.4k.mkv, as requested” is different from “here’s byte 0x67, and it’s the first byte of my text response to your comment”.

numpad0May 10, 2026
Same in Japan. There's allegedly someone making big bucks going after bittorrent users, straining ISP abuse teams and judicial systems. Interesting that Germany has laws against that.
pocksuppetMay 10, 2026
> It is also not even required to show that that single byte was uploaded, your IP getting logged as part of the swarm suffices

What if someone would release software that would connect to random swarms and not upload or download anything? Would they still be criminally liable? You could disguise the purpose by saying it's measuring swarm diversity.

cbdevidalMay 9, 2026
I like it in theory but the IA hosts over 175PB of data. Wonder how many other producers could replicate that data.
aryonocoMay 10, 2026
I don’t have hard data to back this up, but I estimate that plenty of main Usenet binary providers easily exceed that.
AnthonyMouseMay 10, 2026
Suppose you don't have ten hosts that each have 175PB of data but rather a million hosts that each have an average of 1.75TB, and therefore the equivalent of 10 full copies. And then something that periodically checks if there is any given subset of the data with too few copies and makes more.
pocksuppetMay 10, 2026
There are only 3-4 providers because the system is spammed with hundreds of terabytes of new data per day by actors seeking to destroy it. They can't moderate the spam because the pirated data is all encrypted so indistinguishable from random data, and because moderation would destroy their pretense of not knowing what content is being posted.
anthkMay 10, 2026
Spam is dead since Google Groups dissappeared and most people just use non-binary newsgroups for high tech/culture talks.
pocksuppetMay 10, 2026
The binary Usenet is the one that Internet Archive would be like. It receives hundreds of terabytes of new data every day. Most of it is just random bits designed to waste space on the providers.
topranksMay 10, 2026
Usenet is a distributed policy from the ground up.

It’s centralised in the way you describe now that it’s only used for large files / piracy, but it used to me much more diverse.

latenightcodingMay 9, 2026
>> Gen AI ARchive

isn't this a nightmare for privacy

addedGoneMay 9, 2026
Let's hope they don't use Google captcha and KYC everyone.
AnimatsMay 9, 2026
Oh, good. We need more backups.

The one in Egypt doesn't get updated.

kennykartmanMay 9, 2026
I'm so happy about this. Really. I cannot overstate how much important the internet archive is for all of us.
anant-singhalMay 10, 2026
The uncomfortable part is that “preserving knowledge” sounds universally good until copyright law present themselves.
jrochkind1May 10, 2026
> collecting the generative AI wave that is currently upon us all.

I don't understand what this means?

bl4ckneonMay 10, 2026
I had the same thought. Does that mean they are archiving a bunch of Ai stuff? Doesn't sound right to me
dopidopHN2May 10, 2026
I have a related question on the domain archive.org

I've noticed that this domain now host content subject to copyright.

As a example : entire season of startrek "voyager" are randomly hosted there in direct download.

Why? Is that not a liability?

hgghMay 10, 2026
Internet Archive is a library. Libraries host copyrighted content. Libraries are good.