Tell HN: The Wayback Machine is up, in read-only mode

89 points by jfil a year ago

The https://web.archive.org/ service is up again in "(provisional, read-only) service". The rest of the Archive.org services are still down.

yasser_kaddoura a year ago

I used to solely depend on Wayback machine to automate archiving pages. Now, I am archiving webpages using selenium python package on https://archive.ph/ and https://ghostarchive.org/.

This told me not to depend on 3rd party services. Might self-host https://archivebox.io/.

keyle a year ago

I was just fantasising earlier, daydreaming, about what a distributed warc or similar solution would look like, with peering and user or distributed server archiving. Either by browser plugin submission or passively sending the urls to servers to do the fetching and archiving (removes some of the privacy issues).
I think it's everyone's responsibility to make sure the web gets cached, not one org... and since Google has canned the Google cache.......
- nikisweeting a year ago
  
  ArchiveBox v0.8 is adding the beginnings of a content addressable store for P2P sharing! Stay tuned :)
- aspenmayer a year ago
  
  Here's some prior art:
  https://wiki.archiveteam.org/index.php/Main_Page
Cthulhu_ a year ago

There should be more internet archives, for various reasons, but it doesn't seem like anyone is willing to put in the effort and money involved, let alone the legal headaches.
- jfil a year ago
  
  I agree. And I am dismayed that government and academic institutions like to dance around the legal issues of archiving (outsourcing the legal risk to Internet Archive), instead of pushing for legal protections/exemptions for the act of archiving.
  
  razakel a year ago
  
  The UK and Portugal are both doing it for domestically published websites.
yamrzou a year ago

Would you mind sharing your script?
- yasser_kaddoura a year ago
  
  https://www.reddit.com/r/qutebrowser/comments/1g1zbel/usersc...
  
  yamrzou a year ago
  
  Thank you. It says: "Sorry, this post was removed by Reddit’s filters."
  
  yasser_kaddoura a year ago
  
  That's very odd. I can see it on my end while logged in. I tested it without being logged in, and I am seeing it being removed. Gotta be a bug.
  You can find it here: https://gist.github.com/YasserKa/9a02bc50e75e7239f6f0c8f04fe...

gnabgib a year ago

Two days ago (37 points) https://news.ycombinator.com/item?id=41836677

Alifatisk a year ago

Why did the points matter?
- r721 a year ago
  
  Just means that post wasn't unnoticed and 37 users found it interesting enough to upvote.
- keyle a year ago
  
  Maybe to highlight the fact that it already made the homepage.
  
  Alifatisk a year ago
  
  Oh okey, thanks!
- gingersnap a year ago
  
  https://www.youtube.com/watch?v=9KAGwNtI26w
- ramon156 a year ago
  
  In this show, the points don't matter
- quickuser71 a year ago
  
  They don't, they are informative.

M95D a year ago

Is there any searchable mirror (and tracker) of Internet Archive's torrents?

jakeogh a year ago

The Wayback Machine is critical infrastructure. It needs independent copies.

jfil a year ago

There's no grownup in the room who will fix this. "You're It". I recommend that you take this opportunity to download 1 favourite old website/article/piece of software from the Archive and rehost it on your own site. Reach out to me if you'd like help with getting started.