NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Show HN: A search engine for deleted YouTube videos (1.5B+ indexed since 2005) (tube.archivarix.net)
archivarix 1 days ago [-]
Search engine for YouTube content that's no longer on YouTube: deleted, removed, region-blocked, DMCA'd. ~1.5B videos indexed from 2005 onwards by aggregating archive sources Internet Archive Wayback Machine (CDX + HEAD-spread discovery), Common Crawl. What you get for any video ID: metadata (title, description, channel, upload date, duration, view counts, tags), thumbnails, original captions when the archive captured them, and reconstructed URLs to play the archived video file when available. Channel discovery reconciles legacy username/handle eras to a single canonical identity (lots of channels renamed themselves a dozen times — that part was painful).
1 days ago [-]
n1xis10t 17 hours ago [-]
Seems pretty cool. So this is a recent project, and you haven’t been working on this since 2005 right?

Have you considered also indexing videos that haven’t been deleted?

n1xis10t 5 hours ago [-]
Update: So I mustered the courage to try the search engine, because it was looking not very much like a scam, and it becomes very apparent as soon as you use it that non-deleted videos are also indexed.
archivarix 51 minutes ago [-]
Yes, the database contains all the videos, both deleted and active ones. Or rather, not the videos themselves, but the metadata and links to the video files in the web archive. I don't have servers large enough to store the videos themselves.
1 days ago [-]
archivarix 1 days ago [-]
[flagged]
privatedev 1 days ago [-]
[flagged]
hizihic 23 hours ago [-]
[flagged]
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 19:43:13 GMT+0000 (Coordinated Universal Time) with Vercel.