📜 Website Archive
Find a file
JMARyA 6d5c32ae91
Some checks failed
ci/woodpecker/push/test Pipeline failed
ci/woodpecker/push/container/1 Pipeline failed
ci/woodpecker/push/container-manifest unknown status
ci/woodpecker/push/container/2 Pipeline failed
ci/woodpecker/push/docs-build Pipeline was successful
feat: add indexer architecture and endpoints
- Introduce URI graph data model and layered indexer pipeline
- Create indexers for blob metadata, header metadata, graph, text, and
  documents
- Implement new API endpoints for graph queries, full-text search, and
  header metadata
- Add migrations for new tables
- Refactor code to support streaming HTTP serving and chunk-based blob
  processing
2026-03-07 20:31:44 +01:00
.woodpecker fix: ci 2025-12-31 15:20:03 +01:00
docs feat: add indexer architecture and endpoints 2026-03-07 20:31:44 +01:00
migrations feat: add indexer architecture and endpoints 2026-03-07 20:31:44 +01:00
src feat: add indexer architecture and endpoints 2026-03-07 20:31:44 +01:00
.dockerignore refactor: feature cleanup + sqlite 2025-11-15 20:48:51 +01:00
.gitignore feat: rework arch (postgres + s3) 2026-01-13 08:50:14 +01:00
build.sh wip: working async fs + fixes 2025-11-20 21:39:03 +01:00
Cargo.lock fix: broken pipe file 2026-01-14 12:30:52 +01:00
Cargo.toml fix: broken pipe file 2026-01-14 12:30:52 +01:00
cog.toml ci: releases 2025-11-25 21:21:14 +01:00
config.yml feat: implement outdated refetch 2025-11-17 08:01:54 +01:00
devenv.lock feat: rework arch (postgres + s3) 2026-01-13 08:50:14 +01:00
devenv.nix feat: rework arch (postgres + s3) 2026-01-13 08:50:14 +01:00
docker-compose.yml feat: rework arch (postgres + s3) 2026-01-13 08:50:14 +01:00
flake.lock fix: docs ci 2026-01-25 17:34:16 +01:00
flake.nix fix: docs ci 2026-01-25 17:34:16 +01:00
README.md docs: update readme 2025-12-31 14:48:24 +01:00
renovate.json Add renovate.json 2025-06-21 21:49:31 +00:00
zensical.toml add pacman docs 2026-01-07 17:07:42 +01:00

WebArc

webarc is a local-first web archiving system designed to capture, preserve, and replay HTTP content in an extensible way.

Unlike traditional crawlers that fetch pages in isolation, WebArc focuses on recording real HTTP traffic, storing it in an archive, and making that archive usable through multiple interfaces.

See the documentation.