managarten/services/mana-crawler-go/package.json
Till JS 64f7f768eb feat(infra): add Go web crawler (mana-crawler-go)
Goroutine-based crawler replacing NestJS mana-crawler:
- goquery for HTML parsing (title, content, links, metadata)
- robots.txt checker with 24h cache
- Worker pool with configurable concurrency + rate limiting
- PostgreSQL for job/result storage
- Same API surface: POST/GET/DELETE /api/v1/crawl

11 MB binary, ~15 MB Docker image vs ~200 MB NestJS.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-27 22:10:45 +01:00

11 lines
286 B
JSON

{
"name": "mana-crawler-go",
"version": "1.0.0",
"private": true,
"description": "Go web crawler replacing NestJS mana-crawler",
"scripts": {
"build": "go build -ldflags=\"-s -w\" -o dist/mana-crawler ./cmd/server",
"dev": "go run ./cmd/server",
"test": "go test ./..."
}
}