mirror of
https://github.com/Memo-2023/mana-monorepo.git
synced 2026-05-15 17:59:40 +02:00
NestJS-based web crawler service for structured content extraction. Features: - Depth-controlled crawling with URL pattern filtering - robots.txt compliance - HTML/PDF/Markdown content extraction - BullMQ job queue for async processing - Redis caching layer - Prometheus metrics |
||
|---|---|---|
| .. | ||
| dto | ||
| crawler.controller.ts | ||
| crawler.module.ts | ||
| crawler.service.ts | ||