mirror of
https://github.com/Memo-2023/mana-monorepo.git
synced 2026-05-17 03:19:39 +02:00
NestJS-based web crawler service for structured content extraction. Features: - Depth-controlled crawling with URL pattern filtering - robots.txt compliance - HTML/PDF/Markdown content extraction - BullMQ job queue for async processing - Redis caching layer - Prometheus metrics
20 lines
141 B
Text
20 lines
141 B
Text
# Dependencies
|
|
node_modules
|
|
|
|
# Build
|
|
dist
|
|
|
|
# Environment
|
|
.env
|
|
.env.local
|
|
|
|
# IDE
|
|
.idea
|
|
.vscode
|
|
|
|
# Debug
|
|
*.log
|
|
npm-debug.log*
|
|
|
|
# Test
|
|
coverage
|