Back to Projects

ContextBox

Personal knowledge assistant with OCR and semantic search.

TesseractpgvectorLLMFastAPI
View Code

Knowledge Pipeline

Processing Pipeline

Automated
ScreenshotHelloWorldOCR TextEmbedding→ Stored in pgvector

Semantic Search

<200ms
Search by concept...

Vector Space

100K+ chunks
QueryDoc ADoc BDoc C

Processing Flow

CaptureScreenshot
ExtractTesseract OCR
EmbedTransformers
Retrievepgvector
1K+
Screenshots
100K+
Chunks Embedded
<200ms
Search Latency
90%+
OCR Accuracy
Your personal searchable memory