Scan-to-Stream: A Neural OCR and Compression Pipeline
An end-to-end pipeline that reads noisy scanned documents with a custom-built neural network and compresses the output using a custom Adaptive Huffman encoder; all from scratch, no external libraries.