Skip to content

Blake Rayvid

Menu
  • About
  • Portfolio
Menu

OCR Image Search

Posted on April 21, 2026May 16, 2026 by Blake

This Flask app scans a folder of images and makes the text inside them fully searchable, using open-source Tesseract OCR.

This could be helpful for someone who has a massive folder of screenshots or scanned documents and wants to find what they need without uploading everything to a cloud service. This app runs 100% locally on your own machine.

Instead of scrolling through hundreds of thumbnails to find a specific image, you can just search for the text contained in the image. You can use broad substring searches, or toggle “Exact Match” to look for specific whole words.

The dashboard includes a term frequency analyzer. It automatically displays the top 50 most frequently occurring words in all the extracted text, ignoring common words like the, and, or, etc.

As you add new images to the folder, you won’t need to rebuild the whole database. Simply click “Sync New Images” to process any new files, and make them searchable too.

🔗 View on GitHub

Tags: Flask, HTML/CSS/JS, Python, Tesseract OCR
Categories: Interactive, Utilities

Post navigation

← Data Science CV

Categories

  • Data Science
  • Exploration
  • Finance
  • Health
  • Interactive
  • Optimization
  • Utilities

Tags

D3.js Desmos Docker Express.js FFmpeg Flask Gemini API Google Maps API HTML/CSS/JS MATLAB Matplotlib Netlify NetworkX Next.js NLP Node.js NumPy P5.js Pandas Pillow PostgreSQL Python QuantConnect Railway React.js Scikit-Learn SciPy TensorFlow Tesseract OCR WeasyPrint YFinance API

© 2026 Blake Rayvid. All rights reserved.