Notes
Gemma OCR
A Streamlit application that extracts text from images using Google's Gemma 3-4B model via Clarifai.
Features
- Upload images in various formats (JPG, JPEG, PNG, BMP)
- Extract text from images using the Gemma 3-4B model
- Download extracted text as a file
- Responsive UI with image preview
Requirements
- Python 3.10+
- Streamlit
- Pillow
- Clarifai SDK
Setup
Clone this repository:
git clone https://github.com/Sumanth077/gemma_ocr cd gemma_ocr
Install the dependencies:
pip install streamlit pillow clarifai
Set your Clarifai Personal Access Token (PAT) as an environment variable:
export CLARIFAI_PAT="your_clarifai_pat"
Run the application:
streamlit run ocr_app/app.py
- Module IDgemma_ocr
- Latest Version ID0_0_1
- DescriptionAn OCR App using Google's Gemma-3 Model.
- Last UpdatedMar 20, 2025
- Repository
- Commit
- Share
- Badge
coming soon