Notes
Gemma OCR
A Streamlit application that extracts text from images using Google's Gemma 3-4B model via Clarifai.
Features
- Upload images in various formats (JPG, JPEG, PNG, BMP)
- Extract text from images using Gemma 3-4B model
- Download extracted text as a file
- Responsive UI with image display options
Requirements
- Python 3.7+
- Streamlit
- Pillow
- Clarifai
Setup
- Clone this repository
- Install dependencies:
pip install streamlit pillow clarifai
- Set your Clarifai PAT (Personal Access Token) as an environment variable:
export CLARIFAI_PAT="your_clarifai_pat"
- Run the application:
streamlit run ocr_app/app.py
Usage
- Upload an image containing text
- Click "Extract Text"
- View the extracted text
- Download the text if needed
- Module IDgemma_ocr
- Latest Version ID0_0_1
- DescriptionGemma OCR
- Last UpdatedMar 17, 2025
- Repository
- Commit
- Share
Copy URL Twitter Facebook Reddit LinkedIn Email
- Badge
coming soon