sumanth
AI-Apps
gemma_ocr

Gemma OCR

Notes

Gemma OCR

A Streamlit application that extracts text from images using Google's Gemma 3-4B model via Clarifai.

Features

Upload images in various formats (JPG, JPEG, PNG, BMP)
Extract text from images using Gemma 3-4B model
Download extracted text as a file
Responsive UI with image display options

Requirements

Python 3.7+
Streamlit
Pillow
Clarifai

Setup

Clone this repository
Install dependencies:
```
pip install streamlit pillow clarifai
```
Set your Clarifai PAT (Personal Access Token) as an environment variable:
```
export CLARIFAI_PAT="your_clarifai_pat"
```
Run the application:
```
streamlit run ocr_app/app.py
```

Usage

Upload an image containing text
Click "Extract Text"
View the extracted text
Download the text if needed

Module ID
gemma_ocr
Latest Version ID
0_0_1
Description
Gemma OCR
Last Updated
Mar 17, 2025
Repository
github.com/Sumanth077/gemma_ocr
Commit
e1008aa
Share
Badge