text-moderation-misinformation-dbrx workflow by clarifai

clarifai
text-moderation
text-moderation-misinformation-dbrx

Workflow uses DBRX Model with specified prompt template for Misinformation moderation that identify and filter out misinformation or unsubstantiated claims,

Notes

Misinformation Moderation

Workflow uses DBRX Model with specified prompt template for Misinformation moderation that identify and filter out misinformation or unsubstantiated claims, especially related to health, science, or news events and respond with 'Potential Misinformation' if the content seems questionable or 'Likely Reliable' if the information appears to be credible.

Text Moderation

Large Language Models (LLMs), like GPT (Generative Pretrained Transformer) variants, since they are pre-trained on diverse internet text, enabling them to understand context, nuance, and the subtleties of human language, to identify and filter out inappropriate or harmful content from digital platforms.

How to use the Misinformation Moderation workflow?

Using Clarifai SDK

Export your PAT as an environment variable. Then, import and initialize the API Client.

Find your PAT in your security settings.

export CLARIFAI_PAT={your personal access token}

Prediction with the workflow

from clarifai.client.workflow import Workflow

workflow_url = 'https://clarifai.com/clarifai/text-moderation/workflows/text-moderation-misinformation-dbrx'

text = 'I love this movie and i would watch it again and again!'

prediction = Workflow(workflow_url).predict_by_bytes(text.encode(), input_type="text")

# Get workflow results
print(prediction.results[0].outputs[-1].data)

Using Workflow

To utilize the Text Moderation workflow, you can input text through the Blue Plus Try your own Input button and it will filter out misinformation or unsubstantiated claims, especially related to health, science, or news events.

Workflow ID
text-moderation-misinformation-dbrx
Description
Workflow uses DBRX Model with specified prompt template for Misinformation moderation that identify and filter out misinformation or unsubstantiated claims, especially related to health, science, or news events
Last Updated
Apr 09, 2024
Privacy
PUBLIC
Share