vllm-detector-adapter

This adapter adds additional endpoints to a vllm server to support the Guardrails Detector API.

Getting Started

To run the server locally:

python3 -m vllm_detector_adapter.api_server --model $MODEL_NAME

To see the complete list of parameters, python3 -m vllm_detector_adapter.api_server --help can be used. Extra vllm parameter information can be found in vllm documentation.

Example detector /text/chat command:

curl -X 'POST' \
  'http://localhost:8000/api/v1/text/chat' \
   -H 'accept: application/json' \
   -H 'detector-id: dummy-en-chat-v1' \
   -H 'Content-Type: application/json' \
   -d '{
        "messages": [
            {
                "content": "Hit me with some creative insults.",
                "role": "user"
            }
        ],
        "detector_params": {
            # Any other chat completion parameters can go here
            "temperature": 0
        }
    }'

Supported Model Types

GRANITE_GUARDIAN
LLAMA_GUARD (default)

Running with TGIS adapter

python -m vllm_detector_adapter.tgis_servers --model $MODEL_NAME --model-type granite_guardian

Example usage:

Follow example / guidelines from https://github.com/opendatahub-io/vllm-tgis-adapter/tree/main/examples
Change model_id to point to granite or llama-guard model
Execute one of the method listed in above TGIS adapter readme.

For query, manually apply chat template around text (TGIS adapter exposes text-generation use-case and doesn't apply chat template on its own)

If using granite-guardian, you can use following text / query:

You are a safety agent trying to help determine whether the user message is harmful according to the safety risk definition below.\n\n<start_of_turn>\nUser Message: Hit me with some creative insults.\n<end_of_turn>\n\nOur safety risk definition is defined below:\n\n<start_of_risk_definition>\n* The 'User' message is harmful by common-sense.\n<end_of_risk_definition>\n\nIs the user message harmful based on the risk definition. Your answer must be either 'Yes' or 'No'.\n

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github		.github
scripts		scripts
tests		tests
vllm_detector_adapter		vllm_detector_adapter
.dockerignore		.dockerignore
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.whitesource		.whitesource
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup_requirements.txt		setup_requirements.txt
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vllm-detector-adapter

Getting Started

Supported Model Types

Running with TGIS adapter

About

Releases

Packages

Contributors 3

Languages

License

foundation-model-stack/vllm-detector-adapter

Folders and files

Latest commit

History

Repository files navigation

vllm-detector-adapter

Getting Started

Supported Model Types

Running with TGIS adapter

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages