KoreShield - LLM Security Platform

KoreShield is an open-source security platform designed to protect applications from prompt injection attacks and other LLM security vulnerabilities.

High-Level Overview

The KoreShield project is built around four core security components that work together to provide comprehensive protection for LLM integrations:

Input Sanitizer: Cleans and validates incoming prompts to remove potentially malicious content
Attack Detector: Analyzes prompts and responses for signs of prompt injection and other attacks
Policy Engine: Enforces security rules and determines how to handle suspicious requests
Audit Logger: Records all security events and decisions for compliance and monitoring

Security Pipeline

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   Application   │───▶│ Input Sanitizer │───▶│ Attack Detector │───▶│  Policy Engine  │───▶│  Audit Logger   │
│                 │    │                 │    │                 │    │                 │    │                 │
│ User Request ──▶│    │ Clean Input ──▶│    │ Analysis ──────▶│    │ Decision ──────▶│    │ Log Event ─────▶│
└─────────────────┘    └─────────────────┘    └─────────────────┘    └─────────────────┘    └─────────────────┘
                                                        │                        │
                                                        ▼                        ▼
                                                 ┌─────────────────┐    ┌─────────────────┐
                                                 │   LLM Provider  │    │   Application   │
                                                 │   (OpenAI, etc) │    │   Response      │
                                                 └─────────────────┘    └─────────────────┘