GitHub - rabbidave/StoopKid-Event-Driven-Input-Monitoring-for-Language-Models: A set of serverless functions designed to assist in the monitoring of inputs to language models, including routine and specific inspection of the message queue, as well as event-driven triggering of more complex metric calculation based on (what will eventually be) configurable environment variables; alongside a suite of other tools/functions

♫ The Dream of the 90's ♫ is alive in Portland "a weird suite of Enterprise LLM tools" named after Nicktoons

by some dude in his 30s

Utility 1) Stoop Kid: Serverless & Event Driven Input Monitoring for Language Models

Description:

A set of serverless functions designed to assist in the monitoring of inputs to language models, including routine and specific inspection of the message queue, and event-driven triggering of more complex metric calculation based on continually configurable values (assuming use of an environment variable); subsequently alerts to another SQS queue

Rationale:

Large Language Models are subject to various forms of prompt injection (indirect or otherwise); lightweight and step-wise alerting of similar prompts compared to a baseline help your application stay secure
User experience, instrumentation, and metadata capture are crucial to the adoption of LLMs for orchestration of multi-modal agentic systems; a high cosine similarity paired with a low rouge-L could indicate poor generalization, better prompt engineering for users, and/or an attack on the system

Intent:

The intent of this StoopKid.py is to efficiently spin up, store and retrieve messages from an ElastiCache instance, thereby affecting a windowing function as a means to monitor the inputs to a language model.

The goal being to detect if the model is starting to experience drift from the baseline loaded ROUGE-L (which should be regularly updated and stored in S3).

The ROUGE-L value is calculated intially from the baseline, and stored in memory. ROUGE-L is calculated for incoming messages only after comparing the cosine similarity of new messages in the dataframe to the last 5 minutes worth of messages; when complete the function spins down appropriately.

The cosine similarity is used as a heuristic to detect similar inputs within the incoming dataframes to the last 5 minutes worth of messages (ostensibly to identify either poor generalization or attack), and the ROUGE-L score is used to more precisely compare the inputs with a baseline dataset; as a means of validating the assumption of the first function.

If new inputs are found to be "too similar", and subsequently found to be drifting from the baseline, a message is posted to a second SQS queue for further analysis.

Note: Needs logging and additional error-handling; this is mostly conceptual and assumes the use of environment variables rather than hard-coded values for cosine similarity & ROUGE-L

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
GPT4_Eval_&_To-Do_List.md		GPT4_Eval_&_To-Do_List.md
LICENSE		LICENSE
README.md		README.md
StoopKid.py		StoopKid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

♫ The Dream of the 90's ♫ is alive in Portland "a weird suite of Enterprise LLM tools" named after Nicktoons

by some dude in his 30s

Utility 1) Stoop Kid: Serverless & Event Driven Input Monitoring for Language Models

Description:

Rationale:

Intent:

Note: Needs logging and additional error-handling; this is mostly conceptual and assumes the use of environment variables rather than hard-coded values for cosine similarity & ROUGE-L

About

Releases

Packages

Languages

License

rabbidave/StoopKid-Event-Driven-Input-Monitoring-for-Language-Models

Folders and files

Latest commit

History

Repository files navigation

♫ The Dream of the 90's ♫ is alive in Portland "a weird suite of Enterprise LLM tools" named after Nicktoons

by some dude in his 30s

Utility 1) Stoop Kid: Serverless & Event Driven Input Monitoring for Language Models

Description:

Rationale:

Intent:

Note: Needs logging and additional error-handling; this is mostly conceptual and assumes the use of environment variables rather than hard-coded values for cosine similarity & ROUGE-L

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages