Roland Pihlakas open source projects

An independent AI safety researcher. I studied psychology, have 20 years of experience in modelling natural intelligence and in designing various AI algorithms.

Pinned Loading

bioblue bioblue Public

Notable runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLM-s with simplified observation format. The benchmark themes include multi-objec…

Python 1 2
ai-safety-gridworlds ai-safety-gridworlds Public

Forked from google-deepmind/ai-safety-gridworlds

Extended, multi-agent and multi-objective (MaMoRL / MoMaRL) environments based on DeepMind's AI Safety Gridworlds. This is a suite of reinforcement learning environments illustrating various safety…

Python 10 2
Manipulative-Expression-Recognition Manipulative-Expression-Recognition Public

MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. MER benchmarks language models for manipulative expressions,…

HTML 13 3
DataAnonymiser DataAnonymiser Public

Anonymises data inside text files and in sheet files. It recognises and removes various sorts of personally identifiable information (PII). Each removed part is replaced with a suitable generic tex…

Python 3 2
zoo_to_gym_multiagent_adapter zoo_to_gym_multiagent_adapter Public

Enables you to convert a PettingZoo environment to a Gym environment while supporting multiple agents (MARL). Gym's default setup doesn't easily support multi-agent environments, but this wrapper r…

Python 1
tcpoverudp2 tcpoverudp2 Public

Reliably forwards TCP connections using UDP over two network interfaces in parallel.

Perl 7 1

People

Top languages

Loading…

Most used topics

Loading…

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roland Pihlakas open source projects

Pinned Loading

Repositories

People

Top languages

Most used topics