Skip to content

Latest commit

 

History

History
98 lines (74 loc) · 2.7 KB

regex.md

File metadata and controls

98 lines (74 loc) · 2.7 KB

Using regular expressions

Regular expressions format

The set of supported regular expressions by Orion is POSIX Extended (see details here).

Note that POSIX doesn't support lookarounds so you cannot use expressions like this one:

^(?!WeatherObserved).*

However, you can found POSIX equivalences. For instance, the above expression is equivalent to this one (more info in this post:

^(([^\nW].{14}|.[^\ne].{13}|.{2}[^\na].{12}|.{3}[^\nt].{11}|.{4}[^\nh].{10}|.{5}[^\ne].{9}|.{6}[^\nr].{8}|.{7}[^\nO].{7}|.{8}[^\nb].{6}|.{9}[^\ns].{5}|.{10}[^\ne].{4}|.{11}[^\nr].{3}|.{12}[^\ntv].{2}|.{13}[^\ne].|.{14}[^\nd]).*|.{0,14})$

They may be longer and uglier but they are functionally equivalent.

Regular expressions in payload

In some operations regular expressions can be used in payload, as shown below.

Example:

POST /v2/op/query

{
  "entities": [
    {
      "idPattern": ".*",
      "type": "Sensor"
    }
  ]
}

The regular expression language allows the usage of backslash (\). For example, SENSOR\d{2} matches strings containing SENSOR followed by two digits. However, note that \ is a special character in JSON, which, according to the JSON specification, needs to be encoded as \\.

Example:

POST /v2/op/query

{
  "entities": [
    {
      "idPattern": "SENSOR\\d{2}",
      "type": "Sensor"
    }
  ]
}

The issue could be even funnier if you use the shell to send the request. For example, if you use multi-line curl you need to escape \ at shell level. Eg:

curl localhost:1026/v2/op/query -s -S --header 'Content-Type: application/json' --header 'Accept: application/json' -d @- | python -mjson.tool) <<EOF
{
  "entities": [
    {
      "idPattern": "SENSOR\\\\d{2}",
      "type": "Sensor"
    }
  ]
}
EOF

In other words, the SENSOR\\\\d{2} string at shell level is transformed to SENSOR\\d{2} at payload level (i.e. the actual HTTP request that is sent over the wire) which, in sequence, corresponds to the SENSOR\d{2} regular expression after JSON parsing stage at Orion. However, note that if you use a file for your payload, the shell escaping isn't needed, i.e.:

curl localhost:1026/v2/op/query -s -S --header 'Content-Type: application/json' --header 'Accept: application/json' -d @payload.json

would use the following payload.json file:

{
  "entities": [
    {
      "idPattern": "SENSOR\\d{2}",
      "type": "Sensor"
    }
  ]
}

More details on this can be found in the following issue comment at github.