seq-log-parser

Have you ever noticed that standard syslog or gelf entries that Seq extracts with sqelf and squiflog are quite bland, ie. that they don't contain many useful properties, that make Seq such a powerhouse? No need to fear.

seq-log-parser is an intermediate proxy in which sqelf, squiflog and all the other things can relay their logging entries to. Seq-log-parser will then extract properties with the awesome power of regex and relay them straight back to the Seq server, even using the same API keys!

Oh, it's available as a Docker container as well!

Rationale

Consider such docker-compose file:

version: '3.5'
services:
  seq:
    image: datalust/seq
    environment:
        ACCEPT_EULA: "Y"
  squiflog:
    image: datalust/squiflog
    environment:
      SEQ_ADDRESS: "http://seq:80"
    ports:
      - published: 514
        target: 514
        protocol: udp
        mode: ingress

In this case, say you configure your server to deposit logs there. Example log entry you might get is:

{
  "@t": "2020-05-03T20:41:39.0000000Z",
  "@mt": "kernel: [341690.053545] veth2c1b259: renamed from eth0",
  "@m": "kernel: [341690.053545] veth2c1b259: renamed from eth0",
  "@i": "5dceda5b",
  "facility": "kern",
  "hostname": "hypervisor1",
}

Not too useful, isn't it? Now say you use seq-log-parser in such a way:

version: '3.5'
services:
  seq:
    image: datalust/seq
    environment:
        ACCEPT_EULA: "Y"
  seq-log-parser:
    image: smokserwis/seq-log-parser
    environment:
      SEQ_ADDRESS: "http://seq"
      REGEX: "(?P<source>.*): \\[(?P<uptime(\\d+)\\.(\\d+))\] (?P<interface>.*): (?P<message>.*)"
      OVERWRITE_CONTENTS: "message"
  squiflog:
    image: datalust/squiflog
    environment:
      SEQ_ADDRESS: "http://seq-log-parser"
    ports:
      - published: 514
        target: 514
        protocol: udp
        mode: ingress

And now the log entry will look like this:

{
  "@t": "2020-05-03T20:41:39.0000000Z",
  "@mt": "renamed from eth0",
  "@m": "kernel: [341690.053545] veth2c1b259: renamed from eth0",
  "@i": "5dceda5b",
  "facility": "kern", 
  "Properties": {
    "source": "kernel",
    "uptime": "341690.053545", 
    "interface": "veth2c1b259",
    "message": "renamed from eth0"
  },
  "hostname": "hypervisor1"
}

Now isn't that much nicer?

Usage

The container is configured by following envs:

Env name	Description	Required?	Default	Can postfix i?
SEQ_ADDRESS	Address of the real Seq server	True	N/A	no
REGEX	Regex to use to match. It must be a valid Python regex with named groups	True	N/A	yes
OVERWRITE_CONTENTS	If this env is defined, the text will be overwritten by value of formatted field	False	False	yes
FIELD_TO_PARSE	Name of the received field to parse against	False	@mt	no
BIND_ADDR	Address to bind the listening port on	False	0.0.0.0	no
BIND_PORT	Port to bind the listening port on	False	80	no
LOGGING_LEVEL	Default Python logging level to configure for seq-log-parser's own logs	False	WARNING	no
REGEX_PROPERTY	A pair of key=value, a custom property to attach to entries	False	none	yes
SEQ_LOG_LEVEL	If this is defined, entries that will match regex will get assigned this severity level	False	none	yes
STORE_IN_ENTRY	If this is set to `True`, the message's body will be updated instead of it's Properties	False	False	yes
DROP_ENTRY	If this is set to `True`, then matching regexes will result in an entry being dropped	False	False	yes

Take care for your regexes to be valid Python named group regexes. Don't forget about escaping the escape character if you're writing YAML for deployment!

What matches given named group will be added to log's Properties.

If you are using a list of regexes, and you want to add some kind of a property depending on which regex matched, you can specify envs called REGEX_PROPERTYi with i being the number of your regex. You specify them in a format key=value.

If you want to add a custom property but are using only a single regex, just name your env REGEX_PROPERTY and also specify it in form of key=value.

If an entry doesn't match any of the regexes, it will be sent as-is.

OVERWRITE_CONTENTS will update both the FIELD_TO_PARSE, as well as a MessageTemplate field, if it's present.

OVERWRITE_CONTENTS needs to be specified in a Python format string like:

Please note that if you enable STORE_IN_ENTRY then custom properties handled by REGEX_PROPERTY will be added to entry's body, and not it's Properties!

{message} {url}

Then, it refers to the names of the fields to construct a new message.

Multiple regexes

Please note that regex matching will be done in order you specify them.

All variables detailed beforehand that have the property Can postfix i can be postfixed with a natural number. They will relate then only to that regex

If you input can be matched by multiple regexes, just specify them as environment variables REGEX1, REGEX2, REGEX3 instead of a single REGEX. You will then use REGEX_PROPERTYi to assign custom properties.

You may not leave REGEX_PROPERTYi blank. If you have say, REGEX2 to which you don't need a custom property, just don't define REGEX_PROPERTY2.

You can do the same with OVERWRITE_CONTENTS. Note that if only OVERWRITE_CONTENTS is set, it will apply to all regexes!

You can do the same with SEQ_LOG_LEVEL, just specify a Python format string referring to the extracted properties. It will update Seq's field @l. If you got multiple regexes, just specify SEQ_LOG_LEVELi.

Note that the value of SEQ_LOG_LEVEL will be cast to uppercase!

You can do the same with DROP_ENTRIES. If set to True entry matching given regex will be dropped on the floor.

Metrics

On endpoint /metrics there works a Prometheus exporter. Following metrics are available:

Name	Labels	Meaning	Type
seq_successes	none	Successful calls to Seq server	counter
seq_failures	none	Failed calls to Seq server	counter
matched.regex	regex - the pattern that matched	Amount of entries that matched this regex	counter
matched.nothing	none	Amount of entries that matched no regex	counter
entries.total	none	Total amount of entries processed	counter
entries.calls	none	Amount of time that ingestion call was made	counter

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
seq_log_parser		seq_log_parser
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

seq-log-parser

Rationale

Usage

Multiple regexes

Metrics

About

Releases

Packages

Languages

License

smok-serwis/seq-log-parser

Folders and files

Latest commit

History

Repository files navigation

seq-log-parser

Rationale

Usage

Multiple regexes

Metrics

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages