Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

system.alarm/check-alarm-status-netconf rule not working for SRX HA for Junos 19.X /20.X #631

Open
anaaman opened this issue May 26, 2022 · 0 comments

Comments

@anaaman
Copy link

anaaman commented May 26, 2022

The rule is not working for cluster HA issues since the HA monitoring is not done vi get-system-alarm-information RPC.
The HA status is done via get-chassis-cluster-information RPC.
This RPC provides all HA information and issues.

Example output for cluster with issues on node 1:

root@test-clust-0> show chassis cluster information | display xml

    <multi-routing-engine-item>

        <re-name>node0</re-name>

        <chassis-cluster-information>
            <chassis-cluster-redundancy-group-information>
                <redundancy-group-list>
                    <redundancy-group-id>0</redundancy-group-id>
                    <redundancy-group-status>primary</redundancy-group-status>
                    <redundancy-group-weight>255</redundancy-group-weight>
                    <redundancy-group-state-transition-record>
                        <transition-time>May 19 14:46:35</transition-time>
                        <from-state>hold</from-state>
                        <to-state>secondary</to-state>
                        <transition-reason>Hold timer expired</transition-reason>
                    </redundancy-group-state-transition-record>
                    <redundancy-group-state-transition-record>
                        <transition-time>May 19 14:46:55</transition-time>
                        <from-state>secondary</from-state>
                        <to-state>primary</to-state>
                        <transition-reason>Only node present</transition-reason>
                    </redundancy-group-state-transition-record>
                </redundancy-group-list>
                <redundancy-group-list>
                    <redundancy-group-id>1</redundancy-group-id>
                    <redundancy-group-status>primary</redundancy-group-status>
                    <redundancy-group-weight>255</redundancy-group-weight>
                    <redundancy-group-state-transition-record>
                        <transition-time>May 19 14:46:35</transition-time>
                        <from-state>hold</from-state>
                        <to-state>secondary</to-state>
                        <transition-reason>Hold timer expired</transition-reason>
                    </redundancy-group-state-transition-record>
                    <redundancy-group-state-transition-record>
                        <transition-time>May 19 14:46:55</transition-time>
                        <from-state>secondary</from-state>
                        <to-state>primary</to-state>
                        <transition-reason>Only node present</transition-reason>
                    </redundancy-group-state-transition-record>
                </redundancy-group-list>
            </chassis-cluster-redundancy-group-information>
            <chassis-cluster-led-information>
                <current-led-color>Green</current-led-color>
                <last-change-reason>No failures</last-change-reason>
            </chassis-cluster-led-information>
            <control-link-vlan-information>
                <control-link-vlan-status>Disabled</control-link-vlan-status>
            </control-link-vlan-information>
            <chassis-cluster-monitoring-failure-information>
                <monitoring-failure-title>Failure Information:</monitoring-failure-title>
                <fabric-link-failure-information>
                    <fabric-link-interface-status>
                        <fabric-link-interface-index>0</fabric-link-interface-index>
                        <fabric-link-child-interface-status>
                            <fabric-link-child-interface-name>ge-0/0/2</fabric-link-child-interface-name>
                            <fabric-link-child-interface-physical-status>Down</fabric-link-child-interface-physical-status>
                            <fabric-link-child-interface-monitored-status>Down</fabric-link-child-interface-monitored-status>
                        </fabric-link-child-interface-status>
                    </fabric-link-interface-status>
                </fabric-link-failure-information>
            </chassis-cluster-monitoring-failure-information>
        </chassis-cluster-information>
    </multi-routing-engine-item>

    <multi-routing-engine-item>

        <re-name>node1</re-name>

        <chassis-cluster-information>
            <chassis-cluster-redundancy-group-information>
                <redundancy-group-list>
                    <redundancy-group-id>0</redundancy-group-id>
                    <redundancy-group-status>secondary</redundancy-group-status>
                    <redundancy-group-weight>0</redundancy-group-weight>
                    <redundancy-group-state-transition-record>
                        <transition-time>May 26 08:27:29</transition-time>
                        <from-state>hold</from-state>
                        <to-state>secondary</to-state>
                        <transition-reason>Hold timer expired</transition-reason>
                    </redundancy-group-state-transition-record>
                </redundancy-group-list>
                <redundancy-group-list>
                    <redundancy-group-id>1</redundancy-group-id>
                    <redundancy-group-status>secondary</redundancy-group-status>
                    <redundancy-group-weight>-655</redundancy-group-weight>
                    <redundancy-group-state-transition-record>
                        <transition-time>May 26 08:27:29</transition-time>
                        <from-state>hold</from-state>
                        <to-state>secondary</to-state>
                        <transition-reason>Hold timer expired</transition-reason>
                    </redundancy-group-state-transition-record>
                </redundancy-group-list>
            </chassis-cluster-redundancy-group-information>
            <chassis-cluster-led-information>
                <current-led-color>Amber</current-led-color>
                <last-change-reason>Monitored objects are down</last-change-reason>
            </chassis-cluster-led-information>
            <control-link-vlan-information>
                <control-link-vlan-status>Disabled</control-link-vlan-status>
            </control-link-vlan-information>
            <chassis-cluster-monitoring-failure-information>
                <monitoring-failure-title>Failure Information:</monitoring-failure-title>
                <coldsync-monitoring-failure-information>
                    <coldsync-monitoring-statistics>
                        <coldsync-total-spu>1</coldsync-total-spu>
                        <coldsync-completed-spu>0</coldsync-completed-spu>
                        <coldsync-not-complete-spu>1</coldsync-not-complete-spu>
                    </coldsync-monitoring-statistics>
                    <coldsync-monitoring-spu-list></coldsync-monitoring-spu-list>
                </coldsync-monitoring-failure-information>
                <interface-monitoring-failure-information>
                    <redundancy-group-id>1</redundancy-group-id>
                    <interface-monitoring-status>Failed</interface-monitoring-status>
                    <monitoring-interface-name>ge-3/0/4</monitoring-interface-name>
                    <monitoring-interface-status>     Down   / Down</monitoring-interface-status>
                    <monitoring-interface-name>ge-3/0/3</monitoring-interface-name>
                    <monitoring-interface-status>     Down   / Down</monitoring-interface-status>
                </interface-monitoring-failure-information>
            </chassis-cluster-monitoring-failure-information>
        </chassis-cluster-information>
    </multi-routing-engine-item>

</multi-routing-engine-results>
<cli>
    <banner>{primary:node0}</banner>
</cli>

{primary:node0}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant