<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to monitor ES (Elastic Search) health from controller logs and CLI? in RUCKUS Self-Help</title>
    <link>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40060#M23</link>
    <description>&lt;P style="margin: 0;"&gt;Hi &lt;A href="https://forums.ruckuswireless.com/users/5f916a0d48339326c8ee4bab" style=""&gt;@nayanendu_mallick&lt;/A&gt; ,&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Great Reference!! Thank you for sharing this one.&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Regards,&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Abilash&lt;/P&gt;</description>
    <pubDate>Mon, 31 Jan 2022 12:32:08 GMT</pubDate>
    <dc:creator>abilashpr</dc:creator>
    <dc:date>2022-01-31T12:32:08Z</dc:date>
    <item>
      <title>How to monitor ES (Elastic Search) health from controller logs and CLI?</title>
      <link>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40057#M20</link>
      <description>&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;What is ES?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Elasticsearch is a highly scalable open-source full-text search and analytics engine. It allows you to store, search, and analyze big volumes of data quickly and in near real-time. It is generally used as the underlying engine/technology that powers applications that have complex search features and requirements." — &lt;A href="https://www.elastic.co/guide/en/elasticsearch/reference/current/getting-started.html" target="_blank" rel="noopener"&gt;https://www.elastic.co/guide/en/elasticsearch/reference/current/getting-started.html&lt;/A&gt;”&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Starting from the 3.5 version, SZ/VSZ uses ES, especially for quick response for UI, needs and we have two data sources to store data. Some of them are in Cassandra (Configuration), some of them are in ES(status/stats). We rely on an underlying sync mechanism to make sure data consistency.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;Why does the ES issue happen?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;The problem can be separated into two parts.&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;UI Not responding.&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P style="margin: 0;"&gt;There are many reasons cause the ES cluster failed. In most cases, network partition (or unstable network connection between nodes). In such cases, we have a few ways to do recovery automatically. Configurer service can do some recovery but only try to restart the ES node.&lt;/P&gt;
&lt;OL start="2"&gt;
&lt;LI&gt;&lt;STRONG&gt;Data Out of sync.&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P style="margin: 0;"&gt;Many reasons can probably fail ES, in this situation, we must check the log and deal with it in a different way. &lt;STRONG&gt;&lt;U&gt;Reindex is the final solution to solve this problem&lt;/U&gt;&lt;/STRONG&gt;. However, to confirm the problem is out of sync, usually need to manually check, &lt;STRONG&gt;&lt;U&gt;not every case can be resolved by reindexing procedure.&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;Issues that you will notice on UI:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Pop-up error messages about ES keywords (For example: "all shards failed")&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Image_ images_messages_61f798e72c844f7ca99369ad_21ddcf61f6ed0a4eb5839b9d7f715640_Allshardsfailed-f9e7da73-8889-4e73-991c-4fc1557ea26b-733799386.JPG"&gt;&lt;img src="https://community.ruckuswireless.com/t5/image/serverpage/image-id/2516i9E93E4B202399FDA/image-size/large?v=v2&amp;amp;px=999" role="button" title="Image_ images_messages_61f798e72c844f7ca99369ad_21ddcf61f6ed0a4eb5839b9d7f715640_Allshardsfailed-f9e7da73-8889-4e73-991c-4fc1557ea26b-733799386.JPG" alt="Image_ images_messages_61f798e72c844f7ca99369ad_21ddcf61f6ed0a4eb5839b9d7f715640_Allshardsfailed-f9e7da73-8889-4e73-991c-4fc1557ea26b-733799386.JPG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Pop-up error messages about "alias missing"&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Image_ images_messages_61f798e72c844f7ca99369ad_c8e27c7585ecfd2da327de4561602a3a_AliasMissing-e53cbeb9-64c7-4cba-8c2e-2af5f1b0c5c4-1521986281.JPG"&gt;&lt;img src="https://community.ruckuswireless.com/t5/image/serverpage/image-id/2517i781AB959590004A8/image-size/large?v=v2&amp;amp;px=999" role="button" title="Image_ images_messages_61f798e72c844f7ca99369ad_c8e27c7585ecfd2da327de4561602a3a_AliasMissing-e53cbeb9-64c7-4cba-8c2e-2af5f1b0c5c4-1521986281.JPG" alt="Image_ images_messages_61f798e72c844f7ca99369ad_c8e27c7585ecfd2da327de4561602a3a_AliasMissing-e53cbeb9-64c7-4cba-8c2e-2af5f1b0c5c4-1521986281.JPG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Show no result in WebGUI (For example: WLAN tab or AP tab show no result)&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Image_ images_messages_61f798e72c844f7ca99369ad_4a48bd0f237d5039383149f05ca61f5e_wlanmissing-e9e05333-da61-4959-a7f7-99db6b08e2bc-215179319.JPG"&gt;&lt;img src="https://community.ruckuswireless.com/t5/image/serverpage/image-id/2518i5CE342F7E3880EAB/image-size/large?v=v2&amp;amp;px=999" role="button" title="Image_ images_messages_61f798e72c844f7ca99369ad_4a48bd0f237d5039383149f05ca61f5e_wlanmissing-e9e05333-da61-4959-a7f7-99db6b08e2bc-215179319.JPG" alt="Image_ images_messages_61f798e72c844f7ca99369ad_4a48bd0f237d5039383149f05ca61f5e_wlanmissing-e9e05333-da61-4959-a7f7-99db6b08e2bc-215179319.JPG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;AP Traffic column or radio (2.4 or 5GHz) column would show N/A.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Image_ images_messages_61f798e72c844f7ca99369ad_968872a55d02e47e99f2144e3341d60b_APtrafficNA-579e0fe8-d5d9-457c-bcf4-4d5fbada3294-1389817204.JPG"&gt;&lt;img src="https://community.ruckuswireless.com/t5/image/serverpage/image-id/2519iC6A4D34079350A9D/image-size/large?v=v2&amp;amp;px=999" role="button" title="Image_ images_messages_61f798e72c844f7ca99369ad_968872a55d02e47e99f2144e3341d60b_APtrafficNA-579e0fe8-d5d9-457c-bcf4-4d5fbada3294-1389817204.JPG" alt="Image_ images_messages_61f798e72c844f7ca99369ad_968872a55d02e47e99f2144e3341d60b_APtrafficNA-579e0fe8-d5d9-457c-bcf4-4d5fbada3294-1389817204.JPG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;And sometimes, navigating between different tabs of the controller you would see “An Unknown error” occurred message.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;How to identify from the logs if ES has gone bad?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Download Snapshot logs from the SZ/vSZ, by navigating to Diagnostics &amp;gt; Application logs&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Image_ images_messages_61f798e72c844f7ca99369ad_1fd9d1286bf722ea1601346882f55061_Snapshotlogs-b56ac155-e5ca-4795-89a6-a6844e4515e6-847213798.JPG"&gt;&lt;img src="https://community.ruckuswireless.com/t5/image/serverpage/image-id/2520iFE04DB75A08AAA01/image-size/large?v=v2&amp;amp;px=999" role="button" title="Image_ images_messages_61f798e72c844f7ca99369ad_1fd9d1286bf722ea1601346882f55061_Snapshotlogs-b56ac155-e5ca-4795-89a6-a6844e4515e6-847213798.JPG" alt="Image_ images_messages_61f798e72c844f7ca99369ad_1fd9d1286bf722ea1601346882f55061_Snapshotlogs-b56ac155-e5ca-4795-89a6-a6844e4515e6-847213798.JPG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;If Elastic Search service is offline, that could be verified by running the command “show service”&lt;/LI&gt;
&lt;LI&gt;Once you download the snapshot logs extract them using 7 zip. Then navigate to applogfiles folder, you would see all services folders of the controller.&lt;/LI&gt;
&lt;/OL&gt;
&lt;P style="margin: 0;"&gt;Configurer/configurer.log&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;Configurer[pool-4-thread-1] INFO c.r.w.c.s.ElasticClientService - Start to initial ES client.&lt;BR /&gt;Configurer[pool-4-thread-1] INFO c.r.w.c.s.ElasticClientService - Failed to init ES client&lt;BR /&gt;org.elasticsearch.client.transport.NoNodeAvailableException: None of the configured nodes are available: []&lt;BR /&gt;at org.elasticsearch.client.transport.TransportClientNodesService.ensureNodesAreAvailable(TransportClientNodesService.java:305) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.client.transport.TransportClientNodesService.execute(TransportClientNodesService.java:200) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.client.transport.support.InternalTransportClusterAdminClient.execute(InternalTransportClusterAdminClient.java:86) ~[elasticsearch-1.7.2.jar:na]&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Core/core.log&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;Caused by: org.elasticsearch.indices.IndexMissingException: [alias_apmeshstatus_all] missing&lt;BR /&gt;at org.elasticsearch.cluster.metadata.MetaData.convertFromWildcards(MetaData.java:884) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.cluster.metadata.MetaData.concreteIndices(MetaData.java:692) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.action.search.type.TransportSearchTypeAction$BaseAsyncAction.&amp;lt;init&amp;gt;(TransportSearchTypeAction.java:118) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.action.search.type.TransportSearchDfsQueryThenFetchAction$AsyncAction.&amp;lt;init&amp;gt;(TransportSearchDfsQueryThenFetchAction.java:76) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;&lt;BR /&gt;&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;Web/web.log&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;Web[localhost-startStop-2] ERROR c.r.s.d.s.r.ReIndexServiceImpl - init reindex [com.ruckuswireless.scg.domain.service.reindex.APReIndexCommand@47dda78] failed&lt;BR /&gt;org.elasticsearch.indices.IndexMissingException: [alias_apmeshstatus_all] missing&lt;BR /&gt;at org.elasticsearch.cluster.metadata.MetaData.convertFromWildcards(MetaData.java:884) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.cluster.metadata.MetaData.concreteIndices(MetaData.java:692) ~[elasticsearch-1.7.2.jar:na]&lt;BR /&gt;at org.elasticsearch.action.search.type.TransportSearchTypeAction$BaseAsyncAction.&amp;lt;init&amp;gt;(TransportSearchTypeAction.java:118) ~[elasticsearch-1.7.2.jar:na]&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Configurer/es-monitor.log (search by “health” keyword), check if the health shows red or yellow.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Image_ images_messages_61f798e72c844f7ca99369ad_db6be6cff2192a1f6757765460d8920e_redyellow-f107512b-130f-47df-a7cf-b7756702040b-1185417880.JPG"&gt;&lt;img src="https://community.ruckuswireless.com/t5/image/serverpage/image-id/2521iD9051841392E4273/image-size/large?v=v2&amp;amp;px=999" role="button" title="Image_ images_messages_61f798e72c844f7ca99369ad_db6be6cff2192a1f6757765460d8920e_redyellow-f107512b-130f-47df-a7cf-b7756702040b-1185417880.JPG" alt="Image_ images_messages_61f798e72c844f7ca99369ad_db6be6cff2192a1f6757765460d8920e_redyellow-f107512b-130f-47df-a7cf-b7756702040b-1185417880.JPG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;In the same “es-monitor.log” (search by “&lt;STRONG&gt;UNASSIGNED&lt;/STRONG&gt;” keyword)&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;index&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; shard prirep state&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; docs&amp;nbsp;&amp;nbsp; store ip&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; node&lt;BR /&gt;apcliscripthistory_20220127&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; p&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; STARTED&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp; 130b x.x.x.x 614c6b97-baa4-488e-8ace-e62b2f086855&lt;BR /&gt;apcliscripthistory_20220127&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; r&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; UNASSIGNED&lt;BR /&gt;apcliscripthistory_20220127&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; p&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; STARTED&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp; 130b x.x.x.x 614c6b97-baa4-488e-8ace-e62b2f086855&lt;BR /&gt;apcliscripthistory_20220127&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; r&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; UNASSIGNED&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;From version, 5.2 onwards, we can check the ES health status from the controller CLI:&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;vSZ-H-179#debug&lt;BR /&gt;&lt;BR /&gt;vSZ-H-179(debug)# debug-tools&lt;BR /&gt;&lt;BR /&gt;[Change to system]&lt;BR /&gt;&lt;BR /&gt;Welcome to Debug CLI Framework!&lt;BR /&gt;&lt;BR /&gt;(debug tool-set) system $ use sz&lt;BR /&gt;&lt;BR /&gt;[Change to sz]&lt;BR /&gt;&lt;BR /&gt;(debug tool-set) sz $ ?&lt;BR /&gt;&lt;BR /&gt;Debug Tools (sz):&lt;BR /&gt;&lt;BR /&gt;Command&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Help&lt;BR /&gt;&lt;BR /&gt;================ ================================&lt;BR /&gt;show-es-cat-aliases show ES cat aliases info&lt;BR /&gt;show-es-cat-health show ES cat health info&lt;BR /&gt;show-es-cat-indices show ES cat indices info&lt;BR /&gt;show-es-cat-master show ES cat master info&lt;BR /&gt;show-es-cat-nodes show ES cat nodes info&lt;BR /&gt;show-es-cat-shards show ES cat shards info&lt;BR /&gt;show-es-cluster-settings show ES cluster settings&lt;BR /&gt;show-es-folder-info show ES folder info&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;For example:&lt;/STRONG&gt; If you want to check the ES health, you could run the below command. If it shows “green” and 100% then ES health is fine. If it shows “red” or “yellow”, then ES has gone bad.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;(debug tool-set) sz $ show-es-cat-health&lt;BR /&gt;epoch&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; timestamp cluster&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; status node.total node.data shards pri relo init unassign pending_tasks max_task_wait_time active_shards_percent&lt;BR /&gt;1643352441 06:47:21&amp;nbsp; vSZ-H-179_54 green&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 2&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 2&amp;nbsp;&amp;nbsp;&amp;nbsp; 472 236&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 100.0%&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;To check if the Indexes are in an &lt;STRONG&gt;UNASSIGNED&lt;/STRONG&gt; state or not, if it shows like below then ES is OK.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;(debug tool-set) sz $ show-es-cat-shards&lt;BR /&gt;*** Unknown syntax: show-es-cat-shards&lt;BR /&gt;index&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; shard prirep state&amp;nbsp;&amp;nbsp; docs&amp;nbsp; store ip&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;node&lt;BR /&gt;hccdclientconnection_20220128&amp;nbsp;&amp;nbsp; 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; r&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; STARTED&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp; 130b x.x.x.x bbf54faf-508b-44a3-8804-76a1b5da4b2d&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;If it shows like below then ES is not OK:&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;index &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; shard prirep state &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;docs &amp;nbsp; store ip &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; node&lt;BR /&gt;apcliscripthistory_20220127 &amp;nbsp; &amp;nbsp; 1 &amp;nbsp; &amp;nbsp; p &amp;nbsp; &amp;nbsp; &amp;nbsp;STARTED &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0 &amp;nbsp; &amp;nbsp;130b x.x.x.x 614c6b97-baa4-488e-8ace-e62b2f086855&lt;BR /&gt;apcliscripthistory_20220127 &amp;nbsp; &amp;nbsp; 1 &amp;nbsp; &amp;nbsp; r &amp;nbsp; &amp;nbsp; &amp;nbsp;UNASSIGNED&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;A brief note on INDEX and SHARDs:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;Index&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;An INDEX is a collection of documents that have somewhat similar characteristics. It is identified by a name and this name is used to refer to the index when performing indexing, search, update, and delete operations against the documents in it. In a single cluster, you can define as many indexes as you want.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;Shards&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;An index can potentially store a large amount of data that can exceed the hardware limits of a single node. To solve this problem, Elasticsearch provides the ability to subdivide your index into multiple pieces called SHARDS. Each shard is in itself a fully functional and independent "index" that can be hosted on any node in the cluster.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;Troubleshooting:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;If Elastic Search service is offline, then perform below: (plan a downtime)&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Follow the below steps when the Controller is running on 3.5.1.x firmware&lt;/P&gt;
&lt;P style="margin: 0;"&gt;- Execute the below command in the debug mode to recover the Elastic Search service on the vSZ-E/SZ-100.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;SZ-100(debug)#force-recover-escluster&lt;BR /&gt;SZ-100(debug)#reload now&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Follow the below commands for controller running from 3.6.x and above&lt;/P&gt;
&lt;P style="margin: 0;"&gt;You don't have to go to debug mode in here.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;SZ-100# force-recover-escluster&lt;BR /&gt;SZ-100# reload now&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Once the Controller is online, check if all the data is available or not. If there is any data missing, execute the following command to recover the data.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;SZ-100(debug)#reindex-elasticsearch-all&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&lt;STRONG&gt;If Elastic Search service is not offline, and still you see some of the Data missing on UI or the errors as shown in the screenshot earlier, then only perform&lt;/STRONG&gt;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE style="margin: 0;"&gt;SZ-100(debug)#reindex-elasticsearch-all&lt;/PRE&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;This command needs to be performed on individual nodes in the cluster one after another and does not require downtime. If the above-mentioned Troubleshooting steps still do not help in recovering the ES or the Data on UI, then reach out to &lt;STRONG&gt;TAC&lt;/STRONG&gt;. There are certain commands that need to be run from the shell of the controller to recover the ES.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Hope the above article helps you in identifying the Elastic Search-related issues and resolving them.&lt;/P&gt;
&lt;P style="margin: 0;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Cheers!!&lt;/P&gt;
&lt;P style="margin: 0;"&gt;Happy Learning!!&lt;/P&gt;</description>
      <pubDate>Wed, 21 Sep 2022 11:28:05 GMT</pubDate>
      <guid>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40057#M20</guid>
      <dc:creator>Nayanendu</dc:creator>
      <dc:date>2022-09-21T11:28:05Z</dc:date>
    </item>
    <item>
      <title>Re: How to monitor ES (Elastic Search) health from controller logs and CLI?</title>
      <link>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40058#M21</link>
      <description>&lt;P style="margin: 0;"&gt;Very useful article!&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Of cause, the best is to maintain good network connections and all nodes in good health. But it is very important to know what to do when something out of our control happens and data are out of synch...&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;I had a few situation when ES re-indexing was really necessary and recovered the system data. TAC helped me with that, so I learned this commands and can now do similar proceeding myself.&amp;nbsp;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;In one case after nasty power outage data were messed so far, that AP were missing from proper zones, instead were placed in wrong zones, and re-indexing fixed everything. Re-indexing spared me effort of restoring backups...&amp;nbsp;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;But you still must have proper backups, as sometimes data can be damaged as well, not only indexes.&amp;nbsp;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 31 Jan 2022 09:00:47 GMT</pubDate>
      <guid>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40058#M21</guid>
      <dc:creator>eizens_putnins</dc:creator>
      <dc:date>2022-01-31T09:00:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to monitor ES (Elastic Search) health from controller logs and CLI?</title>
      <link>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40059#M22</link>
      <description>&lt;P&gt;&lt;A href="https://forums.ruckuswireless.com/users/5f9169ec49d3ca752488a750"&gt;@eizens_putnins&lt;/A&gt;​ You are absolutely correct!! Proper backups especially cluster backups should be there for the worst-case scenarios to recover. This article will help in doing initial triage.&lt;/P&gt;</description>
      <pubDate>Mon, 31 Jan 2022 10:34:20 GMT</pubDate>
      <guid>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40059#M22</guid>
      <dc:creator>Nayanendu</dc:creator>
      <dc:date>2022-01-31T10:34:20Z</dc:date>
    </item>
    <item>
      <title>Re: How to monitor ES (Elastic Search) health from controller logs and CLI?</title>
      <link>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40060#M23</link>
      <description>&lt;P style="margin: 0;"&gt;Hi &lt;A href="https://forums.ruckuswireless.com/users/5f916a0d48339326c8ee4bab" style=""&gt;@nayanendu_mallick&lt;/A&gt; ,&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Great Reference!! Thank you for sharing this one.&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Regards,&lt;/P&gt; 
&lt;P style="margin: 0;"&gt;Abilash&lt;/P&gt;</description>
      <pubDate>Mon, 31 Jan 2022 12:32:08 GMT</pubDate>
      <guid>https://community.ruckuswireless.com/t5/RUCKUS-Self-Help/How-to-monitor-ES-Elastic-Search-health-from-controller-logs-and/m-p/40060#M23</guid>
      <dc:creator>abilashpr</dc:creator>
      <dc:date>2022-01-31T12:32:08Z</dc:date>
    </item>
  </channel>
</rss>

