How to send alert when Azure Service fabric health goes bad
up vote
0
down vote
favorite
Recently the our Azure service fabric health went bad after a deployment. The deployment was successful but service fabric health went bad due to some code issue and it was not rolling back. Only on looking into the service fabric explorer did we know that the cluster went bad
Is there a way to get an email alert when the service fabric health goes bad.
Scenarios where service fabric failed
Whole cluster so what happened was 1 service went bad(showed in red) and was consuming a lot of memory and that in turn caused other services to go bad. after which the whole cluster I had to log into the scaleset to see which services was taking most of the memory.
In another case we added another reliable collection to existing reliable collection to statefull service. This caused failure.
in each of the cases i need to look at the servifabric explorer and then go to each scale set to see the actual error message.
azure-service-fabric
add a comment |
up vote
0
down vote
favorite
Recently the our Azure service fabric health went bad after a deployment. The deployment was successful but service fabric health went bad due to some code issue and it was not rolling back. Only on looking into the service fabric explorer did we know that the cluster went bad
Is there a way to get an email alert when the service fabric health goes bad.
Scenarios where service fabric failed
Whole cluster so what happened was 1 service went bad(showed in red) and was consuming a lot of memory and that in turn caused other services to go bad. after which the whole cluster I had to log into the scaleset to see which services was taking most of the memory.
In another case we added another reliable collection to existing reliable collection to statefull service. This caused failure.
in each of the cases i need to look at the servifabric explorer and then go to each scale set to see the actual error message.
azure-service-fabric
What exactly went has, a single service or the whole cluster.
– Peter Bons
Nov 10 at 16:37
updated above scenarios but it would be both where a single service goes bad and that in turn the whole cluster over all health.
– kumar
Nov 11 at 3:01
Maybe the watchdog service is something that you are looking for: blogs.msdn.microsoft.com/softwaresimian/2017/09/18/…
– Thieme
Nov 12 at 15:00
add a comment |
up vote
0
down vote
favorite
up vote
0
down vote
favorite
Recently the our Azure service fabric health went bad after a deployment. The deployment was successful but service fabric health went bad due to some code issue and it was not rolling back. Only on looking into the service fabric explorer did we know that the cluster went bad
Is there a way to get an email alert when the service fabric health goes bad.
Scenarios where service fabric failed
Whole cluster so what happened was 1 service went bad(showed in red) and was consuming a lot of memory and that in turn caused other services to go bad. after which the whole cluster I had to log into the scaleset to see which services was taking most of the memory.
In another case we added another reliable collection to existing reliable collection to statefull service. This caused failure.
in each of the cases i need to look at the servifabric explorer and then go to each scale set to see the actual error message.
azure-service-fabric
Recently the our Azure service fabric health went bad after a deployment. The deployment was successful but service fabric health went bad due to some code issue and it was not rolling back. Only on looking into the service fabric explorer did we know that the cluster went bad
Is there a way to get an email alert when the service fabric health goes bad.
Scenarios where service fabric failed
Whole cluster so what happened was 1 service went bad(showed in red) and was consuming a lot of memory and that in turn caused other services to go bad. after which the whole cluster I had to log into the scaleset to see which services was taking most of the memory.
In another case we added another reliable collection to existing reliable collection to statefull service. This caused failure.
in each of the cases i need to look at the servifabric explorer and then go to each scale set to see the actual error message.
azure-service-fabric
azure-service-fabric
edited Nov 11 at 3:00
asked Nov 10 at 4:42
kumar
1,46612548
1,46612548
What exactly went has, a single service or the whole cluster.
– Peter Bons
Nov 10 at 16:37
updated above scenarios but it would be both where a single service goes bad and that in turn the whole cluster over all health.
– kumar
Nov 11 at 3:01
Maybe the watchdog service is something that you are looking for: blogs.msdn.microsoft.com/softwaresimian/2017/09/18/…
– Thieme
Nov 12 at 15:00
add a comment |
What exactly went has, a single service or the whole cluster.
– Peter Bons
Nov 10 at 16:37
updated above scenarios but it would be both where a single service goes bad and that in turn the whole cluster over all health.
– kumar
Nov 11 at 3:01
Maybe the watchdog service is something that you are looking for: blogs.msdn.microsoft.com/softwaresimian/2017/09/18/…
– Thieme
Nov 12 at 15:00
What exactly went has, a single service or the whole cluster.
– Peter Bons
Nov 10 at 16:37
What exactly went has, a single service or the whole cluster.
– Peter Bons
Nov 10 at 16:37
updated above scenarios but it would be both where a single service goes bad and that in turn the whole cluster over all health.
– kumar
Nov 11 at 3:01
updated above scenarios but it would be both where a single service goes bad and that in turn the whole cluster over all health.
– kumar
Nov 11 at 3:01
Maybe the watchdog service is something that you are looking for: blogs.msdn.microsoft.com/softwaresimian/2017/09/18/…
– Thieme
Nov 12 at 15:00
Maybe the watchdog service is something that you are looking for: blogs.msdn.microsoft.com/softwaresimian/2017/09/18/…
– Thieme
Nov 12 at 15:00
add a comment |
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53236045%2fhow-to-send-alert-when-azure-service-fabric-health-goes-bad%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
What exactly went has, a single service or the whole cluster.
– Peter Bons
Nov 10 at 16:37
updated above scenarios but it would be both where a single service goes bad and that in turn the whole cluster over all health.
– kumar
Nov 11 at 3:01
Maybe the watchdog service is something that you are looking for: blogs.msdn.microsoft.com/softwaresimian/2017/09/18/…
– Thieme
Nov 12 at 15:00