#SiteReliability

2024-11-18

Hannaford's recent weeklong outage has me wondering: Do companies truly understand the cost of cutting corners on engineering talent?
These unacceptably long outages which are more frequently occurring at major retailers highlights a common problem I'm seeing in tech: undervaluing highly experienced & knowledgeable engineers. It's way past time for companies to rethink their hiring priorities... stop cheaping out on your Ops and Sec talent, it's going to cost you far more in the end!
I'm exceptionally good at building reliable & resilient systems & teams, so it's super frustrating to be unemployed while witnessing preventable outages for which I could have made a difference. Yes, it's true, 30+ years of engineering experience doesn't come cheap, but I'm damn sure my price is far less than the loss in revenue from a weeklong eComm outage at a major business!
Anyway, if yer looking for a decent engineer/leader, please reach out...
#open_to_work #engineering #siteReliability #Technology

mainepublic.org/business-and-e

2024-09-23

No, I did not want to have a system-wide outage this morning, thankyouverymuch 😰

(but we recovered, although not without some sweating. Aren't new and different failure modes fun?)

(no, I'm not an SRE but we're a small shop)

#onCall #siteReliability #SRE

Dotan Horovits ✈️Devoxx Polandhorovits@fosstodon.org
2024-05-29

"What should I monitor? Am I tracking the right metrics?" 📈📊
Common industry metrics frameworks provide useful monitoring guidance for #DevOps and #SRE.
Here's a good overview for the different methods:
logz.io/blog/evops-sre-metrics
#monitoring #observability #sitereliability

2024-02-13

No one ever complains about #steam going down or being slow, despite tens of millions of concurrent users at all times. I'd like to know more about how Valve manages that. The service itself is practically transparent. #sitereliability #devops #cloud #CloudComputing #videogames

Dotan Horovits ✈️Devoxx Polandhorovits@fosstodon.org
2024-02-07

Life of a SRE. I love this pic by
@attachmentgenie @cfgmgmtcamp .
It only shows how unsustainable this screen gazing approach is, with today's #microservices #cloudnative systems.
Time to revisit your #siteReliability practices
medium.com/@horovits/sre-revis
#CfgMgmtCamp #SRE #DevOps

Mohammed S. Al SahafMohammedSahaf@hachyderm.io
2023-04-12

Here are the steps to enable #http3/#quic in #caddy:
....

It takes 0, zero, nil lines to enable and configure #http3/#quic in #CaddyServer! You don't need to do anything special to keep up with the industry standard and progress. Caddy takes care of keeping your services up-to-date.

#systemadministration #sysadmin #devops #sre #web #linux #unix #windows #sitereliability

On-Call Me Maybe Podcastoncallmemaybe
2023-02-28

Catch OCMM co-hosts @adrianamvillela and @anamedina at SLOconf this year!

2022-12-17

#Introduction 👋 Hello World!

I’m a proud #dogMom that loves to overshare photos of my #rescue #dog (Cassie).

Bringing #diversityEquitiyInclusion to #tech motivates me.

Professionally, I’ve had a long career in #softwareEngineering, but am now on a journey in the world of #siteReliability #engineering.

Sometimes I’ll also post things about #food, #coffee, #whiskey / #whisky, #wine, #travel, #nba #basketball, and #snowboarding.

#introductions #dei #womenwhocode #sre #developer #dogs

dog dressed as a taco
2022-12-16

Questions to Engineering Managers:
• How long were you an individual contributing engineer before making the switch to management?
• What was your motivation to change your path?
• If you switched back to an IC role, what made you leave management?
• What advice would you give to your younger self now to put them at ease about making the switch?

#engineeringmanagement #engineeringleadership #softwareengineering #techlead #developer #management #sre #sitereliability #tech #techcareer #career

Dotan Horovits ✈️Devoxx Polandhorovits@fosstodon.org
2022-12-14

useful #reliability anti-patterns shared by Ayelet Sachto #SRE
at #Google as part of #DevOpsDays Tel Aviv.
#DevOps #sitereliability

Dotan Horovits ✈️Devoxx Polandhorovits@fosstodon.org
2022-12-11

I keep seeing these blind spots when using #metrics, so I decided to put together a simple explainer.

I hope you find it useful.
Let me know if you have more useful tips.

horovits.medium.com/phantom-me

#devops #monitoring #sre #sitereliability #observability #analytics #timeseries

Dotan Horovits ✈️Devoxx Polandhorovits@fosstodon.org
2022-12-06
2022-11-17

Aaand issue fixed before user came.

Love the early alerting system and anomaly detection! Making sure the system is reliable and let us know before report happens.

Hope this can make us having few steps closer to proper SRE practices~

#SRE #SiteReliability #FioTechRant

Orrery :autism: :peacock:orrery@weirder.earth
2018-08-16

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst