chaosmonkey
SimianArmy
Our great sponsors
chaosmonkey | SimianArmy | |
---|---|---|
22 | 6 | |
14,490 | 7,755 | |
1.2% | - | |
2.0 | 0.0 | |
4 months ago | over 5 years ago | |
Go | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chaosmonkey
-
Zero Downtime Postgres Upgrades
Never saw this communicated by Google, but Netflix is the company I have in mind for doing that: https://github.com/Netflix/chaosmonkey
-
Introduction to Chaos Engineering
In 2010 Netflix developed a tool called "Chaos Monkey", whose goal was to randomly take down compute services (such as virtual machines or containers), part of the Netflix production environment, and test the impact on the overall Netflix service experience. In 2011 Netflix released a toolset called "The Simian Army", which added more capabilities to the Chaos Monkey, from reliability, security, and resiliency (i.e., Chaos Kong which simulates an entire AWS region going down). In 2012, Chaos Monkey became an open-source project (under Apache 2.0 license). In 2016, a company called Gremlin released the first "Failure-as-a-Service" platform. In 2017, the LitmusChaos project was announced, which provides chaos jobs in Kubernetes. In 2019, Alibaba Cloud announced ChaosBlade, an open-source Chaos Engineering tool. In 2020, Chaos Mesh 1.0 was announced as generally available, an open-source cloud-native chaos engineering platform. In 2021, AWS announced the general availability of AWS Fault Injection Simulator, a fully managed service to run controlled experiments.
- Pour one out for the Netflix admins right now
- [URGENT] Netflix Engineer Needs Help Scaling Kubernetes Deployment or I'm Toast!
-
Weird topic: Looking for ideas to make the sys admin job more competitive and inject some adrenaline
Have a look at it https://github.com/Netflix/chaosmonkey
- It Took Just Four Days From Elon Gleefully Admitting He’d Unplugged A Server Rack For Twitter To Have A Major Outage
-
Manager does a little code cleanup...
It’s kinda like Chaos Monkey except Elon is the chaos
- Discussion Thread
-
This perfect plan is based on a rather childish assumption that no one will fight back
they really did lmao
-
What's the dumbest thing you have done since working in IT?
Thanks, that was an interesting read. Chaos Monkey
SimianArmy
-
Introduction to Chaos Engineering
In 2010 Netflix developed a tool called "Chaos Monkey", whose goal was to randomly take down compute services (such as virtual machines or containers), part of the Netflix production environment, and test the impact on the overall Netflix service experience. In 2011 Netflix released a toolset called "The Simian Army", which added more capabilities to the Chaos Monkey, from reliability, security, and resiliency (i.e., Chaos Kong which simulates an entire AWS region going down). In 2012, Chaos Monkey became an open-source project (under Apache 2.0 license). In 2016, a company called Gremlin released the first "Failure-as-a-Service" platform. In 2017, the LitmusChaos project was announced, which provides chaos jobs in Kubernetes. In 2019, Alibaba Cloud announced ChaosBlade, an open-source Chaos Engineering tool. In 2020, Chaos Mesh 1.0 was announced as generally available, an open-source cloud-native chaos engineering platform. In 2021, AWS announced the general availability of AWS Fault Injection Simulator, a fully managed service to run controlled experiments.
-
What is the best OS for homelab (Mini PCs)
Advanced: set up Hadoop cluster and Latency Monkey https://github.com/Netflix/SimianArmy/wiki/The-Chaos-Monkey-Army to learn about distributed systems resilience
- AWS Config and orphaned resources
- Teacher pulls the shower head for the first time since she’s worked at the school
-
On Being Indispensable at Work
Maybe companies need some kind of "Chaos Monkey" system [0] in place for regular employees. Not in "terminating" random employee's contracts, but a culture were regular, maybe even some kind of random transferals onto other projects, or onto internal work regularly happens.
Everybody knows this and everybody should be prepared to a situation that tomorrow they are not working on the same problem they work on today. How would they structure work? How would they share knowledge? What can the organization to to ensure there always are fallbacks to everybody? At least fallbacks that if not perform at 100% but still on 80 - 90%.
Sadly in my org this would not work of the get go, as we often have personal access tokens to our clients' systems. This is sometimes even a contractual obligation as for specific clients we need to be vetted. But even in these cases we could be reassigned/reshuffled towards a slightly different proposition or at least be reassigned to an internal topic for a few days - just like we would not be able to work if we fell ill.
[0]: https://github.com/Netflix/SimianArmy/wiki/Chaos-Monkey
-
DTCC planning liquidity risk testing on 26th April 21 (4 months early)
Maybe. Could also just be a chaos monkey expanding its cage.
What are some alternatives?
litmus - Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd.io/a4Zu_sH4TZGeih-xCimi3Q
chaos-mesh - A Chaos Engineering Platform for Kubernetes.
WLED - Control WS2812B and many more types of digital RGB LEDs with an ESP8266 or ESP32 over WiFi!
litmus - A fast python HTTP server inspired by japronto written in rust.
room-assistant - Presence tracking and more for automation on the room-level
hyperion.ng - The successor to Hyperion aka Hyperion Next Generation
kurt - A Kubernetes plugin that gives context to what is restarting in your Kubernetes cluster
Cloudbox - Ansible-based solution for rapidly deploying a Docker containerized cloud media server.
materials - Bonus materials, exercises, and example projects for our Python tutorials
ChaosEngineeringPublicStories - This Repository holds a list of public Chaos Engineering stories from major institutions around the world
pg_easy_replicate - Easily setup logical replication and switchover to new database with minimal downtime
nitter - Alternative Twitter front-end