Rawfeed

Blog

Stories curated by the team and community.

Explore essays, travel notes, culture briefs, and tech updates. Admins highlight the best stories and publish submissions from signed-in contributors.

Top picks

The Hidden Costs of Over-Reliance on AI in Production Systems

Tech

The Hidden Costs of Over-Reliance on AI in Production Systems

Over-reliance on AI can obscure operational realities and introduce risks that experienced engineers must address.

Tags: AI, Production Systems, Operational Risks, Automation Bias

Read story →
Silent Failures

Tech

Silent Failures

Silent failures in production systems can be more damaging than crashes. This article explores how to detect and manage these subtle yet critical issues.

Tags: production, failure detection, system reliability

Read story →
Platforms Don’t Own Your System. You Do

Tech

Platforms Don’t Own Your System. You Do

Modern platforms make it easier than ever to ship software, but they don’t remove responsibility. Understanding the difference between a platform and a system is now a core production skill.

Tags: systems thinking, platforms, software architecture, reliability engineering, cloud infrastructure

Read story →
How Enterprise Buyers Think About Software Vendors

Tech

How Enterprise Buyers Think About Software Vendors

Enterprise buyers are increasingly prioritizing risk reduction and operational maturity over flashy features. Understanding this shift can enhance your sales strategy.

Tags: enterprise sales, risk reduction, operational maturity

Read story →

Promoted

Silent Failures

Silent failures in production systems can be more damaging than crashes. This article explores how to detect and manage these subtle yet critical issues.

Read story →

Platforms Don’t Own Your System. You Do

Modern platforms make it easier than ever to ship software, but they don’t remove responsibility. Understanding the difference between a platform and a system is now a core production skill.

Read story →

The Cost of Unobservability: Real-World Implications for Production Systems

This article explores the critical importance of observability in production systems, detailing how lack of visibility can lead to significant operational risks and inefficiencies. It outlines specific patterns and anti-patterns observed in industry practices.

Read story →

The Burden of On-Call: Balancing Reliability and Team Well-Being

On-call responsibilities are essential for reliability but can lead to burnout; a balance is critical for sustainable operations.

Read story →

How Enterprise Buyers Think About Software Vendors

Enterprise buyers are increasingly prioritizing risk reduction and operational maturity over flashy features. Understanding this shift can enhance your sales strategy.

Read story →

All stories

4 published
Navigating Operational Maturity in Distributed Systems

Tech

Navigating Operational Maturity in Distributed Systems

Operational maturity is crucial for the resilience of distributed systems, influencing incident response and overall reliability.

Tags: Operational Maturity, Distributed Systems, Incident Response, Reliability Engineering

Read story →
The Impact of Backpressure in Event-Driven Architectures

Tech

The Impact of Backpressure in Event-Driven Architectures

Understanding backpressure mechanisms is crucial for maintaining performance in event-driven systems.

Tags: backpressure, event-driven, distributed systems, reliability engineering, Kafka, RabbitMQ

Read story →

Tech

The Tradeoffs of Retry Logic in Distributed Systems

Retry logic is essential in distributed systems but can introduce complexity and performance tradeoffs that need careful consideration.

Tags: Distributed Systems, Reliability Engineering, Operational Maturity

Read story →

Tech

Understanding Failure Modes in Distributed Systems: A Practical Approach

This article examines the various failure modes that can occur in distributed systems, providing a structured framework for recognizing and mitigating these issues in production environments.

Tags: Distributed Systems, Reliability Engineering, Incident Response, Failure Modes

Read story →

Submit an article

Sign in to submit an article for admin review.