Skip to content

Latest commit

 

History

History
357 lines (310 loc) · 20.3 KB

notes-design.md

File metadata and controls

357 lines (310 loc) · 20.3 KB

Ashwin Jayaprakash CC-BY-SA-4.0

Table of Contents

Created by gh-md-toc

About

This is an opinionated collection of reading material that I've collected over the years to help me become a better Engineer. It is not an exhaustive list. I share some more on a somewhat regular basis on my Tech Blog. I also do a lot of general reading on leadership, culture, org building etc but I haven't found the time to share those.

Problem solving

Programming

Computer fundamentals

IDEs

Best practices

Code Katas

Code reviews

System design basics

(Also see 12 Factor, FURPS / Functional Usability Reliability Performance Supportability)

  • Requirements
    • Goals
    • Non goals
    • Scale
    • Constraints
    • Success measures
  • Design - Basics
    • Customer journeys (Visit -> Search -> Add -> Sign in/up -> Checkout -> Order -> Invoice -> Payment -> Fulfillment -> Shipment -> Confirmation/Return/Cancel/Refund/Status/Notify)
    • APIs and Entities - CRUD
    • Services - Foreground and background
    • Stores
    • Reliable (consistent, available), scalable (latency, goodput), observable, maintainable, secure
  • Design - Principles
    • Customer first
      • MVP
      • Necessary (support), sufficient (resistance), nice to have
    • Critical thinking
      • ROI
      • COGS
      • KPIs
      • Data and experiment driven
    • Tradeoffs
      • Simple, easy to reason and scale
      • Mechanical sympathy
      • Reduce toil and undifferentited heavy lifting
  • Scope
    • CDN / Edge network / POP, Region / availability zone / cell
  • System - Loadbalancer
    • Healthcheck
    • Routing
    • DNS updates
    • TLS and Certs
  • System - API Gateway
    • Protocols
    • Firewall
    • Rate limiting
    • Bounds check
    • Authz and authn
    • Routing
    • Logging
  • Concept - Resilience
    • Timeout
    • Retry
    • Jitter
    • Bulkhead
    • Circuit break
    • Backpressure
    • Memoize
    • Fault domain
  • Concept - Reads, writes and operations
    • Space and compute complexity
      • Arrival time, wait time, service time
      • Read and write ratio
    • Consistency, availability, durability
      • Atomic, transactional, lease, lock
      • Eventually or strongly consistent
      • Durable
      • Synchronous, asynchronous
        • Commands (need to be worked upon, ideally by a single worker)
        • Events (already happened)
        • Query (deterministic queries that don't have random numbers etc produce repeatable results)
        • Checkpoint, write after/before logs, replay-idempotence, compensation-rollback
    • Critical path
      • Crosstalk, coordination, contention
      • Partition, replication, affinity, anti-affinity
      • Schema on read or write, filter and aggregate on read or write
      • Deduplication, pre-fetching, pre-aggregating
      • Batching, compression, retention, tiering
      • Online vs offline - Analytics, notifications, repairs, reconciliation
      • Caching, invalidation, summary or probabilistic structures
      • Scoring, sorting, constraint satisfaction
  • Concept - Algorithms
    • Loadbalancing
      • Least used
      • Power of two random choices
      • Round robin
      • ...
    • Hashing
      • Consistent hash
      • Modulo hash
      • Rendezvous hash
      • ...
  • Concept - Security
    • Authenticate and authorize
    • Encryption at rest and motion
    • Sign and verify
    • Governance, risk and compliance
    • Audit trail
    • Principles
      • SSO, RBAC ...
      • Least privilege
      • Short lived tokens
      • Key rotation
      • ...
  • Concept - Operations
    • Measure, monitor, alert - Detect, escalate, respond, prevent
      • Cost
      • Performance
      • Errors
      • Capacity
      • Disruption and error budgets
      • Test - chaos and synthetic
      • SLO
    • Deploy safely
      • Feature flags and canary/blue green rollouts
      • Reduce toil
        • Auto repair
        • Auto scale up and back to zero

Performance and deep dives

Concurrency

Distributed systems

SQL

Cloud

Testing

Documentation and information management

Engineering excellence

Operational excellence

Project maintenance and contributions

Learning and leadership

"Awesome" lists