Links

Wednesday 2025-01-15 Assorted Links
Assorted Links links
Published: 2025-01-15
Wednesday 2025-01-15 Assorted Links

Assorted links for Wednesday, January 15:

  1. Cloud PUE: Comparing AWS, Azure and GCP Global Regions

    New data reveals how efficiently the major cloud providers run and cool their data centers – from AWS’s and Azure’s tropical struggles to Google’s industry-leading performance.

  2. The No-Order File System

    In this paper, we introduce the No-Order File System (NoFS), a simple, lightweight file system that employs a novel technique called backpointer based consistency to provide crash consistency without ordering writes as they go to disk.

  3. Whose Code is it Anyway?

    In order to measure the engineering effectiveness of Yelp, we need to measure the effectiveness of its organizations and the teams that make up those organizations. But how do we know what a team is responsible for? We needed a way to assign an owner to something (let’s call this an entity) that we want to measure. Once an entity has an owner, we can collect metrics on that entity and derive the health score (i.e., effectiveness) for that owner. These metrics can then be aggregated by team, organization, or even the entire Engineering division, so that we can identify areas that we can collectively improve. And this is how the Ownership microservice was born.

  4. How we ported Linux to the M1
  5. Nix + Bazel = fully reproducible, incremental builds
Tuesday 2025-01-14 Assorted Links
Assorted Links links
Published: 2025-01-14
Tuesday 2025-01-14 Assorted Links

Assorted links for Tuesday, January 14:

  1. αcτµαlly pδrταblε εxεcµταblε

    One day, while studying old code, I found out that it’s possible to encode Windows Portable Executable files as a UNIX Sixth Edition shell script, due to the fact that the Thompson Shell didn’t use a shebang line. Once I realized it’s possible to create a synthesis of the binary formats being used by Unix, Windows, and MacOS, I couldn’t resist the temptation of making it a reality, since it means that high-performance native code can be almost as pain-free as web apps.

  2. I spent 4 hours learning how Netflix operates Apache Iceberg at scale
  3. How to secure your GitHub Actions workflows with CodeQL

    To help prevent the introduction of vulnerabilities, identify them in existing workflows, and even fix them using GitHub Copilot Autofix, CodeQL support has been added for GitHub Actions.

  4. How the UK was connected to the Internet for the first time
  5. Highlights from Git 2.48

    The open source Git project just released Git 2.48. Here is GitHub’s look at some of the most interesting features and changes introduced since last time.

Monday 2025-01-13 Assorted Links
Assorted Links links
Published: 2025-01-13
Monday 2025-01-13 Assorted Links

Assorted links for Monday, January 13:

  1. Simple defer, ready to use

    With this post I will concentrate on the here and now: how to use C’s future lifesaving defer feature with existing tools and compilers.

  2. Cleanup Attribute in C

    In this blog post I explore __attribute__((cleanup(...))). I discuss what it does, how it does it, why use it, performance considerations, and finish by saying it’s absolutely fantastic.

  3. Debanking (and Debunking?): An entertaining and fairly deep explanation of what debanking is and why it occurs. And I learned a useful new word: sontaku.

    Japanese has a beautiful word, sontaku, for the attitude and actions a diligent subordinate would take without his superior’s explicit instruction, believing them to anticipate his boss’ desires. Sontaku is a core skill in the American professional class.

  4. Seeing like a Bank
  5. The Bond villain compliance strategy
Friday 2025-01-10 Assorted Links
Assorted Links links
Published: 2025-01-10
Friday 2025-01-10 Assorted Links

Assorted links for Friday, January 10:

  1. Colliding with the SHA prefix of Linux’s initial Git commit

    There was a recent discussion about how Linux’s “Fixes” tag, which traditionally uses the 12 character commit SHA prefix, has an ever increasing chance of collisions. There are already 11-character collisions, and Geert wanted to raise the minimum short id to 16 characters.

    Tools like linux-next’s “Fixes tag checker”, the Linux CNA’s commit parser, and my own CVE lifetime analysis scripts do programmatic analysis of the “Fixes” tag and had no support for collisions (even shorter existing collisions).

    So, in an effort to fix these tools, I broke them with commit 1da177e4c3f4 (“docs: git SHA prefixes are for humans”):

  2. FreeBSD Considers Making Use Of Rust Within Its Base System
  3. The Architect’s Guide to Open Table Formats and Object Storage
  4. Alerts Are Fundamentally Messy
  5. Databases in 2024: A Year in Review
Wednesday 2025-01-08 Assorted Links
Assorted Links links
Published: 2025-01-08
Wednesday 2025-01-08 Assorted Links

Assorted links for Wednesday, January 8:

  1. Amex’s FaaS Uses WebAssembly Instead of Containers

    A key reason behind Amex’s adoption of WebAssembly is that WebAssembly demonstrated superior performance metrics compared to containers.

  2. Enhance build security and reach SLSA Level 3 with GitHub Artifact Attestations

    The Supply-chain Levels for Software Artifacts (SLSA) framework … provides a comprehensive, step-by-step methodology for building integrity and provenance guarantees into your software supply chain.

  3. Introducing Configurable Metaflow

    Standing on the shoulders of our extensive cloud infrastructure, Metaflow facilitates easy access to data, compute, and production-grade workflow orchestration, as well as built-in best practices for common concerns such as collaboration, versioning, dependency management, and observability, which teams use to setup ML/AI experiments and systems that work for them. As a result, Metaflow users at Netflix have been able to run millions of experiments over the past few years without wasting time on low-level concerns.

  4. The Feds Push WebAssembly for Cloud Native Security

    According to a National Institute of Standards and Technology (NIST) paper, “A Data Protection Approach for Cloud-Native Applications,” released earlier this year, WebAssembly could and should be integrated across the cloud native service mesh sphere in particular to enhance security.

  5. Self-Designing Software

    Exploring ways to include a software system as an active member of its own design team, able to reason about its own design and to synthesize better variants of its own building blocks as it encounters different deployment conditions.

Tuesday 2025-01-07 Assorted Links
Assorted Links links
Published: 2025-01-07
Tuesday 2025-01-07 Assorted Links

Assorted links for Tuesday, January 7:

  1. What is Inference Parallelism and how it works

    Inference parallelism aims to distribute the computational workload of AI models, particularly deep learning models, across multiple processing units such as GPUs.

  2. Open Source Innovation Comes to Time-Series Data Compression

    NetApp Instaclustr collaborated with the University of Canberra through the OpenSI initiative to develop the Advanced Time Series Compressor (ATSC) — an open source innovation that fundamentally reimagines high-volume time-series data compression.

    ATSC implements a sophisticated lossy compression approach. Rather than storing complete data sets, it generates mathematical functions that closely approximate the original data patterns, storing only the essential parameters of these functions. This approach is paired with granular configurability — users can precisely tune their desired level of accuracy, balancing storage efficiency with data fidelity based on their specific use cases.

  3. What Do You Lose When You Abandon the Cloud?

    High-profile moves from 37signals (the company behind Basecamp and HEY) and GEICO have sparked a renewed interest in cloud repatriation.

    One sometimes overlooked advantage of moving to the cloud is that it allows you to pay for resources when they are needed, for example, as new customers come online. Spending moves from upfront CAPEX (buying new machines in anticipation of success) to OPEX (paying for additional servers on demand).

    Another thing to weigh up is pace of innovation — both from the cloud provider and from the consumer.

    The Zynga example [of moving from the cloud to on-prem, then back to the cloud] highlights several other trade-offs. One to consider is that if you are running your own data centers, you need to be able to hire the right people and retain them.

    There is another set of trade-offs around security. Keeping servers up to date, and guarding against intrusions, is time-consuming work that big cloud providers are very experienced in.

  4. Why All the Major Cloud Platforms Are the Same

    Each provider brought unique strengths and strategic priorities to the table, creating differentiation initially, but eventually converging on a consistent baseline of functionality.

  5. Indexing code at scale with Glean

    How is Glean different?

    • Glean doesn’t decide for you what data you can store.
    • Glean’s query language is very general.
Monday 2025-01-06 Assorted Links
Assorted Links links
Published: 2025-01-06
Monday 2025-01-06 Assorted Links

Assorted links for Monday, January 6:

  1. Managing large-scale Redis clusters on Kubernetes with an operator – Kuaishou’s approach
  2. Supercharge Your RAG App With Agentic Hybrid Search

    By using structured metadata and letting an LLM choose the best retrieval method for each query, you can turn your RAG app into a better assistant.

  3. Cloud Efficiency at Netflix
  4. Part 1: A Survey of Analytics Engineering Work at Netflix
  5. How we think about Threads’ iOS performance: Key metrics: %FIRE (Frustrating image-render experience), TTNC (Time-to-network content), cPSR (Creation-publish success rate)
Friday 2025-01-03 Assorted Links
Assorted Links links
Published: 2025-01-03
Friday 2025-01-03 Assorted Links

Assorted links for Friday, January 3:

  1. Apache Pinot Brings Real Time Analysis to Columnar Data

    Pinot was born to solve the problem of “running analytical queries for hundreds of millions of users at scale, in a low-cost manner,” explained Chinmay Soman, head of product for StarTree, which offers a fully managed cloud native version of Pinot.

    Pinot brings “simplification in the data stack,” Soman said in an interview with TNS. “The problem is not new. It’s been solved by many legacy technologies. What Pinot brings is the simplification and the scale for these problems.”

  2. GenAI is Quickly Reinventing IT Operations, Leaving Many Behind

    GenAI can significantly transform IT Operations Management by proactively providing context-rich insights, accurate predictions, and actionable recommendations for managing the IT landscape.

  3. How to Create and Use an AI Git Agent
  4. Platform Engineering needs Observability: here’s why

    Observability offers real-time insights into system behavior, allowing teams to proactively identify and address issues before they affect users. By adopting observability, platform engineering teams can improve system resilience, sustain uninterrupted user experiences during peak events, and uphold operational stability.

  5. Announcing systemd v257
Thursday 2025-01-02 Assorted Links
Assorted Links links
Published: 2025-01-02
Thursday 2025-01-02 Assorted Links

Assorted links for Thursday, January 2:

  1. Design Token-Based UI Architecture

    Design tokens are design decisions as data and serve as a single source of truth for design and engineering. Utilizing deployment pipelines, they enable automated code generation across platforms, allowing for faster updates and improved consistency in design.

  2. Retrofitting spatial safety to hundreds of millions of lines of C++

    [W]e’re working towards bringing spatial memory safety into as many of our C++ codebases as possible, including Chrome and the monolithic codebase powering our services.

    Building on the successful deployment of hardened libc++ in Chrome in 2022, we’ve now made it default across our server-side production systems.

  3. Reconsidering Kubernetes deployments: when operators are overkill

    TL:DR: Kubernetes Operators are powerful but can be overkill for simple deployments. Explore alternatives like Helm, ArgoCD, and Devtron to streamline your Kubernetes deployments without sacrificing efficiency.

  4. Securely Deploy and Run Multiple Tenants on Kubernetes

    Three fundamental options:

    1. Namespace-Based Isolation With Network Policies, RBAC and Security Controls
    2. Cluster-Level Isolation
    3. Virtual Clusters
  5. AI-Driven Code Review: Enhancing Developer Productivity and Code Quality

    AI-driven code review tools are changing traditional development and helping teams build better software.