SRE / DevOps / Kubernetes Weekly Reportまとめ#62(2021/4/4~4/9) - 運び屋 (A carrier(forwarder) changed his career to an engineer)

この記事は2021/4/4~2021/4/9発行の下記3つのWeekly Reportを読み、備忘録兼リンク集として残しているものです。
なるべく情報を早く届けたい/共有したいので、ブログのリンクを確認次第、先行公開しています。自身のコメントは随時追加しています。
The English Version of this blog is here.
DEVOPS WEEKLY ISSUE #536 April 4th, 2021
- News
- Tools
  - The first sigstore tool I’ve come across. Cosign allows for signing a container image and storing the signature in the registry, and finding and verifying signatures for a container image.
SRE Weekly Issue #264 April 4th, 2021
- Articles
- Outages
KubeWeekly #258 April 9th, 2021

この記事は2021/4/4~2021/4/9発行の下記3つのWeekly Reportを読み、備忘録兼リンク集として残しているものです。

なるべく情報を早く届けたい/共有したいので、ブログのリンクを確認次第、先行公開しています。自身のコメントは随時追加しています。

誰かの情報源や検索工数削減などになれば幸いです。

DEVOPS WEEKLY ISSUE #536 April 4th, 2021

SRE Weekly Issue #264 April 4th, 2021

KubeWeekly #258 April 9th, 2021

The English Version of this blog is here.

この記事を読んで疑問点や不明点があれば、URLから本文をご確認の上、ご指摘頂ければ幸いです。
理解が浅いジャンルも、とにかくコメントする様にしていますので、私の勘違いや説明不足による誤解も多々あろうかと思います。
情報量が多いので文字とリンクだけに絞っております。
各レポートで取り上げられている記事には2020年以前のものもあり、必ずしも最新のものという訳ではない様です。

DEVOPS WEEKLY ISSUE #536 April 4th, 2021

News

A pitch for a Distributed Operating System Interface (DOSi) and that operating systems should be reimagined to support higher level workloads that are managed in a distributed environment.

タイトルは「The Distributed Operating System Void」。
既存のインターフェース(CNI、CRI、CSI、OCI)を補完する、kuberneteslandとuserlandの間の明確なインターフェースDOSi)(Distributed Operating System Interface) を定義、解説している。

How do you fulfill the promise of continuous deployment? A presentation on the importance of high performance teams and how to build and measure progress.

タイトルは「It is time to fulfill the promise of CI/CD」。
メッセージやポイントが明確で良い。
こういう視覚的に分かりやすく、小難しく感じない資料を作るスキルが欲しい。

A nice explanation of how to trigger a GitHub Action from a webhook, using the repository dispatch configuration and API.

タイトルは「GitHub Actions Trigger Via Webhooks」。
GitHub Actionワークフローを手動でトリガーするWebhookを構築する方法を解説している。

A post on what you should be logging, at least from a security point of view. A good overview of the Adversarial Tactics, Techniques, and Common Knowledge (ATT&CK) taxonomy from MITRE.

タイトルは「What exactly should we be logging?」。
筆者がセキュリティーアーキテクト及びテクニカルリーダーとしてログに関するQ＆Aを行う中の知恵を引き出してまとめている記事。システムについて適切な質問をするためのツールと知識が得られる。

A look at how one security team started building security into the development process by integrating various code scanning tools.

タイトルは「Software Security at Rocketship Pace」。
筆者達がコードスキャンプラットフォーム「Intersect」を設計するときに採用したアプローチと、その過程で学んだ教訓について概説している。
SAST(静的アプリケーションセキュリティーテスト)およびSCA(ソフトウェア構成分析)の分野で、筆者達のすべてのニーズに対応する単独のツールが市場に存在しなかった。求めているカバレッジを達成するため、複数のツールを使用し、すべてのツールが連携するようにオーケストレーションレイヤーを構築している。

How to use a multilayer cache to improve cache hit rate on long tail content.

タイトルは「CDN for long-tail content? Fight the cache miss with multilayer caching!」。
以下の3つのポイントでタイトルの内容を簡潔に解説している。
- Ideal content structure
- Long-tail content
- Multilayer cache

What’s the argument for adopting a service mesh? This post explores the question, and some of the advantages and challenges.

タイトルは「When Is Service Mesh Worth It?」。
筆者がTetrateの創設エンジニアでありGoogleの元々のIstioビルダーの1人であるZach Butcher氏から学んだ内容を以下の3つのポイントで解説している。
- Service Mesh For The Rest of Us
- Usability Improvements to Ease Adoption
- When Service Mesh is Worth It

What's New in Salt 3003 Aluminium: Beacons, Cloud, Development, Salt Extensions, Performance and caching, Juniper minion, FIPS mode and more.

タイトルは「What's New in Salt 3003 Aluminium Release」。
タイトル通り、Salt Aluminiumリリースをまとめ、解説している。

Tools

The first sigstore tool I’ve come across. Cosign allows for signing a container image and storing the signature in the registry, and finding and verifying signatures for a container image.

OCIレジストリでのコンテナの署名、検証、および保存を行うツール「cosign」のGitHubページ。KubeWeekly#255で取り上げたので割愛。

SRE Weekly Issue #264 April 4th, 2021

Articles

Balancing act: the current limits of AWS network load balancers

This well-researched article caught me by surprise. It’s shocking that Ably received advice from AWS to stay under 400,000 simultaneous connections, despite Amazon’s own documentation stating support for “millions of connections per second”.

Paddy Byers — Ably

タイトルに関する内容を以下4つの項目で解説している。
- The ask: practically infinite scalability
- The application: millions of realtime subscriptions
- Limit 1: maximum target group size
- Limit 2: Connection stability

A Journey Into SRE

This blog is about how a group of hard-working individuals, with unique skills and working methods, managed to create a successful SRE team.

There’s a lot of detail about what their SREs do and how they communicate, with 3 projects as case studies.

Sergio Galvan — Algolia

Algolia社が独自のスキルと作業方法を備えた勤勉な個人のグループが、どのようにして良いSREチームを作り上げたかを以下の項目で解説している。
- What SREs do at Algolia
- How we work as a team
- Pairing creates a team
- Three Projects
- The journey continues ..

March 2 incident update

This is an incident followup from an incident at Deno earlier this year. Their CDN saw their heavy use of .ts files (TypeScript, a JavaScript variant) and mistakenly assumed they were MPEG transport segments, a violation of the CDN’s ToS. Oops.

Luca Casonato — Deno

上記の通り、Deno社の障害のフォローアップ記事。CDNとして利用しているCloudflare社と再発防止を確認している。

Kubernetes Supports Nine Pillars of SRE

Wait, there are 9 now?

Marc Hornbeek — Container Journal

タイトル通り、以下9つの柱を解説している。
1. Leadership and Culture
2. Work Sharing
3. Measurement
4. SLOs and SLIs, Error Budgets
5. Toil Reduction
6. Deployments
7. Performance Management
8. Incident Management
9. Anti-Fragility

Frequently Asked Questions on Deviations

There’s a nice little discussion of why “human error” is not a good enough answer for why a deviation (from standard operating procedure) happened.

Susan J. Schniepp and Steven J. Lynn — Pharmaceutical Technolog

上記の通り、FAQ形式で解説されている。Qは以下の通り。
- What is a deviation and do all deviations need to be investigated?
- What is a planned deviation?
- What’s the best process for investigating deviations?
- Why is human error not an acceptable finding for deviations?
- How much time should I allow for a deviation to be investigated?
- Are out-of-specification (OOS) results considered deviations?

How To Get Fooled By Metrics

They deployed an optimization that skipped sending some requests to the backend… and the backend metrics got worse. Why? Hint: aggregate metrics.

Dominik Sandjaja — Trivago

上記の通り、メトリクスが想定外の動きをしたので調査を行い、原因を突き止めて結果的にシステムの改善を確認した話。

Outages

Twitter
National Weather Service (US)
reddit
Squarespace.com
Squarespace.com itself, but not user sites.
Microsoft 365

上記各社の障害情報

KubeWeekly #258 April 9th, 2021

The Headlines

Editor’s pick of the highlights from the past week.

CNCF joins Google Summer of Code 2021 – Calling all student applications by April 13!

We are excited to announce that Cloud Native Computing Foundation is participating in GSoC 2021, one of the most popular programs for new contributors in the world of open source development.

For those who are not familiar, GSoC is a global program focused on introducing student developers to the world of open source software development. Through the program, students work with participating open source organizations like CNCF on a 10-week programming project during their break from school. Read the blog post to learn more.

Google Summer of Code 2021(GSoC 2021)にCNCFが参加する。登録はこちらから。April 13, 2021 14:00まで。

ICYMI: CNCF online programs this week

A weekly summary of CNCF online programs from this week.

What is continuous improvement?

Pini Reznik, Container Solutions

スピーカーが以下2つの質問に回答し、聞き手が「なぜクラウドネイティブへのトランスフォーメーションが非常に多く失敗するのか」を理解し、効果的なテクノロジーの採用と組織の変革を成功させる勝ち筋の戦略を選択できることを目指している。
1. Why did you need to change in the first place?” and
2. What is wrong with your traditional approach to building software?
「Cloud Native is more than Tech」はCNCFの定義からも自明だと思っていたけれども、要素の具体化は大事。

A Deep Dive into Kubestr - A new way to explore your Kubernetes options

Michael Cade & Sirish Bathina, Kasten by Veeam

先週のこの「 ICYMI: CNCF online programs this week」で取り上げているため、割愛。

Every time it comes up, the @CloudNativeFdn special interest group for contributor strategy knocks my socks off. The level of effort this group is putting into making the CNCF a place that everyone can engage is so impressive. @fuzzychef @ParisInBmore

Respect.
— Cornelia Davis (@cdavisafc) 2021年4月6日

The Technical

Tutorials, tools, and more that take you on a deep dive into the code.

Windows containers on Windows 10 without Docker (using Containerd)

James Sturtevant

Kubernetesを使用してWindowsでコンテナを適切に機能させることに取り組んできた筆者。containerdでローカル開発を行う必要があったため、ローカルマシンの構成を行ったが、包括的な資料が見当たらなかったので、自身のステップを書き留めて共有している。

Oxidizing the Kubernetes operator

Pavel Pscheidl

冒頭でKubernetes OperatorとRustを解説し、その組み合わせでの環境構築手順を以下の項目で解説している。
- Implementing an operator
- Project setup
- Creating a CustomResourceDefinition
- How Kubernetes and Operator work together
- Creating a custom Controller
- Implementing the operator logic
- Finalizers
- Creating and deleting deployments
- Running the operator
- Additional resources

Site Reliability Engineering (SRE) best practices

Rayan Das, Infracloud

タイトル通り以下7つのSREのベストプラクティスを解説している。
1. Error Budgets
2. Define SLOs Like a User
3. Monitoring Errors and Availability
4. Efficiently Planning Capacity
5. Paying Attention to Change Management
6. Blameless Postmortem
7. Toil Management

Essential tips to manage your gRPC services with Kong like a pro

Guilherme Salazar, Kong

gRPCサービスをプロキシーするようにKongをセットアップする方法のステップバイステップのチュートリアル。2つの可能なシナリオを解説している。

Implementing zero downtime deployments on Kubernetes — the plan

Matthew Flatt

タイトル通り、ダウンタイム無しでKubernetes上で複数のデプロイを実施するプランを検討している。「Kubernetes rolling updates」「Blue/green deployments」「Rainbow deployments」の対比がわかりやすかった。

Bringing your VMs to Kubernetes with KubeVirt

Irina Lindt, Kubermatic

VMのワークロードをKubernetesで管理できるオープンソースプロジェクト「KubeVirt.io」を紹介し、利用方法を解説している記事。
次の記事では、Kubermatic KubernetesPlatformでKubeVirtを使用方法を解説する模様。

A new era of Kubernetes integrations on GitLab.com

Viktor Nagy, GitLab

GitLab Kubernetes Agentの紹介。GitLabインスタンスとKubernetesクラスター間の安全な接続を提供し、プルベースのデプロイがネットワークポリシーに基づいてアラートを受信できる。

The distributed operating system void

Kris Nova, Twilio

上記のDEVOPS WEEKLY ISSUE #536で取り上げているので割愛。

Generating Kubernetes network policies by sniffing network traffic

Murat Celep, VMware

Kubernetesクラスターで実行されているアプリからキャプチャされた実際のネットワークトラフィックに基づいて、Kubernetesネットワークポリシーの作成を自動化する実験を解説している。
この記事内で参照されているコードのリンクはこちら。

I helped bring this to life, and we've been using it extensively at MLB. Check it out! https://t.co/pTzQvrpkOe
— Michael Goodness (@opsgoodness) 2021年4月8日

The Editorial

Articles, announcements, and morethatgive you a high-level overview of challenges and features.

Kubernetes 1.21: Power to the community

Kubernetes Release Team

Kubernetes 1.21 Release Teamによるリリースノート。キャッチアップせねば。
リリースロゴは以下。

Weaveworks (part 2), with Alexis Richardson

Craig Box, Kubernetes Podcast from Google

Google社社員によるKubernetes Podcast。今回のHostはCraig Box氏とGuest HostはJustin Garrison(senior systems engineer in the media)氏。以前の登場回は以下。
- Episode 20, with Justin Garrison
先週のPart 1に引き続きWeaveworks社のco-founder及びCEOであるAlexis Richardson氏をゲストとして迎えている。
News of the weekで気になったトピックは以下の通り。

Why I run Django on Kubernetes as a one-man SaaS

Anthony Simon

筆者がタイトルの内容を以下の項目で解説している。
- An elephant in the room
- There’s no holy grail
- Why I use Kubernetes
- Why I use Django
- Standing on the shoulders of giants
- What to make of this

DevOps and Kubernetes: a perfect match?

Gilad David Mayaan, Container Journal

以下の文は疑問。パブリッククラウドへの移行を手助けするものとして完全に適している？
- Kubernetes is perfectly suited to help transition infrastructure to public clouds like Azure or AWS.
以下は同意。
- In short, DevOps and Kubernetes are not a perfect match, but Kubernetes can certainly be a powerful tool when properly configured. Just make sure you are not getting in too deep, and understand that K8s is not an all-encompassing solution.

PODCAST: How to manage a successful CNCF project with William Morgan of Linkerd

Justin Dorfman & Richard Littauer

Buoyant社のCEOであるWilliam Morgan氏をゲストに迎えて、彼のTwitter社からLinkerdに至る経歴やLinkerdに焦点をあてて話をしている。

Kubernetes jobs market (Q1 2021)

Derek Newman

Kubernetesの仕事を探す際の期待値を数値で解説している。
筆者からの数字を見る際の注意点は以下。 The job descriptions that we collected are slightly skewed:
1. At Kube careers we only focus on Kubernetes jobs.
2. If a job doesn't have a clear salary range we discard it. Many job offers don't indicate a salary range and we think this is not good for engineers looking for work.
3. We discarded job offers from recruitment agencies.
4. We analysed listings on platforms used by European and American audiences.
5. The dataset is small — only 86 job descriptions from January, February and March 2021.

Upcoming CNCF Online Programs

Cloud Native Live

4/14/21: Enforce configuration and security checks for your YAML Files and Helm Charts with KubeLinter, by Viswajith Venugopal, StackRox - RSVP

On-demand

4/15/21: What's new in Argo Workflows 3.0, by Alex Collins, Intuit - RSVP

YouTube playlist submissions

Looking for more great curated content? Visit our Online Programs playlist on YouTube.

Learn more about CNCF Online Programs

いかがでしたか？気になる記事や情報はありましたか？

私もまだ内容を咀嚼出来ていないものが多々ありますので、この備忘録兼リンク集を活用しながら理解を深めていきたいと思います。

では、また。

Bye now!!

Yoshiki Fujiwara