2019-07-04

Varnish probe

Your backend is responding with a 301 to the health check; only 200 is an expected "healthy" response by default.
.probe = {
    .url = "/";
    .timeout  = 1s;
    .interval = 10s;
    .window    = 10;
    .threshold = 8;
    .expected_response = 301;
}
https://serverfault.com/questions/451840/varnish-going-sick

ELBなどでhealth checkをする際に使える便利なempty_gif（Nginx） - Qiita

ヘルスチェック用のURLをnginxで用意する場合

    location = /healthcheck.html {
         empty_gif;
         access_log off;
         break;
    }

バックエンド確認 (varnish4)

varnishadm backend.list
(詳細)
varnishadm backend.list -p

2019-06-26

ceph journal SSD

Journal 用 SSD を用いた Ceph 構成の構築 - ジェダイさんのブログ

Ceph: maintenance mode, use case and common operations | Sébastien Han

2019-06-12

postgres パラレルクエリパラレル実行

https://pgecons-sec-tech.github.io/tech-report/presentation/PGECons_20181018_parallel.pdf

https://www.postgresql.jp/document/11/html/runtime-config-resource.html#GUC-MAX-PARALLEL-WORKERS

https://www.depesz.com/2018/02/12/waiting-for-postgresql-11-support-parallel-btree-index-builds/

https://www.cybertec-postgresql.com/en/postgresql-parallel-create-index-for-better-performance/

2019-05-28

dell raid card

dell_raid_hdd_ssd - Google スプレッドシート

PowerEdge RAIDコントローラーに関する一般情報 | Dell 日本

SAS 6iR 2TB

unexpected sense

https://www.dell.com/community/PowerEdge-HDD-SCSI-RAID/Dell-R720-with-PERC-H710P-Mini-Repeating-Unexpected-Sense-for/td-p/5058290

https://twitter.com/kashima8086/status/1077928309315366912

Non-RAID

No, the H710 does not support non-RAID. The H310, H200, and SAS 5/6 are the only controllers that support non-RAID.
Non-RAID Supported? - Dell Community

2019-05-26

smart smartmontools

SSD smart

https://unix.stackexchange.com/questions/106678/how-to-check-the-life-left-in-ssd-or-the-mediums-wear-level

plextor

PX-G128M6e
241 (0xF1) Total_LBAs_Written -> 32MiB書込みごとに +1インクリメント
http://lab.mitty.jp/trac/lab/wiki/TipAndDoc/Hardware/Storage

samsung

177 Wear_Leveling_Count

Wear Leveling Count はの減り方は書き込み量と反比例で安定しており急激に死ぬことはない
Sumsung SSD 840 120GBを使い潰す(後編) - 人生シングルスレッド

crucial

202 Percentage Lifetime Used (202 Percent_Lifetime_Remain)

https://www.crucial.com/support/articles-faq-ssd/ssds-and-smart-data

INTEL

232 Available_Reservd_Space

232 Available_Reservd_Space 0x0033   100   100   010    Pre-fail  Always       -       0
233 Media_Wearout_Indicator 0x0032   099   099   000    Old_age   Always       -       0
E8 Available Reserved Space Normalized 予約領域の残っている数を表す。正規化した (Normalized) 値は、100 パーセントを表す 100 から始まる。正常の範囲は 10 パーセントまで。
E9 Media Wearout Indicator Normalized 記録メディアであるフラッシュメモリーの使い込んだ程度を表す。平均消去回数が増えるにつれ、正規化した (Normalized) 値が 100 から 1 へ減少していく。
http://www.nkjmkzk.net/?p=1380

In your first example, what I think you are referring to is the "Media Wearout Indicator" on Intel drives, which is attribute 233. Yes, it has a range of 0-100, with 100 being a brand new, unused drive, and 0 being completely worn out. According to your ouptut, this field doesn't seem to exist.
In your second example, please read the official docs about SSD_Life_Left. Per that page:
The RAW value of this attribute is always 0 and has no meaning. Check the normalized VALUE instead. It starts at 100 and indicates the approximate percentage of SDD life left. It typically decreases when Flash blocks are marked as bad, see the RAW value of Retired_Block_Count
hard disk - How to check the life left in SSD or the medium's wear level? - Unix & Linux Stack Exchange

WD

232 Available_Reservd_Space

https://github.com/FrauBSD/ssd_report

WD未対応

perc smart

PERC H710p SMART data - Server Fault

FreeBSD-10.2 on Dell T430 w PERC H730 card | The FreeBSD Forums

mrsasでsmartctlで値が取れない

smartctl not working with mrsas(4) driver on Dell PERC H730

デル製サーバーでsmartmontoolsを利用してSMART情報を表示 – スペース・アイ株式会社

drivedb.h

update-smart-drivedb コマンドで更新

/var/lib/smartmontools/drivedb/drivedb.h

smart情報不良セクタ

SMARTの属性には様々なものがありますが、ここでは不良セクタに関するもののみ取り上げます。
05（5）代替処理済のセクタ数
C4（196）セクタ代替処理発生回数
C5（197）代替処理保留中のセクタ数
C6（198）回復不可能セクタ数
※かっこ内の数値は、16進値を10進表記したもの。
SMART情報と不良セクタ - PCと解

交換に至ったディスクでは、S.M.A.R.T.値の「5 Reallocated_Sector_Ct（代替処理済みの不良セクタ）」の数値に変動がみられました。
【記事】高耐久HDDの運用・S.M.A.R.T.値の監視レポート | 研究開発者向け情報発信メディア TEGAKARI

2019-05-23

sftp umask

ForceCommand internal-sftp -u 002

コイケアキヨシ blog: sftpの権限だけのユーザを作成する（ディレクトリの制限も）

umask はパーミッションを落とす方向に作用しますので、元のファイルが 666 ならアップロードしたファイルは 664 になりますが、元が 644 なら 644 のままです。 なので、SFTPクライアント側であらかじめ g+w としておくか、SFTPクライアントの設定でアップロード後に chmod する必要があります。 

nginx - SFTPで複数のアカウントによる所有権とアップロードの問題｜teratail

2019-05-13

megacli centos7

インストール

Broadcom サポートサイトからMegaCli ユーティリティをダウンロードします。
http://docs.avagotech.com/docs/12351587

https://www.dell.com/support/article/jp/ja/jpbsd1/SLN292235

$ unzip 8-07-14_MegaCLI.zip 
$ sudo yum install Linux/MegaCli-8.07.14-1.noarch.rpm

使用方法

RAID状態を確認

/opt/MegaRAID/MegaCli/MegaCli64 -LDInfo -Lall -aALL
/opt/MegaRAID/MegaCli/MegaCli64 -PDList -aALL

nrpe

epel (centos6)

sudo yum install nrpe

sudo systemctl start nrpe
sudo systemctl enable nrpe
(centos6)
sudo service nrpe start
sudo chkconfig nrpe on

curl -OL https://github.com/glensc/nagios-plugin-check_raid/releases/download/4.0.8/check_raid.pl
chmod +x check_raid.pl
sudo chown root:root check_raid.pl   
sudo mv check_raid.pl  /usr/lib64/nagios/plugins/
sudo /usr/lib64/nagios/plugins/check_raid.pl -S

sudo vi /etc/sudoers.d/check_raid
(nrpeを追加)
User_Alias CHECK_RAID=nagios, icinga, nrpe
(動作確認)
sudo -u nagios /usr/lib64/nagios/plugins/check_raid.pl --plugin megacli
sudo vi /etc/nagios/nrpe.cfg 
(check_raidを追加)
command[check_raid]=/usr/lib64/nagios/plugins/check_raid.pl --plugin megacli

sudo systemctl restart nrpe
(centos6)
sudo service nrpe restart