Copyright © changeloghttps://validator.w3.org/feed/docs/rss2.htmlchangelog Updateshpc-internal.carnegiescience.eduhttps://hpc-internal.carnegiescience.edu?utm_source=noticeable&utm_campaign=3f43ej0latlbxv21efel&utm_content=other&utm_id=3f43Ej0LaTLbXv21eFel.t8lIbf2iSTWZIIP91xqU&utm_medium=newspageenTue, 07 Dec 2021 18:26:14 GMThttps://noticeable.io[email protected] (changelog)[email protected] (Noticeable Team)https://storage.noticeable.io/projects/3f43Ej0LaTLbXv21eFel/newspages/t8lIbf2iSTWZIIP91xqU/01h55ta3gshjbemty2fj8xrzn2-header-logo.pngchangelog Updateshttps://hpc-internal.carnegiescience.edu?utm_source=noticeable&utm_campaign=3f43ej0latlbxv21efel&utm_content=other&utm_id=3f43Ej0LaTLbXv21eFel.t8lIbf2iSTWZIIP91xqU&utm_medium=newspagehttps://storage.noticeable.io/projects/3f43Ej0LaTLbXv21eFel/newspages/t8lIbf2iSTWZIIP91xqU/01h55ta3gshjbemty2fj8xrzn2-header-logo.png#1e88e5qicYM3U7JTa4BbA78PsIWed, 17 Nov 2021 21:32:57 GMT[email protected] (Floyd Fayton)Memex Login Hung on 11/17/21https://noticeable.news/3f43ej0latlbxv21efel/publications/memex-login-hung-on-11-17-21Initial incident and probable cause:

Broken pipe on the login caused by a file update on the login. The master node and login were both rebooted and “wwsh file sync” commands were automated memex_routecheck.sh (crontab and cron.hourly). After reboot, the routing table and maintenance motd were both updated sooner than before. Also added a ping check in order to determine whether the networks needs restarting (after the routing table is updated).

]]>
Initial incident and probable cause:

Broken pipe on the login caused by a file update on the login. The master node and login were both rebooted and “wwsh file sync” commands were automated memex_routecheck.sh (crontab and cron.hourly). After reboot, the routing table and maintenance motd were both updated sooner than before. Also added a ping check in order to determine whether the networks needs restarting (after the routing table is updated).

]]>
NewSystem FailureMaintenanceAnnouncement
UE8MII49pgq894dKcogPThu, 07 May 2020 18:12:00 GMT[email protected] (Floyd Fayton)SLURM's Default Memory Per CPU Increased (1GB --> 2GB)https://noticeable.news/3f43ej0latlbxv21efel/publications/slurm-s-default-memory-per-cpu-increased-1-gb-2-gbAnnouncementImprovementr21hNwxoxQFI4LTYNgHeThu, 07 May 2020 15:15:00 GMT[email protected] (Floyd Fayton)SLURM Priority Adjustmenthttps://noticeable.news/3f43ej0latlbxv21efel/publications/slurm-priority-adjustmentAnnouncementMaintenanceImprovementq1iUTNu4XWrZcjo7rRcxThu, 13 Feb 2020 20:19:00 GMT[email protected] (Floyd Fayton)Did You Know ... Slack Editionhttps://noticeable.news/3f43ej0latlbxv21efel/publications/did-you-know-slack-editionTipsAnnouncementWelcome Guide2QuP1YochGCrFRF04wYVThu, 23 Jan 2020 16:56:00 GMT[email protected] (Floyd Fayton)Did You Know ... Python Editionhttps://noticeable.news/3f43ej0latlbxv21efel/publications/did-you-know-python-editionAnnouncementTipsWelcome GuideFuF1YhK9mCFksmxHZsDeWed, 15 Jan 2020 15:57:00 GMT[email protected] (Floyd Fayton)Did You Know ... Storage Editionhttps://noticeable.news/3f43ej0latlbxv21efel/publications/did-you-know-storage-editionTipsAnnouncementWelcome GuideRBpMKY9co1xfZS8hubEfMon, 13 Jan 2020 21:58:00 GMT[email protected] (Floyd Fayton)Did You Know ... SLURM Editionhttps://noticeable.news/3f43ej0latlbxv21efel/publications/did-you-know-slurm-editionTipsAnnouncementWelcome Guide5csmLDRBAVK9iyQKDttSThu, 19 Dec 2019 17:35:00 GMT[email protected] (Floyd Fayton)User reported that rsync/cp/scp too slow on /memexnfs/apps,https://noticeable.news/3f43ej0latlbxv21efel/publications/user-reported-that-rsync-cp-scp-too-slow-on-memexnfs-appsSystem FailureAnnouncementFixEC7QoW1ZmiKkRDgyNIDrMon, 02 Dec 2019 19:57:00 GMT[email protected] (Floyd Fayton)Login Node Slowness (module command hanging on memex.carnegiescience.edu)https://noticeable.news/3f43ej0latlbxv21efel/publications/login-slowness-module-command-hangingAnnouncementSystem FailureFixy3QRYShmzZxk6tnDtkgnMon, 14 Oct 2019 15:40:00 GMT[email protected] (Floyd Fayton)System Update & Failed disk in SureStoreHD, memexnfs ZFS pool degradedhttps://noticeable.news/3f43ej0latlbxv21efel/publications/failed-disk-in-sure-store-hd-memexnfs-zfs-pool-degradedAnnouncementSystem Failure