- Roll call: who's there and emergencies
- What has everyone been up to
- What we're up to next
- Other discussions
- Next meeting
- Metrics of the month
Roll call: who's there and emergencies
anarcat, hiro, qbi present, ln5 and weasel couldn't make it but still sent updates.
What has everyone been up to
anarcat
- blog service damage control (#32090)
- new caching service (#32239)
- try to kick cymru back into life (#29397)
- jabber service shutdown (#31700)
- prometheus/ipsec reliability issues (#31916)
- bumped prometheus retention to 5m/365d, bumped back to 1m/365d after i realized it broke the graphs (#31244)
- LDAP sudo transition (#6367)
- finished director replacement (#31786)
- archived public SVN (#15948)
- shutdown SVN internal (#15949)
- fix "ping on new VMs" bug on ganeti hosts (#31781)
- review Fastly contracts and contacts
- became a blog maintainer (#23007)
- clarified hardware donation policy in FAQ (#32044)
- tracking major upgrades progress (fancy graphs!), visible at https://gitlab.torproject.org/anarcat/wikitest/-/wikis/howto/upgrades/ - current est: april 2020
- joined a call with giant rabbit about finances, security and cost, hiro also talked with them about upgrading their CiviCRM, some downtimes to be announced soon-ish
- massive (~20%) trac ticket cleanup in the "trac" component
- worked sysadmin onboarding process docs (ticket #29395)
- drafted a template for service documentation in https://gitlab.torproject.org/anarcat/wikitest/-/wikis/service/template/
- daily grind: email aliases, pgp key updates, full disks, security upgrades, reboots, performance problems
hiro
- website maintenance and eoy campaign
- retire getulum
- make a new machine for gettor
- crm stuff with giant rabbit
- some security updates and service documentation. Testing out ansible for scripts. Happy with the current setup used for gettor with everything else in puppet.
- some gettor updates and maintenance
- started creating the dev website
- survey update
- nagios gettor status check
- dip updates and maintenance
weasel
- moving onionoo forward to new VMs (#31659 and linked)
- moved more things off metal we want to get rid of
- includes preparing a new IRC host (#32281); the old one is not yet gone
qbi
- created tor-moderators@
- updated some machines (apt upgrade)
linus
- followed up with nextcloud launch
What we're up to next
anarcat
New:
- caching server launch and followup, missing stats (#32239)
Continued/stalled:
- followup on SVN shutdown, only corp missing (#17202)
- upstreaming ganeti installer fix and audit of the others (#31781)
- followup with email services improvements (#30608)
- followup on SVN decomissionning (#17202)
- send root@ emails to RT (#31242)
- continue prometheus module merges
hiro
- Lektor package upgrade
- More website maintenance
- nagios bridgedb status check
- investigating occasional websites build failures
- move translations / majus out of moly
- finish prometheus tasks w/ anticensorship-team
- why is gitlab giving an error when creating a MR from a forked repository?
ln5
- nextcloud migration
qbi
- Upgrade some hosts (<5) to buster
Other discussions
No planned discussion.
Next meeting
qbi can't on dec 2nd and we missed two people this time, so it make sense to do it a week earlier...
november 25th 1500UTC, which is 1600CET and 1000EST
Metrics of the month
Access and transfer rates are an average over the last 30 days.
- hosts in Puppet: 75, LDAP: 79, Prometheus exporters: 120
- number of apache servers monitored: 32, hits per second: 203
- number of self-hosted nameservers: 5, mail servers: 10
- pending upgrades: 5, reboots: 0
- average load: 0.94, memory available: 303.76 GiB/946.18 GiB, running processes: 387
- bytes sent: 200.05 MB/s, received: 132.90 MB/s
Now also available as the main Grafana dashboard. Head to https://grafana.torproject.org/, change the time period to 30 days, and wait a while for results to render.