92ab982 | Bryan Davis | 24 May 2017, 18:02:38 UTC | WMCS: add access for bryan davis - wmcs-roots to labstore* - wmcs-admin (new) to labsdb* replicas Bug: T166310 Change-Id: I59c44ea0abdd9ad02ad65cb9080410c6f759ff3b | 07 June 2017, 18:05:50 UTC |
4f08cce | andrewbogott | 07 June 2017, 00:36:02 UTC | Glance: Update our keystone_authtoken section There was a lot of cruft here from previous implementations; also we were using the authtoken when we should use login/password instead. Some day we may want to have a service project and a glance_admin user but for now this patch continues the tradition of just using novaadmin every damn place. Bug: T165211 Change-Id: I652d1e65d1196a10301f0b4384936b20faf33df2 | 07 June 2017, 00:48:21 UTC |
1e6524f | andrewbogott | 07 June 2017, 00:44:08 UTC | Glance: Remove a glance config file I'm tired of maintaining this and it's a duplicate of the liberty version anyway. Change-Id: I5f117a41996cb4cdc3657a60543c4b8cf8be0948 | 07 June 2017, 00:45:06 UTC |
4f9145b | elukey | 07 June 2017, 17:38:03 UTC | Fix cassandra's jmx_prometheus_javaagent config path for deployment-prep Bug: T167222 Change-Id: I280dc541aa8bce3810ad7098101435c92286aa21 | 07 June 2017, 17:39:54 UTC |
019ef36 | Alexandros Kosiaris | 07 June 2017, 14:28:50 UTC | puppet-compiler: Do the rsync using sudo We don't need to create the temporary catalogs directory using sudo, but we do want to use it for the rsync process otherwise it fails Change-Id: I929808106a048c5cae3d363c745a310a010ae15d | 07 June 2017, 17:19:58 UTC |
b58f12d | elukey | 07 June 2017, 16:44:17 UTC | Fix cassandra's jmx_prometheus_javaagent jar path for deployment-prep Bug: T167222 Change-Id: I31a7d04c74c5599e3f25866a6e0073067d567f66 | 07 June 2017, 16:44:17 UTC |
e2865b5 | Faidon Liambotis | 07 June 2017, 15:01:24 UTC | labs_bootstrapvz: don't install HW RAID tools These aren't needed in VMs, as they don't have any hardware RAID. These packages used to be installed by Puppet but they're not anymore and for over a year now. Change-Id: Icfb4d134c38b94af9ce65aade7814a284e567247 | 07 June 2017, 15:09:07 UTC |
650a064 | Faidon Liambotis | 07 June 2017, 14:13:29 UTC | labs bootstrapvz/vmbuilder: avoid hardcoding eth0 grep domain-name against /var/lib/dhcp/dhclient.*.leases instead of dhclient.eth0.leases, in order to avoid hardcoding eth0. Since they don't have another network interface (and even less so one with dhclient running), this should be a no-op. Change-Id: If7219cdb6c9cc24fb856a2f37ad4dbd2c61e318f | 07 June 2017, 14:20:57 UTC |
39de552 | Faidon Liambotis | 07 June 2017, 13:47:48 UTC | autoinstall: avoid hardcoding eth0 Extract the primary interface by looking at the default gateway instead, and use that in the single place we were hardcoding eth0. We may soon not use ethN interfaces, and this was already broken in a handful of cases over the years where the primary one was e.g eth2. The only downside of this is that errors like plugging in the wrong NIC of a server won't become installation failures and thus may never be caught. Bug: T164444 Change-Id: If377da20cbe7a0ea35c2e2a16659847154a6a2ae | 07 June 2017, 14:07:34 UTC |
5fce2d6 | Filippo Giunchedi | 07 June 2017, 13:57:32 UTC | grafana: unhardcode eth0 from server-board Consider all network interfaces found by default. Change-Id: I87faf29937aa5f28053270d7695de8ed76372eb3 | 07 June 2017, 14:01:36 UTC |
f766d82 | andrewbogott | 06 June 2017, 20:28:16 UTC | designate.conf: Update the keystone_authtoken section This wasn't used previously, but I want to use it for a wmf_sink thing. Might as well get accurate values in here. Change-Id: Ie3c849b085ae79ce3ba4ba9917fe04912326891c | 07 June 2017, 13:48:26 UTC |
6d2205f | Alexandros Kosiaris | 07 June 2017, 13:34:39 UTC | Fix 2 issues in compiler-update-facts Fix the for loop to refer to the $factsdir correctly Don't use set -x Change-Id: Id435546dddba98cfc46474a5c9db546c4d24d011 | 07 June 2017, 13:35:32 UTC |
84f0177 | Emanuele Rocca | 07 June 2017, 07:12:34 UTC | VCL: basic support to return HTTP 429, apply it to UA:wikiScrape/0.0.0 Add basic VCL support to return HTTP 429 in cache_text VCL. Return 429 for "User-Agent: wikiScrape/0.0.0" Change-Id: I9edc318528d5557d58533546b2dcd4b77e1d96c0 | 07 June 2017, 13:21:44 UTC |
df64720 | Alexandros Kosiaris | 02 June 2017, 11:13:39 UTC | Refactor facts exporting to better cleanup facts Change the way the facts are exported on every puppetmaster by ignoring any yaml fact file older than 7 days. While at it, stop pushing the node objects as they are not really useful, it's just the facts we care for, not the node objects. Gather all the facts from all the puppetmasters in a single place and then copy them to the destination Drop the call to sudo in rsync and chown and force the entire script to run under root, since this script anyway requires root privileges to run for multiple reasons Fix some indentations issues as well Change-Id: Ie95b96345294330fa7b8fb4926e26e7088c38877 | 07 June 2017, 13:15:38 UTC |
c471d88 | Emanuele Rocca | 07 June 2017, 12:43:51 UTC | base::kernel: create /etc/modules-load.d on Trusty systems Trusty systems do not create /etc/modules-load.d/ but the kmod upstart job parses it. Ensure the directory exists on Trusty. Change-Id: I8dd4478de35c0eb222ea2459236fddfba51be5d2 | 07 June 2017, 12:59:49 UTC |
efca51c | Jaime Crespo | 07 June 2017, 12:52:09 UTC | mariadb: Remove old codfw db hosts from candidates for reimage db2041,38,37,35 have been already reimaged, prevent from being again. Change-Id: Ibb708ef328ab1a97175552f4f0c7fedc8b5d17f1 | 07 June 2017, 12:52:09 UTC |
8b6fa85 | elukey | 07 June 2017, 12:24:30 UTC | Test new zookeeper version on conf2002 The zookeeper client role needs to pick up a zookeeper_version value to avoid conflicts/overrides with the Cloudera repos. This commit is only temporary to allow proper testing on conf2002. Change-Id: Ibb0f246c08d3f95e5ad2540d8fdd37e6a2178e72 | 07 June 2017, 12:24:30 UTC |
d0adc4c | Arzhel Younsi | 07 June 2017, 11:16:17 UTC | LibreNMS: enable 2FA Bug: T164911 Change-Id: I781ffcb429119bb51da270ac43d402899f337a7a | 07 June 2017, 11:16:17 UTC |
526714c | Filippo Giunchedi | 07 June 2017, 09:59:49 UTC | install_server: s/ubuntu/debian/ for stretch ms-be2* machines Change-Id: I79fdbd449912c8b1abde78f3ca5b2c09eeb1562c | 07 June 2017, 10:01:19 UTC |
141e4e1 | Filippo Giunchedi | 07 June 2017, 09:07:21 UTC | install_server: move ms-be2* trusty hosts to stretch Bug: T162609 Change-Id: I186aa29e5f000a6e223d8da5e82aca5a465efe20 | 07 June 2017, 09:16:39 UTC |
e36dda0 | Guillaume Lederrey | 07 June 2017, 08:51:22 UTC | Upgrade kibana to v5.3.3 Bug: T167266 Change-Id: Id104c54cc3df5a1babcc15a764589f635544f740 | 07 June 2017, 09:12:42 UTC |
1920cb0 | Filippo Giunchedi | 07 June 2017, 08:56:07 UTC | swift: mask object reconstructor on >= jessie Bug: T162609 Change-Id: I596f3ce0047539d56e97cf0a8a0e4a64b4a78a85 | 07 June 2017, 08:56:07 UTC |
8841966 | elukey | 06 June 2017, 16:42:44 UTC | Delete unused role/common/analytics/hadoop configs Change-Id: I7a6f44e9bb9936fc563dd5439a93cd09f102be18 | 07 June 2017, 06:58:09 UTC |
489fe75 | Bryan Davis | 30 May 2017, 19:07:56 UTC | flake8: upgrade to 3.3.0 Upgrade to the latest stable version of flake8. This version has some new checks that have been fixed in earlier patches. It also supports multiline 'ignore' and 'exclude' statements in the tox.ini file which are now used to connect comments with exclusions. Change-Id: Ifff8ab1cedf5b0f4362f7c3efe697489bcc67aa3 | 06 June 2017, 21:46:07 UTC |
cf6042a | Federico Leva | 05 June 2017, 20:39:14 UTC | [Planet Wikimedia] Add blog.wikimedia.gr to Greek Planet Change-Id: Id8d9f6084c970e3f18f4b443ce2aec5a8fe83687 | 06 June 2017, 21:30:06 UTC |
8e6aa4d | Paladox | 02 May 2017, 20:40:56 UTC | Jenkins: Add noncanon to jenkins proxy site Per docs at https://wiki.jenkins-ci.org/display/JENKINS/Running+Jenkins+behind+Apache https://phabricator.wikimedia.org/T155840#3229233 I've used nocanon on my test jenkins instance. Change-Id: I92980d658c196dd2d24264cfe3fa449c4d6c9b62 | 06 June 2017, 21:12:17 UTC |
b3560ac | Antoine Musso | 23 May 2017, 07:53:27 UTC | contint: skip hhvm experimental pin on Trusty Apt::Pin['hhvm-from-experimental'] is only defined on Jessie, hence on Trusty that would end up failling. More realisticly, we have to remove HHVM entirely from Trusty nodes. Bug: T165462 Change-Id: Iaae6dc5a1a4d4218017169f3069a40a79807533e | 06 June 2017, 21:06:42 UTC |
cbf058b | Antoine Musso | 06 June 2017, 12:35:06 UTC | contint: remove HHVM from Trusty permanent instances HHVM is still installed on the Trusty permanent slaves. It is outdated and all jobs requiring HHVM are using Jessie nodepool instances. Stop installing HHVM on Trusty permanent slaves. Change-Id: I7886c3767ac24f21f7b588e13edd7407c9fe6580 | 06 June 2017, 20:59:54 UTC |
d16f94d | Paladox | 30 May 2017, 19:51:24 UTC | contint: Only install java 7 on trusty and jessie stretch does not have the package java 7. Bug: T166611 Change-Id: If58499a2a9ab88a614232cfbde4f35256c749407 | 06 June 2017, 20:45:28 UTC |
a1a0edd | andrewbogott | 06 June 2017, 17:54:42 UTC | diskspace.py: Add one more special-case flavor size. Change-Id: I46effb8deb895ddddb03a4ec9985ec31d2e46c91 | 06 June 2017, 17:56:04 UTC |
534c556 | Moritz Muehlenhoff | 06 June 2017, 16:42:41 UTC | Extend account expiry date for pnorman Also change contact address, now managed by Reading. Change-Id: I3d41ac51ab53d0e87e72798b344e6a0b5838401d | 06 June 2017, 16:42:41 UTC |
e9fe011 | elukey | 06 June 2017, 15:29:01 UTC | Set profile::base::check_raid_policy to 'WriteBack' for hadoop workers Bug: T166140 Change-Id: Id47f7c50eec1b7bd9de178d54d74a5bb76c9c1c7 | 06 June 2017, 16:21:58 UTC |
47bed2f | Giuseppe Lavagetto | 06 June 2017, 15:57:05 UTC | role::graphite::alerts: add transformNull to some alerts All the mediawiki session/centralauth alerts only expect data points when there's a problem, so a null datapoint is actually good news. Use transformNull(metric, 0) to account for this. Change-Id: I0e678b676ff87936c972af5603674c583a502c17 | 06 June 2017, 16:00:30 UTC |
9364ad8 | Guillaume Lederrey | 06 June 2017, 11:46:59 UTC | elasticsearch - raise logging of actions to INFO There is no reason to have a logger in DEBUG in production (unless we are actively trying to debug a problem). Bug: T167091 Change-Id: Ic704ab09066841c326a222e604b48fcf08b65f67 | 06 June 2017, 15:51:42 UTC |
94388ef | Francisco Dans | 01 June 2017, 16:33:23 UTC | Add exception for events tagged as coming from MW Will let events tagged as `is_mediawiki` go through to mysql Bug: T67508 Change-Id: I11778eac51e60fcca5ab39ebe5af0e9be14a524e | 06 June 2017, 15:32:10 UTC |
0a606b0 | Filippo Giunchedi | 06 June 2017, 14:44:00 UTC | swift: create swift user home In production the swift user already exists post-provisioning because of T123918, also create/chown its home Bug: T162609 Change-Id: I8eb1344db2e32386741d71a4f3cf0f38fbbeaebc | 06 June 2017, 15:00:00 UTC |
593b6ea | andrewbogott | 06 June 2017, 14:16:13 UTC | diskspace.py: Catch stray instances that nova and filesystem disagree about Change-Id: I2a49a3617b15751c351cb34117d68f4cc8e6e42a | 06 June 2017, 14:33:37 UTC |
b0bddc2 | Filippo Giunchedi | 06 June 2017, 12:56:49 UTC | hieradata: fix missing yaml extension Change-Id: I6d3c0200c232536a902716e894686e770ee35ca7 | 06 June 2017, 13:51:22 UTC |
f928aa5 | andrewbogott | 02 June 2017, 19:25:55 UTC | novastats: Add 'diskspace.py' script This script checks du and df on all labvirt hosts, and then calculates our maximum space commitment by querying nova. Bug: T163796 Change-Id: Iabac89f5cda3d344ac351f51bef8e9acb84d9c4f | 06 June 2017, 13:49:24 UTC |
440ce5b | Filippo Giunchedi | 06 June 2017, 12:56:49 UTC | install_server: ms-be2013 / 16 / 17 to stretch Hosts in row A Bug: T162609 Change-Id: Id488037c2616016dc3861dc47a06aac3375a8d08 | 06 June 2017, 13:28:26 UTC |
c2d1641 | Tyler Cipriani | 05 June 2017, 19:22:48 UTC | Scap: Bump version to 3.5.8-1 Bug: T127762 Change-Id: I1ea0bc432909167c65a2a04dd87b7bc71a51829b | 06 June 2017, 12:43:58 UTC |
6dfa495 | elukey | 06 June 2017, 12:27:12 UTC | Disable role::analytics_cluster::refinery::job::guard The script is currently not working properly and generating periodical spam. Bug: T166937 Change-Id: Ib795eeba0b69d61bdbb3560087652b19fa6b3fd4 | 06 June 2017, 12:28:13 UTC |
75a7bf8 | Alexandros Kosiaris | 06 June 2017, 11:11:03 UTC | servermon: Deploy with scap3 Deploy servermon using scap3 reusing librenms' scap3 configuration Bug: T129152 Change-Id: Id7f46ac5ee9793283add0245a541c6e8dccefbb5 | 06 June 2017, 11:25:52 UTC |
c2ab014 | Giuseppe Lavagetto | 06 June 2017, 11:07:15 UTC | role::mediawiki::scaler: use more sensible intervals for checks In order to reduce the number of false positives, it is more advisable to perform more checks before alarming than having a huge retry interval. So do the following: - raise the check interval to 5 minutes; we really don't need more granularity than that on this alarm - set the retry interval to 5 minutes too. - set the check to go on 10 times before we actually raise an alert in hard state This should help reduce the number of false positives and avoid us head-scratching moments where we check a machine that has recovered since 20 minutes. Change-Id: Ib5b86e0b5a8ebadbb2f9fac7b87a2289af981524 | 06 June 2017, 11:07:15 UTC |
603a5b6 | Emanuele Rocca | 06 June 2017, 10:27:44 UTC | check_ipmi_temp: turn off sel checking Call check_ipmi_sensor with --nosel to turn off system event log checking. We do not want to get criticals on old events. Bug: T125205 Change-Id: Ia97fa0bf112a1d64e56ed7778108007500f4683f | 06 June 2017, 10:27:45 UTC |
0c95f53 | Tyler Cipriani | 18 May 2017, 08:32:05 UTC | Scap3: deploy jobrunner with scap3 Bug: T129148 Change-Id: I9b60db40222172da63f4e29cc64bbbf49567a960 | 06 June 2017, 09:03:33 UTC |
c795a00 | Alexandros Kosiaris | 08 March 2017, 13:46:17 UTC | network: Add kubernetes pod/service IPs We will probably want various services to be accessed by kubernetes pods. Hence add the IP ranges in network::subnets data for eqiad, codfw production as well as the staging cluster. Also add the kubernetes service IPs. These are actually mostly informational as no traffic having a destination or source kubernetes service IP is ever expected to be on the physical wire, but it's probably useful to have those there as well. Change-Id: I4b6c1b2f5f06f10a1adac76221c5790061e438ef | 06 June 2017, 08:22:57 UTC |
8f16c4b | Nuria Ruiz | 05 June 2017, 21:23:27 UTC | Correct pageview_hourly loading scheme on pivot home Bug:T167068 Change-Id: I4fd6f60344c27bda086f937b85d793036e300e62 | 06 June 2017, 08:14:49 UTC |
d0ea43b | Faidon Liambotis | 17 May 2017, 11:16:09 UTC | Do not confine LLDP fact to physical/non-VMs It's actually useful to establish the VM->host relationship and there is no reason why it should be confined anyway. However, only set the (automatic) monitoring parent for the physical hosts (resulting in no changed behavior for those). The rationale behind this is that we want to alert for each individual VM when the Ganeti hosts running them die, as: a) just a host DOWN alert for the VM node is too inconspicuous, b) it's usually the case that VMs can be relocated to other nodes. Change-Id: I8203558efc05e5fb88bff619f3b893d16de6db62 | 06 June 2017, 08:04:07 UTC |
f8c7814 | Timo Tijhof | 25 May 2017, 20:13:09 UTC | dynamicproxy: Centralise error page template and use it This was originally based on the Varnish errorpage, but I'm starting with dynamicproxy first because it's a simple case where the file is just on disk. Later patches will re-use some of this inside the VCL use cases. This change also affects toollabs proxy, since it uses dynamicproxy. Defining 'mediawiki::errorpage' to abstract default parameters and simplify usage in multiple places. Bug: T113114 Change-Id: Id8576df7ca03823256ce824f31dd99e3466ae226 | 05 June 2017, 21:18:30 UTC |
4ad1c8d | Andrew Bogott | 05 June 2017, 20:39:55 UTC | Revert "dynamicproxy: Centralise error page template and use it" This reverts commit f3a1e04bb5a1cebf80ac93676afcde84c8cfb2df. Change-Id: I7a1e92fe019d22f38b50f4a56a39db1ee42ff19a | 05 June 2017, 20:43:52 UTC |
6640d21 | Andrew Bogott | 05 June 2017, 20:39:40 UTC | Revert "dynamic proxy errorpage: s/title/pagetitle/" This reverts commit 581fbf6981d537c31935e8da6c336bff385a4433. Change-Id: I95a3dcdc2f875784a53436d0e71013c3b331ab94 | 05 June 2017, 20:39:40 UTC |
581fbf6 | andrewbogott | 05 June 2017, 20:26:23 UTC | dynamic proxy errorpage: s/title/pagetitle/ Change-Id: I9540d64503e9723d54227b7447bb2a22fba84689 | 05 June 2017, 20:29:53 UTC |
f3a1e04 | Timo Tijhof | 25 May 2017, 20:13:09 UTC | dynamicproxy: Centralise error page template and use it This was originally based on the Varnish errorpage, but I'm starting with dynamicproxy first because it's a simple case where the file is just on disk. Later patches will re-use some of this inside the VCL use cases. This change also affects toollabs proxy, since it uses dynamicproxy. Defining 'mediawiki::errorpage' to abstract default parameters and simplify usage in multiple places. Bug: T113114 Change-Id: I764d00c7b40ad0931590f04ed2f76ecbd84b33ba | 05 June 2017, 20:01:12 UTC |
2246a55 | Bryan Davis | 30 May 2017, 19:03:15 UTC | flake8 fixes for E305 Fix "E305 expected 2 blank lines after class or function definition" warnings in preparation for flake8 upgrade in tox tests. Change-Id: I75425fc791f8745e02f5663aa0c73a6f057ecf2e | 05 June 2017, 19:48:29 UTC |
50cdac9 | Daniel Zahn | 03 June 2017, 00:21:09 UTC | planet: remove "ja" and "ca" (empty), add link to new "el" Change-Id: Idda7f0213e0778ecbbc7af494a698b34c13d2d98 | 05 June 2017, 19:10:28 UTC |
3c593da | Paladox | 30 May 2017, 19:54:57 UTC | jenkins: Install java 8 on stretch and greater If on stretch or greater, set Java version to 8, otherwise keep installing version 7 as before. Bug: T166611 Bug: T162828 Change-Id: If6a134bbaa3bb879a11921b6c667932c198da9a2 | 05 June 2017, 17:49:19 UTC |
0b66168 | Paladox | 04 June 2017, 12:58:03 UTC | Phabricator: Fix colour for Unbreak Now tasks It seems that prod is not affected most likely someone edited it throw the webui. But this can be seen on https://phab-01.wmflabs.org/T145 . Changes indigo to pink. Which is the colour used on prod. Change-Id: I15bfbeca0a3664d4e179c8c392a650395c6d1c7f | 05 June 2017, 17:06:05 UTC |
1e5b3e6 | Bryan Davis | 04 June 2017, 03:14:31 UTC | planet: add Wikimedia Scoring Platform blog feed Change-Id: Ia87168968f89dd67eb3a920219390c4e97c9e539 | 05 June 2017, 17:03:15 UTC |
ff7a8d5 | Filippo Giunchedi | 05 June 2017, 16:19:55 UTC | hieradata: turn off nginx proxy_request_buffering Bug: T166806 Change-Id: I5a8d365263c35e33b4d703cff3f9baa19d27708b | 05 June 2017, 16:19:56 UTC |
369f35d | Filippo Giunchedi | 05 June 2017, 14:42:04 UTC | tlsproxy: selectively disable request buffering This is needed to completely turn off request buffering to disk. Bug: T166806 Change-Id: I8334714809112f5959fbc250d44d5ef1f9136e7d | 05 June 2017, 16:14:42 UTC |
2ea1a97 | Riccardo Coccioli | 05 June 2017, 15:27:53 UTC | Icinga: skip another NRPE error in raid hanlder Bug: T166962 Change-Id: I3a75502dfa40357dc1633a34af7353bc95892de6 | 05 June 2017, 15:27:53 UTC |
4b4bda0 | Filippo Giunchedi | 05 June 2017, 13:52:42 UTC | hieradata: set nginx client_max_body_size 0 for swift Bug: T166806 Change-Id: I1557982ebdb1db280bf510b69d473731e27baa60 | 05 June 2017, 13:53:00 UTC |
d3996d3 | Giuseppe Lavagetto | 05 June 2017, 12:18:49 UTC | etcd: big old roles/auth cleanup We decided not to use the builtin auth system anymore for etcd version 2, and most of what we did here wouldn't apply to etcd v3 anyways, so remove the old cruft as well as our managing tools and classes as they're mostly useless now. When we move to etcd 3, we might re-evaluate our stance, but all commands and outputs will be changing as well. So we will need to re-do the work anyways. For now, let's remove some files from the puppet repository! Change-Id: I1013c7ae2a8643472df28fc8b51527181520bce7 | 05 June 2017, 12:18:49 UTC |
8b3ffd0 | joal | 05 June 2017, 08:35:39 UTC | Add webrequest dataset to pivot configuration Sampled webrequest is now loaded in Druid. This patch updates pivot configuration to show the dataset with a proper name and coments. Bug: T166967 Change-Id: I8d1df2de39e56c93695414171bb24f4d6dec45b8 | 05 June 2017, 08:35:39 UTC |
9528f42 | Emanuele Rocca | 03 June 2017, 11:02:47 UTC | check_ipmi_temp: set check timeout to 60 seconds In a few cases checks are still timing out. Increase timeout from 30 to 60 seconds. Bug: T125205 Change-Id: I0b2edf162477de25a887bbfb7f18ab3900617555 | 04 June 2017, 10:20:20 UTC |
cbd09cd | Bryan Davis | 02 June 2017, 15:10:26 UTC | ircecho: notify service on config change Fix the need for a manual restart of the ircecho service when the configuration is changed by making the config file notify the Service resource that is created by Base::Service_unit. Also change the require for Base::Service_unit to wait on provisioning of the actual python script. There is no need to require the config anymore because of the notify relationship. Change-Id: I2621d741f44278edd9ff428fd2d864b4d4ebc292 | 03 June 2017, 15:20:30 UTC |
a9cc2c0 | Bartosz Dziewoński | 31 May 2017, 20:32:37 UTC | Fix indentation of Gerrit downtime page Failing in style. Change-Id: I01e56c390cab94b4684db1e03c5ee9641602aee6 | 03 June 2017, 00:15:10 UTC |
1497603 | Bartosz Dziewoński | 31 May 2017, 20:30:00 UTC | Fix typo on Gerrit downtime page This message sure looks like it was written in panic. Change-Id: I1fd52be400f28df568bba5aaaf77b3ea0d04fb52 | 03 June 2017, 00:13:27 UTC |
2480598 | Paladox | 02 June 2017, 22:30:22 UTC | Gerrit: Fix wrong syntax in ~/.gitconfig I by mistake did gc.<config>=<value> instead of [gc] <config> = <value> Change-Id: I825a7c9f0d7f708ba3a838bf4a59aa0bc7b8ba63 | 02 June 2017, 22:40:11 UTC |
b3b647e | Paladox | 02 June 2017, 12:20:09 UTC | Gerrit: Set gc.auto and gc.autopacklimit to 0 in ~/.gitconfig This is for gerrit 2.14 and won't affect gerrit 2.13. Apparently in the newer jgit release which gerrit 2.14 has they added support for gc.auto but we have gc switched off but it doesn't seem to affect gc.auto. So we have to switch it off globally in a gitconfig file. See disccusion at https://groups.google.com/forum/#!topic/repo-discuss/lVR37Pm4G3c Bug: T151676 Bug: T156120 Change-Id: Icbcee9c080a3ee618104a5bf2c1b7c579ca33b5f | 02 June 2017, 22:03:37 UTC |
dc715db | Paladox | 01 June 2017, 13:54:57 UTC | Gerrit: Increase packedGitOpenFiles to 6000 Matches changes done in cf3fd766857ee5bf57b23840efb1bca4a10e2c90 <paladox> "Increasing core.packedGitOpenFiles to 6000 so that it is over the average and not significatly under it. It also matches the systemd version where we set the value to 6000" chad says "we currently only use about 4800-4900 on average" Change-Id: I693bf289fb00634604d1485dba27e70c8e3c8ec3 | 02 June 2017, 22:01:53 UTC |
76440f6 | Bryan Davis | 02 June 2017, 16:01:46 UTC | planet: add Wikikmedia Performance Team blog feed Change-Id: I7677c3510437884ad739277af7e865c563b337b2 | 02 June 2017, 18:57:29 UTC |
6171036 | Daniel Zahn | 31 May 2017, 18:06:34 UTC | add admin group releasers-mediawiki to mwreleases1001 This should be a temporary step to give existing MW releasers shell access on the new mwreleases1001 host before a new puppet role has been written for it. Once we have a new role and mwreleases1001 has it applied, this should move to role/common. Bug: T164030 Change-Id: I1a6c72982816bfc8e956c6daa098ab65ad450a65 | 02 June 2017, 18:48:00 UTC |
880715c | Federico Leva | 02 June 2017, 10:57:26 UTC | [Planet Wikimedia] Add some hackathon-related blogs, add Greek planet Covered in the Wikimedia blog already, but some of them missing: https://blog.wikimedia.org/2017/05/31/vienna-hackathon-learnings/ Also add the Greek planet, with translations from translatewiki.net users. Change-Id: I88c74bb402ad8878d7d9717e3badbec7149b824f | 02 June 2017, 17:30:30 UTC |
9cd3333 | Chad Horohoe | 01 June 2017, 22:23:37 UTC | Drop gerrit2001.yaml only includes temp admin permissions Already handled by role now that it's all setup Change-Id: I9216351e12bed3e470dc3bc0a126fe3e2dab2572 | 02 June 2017, 17:19:22 UTC |
d474b99 | Faidon Liambotis | 29 May 2017, 12:16:22 UTC | raid: switch from stringified fact to array Now that we have non-stringified facts, we can drop the whole join with comma (on the fact side) and split with comma (on the puppet side) dance and just pass a regular array instead. This is partially a revert of Ia421806c4fbedf2da4a02ba804fef990e647b38c. Bug: T166372 Change-Id: I0a8b5a07aad58e01405ceaf8dd2e8dc2d9ebd190 | 02 June 2017, 14:45:19 UTC |
fb95ed1 | Faidon Liambotis | 29 May 2017, 11:52:33 UTC | Remove str2bool from is_virtual facts Now that we have disabled stringified facts and that we run Facter >= 2 everywhere, $facts['is_virtual'] is guaranteed to be a boolean and doesn't need to be wrapped by str2bool(). This is partially a revert of Ie82b739b5927f43b08826ce6adb33a8b91ae81eb. Bug: T166372 Change-Id: I0cbbb46b1d598217075920b07b457c076b783d79 | 02 June 2017, 14:39:40 UTC |
1927a94 | Brandon Black | 02 June 2017, 13:28:17 UTC | LVS refactor: service IPs and sparing out lvs101[12] Bug: T150256 Bug: T165765 Change-Id: Ifd085f4e2869ad3703fe9080335bea4576c46329 | 02 June 2017, 13:52:18 UTC |
25ef545 | Faidon Liambotis | 01 June 2017, 18:17:42 UTC | puppet: disable stringified facts in Labs as well Now that they've been tested in prod and that facter was upgraded in Labs, it's time to remove this realm guard and make the two environments consistent again. Change-Id: I3075ed559af083c1ac3e0a44156abcc887cecfec | 02 June 2017, 13:12:21 UTC |
34cb8c9 | Brandon Black | 01 June 2017, 15:01:26 UTC | LVS: new redundancy layout for new eqiad+ulsfo hosts This sets up the "new" eqiad LVSes in the new style, with 4x machines providing N+1 for 3x classes. Ditto the upcoming new ulsfo LVSes with 3x machines providing N+1 for 2x classes. The regexes cover the expected future similar change in esams as well because it was simpler, but esams/codfw node lists at the top are not updated, as those machines aren't actually purchased yet. Bug: T150256 Bug: T164327 Bug: T165765 Change-Id: I81f70d801770707b526c183efc6f881b8bc0ba3f | 02 June 2017, 12:54:19 UTC |
79b5c5f | Emanuele Rocca | 02 June 2017, 10:20:16 UTC | admin: add ema's Yubikey Change-Id: Id47e4b084cd8604fa8a73e9bd3e319fdaf0b2199 | 02 June 2017, 12:05:19 UTC |
610d1cd | Emanuele Rocca | 01 June 2017, 10:52:15 UTC | Re-enable temperature monitoring via NRPE The ACPI log lines flood mentioned in f707810565 has been fixed by blacklisting the acpi_power_meter module (597b9b1af8). Re-enable temperature monitoring. This reverts commit f707810565cae62677ebaa6044d31f2ae221e4c3. Bug: T125205 Change-Id: I95c5a1b00f2b876ea4d12db973b2b346cc78a7cb | 02 June 2017, 12:00:14 UTC |
10b076f | Filippo Giunchedi | 02 June 2017, 10:27:42 UTC | swift: make swift-dispersion-stats policy-aware Required for swift 2.10 upgrade, swift-dispersion defaults to Policy-0 Bug: T151648 Change-Id: Idf7cb2a01eb1df8956fda5eb0c5b787084cb7f42 | 02 June 2017, 10:29:10 UTC |
f750ffe | Guillaume Lederrey | 02 June 2017, 08:20:17 UTC | logstash - curator connects only to localhost Since elasticsearch on the logstash cluster only exposes its API to localhost curator should only use localhost as well. Change-Id: I2421dc967087f27f9d312c716021b0194783530d | 02 June 2017, 10:17:53 UTC |
23a4ebe | Jaime Crespo | 31 May 2017, 14:04:17 UTC | mariadb: Allow full reimage of db2041,38,37,35 (still on trusty) Add the other pending trusty hosts on codfw that are not yet in jessie, all at once, to avoid useless git puppet spam. Change-Id: I738f7806843111db660d7ab4542327de61abed3f | 02 June 2017, 09:29:30 UTC |
dcf02b4 | Guillaume Lederrey | 29 May 2017, 16:18:51 UTC | logstash - cleanup dead code Cleanup related to https://gerrit.wikimedia.org/r/#/c/356063/ Bug: T166154 Change-Id: I904c24bea5161f3f4ad413423a02848e3f0f19de | 02 June 2017, 09:15:30 UTC |
4be76bf | Peter Hedenskog | 31 May 2017, 19:43:45 UTC | Add Save Timing alerts to Icinga Bug: T153170 Change-Id: If5d7d14cbd1a01317da5fa825df2bfe8a4b4ff60 | 02 June 2017, 09:02:38 UTC |
4045358 | Gilles Dubuc | 31 May 2017, 13:31:13 UTC | Add Navigation Timing alerts to Icinga Bug: T153169 Change-Id: I94d8a243db7af6f9f393b54d1bb4e9abbd2b723f | 02 June 2017, 08:58:51 UTC |
69dc16c | Alexandros Kosiaris | 02 June 2017, 07:21:48 UTC | calico: Supploy a calicoctl.cfg file Supply a calicoctl.cfg file so we don't have to use ENV variables to configure calicoctl Change-Id: I0229ea89d1a39453f11e320906312ed238b58dab | 02 June 2017, 07:39:43 UTC |
f3b8ee0 | Bryan Davis | 01 June 2017, 21:00:41 UTC | labs: Direct people to #wikimedia-cloud for support Bug: T166420 Change-Id: I70aa070d7655e4caa2d3903e00471e8bbfadafc2 | 01 June 2017, 21:00:41 UTC |
69e5336 | Bryan Davis | 01 June 2017, 21:00:09 UTC | shinken, icinga: direct bots to #wikimedia-cloud Bug: T166420 Change-Id: I51cd498d8dcaf68300e7e5ec491f8c4d3b6e2ec0 | 01 June 2017, 21:00:09 UTC |
6415ad0 | andrewbogott | 01 June 2017, 17:53:50 UTC | Labvirt2003: Switch to xfs All the other labvirts use xfs, it's easier to re-image this box than make the puppet code handle the difference. Change-Id: I8b8d24e797035aca2db303aa9b3cf4be8fabb050 | 01 June 2017, 17:55:10 UTC |
ccfd2a7 | andrewbogott | 01 June 2017, 17:40:23 UTC | Add hiera file for labtestvirt2003 In particular, specify where the /var/lib/nova/instances partition is. Change-Id: Ic156efbe9db1a16c32164c94e065f0dcdeccad62 | 01 June 2017, 17:40:23 UTC |
ccdc21b | andrewbogott | 01 June 2017, 16:55:17 UTC | Labtestvirt2003: Add to site.pp Change-Id: Id3cd04ca79e2917b9b479c90adeb6ee35d7a6d12 | 01 June 2017, 16:55:17 UTC |
1a31e3c | Faidon Liambotis | 01 June 2017, 17:04:21 UTC | varnish: don't use $name as a parameter name $name is reserved and special in puppet and using it as parameter name causes a puppet 4 parser validate error. Change-Id: I3606c7c921e5b889d5c627d38514a1a383e127ed | 01 June 2017, 17:38:47 UTC |
0ea2d64 | Faidon Liambotis | 01 June 2017, 17:12:56 UTC | phabricator: don't assign a new hash key Hashes are immutable in puppet, so trying to assign a value to a key is invalid code that works right now because of legacy bugs. These are fixed in puppet 4 and the new parser errors out. Do this in a different way. Change-Id: I2a3c31acc558b876cc14ca06409033d4c7f4ade3 | 01 June 2017, 17:38:47 UTC |
cf1e831 | Faidon Liambotis | 01 June 2017, 17:05:25 UTC | restbase: don't define parameter $hosts twice The value of the two definitions was the same, so fairly obvious. Errors with Puppet 4's parser validate. Change-Id: I5c55899a155b7968f3a0789aa2a2ccd88d0187d3 | 01 June 2017, 17:22:46 UTC |
43f5233 | RobH | 01 June 2017, 16:43:41 UTC | Revert "setting labtestvirt2003 into site.pp" This is showing a catalog error for some of the labs specific items, removing the site.pp entry to get a basic system puppet run done (and user keys on system for debugging). This reverts commit f68c4a380b4e134462b721bc9622f66445c77d6b. Change-Id: I785b8eeeb5bc09038baa08007e807d57ed161806 | 01 June 2017, 16:43:41 UTC |
f68c4a3 | RobH | 01 June 2017, 16:31:30 UTC | setting labtestvirt2003 into site.pp just expanding the stanza for labtestvirt200 in codfw Bug:T166237 Change-Id: Iaadfe5fadcf1cca46231cd7a9468236adaf7acfe | 01 June 2017, 16:31:30 UTC |