https://github.com/wikimedia/operations-puppet

sort by:
Revision Author Date Message Commit Date
a0dc8e4 Add kafka_version parameter, s/java_packaage/java_home/ in confluent::kafka::client Also add profile::kafka::simple::broker for testing in labs. This allows for configuration of which Java is used, and which Kafka version will be installed. Change-Id: I577ed3f0fd9c95ba7ba5df1b49ce1e1680754dfb 25 May 2017, 14:43:46 UTC
6da3083 role::aqs: use profile::cassandra Also add a switch to profile::cassandra to allow opening connections to the analytics network in case of need, and fix handling of the case we have no TLS encryption. Change-Id: I727779606ff6c83f95e2dd308df3fa34f14e3f77 25 May 2017, 12:00:05 UTC
39b64f2 role::kubernetes::worker: upgrade calico on one host Let's first test the upgrade on one host only. Bug: T165024 Change-Id: I9866a3148a298f5d0e6501b1781fc5aa0b8816df 25 May 2017, 10:26:25 UTC
6d160c6 admins: add kaldari to analytics-privatedata-users Access to the Hadoop Cluster ("Data Lake") via host stat1002, as requested on ticket. Bug: T166165 Change-Id: I373087f36e00128306af6821cddd94ffe790262c 25 May 2017, 09:54:58 UTC
6a5a14e calico: add new version 2.2.0 Bug: T165024 Change-Id: I667d376f151c33e86dfab3de546a9ee502778e66 25 May 2017, 07:07:43 UTC
24b5e2d phab: comment out include of exim4::ganglia This caused LOTS of cron spam since today. The chain of events was like: < paladox> !log phabricator upgrading mariadb to 10.2 -> /usr/lib/x86_64-linux-gnu/libmysqlclient.so.18: symbolic link to libmariadb.so.3 -> /usr/sbin/exim: /usr/lib/x86_64-linux-gnu/libmysqlclient.so.18: no version information available (required by /usr/sbin/exim) -> /usr/local/bin/exim-to-gmetric" fails But this cron is running every minute and then tries to send mail to root. /etc/aliases says that root should be root@wikimedia.org and then when it _tries_ to mail that sender verification fails: 550-Verification failed for <root@phabricator.phabricator.eqiad.wmflabs>" which then causes "mail delivery failed" messages which get delivered to all of ops. Since Ganglia is deprecated and i wanted to find a quick stop gap before talking about this some more, i am just commenting that class right now. Change-Id: Iff1ce9cdd059a4239309ab93d367e1bfa790b91d 25 May 2017, 01:06:20 UTC
2760d4c authdns::server: move 'include standard' to role Change-Id: Ic63b1f5fdffd81a1df0153d9c6054f0f7dd6a591 25 May 2017, 00:13:44 UTC
a6a2ca2 dnsrecursor: move 'include standard' to role Change-Id: I6fa652760aeafc846e6ab8541449f41f42b621a4 25 May 2017, 00:02:07 UTC
82af211 confd: use logrotate::conf for logrotate Change-Id: I772ffaa22542ca42b4bcd763895e562de7954e30 24 May 2017, 23:44:44 UTC
a79acf1 Gerrit: Remove "" around T\\d+ in gerrit.config When i did a gerrit upgrade and ran this java -jar gerrit.war init -d review_site It removed the quotes around match = "T\\d+" But in the puppet repo we have quotes around it so puppet re added them back - match = T\\d+ + match = "T\\d+" Thats a puppet run so it added "" Change-Id: If2c073e5f6bad030672f5eba88156844cdee9b6a 24 May 2017, 21:00:08 UTC
89b277c DHCP/partman: Add DHCP and partman entries for ores200[1-9] Bug:T165170 Change-Id: I2bca295a6baf0fbbe588be6e8fc00d5ea36ebbf1 24 May 2017, 20:42:08 UTC
837db31 Use is_not_bot filter function for eventlogging mysql consumer Bug: T67508 Change-Id: Ib5156dcf582c9829e336e4c58fe7184cc4e590d2 24 May 2017, 19:13:37 UTC
269f8dd Revert "Changes needed for upgrading to Druid 0.10" We have to roll back to 0.9. Hadoop indexing jobs don't work due to java 8 vs java 7 issues. This reverts commit 6fbadbad2658c964e46203dd447707676016f83e. Change-Id: Ide4e009ca1b4534649462f44aa823e356bf89a6a 24 May 2017, 17:42:31 UTC
1fefeeb r::c::perf - FQ outbound flow rate cap @ 1Gbps This should help clamp down brief rate spikes in local intercache and cache:app flows (aka TCP conns) to no more than 10% of interface bandwidth, and throttle some of the higher-traffic remote proxies (e.g. opera, zscaler, google, etc) to more-reasonable fractions of our available outbound host and transit bandwidth as well. Bug: T147569 Change-Id: If26405f59d6646842e7d6d459eb1a12a046c179b 24 May 2017, 17:07:35 UTC
9403053 novastats: Update some reports to use more up-to-date code. The old 'novastats.py' library is no longer needed since mwopenstackclients does most of the querying we want. This patch replaces some old scripts with a few newer example scripts that use mwopenstackclients. Change-Id: I8e054c973dcdaa731688788b5087043308779f96 24 May 2017, 16:12:23 UTC
fef24b7 openstackclients: add an optional project arg to allinstances() Change-Id: I5b18939213dc9a1d4879c1309ff14664e22755a5 24 May 2017, 16:07:39 UTC
fd9a126 Extend expiry dates for two accounts Change-Id: I66a8ffd574fa5cfdf3482438f9cb1b52b6920d7a 24 May 2017, 14:41:50 UTC
aedd882 tools: have maintain-kubeusers chown $HOME/.kube Chown the $HOME/.kube directory to the tool account. Also protect the generate credentials with chattr. Bug: T165875 Change-Id: I37bc5517ab4bd8b646e0c63d9140c1cc2633e9ea 24 May 2017, 14:20:38 UTC
7b4e9f8 Puppet: more reliable run-puppet-agent - when using the --failed-only option, consider last run failed if the summary yaml file is not valid. Change-Id: I886646b4a250b5e7ca30804837d4d73d6fa7dda2 24 May 2017, 14:02:30 UTC
ed6a7e5 Tidy up tools node motd Change-Id: I795ceb01adf3d0e6b0d839b88530ca7b75f840b2 24 May 2017, 13:58:32 UTC
6fbadba Changes needed for upgrading to Druid 0.10 Druid 0.10 depends on Java 8. The package should install it as a dependency when installing, but it will not be the default Java. Setting JAVA_HOME explicitly in daemon env.sh files. Bug: T164008 Change-Id: I83a36bdfbcbe13f1c06471446a9bc5a0aa3e0941 24 May 2017, 13:43:37 UTC
9d0bfe7 logrotate: Fix uwsgi postrotate script Using invoke-rc.d on jessie systems is plain wrong. For now using the service command wrapper and reevaluate in the future if we should switch to systemctl Change-Id: I584532cf9f8c25d858b3035b78de2a3163fdc25e 24 May 2017, 11:39:14 UTC
aa116a7 Specify the correct host for wikidata icinga config Since we changed it to www.wikidata.org, we also need to change the services referencing it Change-Id: I5633a0010ffef232d991de52126c1f3486767d56 24 May 2017, 11:24:12 UTC
2c7089d Decouple wikidata monitoring from the IP address Using the IP address of text-lb.esams was plain wrong as it would alert even when esams was depooled. Change the definition to use the FQDN, quality the definition correctly and stop using the -I on the command but rather rely on the correct DNS resolution done by -H of check_http Change-Id: I2efa53ea6dc8677cdc3b4769dfdb26a81b09cd33 24 May 2017, 10:53:19 UTC
01bff75 Revert "Use gdb from jessie-backports on jessie" This reverts commit 026c81a83071fca6a16939e3a405d0487e00b8c1. Change-Id: Ia159081fed2e7e6e3e438a85c930d8af386ae710 24 May 2017, 10:44:24 UTC
026c81a Use gdb from jessie-backports on jessie Fixes access to TLS variables in threaded programs (in particular HHVM) (among other changes in four major releases). It's a low level debugging tool, so shouldn't have any impact on existing setups. Change-Id: I7c7540660368846bd9486c83208f0032e4974b49 24 May 2017, 10:40:30 UTC
ab02b19 prometheus: enable qdisc collector on cache hosts Enable queueing discipline (qdisc) metric collection on cache hosts. Traffic control statistics are useful, among other things, to monitor BBR's behavior. Bug: T147569 Change-Id: I490f72235fd873903fdf1baf322cda602d21ae0a 24 May 2017, 10:25:53 UTC
7a3cb90 beta: set profile::etcd::tlsproxy::read_only=false 79cfdefd50 added a new setting so we can switch etcd readonly whenever doing a switchover (T159687). In production the setting is applied via the hieradata role hierarchy: hieradata/role/common/configcluster.yaml hieradata/role/eqiad/configcluster.yaml However on deployment-prep the role hierarchy is not looked up causing on deployment-etcd-01: Could not find data item profile::etcd::tlsproxy::read_only in any Hiera data file... Set the value to false, assuming on beta we want etcd to be writable. Change-Id: I99f0bf11112de2a81bbbf131ece01eaea7871227 24 May 2017, 10:04:31 UTC
776b1d5 raid: Implement the option to check write cache policies Enable all databases to enforce WriteBack policy, though an error (not a warning) if they go down to WriteThrough- they probably will cause performance problems and have something anomalous like a BBU problem or something else. Ignore those alerts on the handle-raid scripts to avoid creating failed disk tasks. This currently only works on megacli systems, although the plan is to deploy it to all systems where the feature makse sense (megacli are the ones that are older and cause daily issues). Bug: T166108 Change-Id: I36fc6fb115c2d9e9d88391ea5a9230d6389f781c 24 May 2017, 09:17:05 UTC
b182709 Enable memcache-based Thumbor broken thumbnail throttling Bug: T151065 Makes memcache run on the thumbor machines, as well as nutcracker to pool connections going to both. This colocation makes sense because the memcache use is minimal and thumbor's throttling feature is tolerant of memcache being down. Change-Id: Idc5f10324e1c0877901393949acccc618b340dde 24 May 2017, 08:34:16 UTC
ef3e76b Puppet: run-puppet-agent improvements - fix bug when --failed-only is set - do not imply --quiet with --failed-only Change-Id: Iaf441764d3893f560cbcc330a182d2eb2ad264ce 23 May 2017, 22:11:30 UTC
92b5ba2 r::c::perf - raise fq flow_limit to 300 Still seem some rare flow_plimit drops, generally on a single queue and only affecting ~20% of cache boxes. Hopefully this will make them truly-rare (there will probably always be bad edge cases where these drops are the most appropriate course of action). Bug: T147569 Change-Id: Ib0b19da2cdd53f8b8928165d770c49f16a6126bc 23 May 2017, 21:26:41 UTC
6b2bd28 Puppet: run-puppet-agent, add --failed-only option - Add the --failed-only option to the run-puppet-agent script to run puppet only if it's enabled and failed in the last run. This should allow to quickly and easily recover from deploys that causes a puppet failure on a large number of hosts. After fixing the underlying issue, it should be enough to run from one of the cumin masters the command: sudo cumin -b 25 -s 1 -p 95 '*' 'run-puppet-agent --failed-only' to rolling force a puppet run only on the hosts where it failed. Change-Id: Ide1246b256a4fd1b48ed660b3e87f77e91c1b5b4 23 May 2017, 20:36:10 UTC
80d305c caches: enable BBR + tuned mq+fq qdiscs Note I've moved it all to role::cache::perf for now instead of using the BBR stuff we've been testing with over in base::sysctl, so that the dependency issue is cleared up for this specific case. We'll have to look later at how to generically plug the runtime dependency of "turn on fq before bbr" for all cases (using the RPS script or a separate simple command to replace a root queue) to make a working shared setup for both cache and non-cache cases... Bug: T147569 Change-Id: I5fcff1105f83e4a6b5884e76730acb747eaecbaa 23 May 2017, 20:03:18 UTC
1ff9351 raid-check: optionally return critical when not in a write policy Failing to a different write policy happens silently in megacli checks (for example, if BBU is flat, damaged, too hot, etc.). In some hosts (databases), a policy change means horrible performance, so bad that it can cause an outage due to the heavy IO now being as slow as a spinning disk. This script now detects the current active policy, and returns a critical alert (not a warning) if the write policy specicied does not match the one requested. By default, it conserves the previous behaviour (not caring about that), but it can be enabled for hosts that want, for example, a WriteBack or a WriteThrough policy. It will also catch a manual misconfiguration (BBU is ok, but has been configured incorrectly). Bug: T166108 Change-Id: I6f1c74ec2e4f2982a0cf83b52b566c78f3858133 23 May 2017, 16:55:48 UTC
8f0cd7d interface-rps: add mq subqueue qdisc setup Change-Id: I21780730f6755aabdc90b0e09b5aac6055b428b3 23 May 2017, 16:11:57 UTC
31bf233 RPS cleanup 5/5: use new config file in puppet This allows future puppet usage of interface::rps to use $rss_pattern again if warranted, via the config file, and also sets up the class to automatically re-execute interface-rps on relevant config and/or code changes. Change-Id: Iad4a8522b74e05762164064c555200451fd5c2a8 23 May 2017, 16:11:57 UTC
2dd5353 RPS cleanup 4/5: Add config file to script, use for rss_pattern Change-Id: I6b4e996d994810d859d2c7d09795907d80f3945c 23 May 2017, 16:11:57 UTC
fad92c3 RPS cleanup 3/5: pattern not necc for LVS The patterns being manually specified are now auto-detected, and the specifiation of a pattern is not a good way to indicate whether interface-rps should be run at all. Defaulting it to on for the LVS case now. This also gets us to a state where nothing is puppetized to use a custom RSS pattern, making further changes easier. Note because of the way the related interface::up_command work, this will create duplicate /e/n/i commands on the LVS hosts and need manual fixup there afterwards. Change-Id: I1bcf39b3ff26992161e26e96d49d8fe7795766e8 23 May 2017, 16:11:57 UTC
e3fba27 RPS cleanup 2/5: remove irqbalance module This was only used by interface::rps, and all interface::rps consumers are confirmed to no longer have the package installed. Change-Id: I8fe8bf5d176d3dc6221579907a882a4e3acef015 23 May 2017, 16:11:57 UTC
1741ae8 RPS Cleanup 1/5: remove unused upstart file Change-Id: Ia2fac8faca2a040014a4d2ac0e4973b81529e460 23 May 2017, 16:11:57 UTC
47e41d0 elasticsearch - deploy elasticsearch-curator along with elasticsearch elasticsearch-curator is a tool to simplify administration of elasticsearch. See https://github.com/elastic/curator for more details. Change-Id: Id5a5af52d7daca9ab93a26c71c9e0a511afab7bf 23 May 2017, 15:42:03 UTC
ef79000 Remove use of is_not_bot filter in eventlogging mysql until code is fixed and change is cleared (announced) Bug: T67508 Change-Id: Ib470e69cd391f0556bece2240d599ddcdc260f26 23 May 2017, 15:35:52 UTC
ee80c0e logstash - apifeature indices need to be cleaned up logstash is not the only indice that needs to be cleaned up. Any elasticsearch output from logstash needs the same indices administration Change-Id: I0ea56a4038c12bb1eed3f8f04306d219381777a1 23 May 2017, 15:26:26 UTC
8092cab Add bot filter to mysql consumer Wraps mysql consumer's input URI with a filter that prevents events triggered by bots/spiders from reaching mysql. Bug: T67508 Change-Id: I3c82e63b498a56f503a6af22ca1278d0d1c27063 23 May 2017, 14:49:11 UTC
5c9ccfc Revert "raid-check: Return critical when not in WriteBack mode for megacli" This reverts commit 000fa95d4394915809a7e34c37a4bf51c40512d9. Change-Id: I98e8b50875ec0cee44c1ce751e19df2c0a5ed1bf 23 May 2017, 14:42:05 UTC
000fa95 raid-check: Return critical when not in WriteBack mode for megacli Failing to a different write policy happens silently in megacli checks (for example, if BBU is flat, damaged, too hot, etc.). In some hosts (databases), a policy change means horrible performance, so bad that it can cause an outage due to the heavy IO now being as slow as a spinning disk. This script now detects the current active policy, and returns a critical alert (not a warning) if the write policy is incorrect. It assumes all megacli hosts will want a writeback policy, so it is hardcoded- but on the advantages, it will also catch a manual missconfiguration. Bug: T166108 Change-Id: I1f54222bfb12360a15e53dff9362fa5eca7b6f6d 23 May 2017, 14:09:27 UTC
a199ae9 hieradata: move webrequest 5xx to logstash.svc Bug: T149451 Change-Id: I097a4cd91d8f4ff5f851f02e5ab2d02a9a346323 23 May 2017, 13:27:20 UTC
2a75f51 role: don't install kafkacat for statistics::private The only machine with this role applied (stat1002) has kafkatee installed but no input/output configured and kafkatee is not running. This leaves oxygen (jessie) with kafkatee installed and thus deprecate kafkatee on trusty. Change-Id: Idaba968c862effa87b516cf22b3006f77a0ddb5b 23 May 2017, 10:39:06 UTC
f5f09b1 profile::calico::builder: fix branch to check out Since we've moved to use gbp for building the package, we should check out the master branch instead. Change-Id: I5f38bca73b61e5cd227b3faeec66e9577807ea2d 23 May 2017, 10:10:03 UTC
cf3256a logstash: move 'hostname' to 'host' for webrequest Bug: T149451 Change-Id: I9e81353b9d2f82ee4a9d4a10c4907e3400ec33b8 23 May 2017, 09:52:37 UTC
9b55714 logstash: build http_request from webrequest fields Bug: T149451 Change-Id: Ieb52e767f8018cbfc6bdb194ceb6cc6e42f859b1 23 May 2017, 09:50:33 UTC
b57a615 Puppet compiler: automatically sync from all masters Bug: T165583 Change-Id: I9ee0abd624d1b01ae3c06030215b9032db9a116a 23 May 2017, 09:11:03 UTC
d3d1006 graphite: move 'standard' and 'base::firewall' to role Change-Id: I66586ec0e46abc29a5e0c1016e5f22d5ba7ec1a3 22 May 2017, 23:56:20 UTC
c7e8026 Phabricator: Use mkdir -p for creating phd directory in systemd Reason is sometimes the directory exists so it will cause mkdir to fail. Use -p to ignore if the directory already exists. --parent "no error if existing, make parent directories as needed" Change-Id: I7179b8472edb290d5a3aadbc15f1e9889b212895 22 May 2017, 20:23:50 UTC
4201cf1 nrpe-server: mkdir for pidfile in ExecStartPre Paladox reported this fix earlier in IRC, and it proved to be necessary on acamar for me (tested there). Change-Id: I8ceaffdc59c5e021ffe0e74031ced4a715edf247 22 May 2017, 18:43:36 UTC
1b315c6 check_cpufreq: Issue a CRITICAL, not a WARNING Fix the return code of 1 which in nagios parlance equals a WARN and set it to 2 which is a CRITICAL. Change-Id: I4dac3e44776d2fc0d64520d57f6e3ed87fdd8823 22 May 2017, 17:20:29 UTC
cab6d71 nrpe: Remove user and group from systemd unit nrpe handles dropping privileges anyway when started as root and does this quite ok after writing it's pid file. Use that behavior for now Change-Id: Ic6bda187ada3412264636a71af02602382e58047 22 May 2017, 16:59:49 UTC
fb55fe2 nrpe: Don't set PrivateTmp=True We have checks (namely mysql ones) that rely on /tmp being shared between nrpe and mysql as the mysql unix socket gets used for those Bug: T148507 Change-Id: I6f8b4da7ed84004d13a841446d457dc745b54869 22 May 2017, 15:25:51 UTC
17b1e9c nrpe: Set type=forking and pass -d in systemd unit nrpe in jessie does not supporting running in the foreground, change the Type to forking and pass -d to indicating daemon behavior Change-Id: I16f5fd9669624fb74e11aff94abc19162759a0cf 22 May 2017, 13:31:07 UTC
f32b99a nrpe: Ship a systemd unit file Use base::service_unit and ship a just slighlty modified version of stretch's systemd unit. Our version differs in just not passing -f and $NRPE_OPTS to the nrpe server. Change-Id: Id1a2695c5bc10928cee62a6cc9fbf04268f0e817 22 May 2017, 13:18:19 UTC
0439fff Set Type=notify for etcd systemd units etcd supports notifying systemd of it's successful startup. For this to work Type=notify is required. While this is not crucial for operation, it's does provide the nice benefit of systemd knowing instantly the status of the daemon and commands like systemctl behaving accordingly Change-Id: I8cc0bd0f174f4390a352571951c115c953cfb0c2 22 May 2017, 12:49:55 UTC
0a1896c profile::nginx::tlsproxy: brown paper bag fix Change-Id: I900201b7c6705ddd8b9d028da97edfe999760ce1 22 May 2017, 11:04:51 UTC
2a1c053 profile::etcd::tlsproxy: fixup nginx configuration for read-only Change-Id: Iaef5b42ace4f10c81fe26b76e34fc90764da60f8 22 May 2017, 10:59:11 UTC
53acfc6 profile::etcd::replication: write to localhost via http Since we're using replication only on clusters that use the tlsproxy profile, and we're planning on having the replicated cluster to be read-only, we need to write directly to localhost or writes would fail. Change-Id: Ic37ea88fe7f43ea1252a2dd1b847b4947b92e9fa 22 May 2017, 10:30:08 UTC
5b71cc8 profile::etcd::tlsproxy: better read-only error reporting Use a format that etcd would use. Change-Id: Ifcfe4dcde2a155dba8015c81cbfc44dcd03b3d67 22 May 2017, 10:27:46 UTC
79cfdef profile::etcd::tlsproxy: allow read-only mode This allows us to prevent writes to the slave cluster, and also to set both cluster in read-only mode while we perform a switchover. Bug: T159687 Change-Id: I4b0e1ae2f9b527125b71e1d9355e4b3325cde991 22 May 2017, 09:42:15 UTC
a9d901c Don't replicate techconductwiki to labs Bug: T165977 Change-Id: I009b946e9f0537008f398ac1c07ad1c42101c7e5 21 May 2017, 12:36:35 UTC
66d4840 Print dbname before running update.php Change-Id: Ic0bba79284c75e6521284060194df2bc93ee9501 21 May 2017, 09:18:46 UTC
3e1a8b6 New SSH key for me (I'm at the hackathon. You too? Find me to confirm) Change-Id: Ie50bb06a2857b3896adf36d63fd11daf7eb680ce 20 May 2017, 17:04:23 UTC
f63a6fc wikistats: Require /usr/lib/wikistats/schema.sql before executing mysql command Change-Id: Icbc16577a987f13ffd5c96b216112b37aa24d976 19 May 2017, 23:11:42 UTC
287fd3c wikistats: ensure systemuser exists before backup dir Change-Id: Ic1fa990a236b8b0fc76767de7eda72058736796b 19 May 2017, 22:39:03 UTC
1c53919 wikistats: 'user' -> 'owner' parameter for /srv/wikistats Change-Id: I1d1d32eb974e609b294e381c58bd6d1cb312386b 19 May 2017, 21:05:01 UTC
d327edd wikistats: use systemuser for git cloning Change-Id: I7a4c67af2fec7a26e380be8f43296bab64656df7 19 May 2017, 20:45:12 UTC
4735432 wikistats: fix typo in db.pp "requires" -> "require" Change-Id: I470f93a12a9d6f2e2ce8770f571b7021b13e4519 19 May 2017, 20:06:53 UTC
76dc7fb wikistats: add missing .erb file extension to grants.sql Change-Id: I0b3699b40a651ae5689b64f0456e3a170ff04ae5 19 May 2017, 19:55:27 UTC
859ee79 wikistats: make db_pass a parameter, use fqdn_rand_string Make db_pass a parameter so we can use it in 2 places, a file that the deployment-script can read it from and the template for the MySQL GRANTS. Use fqdn_rand_string with a random seed to generate a string (there is nothing secret in the db anyways, btw), forget about that exec() with openssl intead. Puppetize giving out the GRANTs on (first) run, add missing template file to fix puppet run. Change-Id: I026de7ecbd90b0c7ec06d7dffa59152b81a5dd2d 19 May 2017, 19:34:54 UTC
791fd62 Planet: Delete sr.planet This has only had 2 blogs ever registered. One is dead. The other had exactly 1 post back in 2008. Change-Id: Iab50c874cc7ebeda21be4c6412b17f3daaf4683a 19 May 2017, 15:36:08 UTC
4a33630 dumpWikidata: Make the minimum shard size depend on the number of shards So that the number of shards can easily be changed w/o having to re-calculate these numbers. Change-Id: I2f178f1e071b4eaa414a5d50911cd4bca1749b4b 19 May 2017, 14:08:47 UTC
8665bb8 Use kill -- -$$ to kill a process group in dumpwikidata scripts We need to kill the children and the grand children here, not just the direct children. Sadly pkill only kills the direct children. I also bumped the minimum expected sizes for the various dumps based on the size of the last dump run (with 15-20% safety margin) Change-Id: I0bfd519e877e4686b69cdb92cc7632f793fa0595 19 May 2017, 13:59:13 UTC
8e26212 Revert "mariadb: allow reimage of db2048 for upgrade to jessie" This reverts commit d710b123726bb478b5c796adb8a01d7a5cfdb849. Change-Id: I47d6716a4707426995031ce2fa214036206c458d 19 May 2017, 10:34:28 UTC
9cdb112 mariadb: Test trusty install on db2049 to confirm hw issues It fails on reimage, hw issues are suspected, checking with trusty as it was the previous state. It could also be firmware? Hopefuly this will give us more information. Change-Id: I9f7eb0638ec985312a2e788301e6945ab348ba87 19 May 2017, 10:21:19 UTC
d710b12 mariadb: allow reimage of db2048 for upgrade to jessie Remove db2049, already reimaged. Change-Id: I3b5d02214c6ed34e81eccd696d17e127bcb6f259 19 May 2017, 09:35:28 UTC
13c758f mariadb: allow reimage of db2049 for jessie upgrade Remove db2051, which was already reimaged. Change-Id: I12a0347fe55a5cc8519b4edb140a14b5cf3c3bf7 19 May 2017, 06:10:19 UTC
ad26de9 gerrit: rename "server" IP to "service" IP Because that's what this is, a service IP, NOT the server IP. Rename the variables to make this less confusing. Change-Id: I26d3afe502e3b0080a3e87c1d3a1c422cc858fc7 18 May 2017, 22:36:15 UTC
0e3240f wikistats: grant db permissions on first run (labs) On the first puppet run on a new instance, apply database grants for the wikistats user. Also move existing code that generates a random password from init.pp to db.pp since it's db-related. And add missing "unless" to generation of random password so it just happens once. Change-Id: I196c241b04970a4bf0d21c31f37f31d62b873db7 18 May 2017, 19:41:10 UTC
bcaf2b4 Assign roles to kubernetes200X hosts Give the kubernetes::worker role to these boxes Bug: T164851 Change-Id: Iafc9937ce6b39a79874196b573ad87819e14f45d 18 May 2017, 13:52:10 UTC
2f48eaf Remove non-ascii character from servermon.rb Remove the copyright sign. Change-Id: Ibaa7d4b783fa71e87262823842abd7b2e53d639f 18 May 2017, 11:33:28 UTC
8ec711c Update docker-host.cfg partman recipe Specify the VG name to docker, set priorities and sizes better, add a placeholder LV and amend a few typos. Finally add confirmations for LVM removals Remove some whitespace as well Change-Id: Ib7c7fbdbf508b685aeebe72cf072b8dd4f102454 18 May 2017, 11:23:35 UTC
b56c5f9 Document servermon optimization Add a comment about how we had to change the puppet tables indexes slightly to fix an inefficient query issued by servermon. Normally we would not be touching the tables created by another application, but ActiveRecord support in puppet is deprecated for a long time now and we are the only user now Bug: T164604 Change-Id: I359e92c5a6852645b86bcf1e183e705f5952507b 18 May 2017, 11:23:16 UTC
d9f4973 mariadb: set db2051 as enabled for full reimage Remove db2052, which has already been moved to jessie. Change-Id: I9915ac63f0c505373949a2eb40a99d6d31866284 18 May 2017, 09:37:30 UTC
6198af9 cassandra: remove unused role Change-Id: Ideb535c305081990815642d8a4be861aee8f0835 18 May 2017, 09:24:59 UTC
a2ca0c9 restbase: remove legacy classes, roles Change-Id: Iaf26092f719f91b4772f6f3eb180941f1e43af89 18 May 2017, 09:23:47 UTC
75680b4 deployment-prep: additional fixes to restbase hiera Change-Id: I94146313de3ee84a20b2e66b4083f6a7aa796fe7 18 May 2017, 09:15:03 UTC
2358a3e profile::cassandra: remove useless pick() Change-Id: I7e7bdfe55b68ab43a64305d001a036ba84a3c0a4 18 May 2017, 08:36:50 UTC
87bdff7 Fix MediaWiki centralauth errors graphite alarm Small fix for the metric to monitor (sum should be sumSeries) Change-Id: I75ff7c9daf230d79af21562ef3bcf15fe1a55a24 18 May 2017, 08:16:43 UTC
a00eb50 restbase: convert deployment-prep to role/profile Also: * Add a deprecation warning to role::restbase::server * Better handle the default instances case for cassandra Change-Id: I48827fd2388d15d24935df909e70d36097aec8aa 18 May 2017, 08:12:18 UTC
3895842 mariadb: clean up duplicate GRANTs for phstats user These GRANTs for the phabricator-statistics (phstats) user are duplicate, looks like copy/paste fail. Cleaning up. Change-Id: I806034b62145a7a53de26c5781dfb851a53e5c48 17 May 2017, 19:39:36 UTC
0a1ad61 debug_proxy: move 'standard' and 'base::firewall' to role Change-Id: If8897410b9c5fd639bf8099acc6e47686c1ec20f 17 May 2017, 19:33:05 UTC
ee6062e phabricator: convert to profile/role-structure Another conversion from "role" to "profile/role"- structure in Puppet, following our newer puppet coding standards. https://wikitech.wikimedia.org/wiki/Puppet_coding#Profiles Change-Id: If2eeff9f844a223fff460b7fe713be14f3f00ccf 17 May 2017, 18:56:53 UTC
back to top