https://github.com/wikimedia/operations-puppet

sort by:
Revision Author Date Message Commit Date
d228a83 scap: Move l10nupdate into module Bug: T87221 Change-Id: Id8f5086d78fd8952aef3bcddce987614f80e2161 24 January 2015, 20:25:16 UTC
747c4c6 scap: Clean up absent'd lint related files / packages See T70255 for discussion about why / when these were removed. I think they've been in puppet long enough to be actually purged. Change-Id: I6a1ad2cb00e3433eca9cd8db18f7bd1b05f339df 24 January 2015, 20:20:09 UTC
6ccc5ef scap: Move scap master code into own class - Create a role class for a scap master - Scap needs dsh from what I understand, so it should be in scap::master - Remove some layers of indirection that seem unnecessary. Bug: T87221 Change-Id: Ib96af2cb7a7f84c4f023479312555add37d7c22a 24 January 2015, 20:16:21 UTC
58d558a scap: Move 'common_scripts' into scripts class A mechanical move Bug: T87221 Change-Id: Icba39e8e70680b7bbb2b47646f88f54ea9c52781 24 January 2015, 20:16:14 UTC
c2b2957 admin: replace phuedx's key Change-Id: Ie5d6166d2c6dc2b0aa3c5b142f590ea6e9ff6e1c 24 January 2015, 09:18:43 UTC
d911c01 logstash: fix exception-json messages * Save exception class name as "class" attribute * Reset "type" to "mediawiki" which was overwritten by exception class Change-Id: Ie81f115ccce27abe819fdbecda0fa8604504edea 24 January 2015, 02:34:01 UTC
b43f009 set up varnish instance defaults file even under systemd Change-Id: Ia2fd7b73fe1501f854e37da3afc8a22fc85c2396 24 January 2015, 00:56:07 UTC
026177f sca: move hiera file to the correct location (d'oh) Change-Id: I7a061ea985b3bace79393096d4af4d65d4a5f660 24 January 2015, 00:36:41 UTC
5751e6a services: create sca role In the work for T86774 it seemed convenient to unify all the roles we apply to services cluster A in a single comprehensive role that we can configure properly. Change-Id: I7cd4b5d47eea62eea111c362aa70730e9c83a5af Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 24 January 2015, 00:29:30 UTC
802f594 logstash: remove support for most udp2log events Since I43cd2c3197838fbdd4837932f966e47414d27508 most log2udp traffic from fluorine to logstash has been disabled. This change removes the processing rules for the now removed events. Change-Id: Iaf5e4557e6c6b85bf55b3ee116346b670d38331a 24 January 2015, 00:15:33 UTC
51155b1 Merge "virt: use role, hiera" into production 24 January 2015, 00:13:04 UTC
792d43b logstash: Update apache2 parsing pattern Updates to apache2 parsing pattern from data captured in production logstash: * Log level may be preceded by a colon (eg "[:error]") * Capture thread id for Worker MPM messages Change-Id: I0789761b6b0759a07ab51c3e1b7d1f7f0f3b40c6 24 January 2015, 00:09:22 UTC
4d786a4 virt: use role, hiera See T86774 Change-Id: I5c3fd4f87183a417cc5d87d4e703302a4fe154dd Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 24 January 2015, 00:07:12 UTC
d25e789 Merge "Re-use Parsoid Varnishes for citoid too" into production 23 January 2015, 23:38:45 UTC
51d78c3 Merge "virt0.wikimedia.org cert expired" into production 23 January 2015, 23:38:34 UTC
86d9556 Remove include of misc::statisics::cron_blog_pageviews from site.pp Change-Id: I3805b5b38cd53fd43681a18a6118095b5c89f7d0 23 January 2015, 23:37:03 UTC
2b913f5 virt0.wikimedia.org cert expired the certificate expired before i got around to revoking it, removing the certificate file as it is no longer used (was a tampa server) Change-Id: Iec14cc802db6b51ccec3cf070c87714a47a34e1b 23 January 2015, 23:36:09 UTC
96c5aa0 Move cron_blog_pageviews to statistics module Change-Id: I122b2f7d31c1a030e3ef9dca80b33a8aff81042e 23 January 2015, 23:34:37 UTC
c03193c Move misc::statistics::limn::data* to statistics module Change-Id: If335cfa4bcc9aa772ba71ec0dcac9641cc07708f 23 January 2015, 23:27:36 UTC
7e64450 Merge "/home/tstarling/.profile" into production 23 January 2015, 23:14:23 UTC
f915612 lsearchd: reclaim search hosts Bug: T86149 Change-Id: If805f9c012fa3ba5d4fb0f14ebeca381671aee77 23 January 2015, 23:13:36 UTC
b262ed7 /home/tstarling/.profile Add .profile from /etc/skel so that .bashrc is executed on login, on servers where /home/tstarling is provisioned solely from puppet. Change-Id: Ic0079a94340e35f9ef759599d08d123f5008c9ab 23 January 2015, 23:10:57 UTC
56cb058 Fixes for a few variable names now that they are in the statistics module Change-Id: I24909bc65bacab5a62f2899ec1169f14f9427482 23 January 2015, 23:09:59 UTC
b802d86 tools: Install rlwrap in dev_environ Bug: T87368 Change-Id: I7ac26f39f4ee4bd01212aed3ede580f75e381ce9 23 January 2015, 23:08:46 UTC
8021996 Remove now unused parts of misc/statistics.pp Still TODO: - geowiki - limn data - cron_blog_pageviews Change-Id: Ied657d4958150d344f95500203ba6c719cb58c13 23 January 2015, 23:02:34 UTC
a2e0436 Fix document root for geowiki-private data in stats.wikimedia.org Change-Id: I968924e5ac9363556053fbd01f22521997393997 23 January 2015, 22:57:11 UTC
503b534 require statistics::web in statistics::sites classes Change-Id: I3ec5366046d8a1952899e2a54b3d1d04cf63a8d8 23 January 2015, 22:54:31 UTC
3f36ef7 Merge "puppetize SSH private key for jenkins VE sync" into production 23 January 2015, 22:51:40 UTC
ebe2e3b Fix dependency, fix template variables Change-Id: I1cb102d6a5220e3ce34b6be4e5cd12cdaa9298f5 23 January 2015, 22:46:55 UTC
6fb0c55 puppetize SSH private key for jenkins VE sync As described in T84731 there is a SSH keypair installed on jenkins boxes and used for "VE sync tasks", but it was installed manually. This is to puppetize the existing key. We will add it to the private puppet repository. This role is just applied on gallium. and existing situation is: -rw------- 1 jenkins jenkins 1.7K Jan 22 2014 jenkins-mwext-sync_id_rsa file /var/lib/jenkins/.ssh/jenkins-mwext-sync_id_rsa Bug: T84731 Change-Id: I568367464d8096c9d085c2657fb5b48481da4ee3 23 January 2015, 22:44:25 UTC
598d2e6 Use proper path for statistics::sites templates Change-Id: Iee382cce771fabb41ed4ff7459269f844d280275 23 January 2015, 22:39:55 UTC
f71df46 Re-use Parsoid Varnishes for citoid too In the manner of Ib4436e8b, let's use citoid so we can actually get it into production now, rather than waiting for the Services cluster to appear once procurement is done and setup is complete in a few weeks/months. Bug: T76949 Change-Id: Ib238012e09d2345ac9bedf0b5011a121477a8e08 23 January 2015, 22:34:09 UTC
e3c8a0a role::statistics::web now uses module classes This also removes the statistics::sites::reportcard class. It turns out this is not needed. Change-Id: I2979bae957aaa448304e6fc5ce9d8e85e9b39c5e 23 January 2015, 22:33:55 UTC
fe8e4b8 Use statistics::web class Change-Id: I5bd95d156318a5323cd76990234b1d83148bfde1 23 January 2015, 22:25:01 UTC
4d32b53 role::statistics::private now uses module includes Change-Id: I0874d3f79b4d7ce183d00e3a5a3625d1eb83ead2 23 January 2015, 22:19:14 UTC
1e3dccd role::statistics::cruncher now uses module classes (except for geowiki) Change-Id: I1c64dd3304d7ff7a3aa8ed0e2bd8b9a3fc48a523 23 January 2015, 22:14:38 UTC
760cef9 Fix dependency for class statistics::rsync::eventlogging Change-Id: I335646626fb8f4e03e4c53b3776775635c7452cc 23 January 2015, 22:09:18 UTC
e20ddeb Include more cruncher classes in new role classes Change-Id: I6ba2780b991e034a8fddbe7fa1ec7df919ac044c 23 January 2015, 22:05:20 UTC
db8ac59 Move custom researcher group researchdb password file definition into node definition This should only be on nodes that have the 'researchers' group included. Change-Id: If47ec117caee8ccdbe7dc6b9185de70818d78f1f 23 January 2015, 22:01:23 UTC
10c3721 Comment out includes that are now covered by the new role Change-Id: I4e1329f3fc077a05dd747ae47dc3d22b671dbac4 23 January 2015, 21:55:47 UTC
bdc4a62 Merge "granting cluster deployment access to Andrew Green" into production 23 January 2015, 21:53:01 UTC
3b30be7 granting cluster deployment access to Andrew Green approved on ticket, was awaiting confirmation of responsibilities Change-Id: I6acbb5361b5bce917a4739713c4c143aca72e686 RT: 9060 Bug: T87463 23 January 2015, 21:51:41 UTC
485a697 Comment out uses of researchdb password classes Change-Id: I3576a0989b63cc4b27af84fa32631548b576cbb2 23 January 2015, 21:50:45 UTC
4c98f40 Comment out manual include of misc::statistics::packages::python in geowiki classes I will refactor this to use require_packages when geowiki is merged into its own module Change-Id: Ia7ca1e989a8f4ab04b0f2b9f84876d99ddcf2714 23 January 2015, 21:46:44 UTC
726dbfd Explicit provider for ganglia_new::monitor::service This seems to be an emerging ugly pattern as we migrate to jessie: For services which ship init files for other systems (e.g. /etc/init.d/foo and/or /etc/init/foo) as well as a systemd unit, the OS gives precendence to the systemd unit, but puppet (in the context of the Service definition) tends to believe the non-systemd variant without an explicit "provider => systemd". Thus in this case (prior to this patch) ganglia-monitor is actually running as a systemd service and functioning correctly there, but puppet keeps failing at status checks and then failing to start it (when it's already running), because it's explicitly invoking /etc/init.d/ganglia-monitor Change-Id: I748ea8ee28e9cde7ea1c2551f98485de1fe1913a 23 January 2015, 21:44:48 UTC
057aecf Apply new statistics::compute class to stat1003 Change-Id: Icecf9ba5813ac40772ced48627bd56374c621268 23 January 2015, 21:38:40 UTC
7d0f945 mediawiki: define pcre_cache_type for canary appservers As the new HHVM package we're installing has the new PCRE engine that needs to be configure, add this configuration there. Change-Id: If4700e5f160582504b9a1a3c8e05e289fbe75447 Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 23 January 2015, 21:30:19 UTC
413ee9b Duplicate variable definition during migration to stat module Change-Id: I903a2aac6941cf0e3a9097c846999f54dc074b43 23 January 2015, 21:22:11 UTC
3522723 Temporarily use local variables in misc statistics classes to satisfy existing class references Change-Id: I130a955ca83917ef0cbf56ff20e6f76a0c56c6e5 23 January 2015, 21:17:10 UTC
5991c01 Just make misc::statistics::base depend on ::statistics Change-Id: Id1d77298021cb94f2bb0137ea9102fb5a71afe72 23 January 2015, 21:11:54 UTC
7b8ce19 Make misc::statistics::user and misc::statistics::base include module classes This is to fix duplicate resource definitions. T87450 Change-Id: I95d004f7c8fe64a9a78f2d786722c4465bfd7da1 23 January 2015, 21:07:37 UTC
f4b51bb Merge "cleaning out holmium entries" into production 23 January 2015, 21:04:08 UTC
2106b5f cleaning out holmium entries death to the old blog server T84263 Change-Id: I549ec19181d35e63bdcd90687d66ecf39273341f 23 January 2015, 21:03:11 UTC
79ff75d Globally qualify statistics module include Change-Id: I3930844e410ccdae95341b3d686582fb324012bb 23 January 2015, 21:03:01 UTC
691c03a Fix typo in class name include Change-Id: I2770fece69f84112e67ea59acb3ce40d673d97b9 23 January 2015, 21:01:19 UTC
2d1fa59 First commit in a multi-commit effort to move misc/statistics.pp into modules/ T87450 Change-Id: I49140d85ddea99f5d4d9a3c71e60cf7fa57d49b6 23 January 2015, 20:19:43 UTC
6378084 use /run/ directly for nginx pidfile this fixes "reload" under systemd w/ jessie, and also works for currnet precise/trusty because they all have /var/run -> /run anyways. Change-Id: Ia6fb7ba1da72f5cae87b825a3befd7341c37673d 23 January 2015, 19:25:16 UTC
c86c2ab snapshot: unify node declarations, use role, hiera See T86774 Change-Id: If95c91749d6290ba9dc2e44251a6344f1641c687 Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 23 January 2015, 17:41:59 UTC
020892f mediawiki: Introduce feature flag to enable/disable lvs Since labs doesn't support LVS yet Bug: T87210 Change-Id: I0b84083645ae14b4479e1b5ed094b666b76ac43d 22 January 2015, 21:26:08 UTC
06408a2 mediawiki: Load prod/beta apache configurabion based on realm Is a hack, should be replaced with something nicer for managing the different apache configuration Bug: T87210 Change-Id: Iff85982d7a3f887933e928e770b089295194a01b 22 January 2015, 21:10:12 UTC
1451dc5 Merge "let bastion hosts have base::firewall" into production 21 January 2015, 23:56:49 UTC
e7e3a75 keyholder: ensure ssh-agent-proxy transmits whole messages Periodically, deployers will see the following error on tin: "Error reading response length from authentication socket." These errors started appearing with the introduction of keyholder. I suspect it happens when the recv() call on the socket that is connected to the actual ssh-agent reads less than a full message. So instead of copying bytes from the proxy to the client as they come, only send complete messages. Bug: T86545 Change-Id: I68b5e1c2f1d80ef7585e97d8820513ee062968be 21 January 2015, 02:30:54 UTC
1d47e5e Merge "Scrap the /var/log lvm partition; make the / 20G by default." into production 21 January 2015, 00:44:05 UTC
4ae028a Scrap the /var/log lvm partition; make the / 20G by default. Bug: T87003 Change-Id: Idaeafb72aa367aea0abdcec1fb9116717e23a3e2 21 January 2015, 00:42:39 UTC
4333cf4 varnish systemd unit file: iterate on fixups #4 Change-Id: I233819b0d0fa004b39cbd19deb5fa866d950d35b 20 January 2015, 23:54:01 UTC
e21cd85 varnish systemd unit file: iterate on fixups #3 Change-Id: I0126e8d99679b467021171d641da9b8fedc774b6 20 January 2015, 23:50:39 UTC
9f97e3c varnish systemd unit file: iterate on fixups #2 Change-Id: I618088f129e1babc3ee320dfb30f98b8fd3923dd 20 January 2015, 23:27:54 UTC
a57ede4 varnish systemd unit file: iterate on fixups Change-Id: I94f13d91a008fc3c2f490f17ccefe42032ab2373 20 January 2015, 23:20:09 UTC
943bdc8 Bump alert thresholds for EventLogging's overall events/s Since EventLogging volume recently outgrew the 350 events/s and EventLogging is known to be able to handle more events/s, we bump the threshold to avoid getting unneeded Icinga warnings for EventLogging. The 450 events/s threshold is arbitrary, but EventLogging is known to currently handle 450 events/s amount of traffic, and that threshold will silence the false alarms for now. The Analytics team has to come up with more realistic thresholds (T86244). Change-Id: I2312d62fff1ad851640c2f2fced646478833b7a4 20 January 2015, 22:02:04 UTC
fd52585 varnish systemd service stuff Change-Id: I459ea096e260b49037bf934ff9d63ef7372c7120 20 January 2015, 21:24:19 UTC
24e3a61 contint: Don't include base firewall by default Applying this class by itself should not cause the inclusion of base::firewall - only the opening up of a firewall hole *if* a firewall already exists. This will remove the base firewall from most beta instances, and we can enable them on a case by case basis if required Change-Id: I94d09b49726297e451777655b5acc614527daa5f 20 January 2015, 08:11:16 UTC
62971fb Update servermon settings Update RSpecs, Rakefile and update the settings and stop managing urls.py Change-Id: I4f1b41d18c454d0226b8bd3bc70af1c1175cbc6b 20 January 2015, 02:46:12 UTC
2cd2202 Update servermon's service_name So that code deployments actually restart the service Change-Id: I5d24c9e9eb9b9f1c951dbafbadec3c74ede3d797 20 January 2015, 02:46:12 UTC
2356ee6 update role::ganglia::config $data_sources ganglia::collector::config $data_sources seems to be the same, or overidden, or... something. Change-Id: I743c5f80c455f91e3e209142dcac4af6a375865e 20 January 2015, 02:27:33 UTC
55e5777 mediawiki: Explicitly open port 80 on mediawiki webservers No-op in prod since base::firewall isn't applied Change-Id: I2df3ae0f53e2c7913539938045bfcea1854d3c0f 20 January 2015, 02:17:25 UTC
2f32217 beta: Kill beta specific mediawiki logging role Bug: T87210 Change-Id: I5aa9392c2c1f9afd6c6625bbccde70c035eb50be 20 January 2015, 02:07:25 UTC
816acdc beta: Kill beta specific jobrunner class Hiera data appropriately set! Bug: T87210 Change-Id: Ibaebde2b9206ae5e7b606bede3e004d6da70fef7 20 January 2015, 02:07:25 UTC
51ee252 beta: Kill videoscaler role Bug: T87210 Change-Id: Ie1f80509d7022c1017d1758108ddceafa14d392e 20 January 2015, 02:07:25 UTC
bb1c8e3 mediawiki: Move jobrunner config into hiera Shared across jobrunner and videoscaler machines Bug: T87210 Change-Id: I21396d68e91344d3f269a75667dc5e1cfcdc7dfe 20 January 2015, 02:03:48 UTC
75d6dcb mediawiki: Fix incredibly annnoyyying stray space Change-Id: I78bcc5bb79067a2d9ec41289f536257569bca586 20 January 2015, 01:41:07 UTC
457d585 redis: user role, hiera See T86774 Change-Id: Ice9de3907a7d2cd90d95c1bddffbdbfbed24d1cd Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 20 January 2015, 01:33:36 UTC
633df0b memcached: use role, hiera See T86774 Change-Id: I1f2aa1fb0320be4e89dc4b869b0786e8fde96dfc Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 20 January 2015, 01:11:20 UTC
a93f85c Tools: Install valgrind on bastions Bug: T87117 Change-Id: I680bdd8a04b6800ed105e27f5a05a2981c63a56a 20 January 2015, 01:03:53 UTC
6d17f31 correct mysql ganglia aggregators for eqiad T87209 Change-Id: I502667045e0ba3723d70f23dadc607b9d668ce51 20 January 2015, 00:29:55 UTC
f11eee6 deploy dbstore2001 dbstore2002 Change-Id: Ib7ff6916b58831969b65566f7a1981bd4a8ba2d8 20 January 2015, 00:20:52 UTC
e6095a2 lvs: move the hiera file to the correct location Change-Id: Ifa0f5e4ed56a90c58c3676352ed5a2b815b9c6eb 20 January 2015, 00:06:10 UTC
2681cdc lvs: use role, hiera See T86774 Change-Id: I49a77b24d6e747460ad35b547c2c00a70ed99836 19 January 2015, 23:54:15 UTC
54deab4 beta: Clean up remnants of older apache-config setup Comes from modules/mediawiki/files/apache/beta now Change-Id: Iab695a98b37a7270da97dd722e1f17e59c75b02e 19 January 2015, 23:01:10 UTC
b2a7747 beta: Kill fatal_monitor.rb script Hasn't worked for a long time (HHVM has no fatal.log) Change-Id: Ied5c098bfb5c1ceac6936c0ea4e67eee3c4b3705 19 January 2015, 22:56:09 UTC
193f8a3 restbase: use role, hiera As per T86774 Change-Id: Idff78403ac860a7ae8b8a48d9dd418fed9b4e26a Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 19 January 2015, 22:27:42 UTC
ab526aa logstash: Experimental logstash irc logging on production logstash SAL is terrible. Let's attempt to replace it! Only on deployment-prep logstash to begin with, because prod logstash is incredibly overloaded atm Change-Id: Ic2f32e5e34d438b83506b60deda428001b18330e 19 January 2015, 22:17:44 UTC
188dc37 Followup commit to 0333eab Fix a missing s typo Change-Id: I6da41d169e3535f895f354d0450d0b8dd0761f80 19 January 2015, 21:02:50 UTC
0333eab monitoring::service: mimic monitoring::host group handling We ended up having different ways of handling servicegroups and hostgroups in icinga. Be consistent and handle the $group parameter the same way Change-Id: I9e8f354ec68339433ae1efd6f5f45dd05cfc67d9 19 January 2015, 20:52:49 UTC
153985a parsoid: use hiera, role This commit cleans up site.pp and moves most class parameters to hiera, for the rationale see T86774 Change-Id: I2c4db8cad5f27e586eaa89b700618aa153cee141 Signed-off-by: Giuseppe Lavagetto <glavagetto@wikimedia.org> 19 January 2015, 20:10:54 UTC
8b5e4cc parsoid: Include base::firewall on parsoid hosts Beta has base::firewall included on all *oid hosts (see T86951) and sca* has base::firewall included for all *oid services (see I13f8e75dd0ba61671bf9f8acd075333c497b4435). This adds base::firewall to parsoid hosts as well so that everything is consistent (prod vs beta as well as *oid hosts). A ferm rule for parsoid's port has been added in Ia312a73d1ab329a22aae26ee851ed584363017b3 Bug: T87105 Change-Id: I5d32c8f3c60d4903d58e850e3507fffb959e4245 19 January 2015, 17:34:37 UTC
037b766 Do not delete the Wikidata dump we just created. Nor the recent ones. Instead delete the older ones. Change-Id: I05634acbe759427399e99d77cf2b9e47215363df 19 January 2015, 16:38:19 UTC
dc23197 shinken: Check wikitech on https rather than http Since http just redirects to https Change-Id: Icfed53d2bb5082037f3613fb2578e74eecc74d8d 19 January 2015, 10:33:25 UTC
fa29bab shinken: Add wikitech check for labs infra Just a basic check to see if main page is up Change-Id: Ie52016c85c0c3a1a887562250368ad854af6c42f 19 January 2015, 00:54:32 UTC
58f34e3 upgrade db1054 to trusty and mariadb 10 Change-Id: I54040963731614a56ae859ca73ad1607876ff0cb 18 January 2015, 20:26:09 UTC
c9e66a4 deployment: Open up redis port on deployment masters base::firewall is applied on deployment-bastion, so things can't seem to deploy in beta because the redis port isn't accessible. Adding in prod too since it is a good practice, even if prod machines don't have base::firewall yet. In a glorious future, these two roles would be unified Change-Id: I617b780a9cc7c5859e93b2b1d506909582106e62 17 January 2015, 09:52:02 UTC
back to top