Nova Resource:Tools
Contents
- 1 Documentation
- 2 Server admin log
- 2.1 July 12
- 2.2 July 11
- 2.3 July 10
- 2.4 July 9
- 2.5 July 8
- 2.6 July 6
- 2.7 July 5
- 2.8 July 4
- 2.9 July 3
- 2.10 July 2
- 2.11 July 1
- 2.12 June 30
- 2.13 June 29
- 2.14 June 28
- 2.15 June 21
- 2.16 June 20
- 2.17 June 16
- 2.18 June 15
- 2.19 June 13
- 2.20 June 10
- 2.21 June 3
- 2.22 June 2
- 2.23 May 27
- 2.24 May 25
- 2.25 May 23
- 2.26 May 22
- 2.27 May 20
- 2.28 May 16
- 2.29 May 14
- 2.30 May 13
- 2.31 May 10
- 2.32 May 9
- 2.33 May 6
- 2.34 April 28
- 2.35 April 27
- 2.36 April 24
- 2.37 April 20
- 2.38 April 13
- 2.39 April 12
- 2.40 April 11
- 2.41 April 10
- 2.42 April 8
- 2.43 April 4
- 2.44 March 30
- 2.45 March 29
- 2.46 March 28
- 2.47 March 21
- 2.48 March 20
- 2.49 March 5
- 2.50 March 4
- 2.51 March 3
- 2.52 March 1
- 2.53 February 28
- 2.54 February 27
- 2.55 February 25
- 2.56 February 23
- 2.57 February 21
- 2.58 February 20
- 2.59 February 19
- 2.60 February 18
- 2.61 February 14
- 2.62 February 13
- 2.63 February 12
- 2.64 February 11
- 2.65 February 10
- 2.66 February 9
- 2.67 February 6
- 2.68 February 4
- 2.69 January 31
- 2.70 January 30
- 2.71 January 28
- 2.72 January 25
- 2.73 January 24
- 2.74 January 23
- 2.75 January 21
- 2.76 January 20
- 2.77 January 16
- 2.78 January 15
- 2.79 January 14
- 2.80 January 10
- 2.81 January 9
- 2.82 January 8
- 2.83 January 7
- 2.84 January 6
- 2.85 January 1
- 2.86 December 27
- 2.87 December 23
- 2.88 December 21
- 2.89 December 19
- 2.90 December 17
- 2.91 December 14
- 2.92 December 4
- 2.93 December 1
- 2.94 November 25
- 2.95 November 24
- 2.96 November 14
- 2.97 November 13
- 2.98 November 3
- 2.99 November 1
- 2.100 October 23
- 2.101 October 20
- 2.102 October 15
- 2.103 October 10
- 2.104 September 23
- 2.105 September 11
- 2.106 August 24
- 2.107 August 23
- 2.108 August 22
- 2.109 August 20
- 2.110 August 19
- 2.111 August 16
- 2.112 August 15
- 2.113 August 11
- 2.114 August 10
- 2.115 August 6
- 2.116 August 5
- 2.117 August 2
- 2.118 August 1
- 2.119 July 31
- 2.120 July 30
- 2.121 July 29
- 2.122 July 25
- 2.123 July 20
- 2.124 July 19
- 2.125 July 10
- 2.126 July 5
- 2.127 July 3
- 2.128 July 2
- 2.129 July 1
- 2.130 June 30
- 2.131 June 26
- 2.132 June 25
- 2.133 June 24
- 2.134 June 19
- 2.135 June 17
- 2.136 June 16
- 2.137 June 15
- 2.138 June 14
- 2.139 June 13
- 2.140 June 11
- 2.141 June 10
- 2.142 June 9
- 2.143 June 8
- 2.144 June 7
- 2.145 June 5
- 2.146 June 4
- 2.147 June 3
- 2.148 June 2
- 2.149 June 1
- 2.150 May 31
- 2.151 May 30
- 2.152 May 29
- 2.153 May 28
- 2.154 May 27
- 2.155 May 24
- 2.156 May 23
- 2.157 May 22
- 2.158 May 21
- 2.159 May 19
- 2.160 May 14
- 2.161 May 10
- 2.162 May 9
- 2.163 May 6
- 2.164 May 4
- 2.165 May 2
- 2.166 May 1
- 2.167 April 27
- 2.168 April 26
- 2.169 April 25
- 2.170 April 24
- 2.171 April 23
- 2.172 April 19
- 2.173 April 15
- 2.174 April 11
- 3 Instances for this project
Documentation
Description
The Tools project is one of two projects in the Tool Labs environment (the other being Toolsbeta).
Tool Labs is a reliable, scalable hosting environment for community developers working on tools and bots that help users maintain and use wikis. The cloud-based infrastructure was developed by the Wikimedia Foundation and is supported by a dedicated group of Wikimedia Foundation staff and volunteers.
Tool Labs is a part of the Labs project, which is designed to make it easier for developers and system administrators to try out improvements to Wikimedia infrastructure, including MediaWiki, and to do analytics and bot work.
Tip: Confused about the terms labs, tool labs etc? Read Wikimedia Labs vs Tool Labs.
The Tool Labs environment provides:
- Support for Web services, continuous bots, and scheduled tasks.
- Access to replicated production databases.
- Easily shared management of tool accounts, where tools and bots are stored.
- A grid engine for dispatching jobs.
- Support for mosh, SSH, SFTP without complicated proxy setup.
- A shared pywikibot installation.
- Time-travel backups for short-term data recovery.
- Version control via Gerrit and Git.
- Support for Redis.
In general, every tool maintainer should work primarily on the Tools project (not Toolsbeta, which is for experiments to the Tool Labs environment itself).
Getting access
After filling in the form, your request will then show up in the queue below, and will be processed shortly by one of the Tool Labs administrators.
Current queue [ link ]:
(No outstanding requests)
Help-page
Tools Resources Overview
Useful links
- Getting started (in-depth)
- Getting started (for toolserver users)
- Bugzilla (product "Wikimedia Labs", component "tools"; report a bug/feature request)
- Service group (tool) interface - create service groups (tools) and add users to them
SSH Fingerprints
tools-login: Help:SSH Fingerprints/tools-login.wmflabs.org
Server admin log
July 12
- 17:57 scfc_de: tools-exec-11: Stopping apache2 service; no clue how it got there
- 17:53 scfc_de: tools-exec-11: Moved log files around, rebooted, restored iptables and reenabled queue ("qmod -e {continuous,task}@tools-exec-11...")
- 13:00 scfc_de: tools-exec-11, tools-exec-13: qmod -r continuous@tools-exec-1[13].eqiad.wmflabs in preparation of reboot
- 12:58 scfc_de: tools-exec-11, tools-exec-13: Disabled queues in preparation of reboot
- 11:58 scfc_de: tools-exec-11, tools-exec-12, tools-exec-13: mkdir -m 2750 /var/log/exim4 && chown Debian-exim:adm /var/log/exim4; I'll file a bug why the directory wasn't created later
July 11
- 11:59 scfc_de: tools-exec-11, tools-exec-12, tools-exec-13: cp -f /data/project/.system/hosts /etc/hosts
July 10
- 20:35 scfc_de: tools-exec-11, tools-exec-12, tools-exec-13: iptables-restore /data/project/.system/iptables.conf
- 16:00 YuviPanda: manually removed mariadb remote repo from tools-exec-12 instance, won't be added to new instances (puppet patch was merged)
- 01:33 YuviPanda|zzz: tools-exec-11 and tools-exec-13 have been added to the @general hostgroup
July 9
- 23:14 YuviPanda: applied execnode, hba and biglogs to tools-exec-11 and tools-exec-13
- 23:09 YuviPanda: created tools-exec-13 with precise
- 23:08 YuviPanda: created tools-exec-12 as trusty by accident, will keep on standby for testing
- 23:07 YuviPanda: created tools-exec-12
- 23:06 YuviPanda: created tools-exec-11
- 19:23 scfc_de: tools-webproxy: "iptables -A INPUT -p tcp \! --source 127/8 --dport 6379 -j REJECT" to block connections from other Tools instances to Redis again
- 14:12 scfc_de: tools-exec-cyberbot: Reran Puppet successfully and hotfixed the Peachy temporary file issue; will mail labs-l later
- 13:33 scfc_de: tools-exec-cyberbot: Freed 402398 inodes ...
- 12:50 scfc_de: tools-exec-cyberbot: "find /tmp -maxdepth 1 -type f -name \*cyberbotpeachy.cookies\* -mtime +30 -delete" as a first step
- 12:40 scfc_de: tools-exec-cyberbot: Root partition has run out of inodes
- 12:34 scfc_de: tools-exec-gift: Forgot to log yesterday: The problems were due to overload (load >> 150); SGE shouldn't have allowed that
- 12:28 YuviPanda: cleaned out old diamond archive logs on tools-master
- 12:28 YuviPanda: cleaned out old diamond archive logs on tools-webgrid-04
- 12:25 YuviPanda: cleaned out old diamond archive logs from tools-exec-08
July 8
- 20:57 scfc_de: tools-exec-gift: Puppet hangs due to "apt-get update" not finishing in time; manual runs of the latter take forever
- 19:52 scfc_de: tools-exec-wmt, tools-shadow: Removed stale Puppet lock files and reran manually (handy: "sudo find /var/lib/puppet/state -maxdepth 1 -type f -name agent_catalog_run.lock -ls -ok rm -f \{\} \; -exec sudo puppet agent apply -tv \;")
- 18:09 scfc_de: tools-webgrid-03, tools-webgrid-04: killall -TERM gmond (bug #64216)
- 17:57 scfc_de: tools-exec-08, tools-exec-09, tools-webgrid-02, tools-webgrid-03: Removed stale Puppet lock files and reran manually
- 17:26 scfc_de: tools-tcl-test: Rebooted because system said so
- 17:04 YuviPanda: webservice start on tools.meetbot since it seemed down
- 14:55 YuviPanda: cleaned out old diamond archive logs on tools-webproxy
- 13:39 scfc_de: tools-login: rm -f /var/log/exim4/paniclog ("daemon: fork of queue-runner process failed: Cannot allocate memory")
July 6
- 12:09 scfc_de: tools-mail: rm -f /var/log/exim4/paniclog after I20afa5fb2be7d8b9cf5c3bf4018377d0e847daef got merged
July 5
- 22:36 YuviPanda: cleared diamond archive logs on a bunch of machines, submitted patch to get rid of archive logs
- 22:17 YuviPanda: changed grid scheduling config, set weight_priority to 0.1 from 0.0 for https://bugzilla.wikimedia.org/show_bug.cgi?id=67555
July 4
- 08:51 scfc_de: tools-exec-08 (some hours ago): rm -f /var/log/diamond/* && restart diamond
- 00:02 scfc_de: tools-master: rm -f /var/log/diamond/* && restart diamond
July 3
- 16:59 Betacommand: Coren: It may take a while though; what the catscan queries was blocking is a DDL query changing the schema and that pauses replication.
- 16:58 Betacommand: Coren: transactions over 30ks killed; the DB should start catching up soon.
- 14:37 Betacommand: replication for enwiki is halted current lag is at 9876
July 2
- 00:21 YuviPanda: restarted diamond on almost all nodes to stop sending nfs stats, some still need to be flushed
- 00:21 YuviPanda: restarted diamond on all exec nodes to stop sending nfs stats
July 1
- 23:09 legoktm: tools-pywikibot started the webservice, don't know why it wasn't running
- 21:08 scfc_de: Reset queues in error state again
- 17:51 YuviPanda: tools-exec-04 removed stale pid file and force puppet run
- 16:07 YuviPanda: applied biglogs to tools-exec-02 and rejigged things
- 15:54 YuviPanda: tools-exec-02 removed stale puppet pid file, forcing run
- 15:51 Coren: adjusted resource limits for -exec-07 to match the smaller instance size.
- 15:50 Coren: created logfile disk for -exec-07 by hand (smaller instance)
- 01:53 YuviPanda: tools-exec-10 applied biglogs, moved logs around, killed some old diamond logs
- 01:41 YuviPanda: tools-exec-03 restarted diamond, atop, exim4, ssh to pick up new log partition
- 01:40 YuviPanda: tools-exec-03 applied biglogs, moved logs around, killed some old diamond logs
- 01:34 scfc_de: tools-exec-03, tools-exec-10: Removed /var/log/diamond/diamond.log, restarted diamond and bzip2'ed /var/log/diamond/*.log.2014*
June 30
- 22:10 YuviPanda: ran webservice start for enwp10
- 22:06 YuviPanda: stale lockfile in tools-login as well, removing and forcing puppet run
- 22:01 YuviPanda: removed stale lockfile for puppet, forcing run
- 19:58 YuviPanda|food: added tools-webgrid-04 to webgrid queue, had to start portgranter manually
- 17:43 YuviPanda: created tools-webgrid-04, applying webnode role and running puppet
- 17:27 YuviPanda: created tools-webgrid-03 and added it to the queue
June 29
- 19:45 scfc_de: magnustools: "webservice start"
- 18:24 YuviPanda: rebooted tools-webgrid-02. Could not ssh, was dead
June 28
- 21:07 YuviPanda: removed alias for tools-webproxy and tools.wmflabs.org from /etc/hosts on tools-webproxy
June 21
- 20:09 scfc_de: Created tool mediawiki-mirror (yuvipanda + Nemo_bis) and chown'ed & chmod o-w /shared/mediawiki
June 20
- 21:01 scfc_de: tools-webgrid-tomcat: Added to submit host list with "qconf -as" for bug #66882
- 14:47 scfc_de: Restarted webservice for mono; cf. bug #64219
June 16
- 23:50 scfc_de: Shut down diamond services and removed log files on all hosts
June 15
- 17:12 YuviPanda: deleted tools-mongo. MongoDB pre-allocates db files, and so allocating one db to every tool fills up the disk *really* quickly, even with 0 data. Their non preallocating version is 'not meant for production', so putting on hold for now
- 16:50 scfc_de: qmod -cq [email protected]
- 16:48 scfc_de: tools-exec-cyberbot: rm -f /var/log/diamond/diamond.log && restart diamond
- 16:48 scfc_de: tools-exec-cyberbot: No DNS entry (again)
June 13
- 22:59 YuviPanda: "sudo -u ineditable -s" to force creation of homedir, since the user was unable to login before. /var/log/auth.log had no record of their attempts, but now seems to work. straange
June 10
- 21:51 scfc_de: Restarted diamond service on all Tools hosts to actually free the disk space :-)
- 21:36 scfc_de: Deleted /var/log/diamond/diamond.log on all Tools hosts to free up space on /var
June 3
- 17:50 Betacommand: Brief network outage. source: It's not clearly determined yet; we aborted the investigation to rollback and restore service. As far as we can tell, there is something subtly wrong with the switch configuration of LACP.
June 2
- 20:15 YuviPanda: create instance tools-trusty-test to test nginx proxy on trusty
- 19:00 scfc_de: zoomviewer: Set TMPDIR to /data/project/zoomviewer/var/tmp and ./webwatcher.sh; cannot see *any* temporary files being created anywhere, though. iipsrv.fcgi however has TMPDIR set as planned.
May 27
- 18:49 wm-bot: petrb: temporarily hardcoding tools-exec-cyberbot to /etc/hosts so that host resolution works
- 10:36 scfc_de: tools-webgrid-01: removed all files of tools.zoomviewer in /tmp
- 10:22 scfc_de: tools-webgrid-01: /tmp was full, removed files of tools.zoomviewer older than five days
- 07:52 wm-bot: petrb: restarted webservice of tool admin in order to purge that huge access.log
May 25
- 14:27 scfc_de: tools-mail: "rm -f /var/log/exim4/paniclog" to leave only relay_domains errors
May 23
- 14:14 andrewbogott: rebooting tools-webproxy so that services start logging again
- 14:10 andrewbogott: applying role::labs::lvm::biglogs on tools-webproxy because /var/log was full and causing errors
May 22
- 02:45 scfc_de: tools-mail: Enabled role::labs::lvm::biglogs, moved data around & rebooted.
- 02:36 scfc_de: tools-mail: Removed all jsub notifications from hazard-bot from queue.
- 01:46 scfc_de: hazard-bot: Disabled minutely cron job github-updater
- 01:36 scfc_de: tools-mail: Freezing all messages to Yahoo!: "421 4.7.1 [TS03] All messages from 208.80.155.162 will be permanently deferred; Retrying will NOT succeed. See http://postmaster.yahoo.com/421-ts03.html"
- 01:12 scfc_de: tools-mail: /var is full
May 20
- 18:34 YuviPanda: back to homerolled nginx 1.5 on proxy, newer versions causing too many issues
May 16
- 17:01 scfc_de: tools-webgrid-02: rm -f /tmp/core (tools.misc2svg, May 13 06:10, 3861106688)
May 14
- 16:31 scfc_de: tools-webproxy: "iptables -A INPUT -p tcp \! --source 127/8 --dport 6379 -j REJECT" to block connections from other Tools instances to Redis
- 00:23 Betacommand: 503's related to bug 65179
May 13
- 20:36 YuviPanda: restarting redis on tools-webproxy fixed 503s
- 20:36 valhallasw: redis failed, causing tools-webproxy to thow 503's
- 19:09 marktraceur: Restarted grrrit because it had a stupid nick
May 10
- 14:50 YuviPanda: upgraded nginx to 1.7.0 on tools-webproxy to get SPDY/3.1
May 9
- 13:16 scfc_de: Cleared error state of queues {continuous,mailq,task}@tools-exec-06 and webgrid-lighttpd; no obvious or persistent causes
May 6
- 19:31 scfc_de: replagstats fixed; Ganglia graphs are now under the virtual host "tools-replags"
- 17:53 scfc_de: Don't think replagstats is really working ...
- 16:40 scfc_de: Moved ~scfc/bin/replagstats to ~tools.admin/bin/ and enabled as a continuous job (cf. also bug #48694).
April 28
- 11:51 YuviPanda: pywikibugs Deployed bf1be7b
April 27
- 13:34 scfc_de: Restarted webservice for geohack and moved {access,error}.log to {access,error}.log.1
April 24
- 23:39 YuviPanda: restarted grrrit-wm, not greg-g. greg-g does not survive restarts and hence care must be taken to make sure he is not.
- 23:38 YuviPanda: restarted greg-g after cherry-picking aec09a6 for auth of IRC bot
- 23:33 legoktm: restarting grrrit-wm https://gerrit.wikimedia.org/r/129610
- 13:07 scfc_de: tools-mail: rm -f /var/log/exim4/paniclog (relay_domains bug)
April 20
- 14:27 scfc_de: tools-redis: Set role::labs::lvm::mnt and $lvm_mount_point=/var/lib, moved the data around and rebooted
- 14:08 scfc_de: tools-redis: /var is full
- 08:59 legoktm: grrrit-wm: 2014-04-20T08:28:15.889Z - error: Caught error in redisClient.brpop: Redis connection to tools-redis:6379 failed - connect ECONNREFUSED
- 08:48 legoktm: Your job 438884 ("lolrrit-wm") has been submitted
- 08:47 legoktm: [01:28:28] * grrrit-wm has quit (Remote host closed the connection)
April 13
- 14:20 scfc_de: Restarted webservice for wikihistory to see if the change to PHP_FCGI_MAX_REQUESTS increases reliability
- 14:17 scfc_de: tools-webgrid-01, tools-webgrid-02: Set PHP_FCGI_MAX_REQUESTS to 500 in /usr/local/bin/lighttpd-starter per http://redmine.lighttpd.net/projects/1/wiki/docs_performancefastcgi#Why-is-my-PHP-application-returning-an-error-500-from-time-to-time
April 12
- 23:51 scfc_de: tools-mail: rm -f /var/log/exim4/paniclog ("unknown named domain list "+relay_domains"")
April 11
- 16:21 scfc_de: tools-login: Killed -HUP process consuming 2.6 GByte; cf. wikitech:User talk:Ralgis#Welcome to Tool Labs
April 10
- 18:20 scfc_de: tools-webgrid-01, tools-webgrid-02: "kill -HUP" all php-cgis that are not (grand-)children of lighttpd processes
April 8
- 05:06 Ryan_Lane: restart nginx on tools-proxy-test
- 05:03 Ryan_Lane: upgraded libssl on all nodes
April 4
- 15:48 Coren: Moar powar!!1!one: added two exec nodes (-09 -10) and one webgrid node (-02)
- 11:11 scfc_de: Set /data/project/.system/config/wikihistory.workers to 20 on apper's request
March 30
- 18:16 scfc_de: Removed empty directories /data/project/{d930913,sudo-test{,-2},testbug{,2,3}}: Corresponding service groups don't exist (anymore)
- 18:13 scfc_de: Removed /data/project/backup: Only empty dynamic-proxy backup files of January 3rd and earlier
March 29
- 10:14 wm-bot: petrb: disabled 1 job in cron in -login of user tools.tools-info which was killing login server
March 28
- 11:53 wm-bot: petrb: did the same on -mail server (removed /var/log/exim4/paniclog) so that we don't get spam every day
- 11:51 wm-bot: petrb: removed content of /var/log/exim4/paniclog
- 11:49 wm-bot: petrb: disabled default vimrc which everybody hates on -login
March 21
- 16:50 scfc_de: tools-login: pkill -u tools.bene (OOM)
- 16:13 scfc_de: rmdir /home/icinga (totally empty, "drwxr-xr-x 2 nemobis 50383 4096 Mär 17 16:42", perhaps artifact of mass migration?)
- 15:49 scfc_de: sudo cp -R /etc/skel /home/csroychan && sudo chown -R csroychan.wikidev /home/csroychan; that should close [[bugzilla:62132]]
- 15:15 scfc_de: sudo cp -R /etc/skel /home/annabel && sudo chown -R annabel.wikidev /home/annabel
- 15:14 scfc_de: sudo chown -R torin8.wikidev /home/torin8
March 20
- 18:36 scfc_de: Pointed tools-dev.wmflabs.org at tools-dev.eqiad.wmflabs; cf. [[Bugzilla:62883]]
March 5
- 13:57 wm-bot: petrb: test
March 4
- 22:35 wm-bot: petrb: uninstalling it from -login too
- 22:32 wm-bot: petrb: uninstalling apache2 from tools-dev it has nothing to do there
March 3
- 19:20 wm-bot: petrb: shutting down almost all services on webserver-02 in order to make system useable and finish upgrade
- 19:17 wm-bot: petrb: upgrading all packages on webserver-02
- 19:15 petan: rebooting webserver-01 which is totally dead
- 19:07 wm-bot: petrb: restarting apache on webserver-02 it complains about OOM but the server has more than 1.5g memory free
- 19:03 wm-bot: petrb: switched local-svg-map-maker to webserver-02 because 01 is not accessible to me, hence I can't debug that
- 16:44 scfc_de: tools-webserver-03: Apache was swamped by request for /guc. "webservice start" for that, and pkill -HUP -u local-guc.
- 12:54 scfc_de: tools-webserver-02: Rebooted, apache2/error.log told of OOM, though more than 1G free memory.
- 12:50 scfc_de: tools-webserver-03: Rebooted, scripts were timing out
- 12:42 scfc_de: tools-webproxy: Rebooted; wasn't accessible by ssh.
March 1
- 03:42 Coren: disabled puppet in pmtpa tool labs\
February 28
- 14:46 wm-bot: petrb: extending /usr on tools-dev by 800mb
- 00:26 scfc_de: tools-webserver-02: Rebooted; inaccessible via ssh, http said "500 Internal Server Error"
February 27
- 15:28 scfc_de: chmod g-w ~fsainsbu/.forward
February 25
- 22:48 rdwrer: Lol, so, something happened with grrrit-wm earlier and nobody logged any of it. It was yoyoing, Yuvi killed it, then aude did something and now it's back.
February 23
- 20:46 scfc_de: morebots: labs HUPped to reconnect to IRC
February 21
- 17:32 scfc_de: tools-dev: mount -t nfs -o nfsvers=3,ro labstore1.pmtpa.wmnet:/publicdata-project /public/datasets; automount seems to have been stuck
- 15:24 scfc_de: tools-webserver-03: Rebooted, wasn't accessible by ssh and apparently no access to /public/datasets either
February 20
- 21:23 scfc_de: tools-login: Disabled crontab for local-rezabot and left a message at User talk:Reza#Running bots on tools-login, etc. (fa:بحث_کاربر:Reza1615 is write-protected)
- 20:15 scfc_de: tools-login: Disabled crontab for local-chobot and left a message at ko:사용자토론:ChongDae#Running bots on tools-login, etc.
- 10:42 scfc_de: tools-mail: rm -f /var/log/exim4/paniclog ("User 0 set for local_delivery transport is on the never_users list", cf. [[bugzilla:61583]])
- 10:30 scfc_de: tools-login: rm -f /var/log/exim4/paniclog (OOM)
- 10:28 scfc_de: Reset error status of task@tools-exec-09 ("can't get password entry for user 'local-voxelbot'"); "getent passwd local-voxelbot" works on tools-exec-09, possibly a glitch
February 19
- 20:21 scfc_de: morebots: Set "enable_twitter=False" in confs/labs-logbot.py and restarted labs-morebots
- 19:14 scfc_de: tools-login: Disabled crontab and pkill -HUP -u fatemi127
February 18
- 11:42 scfc_de: tools-mail: Rerouted queued mail (@tools-login.pmtpa.wmflabs => @tools.wmflabs.org)
- 11:34 scfc_de: tools-exec-08: Rebooted due to not responding on ssh and SGE
- 10:39 scfc_de: tools-mail: rm -f /var/log/exim4/paniclog ("User 0 set for local_delivery transport is on the never_users list" => probably artifacts from Coren's LDAP changes)
- 10:37 scfc_de: tools-login: rm -f /var/log/exim4/paniclog (OOM)
February 14
- 23:54 legoktm: restarting grrrit-wm since it disappeared
- 08:19 scfc_de: tools-login: rm -f /var/log/exim4/paniclog (OOM)
February 13
- 13:11 scfc_de: Deleted old job of user veblenbot stuck in error state
- 13:08 scfc_de: Deleted old jobs of user v2 stuck in error state
- 10:49 scfc_de: tools-login: Commented out local-shuaib-bot's crontab with a pointer to Tools/Help
February 12
- 07:51 wm-bot: petrb: removed /data/project/james/adminstats/wikitools per request from james on irc
February 11
- 15:47 scfc_de: Restarted webservice for geohack
- 13:02 scfc_de: tools-login: rm -f /var/log/exim4/paniclog (OOM)
- 13:00 scfc_de: Killed -HUP local-hawk-eye-bot's jobs; one was hanging with a stale NFS handle on tools-exec-05
February 10
- 23:16 Coren: rebooting webproxy (braindead autofs)
February 9
- 18:14 legoktm: restarting grrrit-wm, it keeps joining and quitting
- 04:27 legoktm: rebooting grrrit-wm - https://gerrit.wikimedia.org/r/#/c/112308
February 6
- 22:50 legoktm: restarting grrrit-wm https://gerrit.wikimedia.org/r/111889
February 4
- 20:38 legoktm: restarting grrrit-wm: 'Send mediawiki/extension/Thanks to -corefeatures' https://gerrit.wikimedia.org/r/111257
January 31
- 03:43 scfc_de: Cleaned up all exim queues
- 01:26 scfc_de: chmod g-w ~{bgwhite,daniel,euku,fale,henna,hydriz,lfaraone}/.forward (test: sudo find /home -mindepth 2 -maxdepth 2 -type f -name .forward -perm /g=w -ls)
January 30
- 21:48 scfc_de: chmod g-w ~fluff/.forward
- 21:40 scfc_de: local-betabot: Added "-M" option to crontab's qsub call and rerouted queued mail (freeze, exim -Mar, exim -Mmd, thaw)
- 18:33 scfc_de: tools-exec-04: puppetd --enable (apparently disabled sometime around 2014-01-16?!)
- 17:25 scfc_de: tools-exec-06: mv -f /etc/init.d/nagios-nrpe-server{.dpkg-dist,} (nagios-nrpe-server didn't start because start-up script tried to "chown icinga" instead of "chown nagios")
January 28
- 04:27 scfc_de: tools-webproxy: Blocked Phonifier
January 25
- 05:37 scfc_de: tools-webserver-02: rm -f /var/log/exim4/paniclog (OOM)
January 24
- 01:07 scfc_de: tools-db: Removed /var/lib/mysql2, set expire_logs_days to 1 day
- 00:11 scfc_de: tools-db: and restarted mysqld
- 00:11 scfc_de: tools-db: Moved 4.2 GBytes of the oldest binlogs to /var/lib/mysql2/
January 23
- 19:24 legoktm: restarting grrrit-wm now https://gerrit.wikimedia.org/r/#/c/109116/
- 19:23 legoktm: ^ was for grrrit-wm
- 19:23 legoktm: re-committed password to local repo, not sure why that wasn't committed already
January 21
- 17:41 scfc_de: tools-exec-09: iptables-restore /data/project/.system/iptables.conf
January 20
- 07:02 andrewbogott: merged a lint patch to the gridengine module. Should be a noop
January 16
- 17:11 scfc_de: tools-exec-09: "iptables-restore /data/project/.system/iptables.conf" after reboot
January 15
- 13:36 scfc_de: After reboot of tools-exec-09, all continuous jobs were successfully restarted ("Rr"); task jobs (1974113, 2188472) failed ("19 : before writing exit_status")
- 13:27 scfc_de: tools-login: rm -f /var/log/exim4/paniclog (OOM)
- 08:54 andrewbogott: rebooted tools-exec-09
- 08:32 andrewbogott: rebooted tools-db
January 14
- 15:10 scfc_de: tools-login: pkill -u local-mlwikisource: Freed 1 GByte of memory
- 14:58 scfc_de: tools-login: Disabled local-mlwikisource's crontab with explanation
- 13:57 scfc_de: tools-webserver-02: rm -f /var/log/exim4/paniclog (out of memory errors on 2014-01-10)
January 10
- 10:41 legoktm: grrrit-wm: restarting https://gerrit.wikimedia.org/r/106670
- 09:00 legoktm: grrrit-wm: setting up #mediawiki-feed, https://gerrit.wikimedia.org/r/106555
January 9
- 18:26 legoktm: rebased grrrit-wm on origin/master since fetching gerrit was failing
- 18:21 legoktm: restarting grrrit-wm https://gerrit.wikimedia.org/r/#/c/106501/
January 8
- 13:44 scfc_de: Cleared error states of continuous@tools-exec-05, task@tools-exec-05, task@tools-exec-09
January 7
- 18:59 scfc_de: tools-login, tools-mail: rm -f /var/log/exim4/paniclog (apparently some artifacts of the LDAP failure)
January 6
- 14:06 YuviPanda: deleted instance tools-mc, didn't know it had come back from the dead
January 1
- 13:24 scfc_de: tools-exec-02, tools-master, tools-shadow, tools-webserver-01: Commented out duplicate MariaDB entries in /etc/apt/sources.list and re-ran apt-get update
- 11:27 scfc_de: tools-webserver-01, tools-webserver-01: rm -f /var/log/exim4/paniclog; out of memory errors
- 11:18 scfc_de: Emptied /{data/project,home}/.snaplist as the snapshots themselves are not available
December 27
- 07:39 legoktm: grrrit-wm restart didn't really work.
- 07:38 legoktm: restarting grrit-wm, for some reason it reconnected and lost its cloak
December 23
- 18:30 marktraceur: restart grrrit-wm for subbu
December 21
- 06:50 scfc_de: tools-exec-01: Commented out duplicate MariaDB entries in /etc/apt/sources.list and re-ran apt-get update
December 19
- 17:22 marktraceur: deploying grrrit config change
December 17
- 23:19 legoktm: rebooted grrrit-wm with new config stuffs
December 14
- 18:13 marktraceur: restarting grrrit-wm to fix its nickname
- 13:17 scfc_de: tools-exec-08: Purged packages libapache2-mod-suphp and suphp-common (probably remnants from when the host was misconfigured as a webserver)
- 13:09 scfc_de: tools-dev, tools-login, tools-mail, tools-webserver-01, tools-webserver-02: rm /var/log/exim4/paniclog (mostly out of memory errors)
December 4
- 22:15 Coren: tools-exec-01 rebooted to fix the autofs issue; will return to rotation shortly.
- 16:33 Coren: rebooting webproxy with new kernel settings to help against the DDOS
December 1
- 14:05 Coren: underlying virtualization hardware rebooted; tools-master and friends coming back up.
November 25
- 21:03 YuviPanda: created tools-proxy-test instance to play around with the dynamicproxy
- 12:16 wm-bot: petrb: deswapping -login (swapoff -a && swapon -a)
November 24
- 07:19 paravoid: disabled crontab for user avocato on tools-login, see above
- 07:17 paravoid: pkill -u avocato on tools-login, multiple /home/avocato/pywikipedia/redirect.py DoSing the bastion
November 14
- 09:12 ori-l: Added aude to lolrrit-wm maintainers group
November 13
- 22:36 andrewbogott: removed 'imagescaler' class from tools-login because that class hasn't existed for a year. Which, a year ago is before that instance even existed so what the heck?
November 3
- 16:49 ori-l: grrrit-wm stopped receiving events. restarted it; didn't help. then restarted gerrit-to-redis, which seems to have fixed it.
November 1
- 16:11 wm-bot: petrb: restarted terminator daemon on -login to sort out memory issues caused by heavy mysql client by elbransco
October 23
- 15:19 Coren: deleted tools-tyrant and tools-exec-cyberbot (cleanup of obsoleted instances)
October 20
- 18:52 wm-bot: petrb: everything looks better
- 18:51 wm-bot: petrb: restarting apache server on tools-webproxy
- 18:49 wm-bot: petrb: installed links on -dev and going to investigate what is wrong with apaches, documentation, Coren, please update it
October 15
- 21:03 Coren: labs-login rebooted to fix the ownership/take issue with success.
October 10
- 09:49 addshore: tools-webserver-01is getting a 500 Internal Server Error again
September 23
- 06:44 YuviPanda: remove unpuppetized install of openjdk-6 packages causing problems in -dev (for bug: 54444)
- 06:44 YuviPanda: remove unpuppetized install of openjdk-6 packages causing problems in -dev (for bug: 54444)
- 05:15 legoktm: logging a log to test the log logging
- 05:13 legoktm: logging a log to test the log logging
September 11
- 09:39 wm-bot: petrb: started toolwatcher
August 24
- 18:00 wm-bot: petrb: freed 1600mb of ram by killing yasbot processes on -login
- 17:59 wm-bot: petrb: killing all python processes of yasbot on -login, this bot needs to run on grid, -login is constantly getting OOM because of this bot
August 23
- 12:17 wm-bot: petrb: test
- 12:15 wm-bot: petrb: making pv from /dev/vdb on new nodes
- 11:49 wm-bot: petrb: syncing packages of -login with exec nodes
- 11:48 petan: someone installed firefox on exec nodes, should investigate / remove
August 22
- 01:24 scfc_de: tools-webserver-03: Installed python-oursql
August 20
- 23:00 scfc_de: Opened port 3000 for intra-Labs traffic in execnode security group for YuviPanda's proxy experiments
August 19
- 09:52 wm-bot: petrb: deleting fatestwiki tool, requested by creator
August 16
- 00:16 scfc_de: tools-exec-01 doesn't come up again even after repeat reboots
August 15
- 15:14 scfc_de: tools-webserver-01: Simplified /usr/local/bin/php-wrapper
- 14:31 scfc_de: tools-webserver-01: "dpkg --configure -a" on apt-get's advice
- 14:24 scfc_de: chmod 644 ~magnus/.forward
- 03:07 scfc_de: tools-webproxy: Temporarily serving 403s to AhrefsBot/bingbot/Googlebot/PaperLiBot/TweetmemeBot/YandexBot until they reread robots.txt
- 02:02 scfc_de: robots.txt: "Disallow: /"
August 11
- 03:14 scfc_de: tools-mc: Purged memcached
August 10
- 02:36 scfc_de: Disabled terminatord on tools-login and tools-dev
- 02:24 scfc_de: chmod g-w ~whym/.forward
August 6
- 19:26 scfc_de: Set up basic robots.txt to exclude Geohack to see how that affects traffic
- 02:09 scfc_de: tools-mail: Enabled rudimentary Ganglia monitoring in root's crontab
August 5
- 20:32 scfc_de: chmod g-w ~ladsgroup/.forward
August 2
- 23:45 scfc_de: tools-dev: Installed dialog for testing
August 1
- 19:57 scfc_de: Created new instance tools-redis with redis_maxmemory = "7GB"
- 19:56 scfc_de: Added redis_maxmemory to wikitech Puppet variables
July 31
- 10:50 HenriqueCrang: ptwikis added graph with mobile edits
July 30
- 19:08 scfc_de: tools-webproxy: Purged popularity-contest and ubuntu-standard
- 07:32 wm-bot: petrb: deleted local-addbot jobs
- 02:01 scfc_de: tools-webserver-01: Symlinked /usr/local/bin/{job,jstart,jstop,jsub} to /usr/bin; were obsolete versions.
July 29
- 15:15 scfc_de: tools-webserver-01: rm /var/log/exim4/paniclog
- 15:10 scfc_de: Purged popularity-contest from tools-webserver-01.
- 02:40 scfc_de: Restarted toolwatcher on tools-login.
- 02:11 scfc_de: Reboot tools-login, was not responsive
July 25
- 23:37 Ryan_Lane: added myself to lolrrit-wm tool
- 12:06 wm-bot: petrb: test
- 07:11 wm-bot: petrb: created /var/log/glusterfs/bricks/ to stop rotatelogs from complaining about it being missing
July 20
- 15:19 petan: rebooting tools-redis
July 19
- 07:06 petan: instances were rebooted for unknown reasons
- 00:42 helderwiki: it works! :-)
- 00:41 legoktm: test
July 10
- 18:04 wm-bot: petrb: installing mysqltcl on grid
- 18:01 wm-bot: petrb: installing tclodbc on grid
July 5
- 19:38 AzaToth: test
- 19:36 AzaToth: test for example
- 18:23 Coren: brief outage of webproxy complete (back to business!)
- 18:13 Coren: brief outage of webproxy (rollback 2.4 upgrade)
July 3
- 13:44 scfc_de: Set "HostbasedAuthentication yes" and "EnableSSHKeysign yes" in tools-dev's /etc/ssh/ssh_config
- 12:58 petan: rebooting -mc it's aparently OOM dying
July 2
- 16:24 wm-bot: petrb: installed maria to all nodes so we can connect to db even from sge
- 12:19 wm-bot: petrb: installing packages -- libmediawiki-api-perl libdatetime-format-strptime-perl libbot-basicbot-perl libdatetime-format-duration-perl
July 1
- 18:39 wm-bot: petrb: started toolwatcher on - login
- 14:22 wm-bot: petrb: installing following packages on grid: libdata-dumper-simple-perl libhtml-html5-entities-perl libirc-utils-perl libtask-weaken-perl libobject-pluggable-perl libpoe-component-syndicator-perl libpoe-filter-ircd-perl libsocket-getaddrinfo-perl libpoe-component-irc-perl libxml-simple-perl
- 12:05 wm-bot: petrb: starting toolwatcher
- 11:40 wm-bot: petrb: tools is back o/
- 09:42 wm-bot: petrb: installing python -zmg -matplotlib @ dev
- 03:33 scfc_de: Rebooted tools-login apparently out of memory and not responding to ssh
June 30
- 17:58 scfc_de: Set ssh_hba to yes on tools-exec-06
- 17:13 scfc_de: Installed python-matplotlib and python-zmq on tools-login for YuviPanda
June 26
- 21:16 Coren: +Tim Landscheidt to project admins, local-admin
- 14:23 wm-bot: petrb: updating several packages on -login
- 13:43 wm-bot: petrb: killing old instance of redis: Jun15 ? 00:06:49 /usr/bin/redis-server /etc/redis/redis.conf
- 13:42 wm-bot: petrb: restarting redis
- 13:28 wm-bot: petrb: running puppet on -mc
- 13:27 wm-bot: petrb: adding ::redis role to tools-mc - if anything will break, YuviPanda did it :P
- 09:35 wm-bot: petrb: updated status.php to version which display free vmem as well
June 25
- 12:34 wm-bot: petrb: installing php5-mcrypt on exec and web
June 24
- 15:45 wm-bot: petrb: changed colors of root prompt productions vs testing
- 07:57 wm-bot: petrb: 50527 4186 22830 1 Jun23 pts/41 00:08:54 python fill2.py eats 48% of ram on -login
June 19
- 12:17 wm-bot: petrb: increasing limit on mysql connections
June 17
- 17:34 wm-bot: petrb: /var/spool/cron/crontabs/ has -rw------- 1 8006 crontab 1176 Apr 11 14:07 local-voxelbot fixing
June 16
- 21:23 Coren: 1.0.3 deployed (jobutils, misctools)
June 15
- 21:40 wm-bot: petrb: there is no lvm on -db which we need as hell - therefore no swap either nor storage for binary logs :( I got a feeling that mysql will die oom soonish
- 21:39 wm-bot: petrb: db has 5% free RAM eeeek
- 18:36 wm-bot: root: removed lot of ?audit? logs from exec-04 they were eating too much storage
- 18:23 wm-bot: petrb: temporarily disabling /tmp on exec-04 in order to set up lvm
- 18:23 wm-bot: petrb: exec-04 96% / usage, creating a new volume
- 12:33 wm-bot: petrb: installing redis on tools-mc
June 14
- 12:35 wm-bot: petrb: updating logsplitter to new version
June 13
- 21:59 wm-bot: petrb: replaced logsplitter on both apache servers with far more powerfull c++ version thus saving a lot of resources on both servers
- 12:43 wm-bot: petrb: tools-webserver-01 is running quite expensive python job (currently eating almost 1gb of ram) it may need to be fixed or moved to separate webserver, adding swap to prevent machine die OOM
- 12:22 wm-bot: petrb: killing process 31187 sort -T./enwiki/target -t of user local-enwp10 for same reason as previous one
- 12:21 wm-bot: petrb: killing process 31190 sort -T./enwiki/target of user local-enwp10 for same reason as previous one
- 12:17 wm-bot: petrb: killing process 31186 31185 69 Jun11 pts/32 1-13:14:41 /usr/bin/perl ./bin/catpagelinks.pl ./enwiki/target/main_pages_sort_by_ids.lst ./enwiki/target/pagelinks_main_sort_by_ids.lst because it seems to be a bot running on login server eating too many resources
June 11
- 07:36 wm-bot: petrb: installed libdigest-crc-perl
June 10
- 13:05 wm-bot: petrb: installing libcrypt-gcrypt-perl
- 08:45 wm-bot: petrb: updated /usr/local/bin/logsplitter on webserver-01 in order to fix !b 49383
- 08:45 wm-bot: petrb: updated /usr/local/bin/logsplitter on webserver-01 in order to fix become afcbot 49383
- 08:44 wm-bot: petrb: updated /usr/local/bin/logsplitter on webserver-01 in order to fix become afcbot 49383
- 08:25 wm-bot: petrb: fixing missing packages on exec nodes
June 9
- 20:44 wm-bot: petrb: moved logs on -login to separate storage
June 8
- 21:24 wm-bot: petrb: installing python-imaging-tk on grid
- 21:20 wm-bot: petrb: installing python-tk
- 21:16 wm-bot: petrb: installing python-flickrapi on grid
- 21:16 wm-bot: petrb: installing
- 16:49 wm-bot: petrb: turned off wmf style of vi on tools-dev feel free to slap me :o or do cat /etc/vim/vimrc.local >> .vimrc if you love it
- 15:33 wm-bot: petrb: grid is overloaded, needs to be either enlarged or jobs calmed down :o
- 09:55 wm-bot: petrb: backporting tcl 8.6 from debian
- 09:38 wm-bot: petrb: update python requests to version 1.2.3.1
June 7
- 15:29 Coren: Deleted no-longer-needed tools-exec-cg node (spun off to its own project)
June 5
- 09:52 wm-bot: petrb: on -dev
- 09:52 wm-bot: petrb: moving /usr to separate volume expect problems :o
- 09:41 wm-bot: petrb: moved /var/log to separate volume on -dev
- 09:31 wm-bot: petrb: houston we have problem, / on dev is 94%
- 09:28 wm-bot: petrb: installed openjdk7 on -dev
- 09:00 wm-bot: petrb: removing wd-terminator service
- 08:39 wm-bot: petrb: started toolwatcher
- 07:04 wm-bot: petrb: installing maven on -dev
June 4
- 14:49 wm-bot: petrb: installing sbt in order to fix b48859
- 13:28 wm-bot: petrb: installing csh on cluster
- 08:37 wm-bot: petrb: installing python-memcache on exec nodes
June 3
- 21:40 Coren: Rebooting -login; it's trashing. Will keep an eye on it.
- 14:15 wm-bot: petrb: removing popularity contest
- 14:11 wm-bot: petrb: removing /etc/logrotate.d/glusterlogs on all servers to fix logrotate daemon
- 09:43 wm-bot: petrb: syncing packages on exec nodes to avoid troubles with missing libs on some etc
June 2
- 08:39 wm-bot: petrb: installing ack-grep everywhere per yuvipanda and irc
June 1
- 20:57 wm-bot: petrb: installed this to exec nodes because it was on some and not on others cpp-4.4 cpp-4.5 cython dbus dosfstools ed emacs23 ftp gcc-4.4-base iptables iputils-tracepath ksh lsof ltrace lshw mariadb-client-5.5 nano python-dbus python-egenix-mxdatetime python-egenix-mxtools python-gevent python-greenlet strace telnet time -y
- 20:42 wm-bot: petrb: installing wikitools cluster wide
- 20:40 wm-bot: petrb: installing oursql cluster wide
- 10:46 wm-bot: petrb: created new instance for experiments with sasl memcache tools-mc
May 31
- 19:17 petan: deleting xtools project (requested by Cyberpower678)
- 17:24 wm-bot: petrb: removing old kernels from -dev because / is almost full
- 17:17 wm-bot: petrb: installed lsof to -dev
- 15:55 wm-bot: petrb: installed subversion to exec nodes 4 legoktm
- 15:47 wm-bot: petrb: replacing mysql with maria on exec nodes
- 15:46 wm-bot: petrb: replacing mysql with maria on exec nodes
- 15:14 wm-bot: petrb: installing default-jre in order to satisfy its dependencies
- 15:13 wm-bot: petrb: installing /data/project/.system/deb/all/sbt.deb to -dev in order to test it
- 13:04 wm-bot: petrb: installing bashdb on tools and -dev
- 12:27 wm-bot: petrb: removing project local-jimmyxu - per request on irc
- 10:54 wm-bot: petrb: killing process 3060 on -login (mahdiz 3060 1964 88 May30 ? 21:32:51 /bin/nano /tmp/crontab.Ht3bSO/crontab) it takes max cpu and doesn't seem to be attached
May 30
- 12:24 wm-bot: petrb: deleted job 1862 from queue (error state)
- 08:26 wm-bot: petrb: updated sql command
May 29
- 21:05 wm-bot: petrb: running sudo apt-get install php5-gd
May 28
- 20:00 wm-bot: petrb: installing p7zip-full to -dev and -login
May 27
- 08:46 wm-bot: petrb: changed config of mysql to use /mnt as path to save binary logs, this however requires server to be restarted
May 24
- 08:44 petan: setting up lvm on new exec nodes because it is more flexible and allows us to change the size of volumes on the fly
- 08:28 petan: created 2 more exec nodes, setting up now...
May 23
- 09:20 wm-bot: petrb: process 27618 on -login is constantly eating 100% of cpu, changing priority to 20
May 22
- 20:54 wm-bot: petrb: changing ownership of /data/project/bracketbot/ to local-bracketbot
- 14:28 labs-logs-bottie: petrb: installed netcat as well
- 14:28 labs-logs-bottie: petrb: installed telnet to -dev
- 14:02 Coren: tools-webserver-02 now live; / and /cluebot/ moved there
May 21
- 20:27 labs-logs-bottie: petrb: uploaded hosts to -dev
May 19
- 13:40 labs-logs-bottie: petrb: killing that nano process seems to be some hang and unattached anyway
- 12:59 labs-logs-bottie: petrb: changed priority of nano process to 19
- 12:55 labs-logs-bottie: petrb: local-hawk-eye-bot /bin/nano /tmp/crontab.d4JhUj/crontab eat too much cpu
- 12:50 petan: nvm previous line
- 12:50 labs-logs-bottie: petrb: vul alias viewuserlang
May 14
- 21:22 labs-logs-bottie: petrb: created a separate volume for /tmp on login so that temp files do not fragment root fs and it does not get filled up by them, it also makes it easier to track filesystem usage
- 13:16 Coren: reboot -dev, need to test kernel upgrade
May 10
- 15:08 Coren: create tools-webserver-02 for Apache 2.4 experimentation
May 9
- 04:12 Coren: added -exec-03 and -exec-04. Moar power!!1!
May 6
- 19:59 Coren: made tools-dev.wmflabs.org public
- 08:04 labs-logs-bottie: petrb: created a small swap on -login so that users can not bring it to OOM so easily and so that unused memory blocks can be swapined in order to use the remaining memory more effectively
- 08:00 labs-logs-bottie: petrb: making lvm from unused disk from /mnt on -login so that we can eventually use it somewhere if needed
May 4
- 17:50 labs-logs-bottie: petrb: foobar as well
- 17:47 labs-logs-bottie: petrb: removing project flask-stub using rmtool
- 15:33 labs-logs-bottie: petrb: fixing missing db user for local-stub
- 12:51 labs-logs-bottie: petrb: creating mysql accounts by hand for alchimista and fubar
May 2
- 20:49 labs-logs-bottie: petrb: uploaded motd to exec-N as well, with information which server users connected to
May 1
- 16:59 labs-logs-bottie: petrb: fixed invalid permissions on /home
April 27
- 18:54 labs-logs-bottie: petrb: installing pymysql using pip on whole grid because it is needed for greenrosseta (for some reason it is better than python-mysql package)
April 26
- 23:55 Coren: reboot to finish security updates
- 08:00 labs-logs-bottie: petrb: patching qtop
- 07:57 labs-logs-bottie: petrb: added tools-dev to admin host list so that qtop works and fixing the bug of qtop
- 07:28 labs-logs-bottie: petrb: installing GE tools to -dev so that we can develop new j|q* stuff there
April 25
- 19:00 Coren: Maintenance over; systems restarted and should be working.
- 18:18 labs-logs-bottie: petrb: we are getting in troubles with memory on tools-db there is only less than 20% free memory
- 18:01 Coren: Begin maintenance (login disabled)
- 13:21 petan: removing local-wikidatastats from ldap
April 24
- 13:17 labs-logs-bottie: petrb: sudo chown local-peachy PeachyFrameworkLogo.png
- 11:37 labs-logs-bottie: petrb: created new project stats and cloned acl from wikidatastats, which is supposed to be deleted
- 11:32 legoktm: wikidatastats attempting to install limn
- 11:15 labs-logs-bottie: petrb: installing npm to -login instance
- 07:34 petan: creating project wikidatastats for legoktm addshore and yuvipandianablah :P
April 23
- 13:32 labs-logs-bottie: petrb: changing permissions of cyberbot and peachy to 775 so that it is easier to use them
- 12:14 labs-logs-bottie: petrb: qtop on -dev
- 12:12 labs-logs-bottie: petrb: removed part of motd from login server that got there in a mysterious way
April 19
- 22:38 Coren: reboot -login, all done with the NFS config. yeay.
- 17:13 Coren: (final?) reboot of -login with the new autofs configuration
- 16:24 Coren: (rebooted -login)
- 16:24 Coren: autofs + gluster = fail
- 14:45 Coren: reboot -login (NFS mount woes)
April 15
- 22:29 Coren: also a test; note how said bot knows its place. :-)
- 22:14 andrewbogott: this is a test of labs-morebots.
- 21:49 andrewbogott: this is a test
- 15:41 labs-logs-bottie: petrb: installing p7zip everywhere
- 08:00 labs-logs-bottie: petrb: installing dev packages needed for YuviPanda on login box
April 11
- 22:39 Coren: rebooted tools-puppet-test (no end-user impact): hung filesystem prevents login
- 07:42 labs-logs-bottie: petrb: removed reboot information from motd
Instances for this project
Instance Name | Instance Type | Project | Image Id | FQDN | Public IP | Launch Time | Puppet Class | Modification dateThis property is a special property in this wiki. | Number of CPUs | RAM Size | Amount of Storage | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
I-0000047a.eqiad.wmflabs | tools-exec-12 | m1.xlarge | tools | ubuntu-14.04-trusty | i-0000047a.eqiad.wmflabs | 9 July 2014 23:07:32 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
9 July 2014 23:16:41 | 8 | 16,384 | 160 | |
I-0000047b.eqiad.wmflabs | tools-exec-13 | m1.xlarge | tools | ubuntu-12.04-precise | i-0000047b.eqiad.wmflabs | 9 July 2014 23:09:19 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
9 July 2014 23:14:00 | 8 | 16,384 | 160 | |
I-00000479.eqiad.wmflabs | tools-exec-11 | m1.xlarge | tools | ubuntu-12.04-precise | i-00000479.eqiad.wmflabs | 9 July 2014 23:06:36 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
9 July 2014 23:11:42 | 8 | 16,384 | 160 | |
I-00000451.eqiad.wmflabs | tools-webgrid-04 | m1.xlarge | tools | ubuntu-12.04-precise | i-00000451.eqiad.wmflabs | 30 June 2014 17:30:43 | base role::labs::instance sudo::labs_project role::labs::tools::webnode |
4 July 2014 09:57:24 | 8 | 16,384 | 160 | |
I-00000450.eqiad.wmflabs | tools-webgrid-03 | m1.xlarge | tools | ubuntu-12.04-precise | i-00000450.eqiad.wmflabs | 30 June 2014 15:52:34 | base role::labs::instance sudo::labs_project role::labs::tools::webnode |
4 July 2014 09:57:16 | 8 | 16,384 | 160 | |
I-000000d8.eqiad.wmflabs | tools-exec-04 | m1.large | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d8.eqiad.wmflabs | 28 February 2014 04:36:59 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
1 July 2014 17:50:59 | 4 | 8,192 | 80 | |
I-000000d5.eqiad.wmflabs | tools-exec-02 | m1.large | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d5.eqiad.wmflabs | 28 February 2014 04:35:47 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
1 July 2014 15:53:40 | 4 | 8,192 | 80 | |
I-000000db.eqiad.wmflabs | tools-exec-07 | m1.medium | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000db.eqiad.wmflabs | 28 February 2014 04:38:11 | base role::labs::instance sudo::labs_project role::labs::tools::execnode |
1 July 2014 12:12:03 | 2 | 4,096 | 40 | |
I-000000d9.eqiad.wmflabs | tools-exec-05 | m1.large | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d9.eqiad.wmflabs | 28 February 2014 04:37:24 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
1 July 2014 11:45:16 | 4 | 8,192 | 80 | |
I-000002f3.eqiad.wmflabs | tools-exec-10 | m1.large | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000002f3.eqiad.wmflabs | 4 April 2014 12:56:55 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
1 July 2014 01:43:13 | 4 | 8,192 | 80 | |
I-000000d6.eqiad.wmflabs | tools-exec-03 | m1.large | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d6.eqiad.wmflabs | 28 February 2014 04:35:43 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
1 July 2014 01:37:47 | 4 | 8,192 | 80 | |
I-000000d4.eqiad.wmflabs | tools-exec-01 | m1.large | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d4.eqiad.wmflabs | 28 February 2014 04:35:02 | base role::labs::instance sudo::labs_project role::labs::tools::execnode role::labs::lvm::biglogs |
30 June 2014 21:13:04 | 4 | 8,192 | 80 | |
I-000002f1.eqiad.wmflabs | tools-webgrid-02 | m1.xlarge | tools | ubuntu-12.04-precise | i-000002f1.eqiad.wmflabs | 4 April 2014 12:56:04 | base role::labs::instance exim::simple-mail-sender sudo::labs_project role::labs::tools::webnode |
29 June 2014 18:26:09 | 8 | 16,384 | 160 | |
I-000002e5.eqiad.wmflabs | tools-proxy-test | m1.medium | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000002e5.eqiad.wmflabs | 1 April 2014 20:48:56 | base role::labs::instance sudo::labs_project role::labs::tools::proxy role::puppet::self |
27 June 2014 19:51:05 | 2 | 4,096 | 40 | |
I-000003d3.eqiad.wmflabs | tools-trusty-test | m1.small | tools | ubuntu-14.04-trusty (deprecated 2014-06-10) | i-000003d3.eqiad.wmflabs | 2 June 2014 20:14:22 | base role::labs::instance sudo::labs_project role::labsnfs::client role::labs::tools::proxy role::labs::lvm::srv |
25 June 2014 08:46:20 | 1 | 2,048 | 20 | |
I-0000043e.eqiad.wmflabs | tools-tcl-test | m1.small | tools | ubuntu-12.04-precise | i-0000043e.eqiad.wmflabs | 23 June 2014 14:42:31 | base role::labs::instance sudo::labs_project |
23 June 2014 14:43:03 | 1 | 2,048 | 20 | |
I-00000274.eqiad.wmflabs | tools-submit | m1.small | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-00000274.eqiad.wmflabs | 22 March 2014 13:54:22 | base role::labs::instance sudo::labs_project role::labs::tools::submit role::labs::lvm::biglogs |
18 June 2014 09:27:08 | 1 | 2,048 | 20 | |
I-000000e6.eqiad.wmflabs | tools-webproxy | m1.medium | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000e6.eqiad.wmflabs | 3 March 2014 15:33:26 | base role::labs::instance exim::simple-mail-sender sudo::labs_project role::labs::tools::proxy role::labs::lvm::biglogs |
23 May 2014 14:09:24 | 2 | 4,096 | 40 | |
I-000000d1.eqiad.wmflabs | tools-mail | m1.small | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d1.eqiad.wmflabs | 28 February 2014 04:32:37 | base role::labs::instance exim::simple-mail-sender sudo::labs_project role::labs::tools::mailrelay role::labs::lvm::biglogs |
22 May 2014 01:17:41 | 1 | 2,048 | 20 | |
I-000000d3.eqiad.wmflabs | tools-webgrid-tomcat | m1.xlarge | tools | ubuntu-12.04-precise (deprecated 2014-04-17) | i-000000d3.eqiad.wmflabs | 28 February 2014 04:34:30 | base role::labs::instance exim::simple-mail-sender sudo::labs_project role::labs::tools::tomcatnode |
29 April 2014 18:13:40 | 8 | 16,384 | 160 | |
… further results |