Betfair
Performance and Automation Engineer, December 2007 - Present
Betfair is the world's largest Internet betting exchange
- One of a small cross-department group most often involved in understanding and resolving exceptional emergency situations, last-line support etc.
- Delivery of infrastructure automation tools to improve throughput in all departments.
- Development of Command and Control tools to manage large (thousands of devices) and global estates.
- Evaluation and selection of third-party tools for automation and testing, dealing directly with suppliers and external companies.
- Working within the Performance and Automation Services departments.
- Automation of performance testing, system building, hypervisor (Xen and KVM) and site management.
- Requirements gathering and refinement across the whole of the Engineering division.
Trutap
Lead Infrastructure Architect (contract) January 2007 - December 2007
Trutap was a mobile social-networking startup which launched in 2007 at the TechCrunch 40 event in the US.
- Delivery of a self-contained datacentre presence on time and within budget, with a documented scaling plan up to one million concurrent user sessions.
- Delivery of a Disaster Recovery plan tested to provide a fully working replicated copy of the main datacentre within 12 hours of acquisition of replacement hardware (4 hours if a backup SAN survives).
- All negotiation with hardware suppliers and datacentres.
- Evaluating, sourcing and building the datacentre presence from scratch.
- All hardware and software evaluation, sourcing and configuration.
- Configuration of a new VMware ESX environment using rapid deployment and implicit disaster recovery.
- Design and coding for a company-wide multi-site configuration and service framework.
- Performance tuning and database optimisation for the system once the code was written.
The Search Works
Lead Infrastructure Architect (contract), August 2006 - December 2006
TSW (now part of TradeDoubler) were the leading search engine ad-placement provider, dealing with Google, MSN, Yahoo and others.
- Delivery of a brand-new datacentre in Tokyo running a re-engineered TSW platform (twice the capacity of the existing system on one quarter the hardware).
- Redesign and rearchitect the TSW platform for deployment in Japan as a fully available load balanced system.
- Develop a new packaging, installation and release methodology.
- Liase with Development and Operations to manage testing and deployment.
- Build, test and deploy the new server platform in Tokyo.
Inspired Broadcast Networks
Senior Systems Administrator, 2004 - 2006
IBN is a leading provider of out-of-home pay-to-play entertainment including the Itbox pub gaming system.
- Designing and implementing the Red Hat Satellite Server structure and policy for all Linux-based systems within the company. Creation of standardised tools used throughout the company to package software for server installation. Training of the Systems and Networks team to use the Satellite Server and packaging tools.
- Engineering the move of the Music (20 Terabytes of MP3 music) from home-grown storage servers to a managed NAS.
- Creation of new policies for packaging, DNS, etc. Liaison with the rest of the business to implement policies at all levels.
- Implementation of Alteon Load balancers in the department and training coworkers to use them.
- General Systems Administration tasks - automation of common jobs, checking servers, securing systems, patching, troubleshooting etc.
- On-call support and out-of-hours administration.
- Training in all of the above.
London Marketing
Head of Infrastructure, 2003 - 2004
London Marketing provides the
London Town web portal for booking hotels, theatre tickets, finding what's on and so on around London.
- Design, implementation and management of a high volume clustered web site with very high customer visibility (Accessible free through BT internet phones throughout London).
- Systems Administration - ultimate authority over the hosting platform, networking infrastructure, development systems and office servers.
- Disaster recovery planning and testing.
- Security management for the company as a whole (Policies, Firewalls, etc).
- Spam and virus protection for a company that has been posting several dozen valid email addresses on its very high traffic website for several years
- Training staff in Unix systems tasks, best practices and methods.
- On call support of all systems.
- I'm reliably informed that, as of mid 2008, they were still running on the almost-unmodified systems I built.
Global Communications
Systems Administrator, 2001 - 2003
Global Communications runs
NetBanx, an online merchant system (credit/debit card payment for websites).
- Systems administration, including maintenance, design and implementation of new servers (http, smtp, pop3, imap, irc, real and proprietary), troubleshooting, security work and occasional resurrection.
- Implementation of monitoring and reporting systems for high availability systems storing several million credit card numbers. This is the core fail-over, redundancy and monitoring suite for NetBanx.
- Implementation of mail servers using Exim, Cyrus and Sophos for SMTP, IMAP/POP3 and virus scanning respectively.
- Web servers using Apache and Apache Stronghold (with hardware SSL acceleration).
- Benchmarking and stress testing of servers.
Highway One
Systems Administrator, 1999 - 2000
- Systems Admin, postmaster, RIPE Administrator
- Training of second-line support personnel in DNS, SMTP, NNTP and other important protocols.
- Third-line support for customer issues.