A list of MMOs, MOBAs, and FPS games that have been load-testing using bots.
TERA
Koo, 2010, How to support an action-heavy MMORPG, slides
Their system to load-test a shard is called Sisyphus, and runs 1500 clients per machine. The behavior of the bots is based on the behaviors from real players (probably from alpha). A WAN simulator is used between bots and realm servers; the average latency is set to 200ms. A high-performance machine and a "dedicated line" were needed.
EVE
Press, 2011, Orchestrator: A post-mortem on an automated MMO testing framework, slides
EVE started to make a thin client from its game client in 2010. Orchestrator is the tool they use to load-test and integration-test their architecture and code. Orchestrator does not send one script to each client, proxy, and server. Instead, it runs a single master script, and tells them, as they progress in the test, what the next operation is. The test can be stopped right when a client reports a bug, not until everything scheduled has been sent.
OpenSim
Lake, 2010, Distributed scene graph to enable thousands of interacting users in a virtual environment, paper
The limitations we have encountered with avatar scaling
during these experiments have been in getting enough
hardware to generate the load of over 1000 clients and the
limited physics simulation capabilities of a single thread on
the scene server.
League of Legends (allocating games to servers)
Delap, 2010, League of Legends: Scaling to millions of ninjas, yordles, and wizards, video + slides
- Load-testing in a realistic setup: more than 50 machines with the same spec as those in production
- EC2 is a good tool, but the network is not reliable, so careful not trying to fix problems that only happen in the test setting.
- With thousands of clients, logs may not be the best way to gather test results.
League of Legends (chat)
McArthur, 2011, Building the chat service for League of Legends, slides
Dozen of EC2 machines, each running 5-9k bots. Each bot is an XMPP chat client (they used the Smack API).
Load-testing is useless without proper modeling
.
Crysis 2
Hall, 2011, A Programmer's Post-mortem Crysis 2 Multiplayer, slides
They wanted to check the frame rate with lots of moving entities, and detect bugs or gameplay issues. They used automatic testing. "Lots" of bots are run for 10 minutes per level to stress-test the builds. The bots do random actions like walking, jumping, or shooting.
Gears of War 3
Weilbacher, 2012, Dedicated Servers in Gears of War 3 Scaling to Millions of Players, slides
Their bots are clients without renderer and user input. They run automated bot matches to check the performance of their server platform. For Gears 2, they used to run 2.5 games per core in 2009. For Gears 3, they run 7 games per core in 2011.
Guild Wars 2
Patrick Wyatt, a lead programmer on Guild Wars and Guild Wars 2, discourages using bots: bot's behavior differs too much from actual users. Instead, he recommends recording live play data, and replay it on the server to fix bugs or check the load.