Evaluating AMD’s TrueAudio and Mantle Technologies with Thief – AnandTech
Scheduled for release today is the 1.3/AMD patch for Thief, Square Enix’s recently released stealth action game. Following last month’s Battlefield 4 patch, Thief is the second big push for AMD’s recent Radeon technology initiative, becoming the second game to support Mantle and the first game to support TrueAudio Technology.
Thief has been something of a miss from a Metacritic perspective, but from a technology perspective it’s still a very big deal for AMD and Radeon owners. As a Mantle enabled title it’s the second game to support Mantle and the first single-player game to support it. Furthermore for AMD it showcases that they have Mantle support from more developers than just EA and other Frostbite 3 users, with Square Enix joining the fray. Finally it’s the first Unreal Engine based game to support Mantle, which can be particularly important since Unreal Engine 3 is so widely used and we expect much the same for the forthcoming Unreal Engine 4.
But more excitingly the release of this patch heralds the public release of AMD’s TrueAudio technology. Where Battlefield 4 was the launch title for Mantle, Thief is the launch title for TrueAudio, being the first game to receive TrueAudio support. At the same time it also marks the start of AMD enabling TrueAudio in their drivers, and the start of their TrueAudio promotional campaign. So along with Thief, AMD is also going to be distributing demos to showcase the capabilities of TrueAudio, but more on that later.
Diving right into matters then, at the end of last week AMD was able to get the press access to today’s patch, giving us a short opportunity to look at Thief from both a Mantle perspective and a TrueAudio perspective. As this week coincides with GDC 2014 we haven’t had a ton of time to spend with Thief, so this overview is going to be relatively brief, but it has given us enough time to play with both of AMD’s technologies.
As this is a brief overview we’re going to skip recapping the technical details behind Mantle and TrueAudio. But if you haven’t read our previous works on those subjects, you can find more details on TrueAudio and Mantle in their respective articles.
Finally, launching alongside today’s Thief patch will be the latest rendition of AMD’s Catalyst drivers, Catalyst 14.3 Beta 1 (build 13.1350.1005). We don’t have a change log for these drivers at this point – expect one to be posted alongside the drivers today – but the important point is that these are the drivers intended to be used alongside the newly patched Thief and AMD’s TrueAudio demos.
First and foremost, let’s talk about Mantle. Whereas Battlefield 4 was primarily a multiplayer game, Thief is the first single player game to gain Mantle support. So although Thief isn’t the first Mantle game, by virtue of being a single player game it presents gamers and Mantle with a very different and much more tightly structured workload to work off of. Perhaps more importantly, since it is a single player game it has a much more consistent performance profile than Battlefield 4, and better still it even has a built in benchmark to go with it.
On the whole, Thief is a better than average game from a graphics technology perspective. It is a multi-platform title based on Unreal Engine 3, and at higher quality settings includes a number of graphical features such as tessellation, contact hardening shadows, and even supersample anti-aliasing (achieved through internally rendering at a higher resolution). However even with those effects, unlike Battlefield 4, Thief is much easier to CPU bottleneck. On our fastest video cards it tends to be SSAA (or very high resolutions) that leads to Thief being bottlenecked, allowing it to otherwise become CPU bottlenecked at 1080p without SSAA.
When it comes to being CPU limited, Thief’s preferences are clear: 4 cores with as much performance per thread as you can throw at it. This leads to Thief strongly favoring Intel CPUs – first the quads and then the dual cores – with AMD’s CPUs and APUs falling into place after that. As a result of these CPU bottlenecks Thief can trend very close to being a best case scenario for Mantle, so long as it’s not outright GPU bottlenecked.
With that in mind we quickly took a look at Thief’s Mantle performance on an R9 290X (Uber mode to rule out throttling) and an R7 260X to cover both a high-end GPU and a mainstream GPU. Furthermore we tested both of those configurations with a variant of the game’s Very High settings – dropping SSAA down to Low in exchange for 16x AF, alongside the game’s Low settings. Finally we ran the above against both a high-end CPU configuration of 6 cores/12 threads at 4.2GHz, and a low-end configuration of 2 cores/4 threads at 3.3GHz.
On a quick side note, AMD included the following notes with their instructions for testing Thief. In short, Mantle is up and running for all compatible AMD cards, but multi-GPU is not yet working, and memory management is in need of further optimization.
- Mantle performance for GPUs with 2GB framebuffers will receive additional optimization in a future application path for Thief™. Currently, these products may see limited gains in scenarios requiring large amount of video memory (e.g. maximum detail settings with SSAA enabled).
- Multi-GPU support under the Mantle codepath will be added to Thief in a future application patch
- As with other first-person titles, relatively smaller gains will be observed in GPU-bound scenarios
Looking first at the R9 290X, we can see that even at our modified Very High settings, there are still some small performance gains to be had from enabling Mantle. Switching out Direct3D for Mantle gets us another 3.6fps, or a 5% boost in performance. As we would expect however, a far more significant gain can be found when using Low settings. He we can see the 290X top out at 86.8fps with D3D – indicating that our earlier Very High settings weren’t all that far from being CPU bottlenecked – while Mantle boosts that up to 117.7fps, for a gain of 30.9fps or 36%.
From a practical perspective we would expect most 290X owners to be playing at settings similar to Very High, so the performance gains, though appreciated, aren’t especially influential in the long run. But it does give us some idea of what to expect.
Meanwhile if we start slowing down the CPU to just 2 cores at 3.3GHz, we can see the Mantle performance advantage grow. In this CPU bottlenecked scenario the performance gains from enabling Mantle are anywhere between 33% for Very High settings to a rather sizable 49% when using Low settings. This scenario, though contrived, makes for a good reminder of how significantly the current Direct3D rendering pipeline can bottleneck a GPU in the wrong (right?) circumstances.
Moving on to the 260X, to no surprise we’re completely and utterly GPU bottlenecked with our Very High settings. The performance gains with Mantle are inconsequential at best, indicating that Mantle isn’t being used to significantly alter the rendering process on the GPU itself.
Shifting over to Low settings still leaves our setup GPU bottlenecked when testing against the 6 core setup, however we do see a very distinct performance gain on the 2 core setup. In this scenario enabling Mantle is worth an 11.1fps boost, or 25%, pushing the framerate up to 55.4fps.
Despite this being an artificial test on our GPU testbed, we would consider this to be a very real scenario overall given the price of the 260X. At $119 (MSRP) the 260X is very likely to be paired with a dual-core CPU or equivalent, so to see a meaningful performance gain in this scenario is promising. Whether any other Mantle-enabled single-player games will be this badly CPU limited remains to be seen, but if other games were to behave like Thief, then we may see similar gains on lower-end setups such as this.
Ultimately we’ve only had a limited amount of time with the Thief Mantle patch, so we’ll have to take a look at the competitive landscape another day. But as a pure Mantle analysis Thief is probably the greater beneficiary from Mantle at this time. The gains at the high end aren’t worth writing home about, but since we need the CPU to churn out a fairly high framerate regardless, there’s a much greater opportunity to benefit from Mantle on lower end Intel CPUs and AMD’s CPUs/APUs.