As I mentioned in a previous post, one of the new features for database mirroring in SQL Server 2008 is log stream compression. There’s a good explanation of what this is (along with some example workloads and performance graphs) on the SQL Customer Advisory Team blog so I’m not going to duplicate all that here. Basically, the way mirroring works is by shipping all the transaction log from the principal to the mirror, and then replaying the log records on the mirror database. Network bandwidth can be a bottleneck for mirrored databases where the rate of transaction log generation is high enough to saturate the network. On synchronously mirrored databases (where a transaction cannot commit on the principal until all the transaction log for it has been hardened in the log of the mirror database), this can even lead to workload throughput degradation. Log compression is a way to alleviate this problem. SS2008 Books Online states that compression rates of at least 12.5% are achieved – obviously the compression ratio could be much higher than that and is dependant on what’s being compressed – i.e. the data that is being processed by the application. One of the tests they did for the post above shows a 5x compression ratio – that’s pretty good. The downside of this is that compression is not free – extra CPU must be used. The obvious CPU load increases comes from having to compress the transaction log on the principal and then decompress it again on the mirror before hardening it in the log. The non-obvious increase will come because the principal and mirror should end up being able to process more transactions/second, which means more CPU is needed. This extra CPU could be a problem for some systems that are already pegged in terms of CPU load, and so they may actually see a drop in performance when log stream compression is enabled. Now, it’s on by default when you upgrade to SS2008 but there is a way to turn it off. The blog post above divulges that there is a trace flag, 1462, that turns off log stream compression and effectively reverts the behavior back to SS2005. Apart from the obvious benefits of this feature for systems that can afford the extra CPU, I’m excited because it may allow more mirrored databases to perform effective database maintenance – in the form of fragmentation control and index maintenance. When database mirroring is enabled, the database has to be running in the full recovery mode. This means that operations such as index rebuilds are fully logged – that’s a lot of extra log being generated if you’re used to switching to bulk-logged recovery mode to perform index operations. Of course, this isn’t an issue for reorganizing an index (with ALTER INDEX … REORGANIZE) as this has been fully logged (and has to be that way because of how it works) since I wrote the old DBCC INDEXDEFRAG for SS2000. Anyway, for some customers the fact that these two operations are fully logged means that running them produces so much additional log that, in conjunction with the log from the workload itself, the network becomes a bottleneck for mirroring and it slows down both the maintenance and reduces the workload throughput. The addition of log stream compression means that for these systems, the log may be compressed enough so that the network is no longer a bottleneck and more regular fragmentation control can take place. This in turn will increase workload throughput as well – a double benefit!
2024: the year in books
Back in 2009 I started posting a summary at the end of the year of what I read during the year and people have been
4 thoughts on “SQL Server 2008: Performance boost for Database Mirroring”
Hi Paul,
In SQL Server 2005 mirroring, does principal server send 1 log record at a time to mirror db or does it send bunch of log records at a time to mirror db?
It sends entire log blocks.
Hello Paul, we noticed high CPU usage after we enabled AG replication. I know this article refers to DB mirroring in SQL 2008, but the underlying functionality of replication remains the same and the TF 1462 still does the same job in AG replication as well.
Did you ever notice high CPU usage issues after AG replication is enabled in later versions of SQL such as SQL 2019?
One other interesting observation we made is that there is one core that is grabbing a lot of CPU. Is it possible that SQL DB engine uses only one core for the compression of data in the background? Please share any information you have regarding this.
There are a lot of things that could cause high CPU so without actually looking at the system, we wouldn’t be able to tell.
For the single core – do you have Buffer Pool Extensions enabled? That will usually peg a single core with high usage.