DLinq (Linq to SQL) Performance (Part 4)

아티클
07/05/2007

Well it's high time I gave you some numbers for the new stuff.

In the original benchmark the Linq version was running at 13.62% of the original time. And while I'm discussing that result, Sekiya Sato pointed out an error in my original benchmark (see the comments of the above posting) in which I had one of my ISDBNull() checks backwards. That error made the "nolinq" version actually run 3.6% faster than it should have. So the number I reported, 13.62% should have actually been 14.09% -- let me restate that result for clarity, in May 2006, DLinq was running at 14.09% of the underlying provider speed in this (harsh) test case on my hardware and not 13.62% as previously reported.

I have in my hands a nice fresh build, which is similar to what you're going to get when you adopt Beta 2. The results below include my original test plus an some quick insert and update tests I added -- I'll describe those in the next installment. What we want to talk about right now is the select cases. The regular select is as orignially described. The syntax for the compiled select (and this really builds) is this:

var fq =CompiledQuery.Compile(
    (Northwinds nw) =>
        from o in nw.Orders
        select new OrderDetail
        {
                OrderID = o.OrderID,
                CustomerID = o.CustomerID,
                EmployeeID = o.EmployeeID,
                ShippedDate = o.ShippedDate
        }
);

Note that with the nice type inferencing you never have to see the generic types in your code but it's still strongly typed. To use this query you simply

foreach (var detail in fq(nw))
{
sum += detail.OrderID;
count++;
}

Now let's have a look at the numbers:

	no linq			with linq
	select	update	insert	originalselect	compiled select	update	compiled update	insert
run 1	915.25	4.87	4.29	497.81	858.07	20.65	21.05	20.25
run 2	916.25	5.02	4.76	491.59	864.60	20.34	20.62	12.02
run 3	942.86	4.87	4.66	496.57	859.11	21.03	20.47	16.08
average	924.79	4.92	4.57	495.32	860.59	20.67	20.71	16.12

The units for all of the above are test iterations per second so bigger is better.

	dlinq	nolinq	ratio
select	495.32	924.79	53.56%
compiled select	860.59	924.79	93.06%
update	20.67	4.92	420.19%	(DLinq is faster)
compiled update	20.71	4.92	421.00%	(DLinq is faster)
insert	16.12	4.57	352.66%	(DLinq is faster)

Wow that's pretty good. If you do nothing to your code, just raw internal improvements go from 14.09% of the underlying provider to 53.56% -- that's a 3.8x improvement. But look at what you can do with compiled queries: if you compile the select statement I got 93.06% of the underlying providers raw speed -- that's 6.6x faster than what I got back in May of 2006. This is a truly great result because, as I've mentioned before, this is a harsh test. With the normal overheads associated with actual business logic and data transfer this result basically means that you may not even be able to measure any throughput degradation at all if you use compiled DLinq queries in your program.

I think I'll let Matt talk about the details of how we did this because he did the work but I can give you the high level points if you haven't already guessed them from the previous postings

create custom methods that bind the data perfectly using lightweight code generation
create reusable SQL with parameters to avoid generating the SQL query again
provide read-only contexts to avoid any unnecessary entity management (this not needed anyway in my case because I new up an OrderDetail object with only part of the data)

When I modelled this on paper last summer it looked like we could get about 95% of the underlying provider speed plus or minus measurement error and we seem to have landed at 93%.

Now you may ask, why is DLinq doing better at updates than my code that writes directly to the underlying provider? I'll talk about this a bit more next time but the short answer is this: the code I wrote to do the updates looks like pretty typical SQL sent to the database in batches. However I didn't go to the trouble of creating prepared statements for update and insert cases, DLinq gives you this automatically. So despite my more complicated logic the savings DLinq got from superior SQL trumped my techinque.

And lastly, as always, this doesn't necessarily translate to any specific numbers for your application but it sure bodes well. I'm very pleased indeed.

Comments

Anonymous
July 05, 2007
Rico has finally posted some numbers that show you the performance he's seeing with Beta 2 bits.
Anonymous
July 05, 2007
When wouldn't I want to compile the query? If it's so much faster, why isn't it the default? Why would I have to write that magic code to make it perform well?
Anonymous
July 05, 2007
Compiling the query makes it durable. There is no need for this, nor is there any desire, unless you intend to run that same query many times. SQL provides regular select statements, prepared select statements, and stored procedures for a reason. Linq now has analogs.
Anonymous
July 05, 2007
Hi Rico, Have to admit that I've been waiting for your post to arrive in the previous days. The wait was well worth it :-) Even 53% is a great number for non-compiled queries. Can't wait to beta 2 ..
Anonymous
July 06, 2007
Some quick links about LINQ: Articles about extension methods by the Visual Basic team Third-party LINQ
Anonymous
July 06, 2007
Some quick links about LINQ: Articles about extension methods by the Visual Basic team Third-party LINQ
Anonymous
July 06, 2007
Rico Mariani has been posting about LINQ to SQL perfomance and has finally posted the performance numbers
Anonymous
July 06, 2007
Rico Mariani has been posting about LINQ to SQL perfomance and has finally posted the performance numbers
Anonymous
July 06, 2007
linq to sql 的动态条件查询方法
Anonymous
July 09, 2007
Rico Mariani did a very good job analyzing performance implications of LINQ to SQL queries. He is currently
Anonymous
July 12, 2007
LINQ to SQL performance
Anonymous
July 16, 2007
Also been catching up on Rico Mariani's notes on improvements to LINQ to SQL performance between the...
Anonymous
August 01, 2007
Rico, THere's been some talk about having a DataBase as an entire layer in a solution. Using LINQ/DLINQ to build all the queries etc. It seems that it can be done, but one thing that strikes me is the performance of this: You call one StoredProcedure with an Update, that stored procedure then calls 4 other stored procedures, and on this update you will get 8 result sets back. Using DLINQ I would assume you'd have to do multiple DLINQ statements to do the same functionality. Is that correct? Thanks :)
Anonymous
August 12, 2007
There are several good new blogs from members of the community team. Nevertheless, the most important
Anonymous
August 24, 2007
I've been meaning to dig into LINQ performance for some time (actually since it came up during one of
Anonymous
August 24, 2007
I've been meaning to dig into LINQ performance for some time (actually since it came up during one of
Anonymous
August 24, 2007
I've been meaning to dig into LINQ performance for some time (actually since it came up during one of
Anonymous
August 24, 2007
I've been meaning to dig into LINQ performance for some time (actually since it came up during one of
Anonymous
August 26, 2007
Risorse su Linq to SQL
Anonymous
November 01, 2007
Some of the best blogs on LINQ to SQL I found are available for great learning, Scott Guthrie The Famous
Anonymous
January 31, 2008
re: Linq はすごい？その７
Anonymous
July 16, 2008
The comment has been removed
Anonymous
November 10, 2008
Heythere,backagain.InmyfirstpostaboutLINQItriedtoprovideabrief(okay,bitdetailed)in...

다음을 통해 공유

DLinq (Linq to SQL) Performance (Part 4)

Comments

추가 리소스