Reference Bits: C#

Showing posts with label C#. Show all posts

Tuesday, January 19, 2010

Agile ADO.Net Persistence Layer: Part 3 Service Class Single<DTO> Data Access Method

When I say data access methods, I’m talking about the methods my UI is going to call whenever it needs to get data. When my Posts controller needs a list of BlogPosts to display, it’s going to call a BAL method like GetAllBlogPosts() or GetAllBlogPostsForCategory(). Last time I mentioned (over and over) that I like to keep things simple. When I need to get or save data, I don’t want to have to search through 3 different classes just to find the one with the method I need. Instead, I’m putting all my persistence logic for a given aggregate in just one place, a service class. This is not a web service. I’m using service in the Domain Driven Design sense here. That means that I have a BlogService class that is my one stop shop for all persistence that has to to with Blogs, BlogPosts, SubmittedBlogUrls, and anything else that falls within the Blog aggregate. Here is what my BlogService class looks like. You can see that it’s mostly “Get” data access methods.

What’s an Aggregate?

I keep using the word aggregate. If you’re not familiar with the term, it just means a group of entities that all share the same persistence class (whether that be a repository, a service, or something else). This is a key concept in Domain Driven Design. If you want to know more I would recommend picking up Eric Evans’ book or Jimmy Nilsson’s book on DDD. For now, all you need to know is that a BlogPost can never exist without a Blog, so there’s no point in BlogPost having it’s own persistence class. In fact we find that if we do break BlogPost out into it’s own persistence class, it will lead to problems down the road due to BlogPost’s dependency on Blog. What’s the solution? We put data access methods for both Blog and BlogPost in the same persistence class and call it an aggregate. That is why BlogService has methods for both Blog and BlogPost entities.

What type of data will data access methods return?

We covered this last post, but to recap all data will be returned as a Data Transfer Object (DTO). The DTOs are all defined in our Common assembly in the DataShapes folder. Our BAL will return data in one of the following 4 formats.

a single DTO
a List<DTO>
a DataPage<DTO>
a string value

For more see last week’s post Agile ADO.Net Persistence Layer: Part 2 Use DTOs.

A simple Single<DTO> data access method

Let’s look at the simplest possible data access method. GetBlogPost() takes a postGuid for a parameter, defines the query to find the BlogPost entity for that postGuid, and then returns the result as a single BlogPost DTO. Here’s the complete method.

public BlogPost GetBlogPost(Guid postGuid)
{
    string query = @"SELECT p.*, s.Score 
                    FROM [dbo].[BlogPost] p
                    LEFT JOIN [dbo].[BlogPostReputationScore] s on s.PostGuid = p.PostGuid
                    WHERE PostGuid = @PostGuid";
    SqlDao dao = new SqlDao();
    SqlCommand command = dao.GetSqlCommand(query);
    command.Parameters.Add(dao.CreateParameter("@PostGuid", postGuid));
    return dao.GetSingle<BlogPost>(command);
}

The first thing you’ll notice is that this isn’t a lot of code. All we’re really doing here is defining a parameterized TSQL query, wrapping that query up in a SqlCommand, and then passing the command and our desired return type off to a Data Access Object (DAO) that automagically executes the command and maps the results to our desired type. It may seem counter intuitive to write code like this when we haven’t even written the DAO yet, but that’s exactly how I did it when I wrote this code for the very first time. I decided that my data access methods should be very simple. I would start with the query and the DTO type that I wanted it to return, then I would pass them both to some type of helper class that would handle the details of running the query and figuring out how to map the query results to the properties of my DTO. By using this top down approach, I gave myself a very clear picture of how I needed my DAO to behave.

What’s a DAO (Data Access Object)?

By looking at the query logic above, you can see that I have this thing called a DAO or Data Access Object. This is a class that encapsulates the helper logic for working with my database. The DAO handles things like creating parameters, getting a connection, and most importantly it implements methods to return my four main data formats, GetSingle<DTO>, GetList<DTO>, GetDataPage<DTO>, and GetStringValue(). The DAO and it’s associated DataMappers are where you’ll find the special sauce that makes this architecture work. We’ll get into their implementation later on.

A BAL that embraces change

It’s easy to look at the simple code above and miss something that I think is very important. In fact that thing is the whole reason that I wrote this framework. That simple data access method above is the blueprint for a flexible persistence layer that makes changing your entities and associated persistence code easy and almost painless. It sets up a simple 3 step process for all data access in your application.

Define a DTO in the exact data shape that you’re looking for. That means create a DTO property for each data field that you need out of the database.
Define a query that gets the data. It can be as simple or as complex as you like. You can develop it in Sql Server Management Studio. You can easily optimize it. Use whatever process or tools work for you. When you’re done just paste the query into your data access method.
Pass both the query and your DTO to the DAO and it will automatically handle field mappings and pass the results back in the data shape you requested.

This is a very powerful way to work. I can’t count the number of times that I’ve worked with and architecture where I dreaded any changes because I knew that any data fields added would require me to modify a sproc, a DAL method, a BAL method, parsing logic, an entity class, it all adds up to a lot of friction that resists any change. This BAL design embraces change. It’s written with the attitude that we know change is going to happen so we’re going to give you as few things as possible to modify, and make sure we don’t have any cross cutting dependencies, so that you can make changes easily.

Next time, more on the service classes.

Next Post: Agile ADO.Net Persistence Layer Part 4: Writing data access for new data shapes

Saturday, January 9, 2010

Agile Ado.Net Persistence Layer Part 1: Design Overview

Last year I did a blog post series on how to design a High Performance DAL using Ado.Net. Judging by the response I’ve gotten from that series, there must be a lot of developers out there who believe that even with the availability of LINQ, Entity Framework, and a host of other ORM technologies, Ado.Net is still your best option when designing a persistence layer. BTW, I’m one of them.

After that series I started digging into Entity Framework and LINQ, and I was impressed by how effortless those technologies made certain parts of application development. Once the EF or LINQ mappings were in place, I found myself writing much less code and focusing more on the business logic of my application. I also found myself driven right back to ADO.Net as I found myself struggling with how do something that I already knew how to do in T-SQL, or fighting errors resulting from an attached data context object that I really didn’t want to start with.

So, I found myself back with ADO.Net, but I didn’t want to give up the ease of development and coding efficiencies that I got from the ORMs. I decided to take a fresh look at how to design an ADO.Net persistence layer. I started at the top (the application layer) and thought about how I want my app code to consume business logic, and then I worked down from there. I incorporated many of the best practices that I’ve used over the years, but I also looked critically at each one and whenever I found that something was slowing me down or leading to duplicate code, I threw it out. The resulting architecture is quick to develop on,testable, easily maintainable, and can be easily optimized for performance. This series of posts will detail the entire design, from application code to database.

A peek at the final design

I always find it’s easier to follow along if I have some idea where I’m going, so this is a quick look at where we’re headed. We’re going to cover the entire architecture for a simple blog aggregator called RelevantAssertions.com. RelevantAssertions is an Asp.Net MVC application that uses our new Agile Ado.Net Persistence Layer. We have 4 projects in the RA solution, WebUI, Tests, Common, and BAL Here’s a quick look.

WebUI

WebUI is our Asp.Net MVC application, that’s our application layer. This contains all UI and presentation logic, but it contains absolutely no business logic.

Common

Common contains classes that we need at all layers of our code. The DataShapes folder is where we define all of our DTO classes.

Tests

This project contains all of our automated tests for both the BAL and the UI.

BAL

I know it’s probably more correct to say BLL, but I like the term BAL. It just sounds better. This is the big class where everything interesting happens. The main workhorses of the BAL are the service classes. These are not web services. They are service classes in the DDD sense. The service classes are going to be the one stop shop where our application code goes to do whatever it needs to do. The service classes will also contain query logic, that’s right I said query logic. Behind the scenes the service classes will use DAOs (Data Access Objects), Data Mappers, and Persisters to do their thing in an efficient object oriented way, but the only classes our application code will use directly are the services.

What, no DAL??

You’ll notice that there is no DAL. It seems a little strange to have an architecture that focuses on ADO.Net but doesn’t have a DAL, but there’s a reason. Usually, the DAL is where I’ll put my query logic, mappings to query results, and any other database specific code. The DAL would allow me to keep all of my TSQL and ADO.Net code separated from the rest of my application, and this separation provided me with some important benefits like:

1) Separation is it’s own virtue, that’s just the right way to do it.
2) I wouldn’t have leakage of db or query logic into my business logic.
3) I could easily swap out SQL Server with another database if needed.
4) It encapsulates code that would otherwise be repeated.
5) We need to hide TSQL from programmers, it scares them.
6) It’s fun to make changes to 3 layers of code every time I add a new data member to an entity class.

At least that’s what I was always taught. But after working with more ORM oriented architectures and the Domain Driven Design way of doing things, I started to look at things differently. Let’s look at some of these benefits (at least the ones that aren’t sarcastic).

I’ve never met anyone who’s ever switched out their database

YAGNI means You Ain’t Gonna Need It. The idea is that we spend a lot of time building stuff that we don’t really need. We build it because it seems like the architecturally correct way to do it, or we think we’ll need the feature one day, or maybe we’re just used to doing it that way. Whatever the reason, the result is that we spend a lot of time coding features that are never used, and that’s not good. After doing this for 14 years or so, I’ve never, ever, run into a single project where they’ve decided “hey, let’s trash the years of investment we’ve made in SQL Server and switch over to MySQL, or any other database. Now I am aware that a db switch is likely if you’re writing a product that clients install onsite and it has to work with whatever their environment is, but for 99% of .Net developers this is just never going to happen. I call YAGNI on this one.

Query logic IS business logic

One of the biggest gripes I had when I started investigating LINQ, EF, and Hibernate (yes I was looking at Java code) architectures is that they had query logic in their repository classes. Now the query logic was written in LINQ, or EntitySQL, or some other abstracted query language, but it was still query logic. Blasphemy!! You can’t put query logic in a BAL class! That stuff has to be abstracted away in the DAL or it will contaminate the rest of the application architecture! Our layered architecture is being violated! Worlds are colliding! It’ll be chaos!! Then I started to notice something, it’s really easy to develop business logic when you include queries in the BAL. In the past I would put my queries in a sproc, then I would write a DAL wrapper for the sproc, and a BAL wrapper for the DAL method. Then, if the query changed, or if I needed an identical query but with a slightly different parameter list I would write a new sproc, then write a new DAL wrapper, then write yet another BAL wrapper method for the DAL wrapper method. By the time all was said and done I would have this crazy duplication of methods across all layers of my application including my database! And don’t even get me started on the crazy designs that I implemented to try and pass query criteria structures (basically the stuff that goes in the where clause) between my BAL an my DAL. I came up with these crazy layers of abstraction that basically existed so that I wouldn’t have to create a simple TSQL WHERE clause in my BAL. Then there’s the problem of handling sorting and data paging, that required even more DAL methods, and each of these DAL methods had corresponding wrapper methods in the BAL that did nothing but pass the call through to the DAL! Why?? I was doing the right thing by separating my business logic from my query logic, why was it so painful? The answer I finally arrived at is simply that query logic is business logic. I’d been putting a separation where no separation belonged.

All real programmers know TSQL

I’ve heard the argument that TSQL is too hard for programmers so we’re going to create something much easier for programmers to use like LINQ or EF. The problem is that these tools require almost exactly the same syntax as TSQL but they put an extra layer of stuff in there that can break and a data context (or session for you nHibernate folks) that throws errors whenever you try to save a complex object graph. How did this attitude that TSQL is a problem for programmers gain any traction? Have you ever met a real programmer who can’t write TSQL? And if you did meet such a person, would you let them touch your business layer code? Why would we ever want to abstract TSQL away? It’s the perfect DSL for accessing SQL Server data and every programmer in the world is already familiar with it.

Using good object oriented design and encapsulating data access code is a good thing

I fully believe this one, but once we decide that query logic is business logic and that we don’t need to hide TSQL from programmers, there’s no reason to put our well designed object oriented data access code in a separate project and call it a DAL. I decided to just put it in a Persistence folder in my BAL and now I have one less DLL to worry about.

So, that’s some of what I was thinking when I made the decisions that I did. It made sense to me. I’m sure it won’t make sense to everyone, but I do think that it resulted in a very usable architecture. Before I wrap up for today, I want to look at one more thing.

The target application code experience. What will it be like to use?

When I’m writing code in my application layer, consuming the logic that is provided through my BAL service classes, what does that code look like. Well I know a couple of examples of code I don’t want it to look like.

I’ve been in a few environments where there were huge libraries of BAL classes, any of which could contain the logic I want. I would often have to resort to a solution wide text search looking for sproc names or keywords that might exist in the method that I needed. I don’t want that. I want everything I need to be in one easy to find place.

I’ve also seen a practice that’s common in the DDD (Domain Driven Design) crowd where you’ll need to go to a factory class to create a new entity, you need to go to a repository class to get an entity from the database, if you have complex logic that involves more than one entity you need to go to a service class, and saving entities is a toss up between using either the repository or a separate service class. There may be a good reason to use that kind of class design inside of the BAL, but when I’m writing code in my application layer, I don’t want to have to worry about which of 4 different classes I’m going to use. So again, I’m a simple guy, when I need to get, save, or validate a BlogPost entity, I want a single service class that I can go to for everything. My app code should look something like this.

// instantiate service classes 
BlogService blogService = new BlogService();
CategoryService categoryService = new CategoryService();
// Get data shaped as lists and pages of our DTOs
DataPage<BlogPost> page = blogService.GetPageOfBlogPosts(pageSize.Value, pageIndex.Value, sortBy);
List<Category> categoryList = categoryService.GetTopCategoryList(30);
// create and save a new BlogPost
BlogPost newPost = new BlogPost();
newPost.BlogGuid = blog.BlogGuid;
newPost.PostTitle = item.Title.Text;
newPost.PostUrl = item.Links[0].Uri.AbsoluteUri;
newPost.PostSummary = item.Summary.Text;
newPost.Score = 0;
blogService.Save(newPost);

Next time we’ll focus less on discussion and more on code. We’ll look at DTO classes and the 4 main data shapes that will go into and come out of our BAL: DTO, List<DTO>, DataPage<DTO>, and String.

Next Post: Agile ADO.Net Persistence Layer Part 2: Use DTOs

Sunday, August 2, 2009

More on Fluent Interface Pattern for Composing Entity Framework Queries

Recap of Last Week

In last week's post I went over an Entity Framework pattern that I’ve been using that’s saved me quite a bit of time in my Repository classes. The basic idea is that most Entity Framework queries can be broken down into 3 pieces as shown in the diagram below.

The interesting part emerges when we realize that an EF query is just an object of type ObjectQuery, and that we can create methods that take an ObjectQuery as a parameter, do their processing, and then pass back a return value of type ObjectQuery. This means that we can chain the methods together, fluent interface style, to create queries that look like this:

List<BlogPost> postSet = GetBlogPostSet().SortBy(“Date”).GetPage(pageIndex, pageSize);

List<BlogPost> postSet = GetBlogPostSet().SortBy(“ID”).GetPage(pageIndex, pageSize);

List<BlogPost> postSet = GetBlogPostSet().SortBy(“ID”).GetAll();

Shaping Data With Projection Methods

This week I want to take a closer look at the Projection Method and how we can use it to shape our return data. So let’s start out by looking at a pretty standard query that returns a list of BlogPost entities.

List<BlogPost> posts = GetAllBlogPosts().SortBy(sortOption).GetList();

This query contains each of the three components described above, a Filter method (GetAllBlogPosts), a Sort method (SortBy), and a Projection method (GetList). The code for each of the methods is listed below. Note that our Projection Method, GetList(), is just running toList() on the query and returning the resulting List<BlogPost>.

// GetBlogPostSetByDate
private ObjectQuery<BlogPost> GetAllBlogPosts()
{
    var query = from p in Context.BlogPostSet.Include("Categories")
                select p;
    return (ObjectQuery<BlogPost>)query;
}
// SortBy
internal static ObjectQuery<BlogPost> SortBy( this ObjectQuery<BlogPost> query, Enums.BlogPostSortOption sortOption)
{
    switch (sortOption)
    {
        case Enums.BlogPostSortOption.ByDate:
            return (ObjectQuery<BlogPost>)query.OrderByDescending(p => p.PostedUtc);
        case Enums.BlogPostSortOption.BySite:
            return (ObjectQuery<BlogPost>)query.OrderBy(p => p.BlogProfile.BlogName);
        case Enums.BlogPostSortOption.ByVote:
            return query;
        default:
            return (ObjectQuery<BlogPost>)query.OrderByDescending(p => p.PostedUtc);
    }
}
// GetList
internal static List<BlogPost> GetList(this ObjectQuery<BlogPost> query)
{
    return query.ToList<BlogPost>();
}

Wonderful, we can mix an match methods like this and they work great as long as we want to return BlogPost entities. But what if we just want a list of the BlogNames? How do we handle that? It seems like a waste to pull a list of full BlogPost entity objects when all we really need is the BlogName. In this case it would be nice to shape my return data into a simple list of BlogNames.

I typically handle this in my Projection Methods. I already have a Filter Method that’s applying the appropriate filter, I have a Sort method that’s applying the appropriate sort, now I need a Projection Method that projects my results onto the appropriate data shape. So, I need to create a new projection method to do this called GetBlogNames(). My new data access code and the new GetBlogNames() method look like this.

List<String> blogNames = GetAllBlogPosts().SortBy(sortOption).GetBlogNames();
 
// GetBlogNames
internal static List<String> GetBlogNames(this ObjectQuery<BlogPost> query)
{
    var newQuery = from p in query
                   select p.BlogProfile.BlogName;   
    return newQuery.Distinct().ToList<String>();
}

Look closely and you’ll see that the GetBlogNames() method does something kind of interesting. It does a second select on my query. We did the first select back in our Filter Method GetAllBlogPosts(). That select returned full BlogPost entities. Now in my projection method I decide that I only need the BlogNames property so I do a second select that returns only the BlogNames.

At first glance it may seem that this is still doing something really inefficient. We have an initial query that pulls full entity objects, then we do a second select against that result set and it get’s the BlogNames. We’re still pulling full entities in the first query, aren’t we?

No we’re not. Thanks to delayed execution our query never does pull full entities. This goes back to the big realization that the query is now an object. I’m a little fuzzy on some of the details here, but it goes something like this. When we create an Entity Framework query, we ‘re creating an ObjectQuery that contains a Command Tree. That’s important. The ObjectQuery contains a Command Tree that represents each of the commands (like select, order by, from, etc.) in our query. When we need the results of the ObjectQuery, like when we call ToList() or when we iterate over the results, our Command Tree gets translated into TSQL (or whatever is used by the provider we’re querying) and the query is executed.

So, right up to the point that I call ToList(), my query is just a data structure containing all of my commands, and adding another select is just adding another command to the Command Tree. At query time, all of my commands will be translated into SQL and executed as a single query that returns just BlogNames.

A Better Data Shaping Example

We’ve seen how we can use a select in our projection method to shape data. Now let’s apply that principle to a more practical, real world, example, drop down list choices. How many times have you seen code that pulled down a list of full entity objects, or a fully populated DataTable, just to populate the choices of a drop down list. This scenario comes up all the time so I’ve added a ListItemDTO Data Transfer Object to my BLL (for more on DTOs see What is the difference between a DTO and a POCO). My ListItemDTO class is just a simple data container for a list item.

public class ListItemDTO
{
    public String Text { get; set; }
    public String Value { get; set; }
}

Now, for this example let’s assume that in my UI, I have a page that displays a grid of all blog posts, for all blogs, that were posted in a given week. Now I want to add a DropDownList to the top of this page that allows me to pick a blog and then the grid will display only the posts from that blog. Simple enough scenario. I just need to create a DropDownList that contains ListItems with Text equal to BlogName, and values equal to BlogId. To get the data to populate this ddl, I need a method in my Repository that gets the list of BlogPosts for the given week, but then shapes the return value as a distinct list of ListItemDTOs where the Text property is BlogName and the Value property is BlogId.

For us this is no problem. We already have the Filter and Sort methods needed to get the data, we just need a Projection method that will shape it as a List<ListItemDTO>.

// GetBlogNameListItemDTOs
internal static List<ListItemDTO> GetBlogNameListItemDTOs(this ObjectQuery<BlogPost> query)
{
    List<ListItemDTO> items = new List<ListItemDTO>();
    var newQuery = from p in query
                   select new {p.BlogProfile.BlogName, p.BlogProfile.BlogId};
    foreach (var item in newQuery.Distinct())
    { 
        items.Add( new ListItemDTO{Text=item.BlogName, Value=item.BlogId.ToString()});
    }
    return items;
}

In this Projection method, we added a new select that modified our query to return only the BlogName and the BlogId. We then used a foreach loop to iterate over the results of the query, create a new ListItemDTO for each item, and then return the List<ListItemDTO>. There are two details to make note of. First we modified our query to return it’s data as an an anonymous type with two properties, BlogName and BlogId. That’s the reason for the “new {p.BlogProfile.BlogName, p.BlogProfile.BlogId}” object syntax. Second, when we iterate over the query in the foreach loop, we call the Distinct() method which, wait for it, makes sure our results are distinct.

So, that was pretty painless. We can now write a data access method that leverages our existing Filter and Sort methods, and uses our new GetBlogNameListItemDTOs() to project the results onto a List<ListItemDTO>. The code is given below.

// BlogNameListItemDtosForWeek
public List<ListItemDTO> BlogNameListItemDTOsForWeek(DateTime startDate)
{
  return GetBlogPostSetForWeek(startDate)
         .SortBy(Enums.BlogPostSortOption.BySite)
         .GetBlogNameListItemDTOs();
}

Conclusion

Hopefully this illustrates a little more of the potential of this pattern. Now that I’ve been using it for a while I can’t bring myself to structure my Entity Framework Repositories any other way. There are always situations where the model doesn’t quite work for me, and when that happens I just go back to writing Entity Framework queries from scratch. But, so far I’ve been pleasantly surprised by how often my data access methods fit neatly into this little time saving model. If you have suggestions on how to improve the pattern, or other data access patterns that have saved you time, please leave a comment or drop me an email.

For anyone who wants to see the Filter/Sort/Projection model in the context of a repository class, I’m including a partial listing of my BlogPostRepository below. You’ll see everything we’ve covered over the last two posts, plus there’s some additional code that handles things like caching.

public class BlogPostRepository : RepositoryBase
{
 
    // Constructors
    public BlogPostRepository() { }
    public BlogPostRepository(RAEntities context)
    {
        this.Context = context;
    }
 
 
    //*********************************************************************************
    // Data Access Methods
    //******************************************************************************** 
 
    // BlogPostSetForWeek_GetPage
    public List<BlogPost> BlogPostSetForWeek_GetPage(int pageIndex, int pageSize, DateTime startDate, Enums.BlogPostSortOption sortOption, CacheOptions cacheOptions)
    {
        startDate = ToSunday(startDate);
        CacheHelper cache = new CacheHelper();
        string cacheKey = String.Format("BlogPostSetByWeek_GetPage_{0}_{1}_{2}_{3}", pageIndex.ToString(), pageSize.ToString(), startDate.ToShortDateString(), sortOption.ToString());
        // If the cache contains the data, and the user wants us to use
        // cache, then return the cached data
        if (cache.Contains(cacheKey) && cacheOptions.UseCache)
        {
            return (List<BlogPost>)cache.Get(cacheKey);
        }
        // If we fell through to this point then we need to pull data.
        List<BlogPost> postSet = GetBlogPostSetForWeek(startDate)
                                 .SortBy(sortOption)
                                 .GetPage(pageIndex, pageSize);
        if (cacheOptions.UseCache)
        {
            cache.Insert(postSet, cacheKey);
        }
        return postSet;
    }
 
 
    // BlogNamesForWeek
    public List<String> BlogNamesForWeek(DateTime startDate)
    {
        return GetBlogPostSetForWeek(startDate)
               .SortBy(Enums.BlogPostSortOption.BySite)
               .GetBlogNames();
    }
 
 
    // BlogNameListItemDtosForWeek
    public List<ListItemDTO> BlogNameListItemDTOsForWeek(DateTime startDate)
    {
        return GetBlogPostSetForWeek(startDate)
               .SortBy(Enums.BlogPostSortOption.BySite)
               .GetBlogNameListItemDTOs();
    }
 
 
    //*********************************************************************************
    // Filter Methods
    //*********************************************************************************    
    // GetBlogPostSetByDate
    private ObjectQuery<BlogPost> GetAllBlogPosts()
    {
        var query = from p in Context.BlogPostSet.Include("Categories").Include("BlogProfile")
                    select p;
        return (ObjectQuery<BlogPost>)query;
    }
 
    // GetBlogPostSetForWeek
    private ObjectQuery<BlogPost> GetBlogPostSetForWeek(DateTime startDate)
    {
        startDate = ToSunday(startDate);
        DateTime startUtc = startDate.Date;
        DateTime endUtc = startDate.AddDays(7).Date;
        var query = from p in Context.BlogPostSet.Include("Categories").Include("BlogProfile")
                    where p.PostedUtc > startUtc & p.PostedUtc < endUtc
                    select p;
        return (ObjectQuery<BlogPost>)query;
    }
 
 
 
 
    //*********************************************************************************
    // Utility Methods
    //*********************************************************************************
 
    public static DateTime ToSunday(DateTime date)
    {
        double offset = date.DayOfWeek == DayOfWeek.Sunday ? 0 : Convert.ToDouble(date.DayOfWeek);
        return date.AddDays(-offset);
    }
}
 
 
internal static class BlogPostRepositoryExtensionMethods
{
    //*********************************************************************************
    // Sort Methods
    //*********************************************************************************
 
    // SortBy
    internal static ObjectQuery<BlogPost> SortBy( this ObjectQuery<BlogPost> query, Enums.BlogPostSortOption sortOption)
    {
        switch (sortOption)
        {
            case Enums.BlogPostSortOption.ByDate:
                return (ObjectQuery<BlogPost>)query.OrderByDescending(p => p.PostedUtc);
            case Enums.BlogPostSortOption.BySite:
                return (ObjectQuery<BlogPost>)query.OrderBy(p => p.BlogProfile.BlogName);
            case Enums.BlogPostSortOption.ByVote:
                return query;
            default:
                return (ObjectQuery<BlogPost>)query.OrderByDescending(p => p.PostedUtc);
        }
    }
 
 
    //*********************************************************************************
    // Projection Methods
    //*********************************************************************************
 
    // GetList
    internal static List<BlogPost> GetList(this ObjectQuery<BlogPost> query)
    {
        return query.ToList<BlogPost>();
    }
 
    // GetPage
    internal static List<BlogPost> GetPage(this ObjectQuery<BlogPost> query, int pageIndex, int pageSize)
    {
        int skipCount = pageSize * pageIndex;
        return query.Skip(skipCount).Take(pageSize).ToList<BlogPost>();
    }
 
    // GetBlogNames
    internal static List<String> GetBlogNames(this ObjectQuery<BlogPost> query)
    {
        var newQuery = from p in query
                       select p.BlogProfile.BlogName;   
        return newQuery.Distinct().ToList<String>();
    }
 
    // GetBlogNameListItemDTOs
    internal static List<ListItemDTO> GetBlogNameListItemDTOs(this ObjectQuery<BlogPost> query)
    {
        List<ListItemDTO> items = new List<ListItemDTO>();
        var newQuery = from p in query
                       select new {p.BlogProfile.BlogName, p.BlogProfile.BlogId};
        foreach (var item in newQuery.Distinct())
        { 
         items.Add( new ListItemDTO{Text=item.BlogName, Value=item.BlogId.ToString()});
        }
        return items;
    }
 
}

Saturday, July 11, 2009

Chaining the C# Null Coalescing Operator

This is something that came up in the comments last week. I was refactoring some code and wound up with the accessor method below that performs multiple null checks while trying to assign a value to _currentUser.

// CurrentUser

private WebUser _currentUser;

public WebUser CurrentUser

{

   get

   {

     if (_currentUser == null) _currentUser = GetWebUserFromSession();

     if (_currentUser == null) _currentUser = GetWebUserFromTrackingCookie();

     if (_currentUser == null) _currentUser = CreateNewWebUser();

     return _currentUser;

   }

}

Now this code is pretty clear, but a reader named Brian pointed out in a comment that we could shorten the code a bit by using the Null Coalescing operator “??”. If you haven’t used the Null Coalescing operator yet, check it out. It’s great for data access code where a method may return data or a null value. The basic syntax is illustrated in this block of code from msdn:

   // y = x, unless x is null, in which case y = -1.
   int y = x ?? -1;

   // Assign i to return value of method, unless
   // return value is null, in which case assign
   // default value of int to i.
   int i = GetNullableInt() ?? default(int);

So, this is really cool, but there’s more. What Brian pointed out was that you can chain the ?? operator to do multiple null checks. So, my block of code above can be rewritten like this:

// CurrentUser

private WebUser _currentUser;

public WebUser CurrentUser

{

   get

   {

     _currentUser = _currentUser ??

                    GetWebUserFromSession() ??

                    GetWebUserFromTrackingCookie() ??

                    CreateNewWebUser();

     return _currentUser;

   }

}

Note that normally the _currentUser assignment would fit all on one line but due to the width limitation of my blog I broke it up into multiple lines. So, C# Null Coalescing operator chaining. I like it. I’ll be adding it to my toolbox.

Addendum:

Hey, you guys pretty much hated this change to the code. I got no blog comments but I received quite a few emails about this technique and the prevailing opinion seems to be that the original syntax with the multiple if blocks was clearer. In fact, not a single person who emailed me liked the ?? method. So I took another look and I think I agree. The original code does seem a little clearer to me and I don’t really think I was able to make my code any shorter or simpler by using ??. So this may not have been the best example, but I do still think this is a very cool technique for the right situation. As always, I just need to make sure that I’m refactoring because the changes make the code cleaner, not just because they’re clever.

Friday, July 3, 2009

SOLID C# Code: Smaller Methods == Clean Code?

I’m a big fan of Robert Martin. His book on Agile Patterns in C# is still one of the three most important programming books I’ve ever read. If you listen to Uncle Bob for any amount of time, it won’t be long before you start hearing terms like SOLID Principles and Clean Code. These are concepts that are widely known, but still a bit tough to define. What exactly is clean code? What does SOLID code look like?

We’ll I’m not smart enough to provide a definition that’s any better than what’s already out there, but I can point at an example and say “that smells pretty SOLID”. So, below is an example of some code that I wrote for an ASP.Net MVC application. CurrentUser is a property that wraps an instance of my WebUser class, which represents, you guessed it, the currently logged in user for my web app. When the CurrentUser property returns a user, there are four possible places that it might have to check to find that user:

There may already be a WebUser instance in the private member _currentUser,
There may be a WebUser instance stored in the Session,
There may be a TrackingCookie in the HTTP Request that can be used to get an existing WebUser,
We may have none of the above, in which case we have to create a new WebUser.

So, with that said, let’s take a look at my first version of this code.

// CurrentUser
private WebUser _currentUser;
public WebUser CurrentUser
{
    get
    { 
        // Do we already have the CurrentUser?
        if (_currentUser == null)
        {
            // Try to get the user from Session
            Object obj  = HttpContext.Current.Session["__currentUser"];
            if (obj != null)
            {
                _currentUser = (WebUser)obj;
                return _currentUser;
            }
            // Try to get the user from a TrackingCookie
            SecurityHelper secHelper = new SecurityHelper();
            WebUserRepository rep = new WebUserRepository(); 
            if (secHelper.TrackingGuid != Guid.Empty)
            {                       
                _currentUser = rep.GetWebUserByTrackingGuid(secHelper.TrackingGuid);
                if (_currentUser != null) return _currentUser;
            }
            // If we still don't have a user then we need to create a new
            // WebUser with a new TrackingGuid.
            WebUserFactory factory = new WebUserFactory();
            _currentUser = factory.CreateWebUser();
        }
        return _currentUser;
    }
}
 
 
Hmmmmmm. Not too horrible, not too long and ungainly, but it definitely has some code smell (the bad kind).  First, I’m not thrilled about the multiple returns nested in conditional blocks. Also, the code is doing a number of different things that I felt needed comments to explain.  Now I love comments, and I’m a firm believer that when in doubt, comment.  But I’ve also come to realize that it is possible to design code in such a way that it can be just as clear, and just as understandable, without the need for comments.  
So how do we reach this promised land of understandability? The main technique that I’ve been using is simply to take my big blocks of code that do several different things, and break them up into several separate, clearly named methods, and then just call them from my main block. Here’s how I applied this technique to my CurrentUser property.
 
 
// CurrentUser
 private WebUser _currentUser;
 public WebUser CurrentUser
{
   get
   {
     if (_currentUser == null) _currentUser = GetWebUserFromSession();
     if (_currentUser == null) _currentUser = GetWebUserFromTrackingCookie(); 
     if (_currentUser == null) _currentUser = CreateNewWebUser(); 
     return _currentUser;
   }
}
 
 
// GetWebUserFromSession
private WebUser GetWebUserFromSession()
{
     Object obj = HttpContext.Current.Session["__currentUser"];
     return obj == null ? null : (WebUser)obj;
}
 
// GetWebUserFromTrackingCookie
private WebUser GetWebUserFromTrackingCookie()
{
     SecurityHelper secHelper = new SecurityHelper();
     WebUserRepository rep = new WebUserRepository();
     if (secHelper.TrackingGuid == Guid.Empty)
         return null;
     else
         return rep.GetWebUserByTrackingGuid(secHelper.TrackingGuid);
}
 
// CreateNewWebUser
private WebUser CreateNewWebUser()
{
     WebUserFactory factory = new WebUserFactory();
     return factory.CreateWebUser();
}
 

Now I realize that in this second version I wrote more code than the first, and I do a couple of extra null checks in my main property code, but look at how easily understandable everything is. I mean really. There’s not a single comment, but can you imagine anybody not understanding exactly what’s going on in this block?

get

{

if (_currentUser == null) _currentUser = GetWebUserFromSession();

if (_currentUser == null) _currentUser = GetWebUserFromTrackingCookie();

if (_currentUser == null) _currentUser = CreateNewWebUser();

return _currentUser;

}

Plus, because I factored out methods like GetWebUserFromSession() and GetWebUserFromTrackingCookie() I can now use those methods in other parts of my class without having to rewrite the functionality. So overall, I think this version smells much more SOLID. What do you think? If anyone has ideas or favorite techniques for how to get more SOLID, please leave a comment.

Reference Bits