The End of Monkeypatching

By Xavier Shay / March 29, 2011

Monkey-patching is so 2010. We're in the future, and with Github and Bundler there is now rarely a need to monkey-patch Ruby code in your applications.

Monkey-patching is the dangerous-yet-frequently-useful technique of re-opening existing classes to change or add to their behavior. For example, if I have always felt that Array should implement the sum method, I can add it in my codebase:

class Array
  def sum
    inject {|sum, x| sum + x }
  end
end

That is a monkey-patch. Of course, when I require activesupport it also adds a sum method to Array though its version has an arity of one and takes a block. This conflict can cause hard to track down errors and is why monkey-patching is to be used with caution.

Thankfully, the spread of this abuse is minimal and most developers understand the risks. More frequently, monkey-patching is used to quickly fix bugs in existing libraries by reopening a class and replacing an existing method with an implementation that works correctly. This is often a fragile solution, relying on on sometimes complex techniques to override exactly the right bit of code, and also on the underlying library not be refactored.

In the dark ages when it was troublesome to own or release your own versions of gems, this was the only cheap solution available. Nowadays though, a modern Ruby application has access to a far easier and more robust solution: fork the offending code, fix it at the source, and set up your application dependencies to use your new code directly. All without having to package a new gem!

The first few steps of this process are mostly self-explanatory, but I have documented them below anyway. If you are already old hat at this stuff, feel free to skip directly to step 4.

Step 1: Fork the Library

It is rare to find a Ruby gem or library that isn’t on Github these days. After locating the library, always check the network graph to try and find other popular forks. Often the problem you are trying to solve has already been fixed by another developer. In that case, you can skip straight to step 3. Otherwise, fork the code base to your own GitHub account.

Step 2: Make Your Changes

Clone your fork and make whatever changes you need. If you are feeling generous, add an appropriate test to the code base as well so it can be contributed back to the original fork, but as long as you have a test somewhere (such as in your main app) for the desired behavior you will be fine.

Step 3: Change Your Gemfile

Point your Gemfile at the new code:

# Gemfile
# From this
gem 'rails'

# To this
gem 'rails', :git => 'git://github.com/xaviershay/rails', :branch => 'important-fix'

And reinstall your gems by running bundle.

Step 4: Document

This step is important. There is no excuse for skipping it. You need documentation in three places:

A note at the top of the README in your fork, documenting the changes. Any developer can stumble across a public fork, and there is nothing more frustrating than trying to figure out whether a fork already solves your problem. At the very least, a “here be dragons” note will be appreciated.
The place in your code base that depends on the fork. You can expect other developers to be familiar with rails and the standard gems. They won’t be familiar with the behavior of your changes.
The Gemfile. Make a note above your gem line as to why a fork is required. Provide enough information that a future developer will know when or if it would be appropriate to upgrade or switch back to the main gem. Here are some real examples from some of my projects:

# An experimental fix for memory bloat issues in development, if it works
# I will be patching to core.
gem ...

# 1.1 requires rubygems > 1.4, so won't install on heroku. This fork removes
# that dependency, since it is actually only required for the development
# dependencies.
gem ...

# Need e86f5f23f5ed15d2e9f2 in master and us to upgrade to dm-core 1.1
# before we switch back. Should be in 1.1.1 release.
gem ...

Bonus Step: Upgrading

Six months down the track, how will you know whether your monkey-patched fixes have been solved elsewhere? Sure, your tests should cover it, but it is nice to have some more confidence. We can use some git tricks to get some intel. Add the master fork as a remote to your project, and you can get a log of the differences between then. Here is an example of a fork of dm-rails I have:

$ git clone git://github.com/xaviershay/dm-rails
$ cd dm-rails
$ git remote add datamapper git://github.com/datamapper/dm-rails
$ git fetch datamapper
$ git log --format=oneline --graph v1.1.0..origin/dev-fix-3.0.x
* e9a2b623aea6c87675247230acce81b031163719 Need to .dup this array because otherwise deleting from it causes undefined iteration
* 0736617a1a97862ab249e6388a3c87df4d9d3231 Remove duplicate dependencies from gemspec now that Jeweler reads the Gemfile
* 0265016cdf4528a922e1db32ae922924465f095f Revert "Merge branch 'master' into mine"
* f054c803baf41fabe0ac443bc8d205f867100a9c Merge branch 'master' into mine
* a969fd1ac2066e4b4bc785a0e9a7d904309ca64f Regenerate gemspec
* 373073444acae97b2a9ad9e511e16f44a46a73ed Clear out descendants on preloadmodels to prevent memory bloat in development

Ignoring the merge and gemspec commits, you can see that my commits did not make it into the 1.1.0 release. This does not mean I should not upgrade - it is quite possible that my problem was solved in another way - but it lets me know what I am looking for.

Parting Words

For a project using Bundler, there is now rarely ever a need to monkey patch anything. Any bugs or enhancements can be fixed properly at the source, resulting in happier code and happier developers.

[yay] Want to jump on the Rails 3 train? Michael Hartl's Ruby on Rails 3 Tutorial series is the way to go. There's a free, online book but if you want to go further, pick up the 15+ hours of screencasts giving you an 'over the shoulder' view of a Rails professional building Rails 3 apps in real time.

Comments

Mark Wilden says:
March 29, 2011 at 2:57 am
The latter part of the post, about fixing through forking, is very good. The first part, about adding Array#sum, however, doesn't make a good argument for its case: "can cause hard to track down errors." These errors would only be hard to track down if you didn't write tests. The fact that Rails itself monkeypatches in this method is indicative of its validity.

Adding methods to a class is one of the hallmarks of OOP. Monkeypatching simply makes it less cumbersome. And as with all tools, it is not appropriate for every use.
Matthew Rudy Jacobs says:
March 29, 2011 at 3:42 am
Generally I think this is a good post,
but on one point I can't decide on;

- "A note at the top of the README in your fork"

This would be fine, except;

1. it means you have to make changes to MASTER
2. it makes it a bit more annoying to be pulled upstream

I'm into the idea of just changing the github description.
Dominik Honnef says:
March 29, 2011 at 3:59 am
When I read the title of this article, I thought "well, another article about refinements, that's not too bad". But instead I was presented with an article that propagates forking a project for every minor problem, and even worse, using your forks in production, leaving system administrators with maybe 10 slightly different versions of library X, all maintained and updated in a unique, usually unsatisfactory fashion. And the real fun begins when dependency A and B need each their own, patched library C, with possibly conflicting changes...

Now that could still be excused, but the worst part about this article is that pull requests are not mentioned a single time. If you fork project X to fix a bug you should *actively* try to get that patch integrated in the main project. Expecting a project maintainer to check the fork queue and filtering through hundreds of usually meaningless forks is not the way contributing to open source works. Instead, you add a fifth step, which actually is the important one here, that says: "Get in contact with the project maintainers".

All in all this article describes something most people already know, and those who didn't, get misled.
Tim Linquist says:
March 29, 2011 at 6:06 am
I like what @Dominik had to say. I would change one thing though. Make his 5th step Step #1. You should always get in touch with the maintainer(s) first to see where the bug/enhancement falls in line with their current plans. A solution may already be in the works before you fork the repo.
Hartog de Mik says:
March 29, 2011 at 7:30 am
+1 for Dominik Honnef
Chris Eppstein says:
March 29, 2011 at 8:20 am
You've omitted one of the best parts of using git to manage private forks: Rebasing. When you need to get the latest code from the original developer you just rebase against their branch, resolve conflicts, and make sure tests are still green. You're changes always sit on top, ready to be pulled into the main line at a moments notice. It also makes it easier for others to see what's been changed from the main line.
Jean says:
March 29, 2011 at 8:22 am
I completely agree with Dominik Honnef. Monkey patching is fragile at best but depending on forked libraries on github is not a valid solution.
Klaus Gundermann says:
March 29, 2011 at 8:35 am
"add a fifth step, ... that says: "Get in contact with the project maintainers"
No, that should be definitely the First step.
The second step should be: Create a test and a patch for the original gem and send it to the maintainer
Meanwhile you may use a monkey patch until your patch gets integrated in the main tree.
Only as last resort create a new fork.

The problem with offensive forking is: a new user will find a dozens fork of the same gem which are only slightly diffent, so with which one should he start with?
Mike says:
March 29, 2011 at 10:58 am
I agree with the comments of Wilden and Honnef above, but would go further to say that this article appears to be a perfect example of the torture of the English language in an attempt to justify the use of an inflammatory title in order to get more hits.

Extending the Ruby language to fit your needs, which is what some call monkey patching, and what David Heinemeier Hansson only half-jokingly suggests we call freedom patching, is not abuse. Like any coding technique, even commenting, it can be abused, but it is such an extremely powerful technique that it is used extensively not only in Ruby, and available in Smalltalk, JavaScript, Objective-C, Perl, Python, and Groovy.
damon says:
March 29, 2011 at 12:28 pm
Solving a non-problem with a royal pain, congrats.
Vito Botta says:
March 29, 2011 at 12:33 pm
I liked the post, but I also tend to agree with Dominik. Is it really a good idea to depend on a forked copy of a dependency for every small change?

An example that comes to mind. I had a problem with Liquid and unicode characters, once I found the problem all I had to do to fix it quickly so I could move on was this:

Liquid.instance_eval do remove_const :VariableSegment remove_const :VariableParser
VariableSegment = /[\w\-]/u VariableParser = /\[[^\]]+\]|#{VariableSegment}+\??/u end

Then I added a comment to let them know about the issue - although I think it hasn't been fixed yet.

So I only had to change those regular expressions. I agree with you that perhaps it feels cleaner somehow to avoid monkey patching, but I don't think I would prefer forking Liquid just for this minor change. Dominik makes also a very good point on contributing and pull requests.

Nice post though, thanks.
Sam says:
March 29, 2011 at 12:38 pm
This was a great article, I'm still a newbie, but certainly getting more comfortable with git/rails ninjarie.
Wyatt Carss says:
March 29, 2011 at 1:08 pm
I haven't worked with a large enough ruby codebase yet to need to manage odd gem issues and decide what course to take, but I probably will someday. I was aware of monkeypatching as a way to work through those sorts of issues, and while I was also aware that most gems are visible on github, I had *not* thought about using the open source model as a path around the monkeypatch.

Good food for thought - it's another useful tool in the belt. Thanks, Xavier, and thanks to the commenters as well - you've provided pretty much the perfect grain of salt :)
Anonym says:
March 29, 2011 at 1:12 pm
I read your entire post, went back on top, saw the monkey patch, and exclaimed "cool, this solves it faster!". I'm gonna be burnt in the fire of a thousand suns!
hakunin says:
March 29, 2011 at 2:36 pm
The article has a point. Look at it the other way around.

1) You have a problem with a gem.
2) You searched/asked around — it's a bug.
3) You figured out how to monkeypatch it.
4) Oh cool, I can make this 5 minutes of my time useful to others! Fork, commit, pull request. Now what?
5) Hmm, since I need to do something before it's pulled, and now that I have a fork already in place, adding :git => my_fork to Gemfile with a #TODO comment linking to pull request is just as easy as adding a monkeypatch to initializers with the same #TODO comment. Still, nobody forces me to link Gemfile to my fork, I may stick with monkeypatch...

The point is that monkeypatching is one step away from contrib in most cases, so why not take the extra step.
Kyle Drake says:
March 29, 2011 at 2:58 pm
I noticed this amazing functionality and tried to use it at work, but then learned it was worthless for us, because bundler has no support for vendoring git-sourced gems yet. In other words bundle package won't work! I'm not comfortable deploying a lot of apps without bundled dependencies, and Heroku doesn't want to pull from our private gem repository. I'm sure there's a method to add a key for it, but again, a little risky for my standards.

I have started on a patch that may or may not fix this (I'm not a git expert and may be lacking knowledge of something important needed). Take a look:
https://github.com/benhamill/bundler/pull/1
https://github.com/kyledrake/bundler/commit/c02b43b5495daf93f36337fef24344bdafee4c81
Joel Parker Henderson says:
March 29, 2011 at 4:34 pm
Monkeypatches are able to test for the existence of a method *before* the patch applies, thus alerting you to the error immediately.
Dominik Honnef says:
March 29, 2011 at 8:23 pm
I agree with Tim Linquist here, my 5th step should have been the first one. Looks like I was trying too hard to fix the author's "workflow" by extending it, instead of fixing the flawed basis.
Andrew Grimm says:
March 29, 2011 at 11:00 pm
@Dominik: I think you're making a straw man with "And the real fun begins when dependency A and B need each their own, patched library C, with possibly conflicting changes..." - there's a difference between a private rails project having a non-standard dependency and a public gem having a non-standard dependency.

One issue I came across recently was that a gem was published, but the source code repository wasn't updated. http://rubygems.org/gems/win32-process version 0.6.5 was published in December, but the author ( https://github.com/djberg96 ) only pushed the changes when I told him they hadn't been pushed (a few days ago).
Xavier Shay says:
March 30, 2011 at 9:14 am
There is a big difference between a quick hack and a well tested patch worthy of a pull request. In a perfect world everyone would have the time and knowledge to contribute back every change they need, but I accept that this is not often the case. This technique deliberately has a low barrier to entry - almost as low as monkey patching - to encourage a better way of patching code, changing it at the source rather than overriding it, that naturally leads into proper patches and contributions. It is the "bare minimum", if you like.

Also note that step #4 applies to monkey patching to ... documentation is important.

using your forks in production, leaving system administrators with maybe 10 slightly different versions of library X, all maintained and updated in a unique, usually unsatisfactory fashion. And the real fun begins when dependency A and B need each their own, patched library C, with possibly conflicting changes...

The whole point of bundler is that it does all the dependency resolution for you and isolates the versions that you use from anything else on the system. The issue you describe simply does not exist in this scenario - it is essentially invisible to system administrators.

Even if there were problems with conflicting code, this is also an issue with monkey patching. With forking though, you are been explicit about where and which code has changed, and provide *more* tools (such as git log) for upgrading or resolving conflicts.

The first part, about adding Array#sum, however, doesn't make a good argument for its case

Apologies this was rather thin, I was in a rush to get on with the main thrust of the article :) The problems with monkey patching have been extensively documented elsewhere.

The problem with offensive forking is: a new user will find a dozens fork of the same gem which are only slightly diffent, so with which one should he start with?

Non-issue. Use the main one, unless a significant fork is documented or obvious from the network tree. Rails has ~1,400 forks and no one has any problem knowing which one to use.

I'm into the idea of just changing the github description.

This is good too but it doesn't matter too much. Any doc changes should be in separate commits. As a maintainer of many projects, I don't find these particularly troublesome.

You've omitted one of the best parts of using git to manage private forks: Rebasing.

Yes this is awesome, I could have been more explicit in the upgrading section.
Matthew Schinckel says:
March 30, 2011 at 9:20 am
You are missing one key aspect of MonkeyPatching (or DuckPunching, as I prefer to call it). When you are patching a standard library, that contains a bug or missing feature. I'm mainly a python programmer, and it pains me that timedelta is unable to be divided by another timedelta, which is a perfectly reasonable operation. Because python builtin classes are unable to be duck-punched, I have to have a function that does this operation, instead of being able to have it just happen.

Thus:

percent = dt_1 / dt_2

Becomes:

percent = timedelta_divide(dt_1, dt_2)

I know which one is easier to read.

Responsible duck-punching (testing the existence of a method, overriding so that current behaviour outside of the desired change is unaffected) can be a clean way to write easier to read code.

Granted, there is danger whenever doing this, but we are responsible adults.
Matthew Schinckel says:
March 30, 2011 at 9:21 am
Oh, and don't even get me started on trying to reconcile the various forks of a package/gem. The official one isn't getting any love? There are 60 forks, each of which has some of the same changes implemented in different ways?
max says:
March 30, 2011 at 10:26 am
well monkey patching is just how the whole rails thing works.. your title is misleading
Mike Bethany says:
April 4, 2011 at 9:26 pm
I've monkey patched just as a protest against a poorly done Ruby library. For instance I wrote this because FileUtils' directory copy is very, very dumb.

require 'fileutils'

module FileUtils
# Monkeypatch FileUtils really annoying lack of proper directory copying
def copy_dir(source, dest)
files = Dir.glob("#{source}/**")
mkdir_p dest
cp_r files, dest
end
module_function :copy_dir
end
John says:
April 6, 2011 at 7:48 pm
bundler is awesome and solves a lot of problems. Unfortunately, it does not handle optional dependencies and using different gems to fulfill a requirement. Until that is fixed (whether upstream in rubygems or in bundler) I avoid it in most of my projects.
Ibrahim Abdullah says:
April 7, 2011 at 10:52 am
As for your monkeypatch to Array class, Ruby has a nice shortcut for inject. You can pass in a symbol that corresponds to a method:

class Array

def sum
inject(:+)
end
end

Or if you are a ruby sugar freak like myself, just use it on the array itself. No need for monkeypatching stuff:

[1, 2, 3, 4].inject(:+)
Pingback: Monkey Patching in Ruby | Mr.Webmaster Blog
jim says:
May 12, 2011 at 5:56 pm
I just precede monkeypatched methods with "my_", e.g. "my_sum", "my_multiply", so it's obvious when I come back to the code a year later they are monkeypatched and will probably not conflict with anyone else's monkeypatched code.