How DRY should I be?
17 minutes read ā¢
The Problem
I love writing abstractions.
It appeases my āducks in a rowā desires. It makes my code tidy and I feel like a pro calling my nifty functions. The problem is, it doesnāt take too long before I get backed into a corner and the tidiness unravels into spaghetti.
Weāve all done this.
It feels great to make magic, but magic gets messy when you need to fix it. If youāre like me, youāll claim youāre not doing it for the magic by hiding behind the intelligent cause of staying DRY.
For the unfamiliar, DRY stands for Donāt Repeat Yourself. Itās a software development principle designed to reduce redudancy and maintainability issues.
In a nutshell:
- Productivity: Writing logic once and reusing it reduces typing.
- Readability: Less code means less to read. And we read a lot more code than we write.
- Maintainability: Less code to read means less to fix when things go wrong.
At least thatās how I understand it.
The term was first introduced in the book The Pragmatic Programmer, advocating:
āEvery piece of knowledge must have a single, unambiguous, authoritative representation within a system.ā
I donāt know how, but maybe itās just because its so simpleā¦ This is the one thing I just innately overdo. When I see those capital letters online, I feel like Iām around my people. They get me. It resonates with the same immature hype that the minimalist movement had. Clearly important elements of truth that people have either drastically under or over-actioned with no clear markers in place to measure effectiveness.
Iām going to be honest. I have not read the bookā¦ maybe my understanding of DRY is incomplete or immature, but Iām going to hazard a guess that most people who āpracticeā DRY havenāt read it either, probably not even you.
I imagine if there is ātrueā nuanced DRY, most developers arenāt familiar with it.
So hereās my hot take.
I donāt think programmers need any encouragement to be more DRY.
Us programmers donāt like repeating ourselves. We work out pretty early that itās easier to use functions and variables than it is to copy and paste code all over the place like 5 year old scrap-bookers.
We want elegance and flow; we like watching code dance for us. The problem is, if we donāt control that obsession, we end up dancing for the code.
Examples
As soon as something becomes repetitive, we have the choice of leaving it in or reaching for an abstraction.
Abstraction #1: The Humble Variable.
# Not very DRY
>> 21
# Variables!
= 9 + 10
>> 21
Nice. It works. But eww. If we know we want to print and log for all sorts of code throughout the project, thatās gonna get repetitive. And itās not very future proof if we want to send that output to more āsinksā in the future. Say I have 26 prints
to stdout throughout my code. What if I now want to add the feature to send a fax every time I print? (A perfectly reasonable goal).
I would have to update the code 26 times. DRY says no!
I guess we could use a function?
"""
Notify on all the services!
"""
>> 21
{{% videogif src=āwuphf.webmā %}}
Much better! Now my action is a single line function call that wraps five statements. I just saved myself 121 lines of code! Let the minimalist addiction kick in. Concise and maintainable.
But what if I need a little more flexibility? Maybe I donāt want to fax or tweet every time. Just most of the time. I donāt want to have to create another function called wuphf_without_fax_and_tweet
. Thatās clearly dumb. Letās make our existing function more flexible.
"""
Now customisable!
"""
>> 21
Now Iām really solving problems. Look at all the work Iāve moved into the function ready for repeat use! My function only needs one mandatory argument. Thatāll keep my function calls concise.
But ahem as āusefulā as our new function is, itās a bit wordy itself. If we write a lot of functions like this, our file might be getting pretty chunky. I donāt want to see these utility functions shadowing out the rest of the business logic, now do I??
We need to get organised.
Letās put it in another file out of the way somewhere and import it. Weāre bound to add additional functions of a similar nature, they can just live in that module too. We may as well start moving any other wordy functions to their own modules and just have a tidy-up while weāre at it.
>> 21
Done! Easy. Works great. Rinse-and-repeat as you please and your main script is going to look so tidy š
# Prompt user
=
Itās a Trap!
This is obviously a silly example, but it illustrates a trap I honestly fall into:
- I abstract for repeat use
- I add more functionality to the abstraction (uh, oh, function is no longer single responsibility)
- I move similar functions into a module
- I need more functionality but have to dance around to implement it.
- I think Iāve fixed it, but one fix somewhere causes a break somewhere else since there are so many calls to my now overcomplicated, fragile function.
{{% videogif src=āleak.webmā%}}
If I knew I needed the flexibility of notifying to different services, I wouldnāt have done the abstraction like this to begin with. Functionality that should exist as statements in the main script now exists two layers deep.
Not that Iād wish Notepad on someone as an IDE, butā¦ good luck navigating without Intellisense. We havenāt even touched classes.
Stuck with the Cheque
Potential downsides of abstraction:
Complexity. Abstraction can turn simple code into a navigation nightmare. A code block becomes a function, then a file, then a module. Code functionality can become hidden deep in the codebase.
Brittle. Abstractions can create unnecessary hoops to jump through in order to anticipate future needs that may never materialise. These hoops constrain the way we write code that interfaces with it. If the code that needs it canāt change, we end up having to change the abstraction and potentially any other code thatās already used it.
Maintainability. Code thatās complex and brittle is already hard to work with. What about anyone in the future who has to work on that code? Can they learn it quickly? What about yourself in a weeks time?
Premature abstraction is basically blind refactoring. We donāt want to refactor later when we have the facts, letās refactor now without them!
Mitigation
Alright, so too much DRY can lead to brittle abstractions. How do we find a good balance? Well, being aware is a good first step. Hopefully, you can analyse your own dev journey and see where you lie on that spectrum. You might be doing great and this is all for someone else.
But if this is for you, read on. Hereās a few things that have helped me.
Consider Abstraction Usefulness
Be honest with yourself. Ask yourself these questions before you get your hands dirty:
How likely are you to need this later?
Is there something more generic that does the job?
- If so, exhaust those options before rolling your own!
How nested is your code?
What do you gain in making a code block a function?
What do you gain in moving a set of functions into a class?
What do you gain in moving a set of functions/classes to a separate file (module, script, etc.)?
File Length
Is your file getting too long? Are you tired of scrolling up and down so much?
To be honest, it probably isnāt too long.
Are you using code folding shortcuts effectively? You can collapse by different levels of indentation. E.g. If you have a set of test functions defined at one indentation level in, you can fold all the function definitions into their titles with
ctrl/cmd + k, ctrl/cmd 1
. Each indentation level deeper is a higher number.Are you using āPeekā? VS Code has an option to jump to function definitions from references to a function elsewhere (Go to Definition). It also allows the inverse, listing references from the definition itself (Go to References). Use
ctrl/cmd + left click
orshift + F12
andoption + F12
respectively. Using Peek to perform these navigations gives you a little window to work with, so you can update a reference (or several) and not have the mental overhead of finding you way back to deal with.Are you using cursor history navigation? VS Code has a āGo forwardā and āGo backā functionality for the cursor. If the Peek window isnāt enough and you just need more visual context, or hindsight is 20/20 and you need to go back somewhere that wasnāt a linked function definition/reference, you can navigate to where your cursor was last and back again.
Are you using āRename Symbolā? This oneās a stretch, but if you find the possibility of similar names causing rename conflict has you wanting to split your file, consider using Rename Symbol. If you need to change the name of a function or variable everywhere you might not want to change every instance of the word too. Say you have a function named, āfileā, but you realise a little late that
file
is actually a built-in variable in your language that shouldnāt be overriden. If you run āfind and replaceā, every instance of that word will change, including instances of that built-in variable and any time you used the word āfileā in inline comments or as part of a name in any other variables, function calls or methods, etc. Using Rename Symbol ensures you donāt mess with that other stuff.
Learning how to navigate in your IDE efficiently will help prevent you from segregating your code out of navigation frustration. When you think youāve maxed out your ability to grasp navigation tools, if it feels like thereās too much going on in a file, then consider moving stuff.
If after some time you can see sections that have discrete responsibilities that you have not edited in a while, you can try moving them, but consider what you will lose and what you will gain carefully!
Thereās plenty more to consider. Use the IDEās outline, breadcrumbs, etc.
Code Complexity
Cyclomatic complexity is a metric that measures the complexity of a program by counting the number of linearly independent paths through a programās source code.
Essentially, itāll help you keep yourself honest with stats that tell you when youāre a bad boy. Too many nested if statements in a single function, that sort of thing. It wonāt help you with Separation of Concerns (if thatās a principle you follow), but it will help you with keeping complexity low. Unnecessary complexity is what makes a brittle abstraction.
If youāre using Python, consider using Ruff as your linter (and formatter). It enables McCabeās complexity rule for linting by default (c901). Itās written in Rust, is very fast and includes support for a lot of linting rules.
Snippets
Snippets are blocks of text that you can inject at your cursor. If weāre talking code, snippets are a great way to inject common code patterns. They offer a template for a common pattern, like looping or conditional statements.
Iāve never much used snippets until recently.
Iāve had to push through some mental friction in using them. Part of me feels if I use snippets, Iāll forget how to write the code patterns they inject. Itās like trading memory-space; if I need to remember the name of a snippet and a keyboard shortcut for it, Iām going to forget the code pattern.
Fortunately, this isnāt true. How you get your code into the IDE matters a lot less than understanding what it does. Sometimes, I find myself staring at the flashing cursor with 10 different for-loop implementations fighting for my attention. Snippets help take this load off a bit.
Visual Studo Code has a built-in snippet feature that you can use to create snippets. There are also a heap of snippets published on the VSCode marketplace as extensions. When you insert a snippet in VSCode, you can tab through the required variables to quickly fill them out.
Other IDEs have similar implementations.
if you find youself needing snippets across IDEs, code editors in browsers or quickly in a terminal, consider using RayCast if youāre on a Mac ā for a bunch of reasons, snippets being one. Raycast allows snippet injection globally, anywhere you can type. It also includes support for dynamic placeholders (variables). It goes beyond code. Itās handy for templating emails, Zendesk tickets, whatever you can think of.
Iāve also used Espanso, KeyboardMaestro, and AutoHotkey on Windows. They all have their own unique features, but support snippets at a minimum.
AI Assistance
AI Autocomplete is different. It often completes what Iām going to type and of course, I get to accept or deny the suggestion. This is one less cognitive hurdle compared to snippets, Iām not trading any mental real-estate to remember a trigger. It comes with the huge added benefit of being intelligent:
- You donāt need to add any snippets manually
- Completions are fully dynamic, using the context of your file, repo or even whole codebase
- Cycle through multiple suggestions if the first one doesnāt work
Iāve been using Sourcegraphās Cody as an alternative to the non-free Github Copilot. I did some work on a frontend with react-form-json-schema and PowerShell as a backend (WILD stuff ā see the amazing Powershell Universal platform). Because the LLM was familiar with the larger react-form-json-schema project, it had no trouble translating that into my niche PowerShell codebase. All the form validation is in native PowerShell with very consistent Test
functions for each field:
After writing a couple of these validation functions, Cody pretty much did the rest. All my fields were defined in the schema. Cody pre-empted each line I needed. Even the comments. It even replicated my weird Write-Host
stuff with āERRORā and āINFOā prefixes (PU sends Write-Host
to the browser console, but not Write-Debug/Info/Error
).
If your code is predictable enough, you may find AI autocomplete does a fine job of anticipating your structure. This in a way is a good marker of your consistency as a programmer. If you find it anticipating your code well, you might find that the mental strain of yourself anticipating your code is also lowered.
AI isnāt always going to have the completions you want, but it is a step up from snippets for cutting through the tedium. Just be aware that when you donāt want a suggestion it can be a bit of a nuisance and suggest a completiion if you pause for too long. In my opinion, this is more annoying than youād think. If you struggle with maintaining a train-of-thought, unwanted completions will derail you quicker.
Regarding DRY itself, if you find a block of code you think could be more concise, ask Cody to rewrite it. See what it suggests.
AI prompting
Weāre all familiar with ChatGPT by now.
The most recent model GPT-4O has really impressed me. Iāve received some great working code examples from it that just worked out-of-the-box. The Powershell Universal project above, itās all in one file, 400 lines of code. I pasted that into ChatGPT and asked for a few changes and it sent it right back, changes applied. It actually became more convenient to ask for changes I knew how to make directly within ChatGPT itself. By the time I could make the necessary modifications to the FormSchema
, UISchema
and āTestā functions myself, ChatGPT wouldāve finished writing the whole script out again from scratch.
Iām excited to see more convenient coding workflows pop up.
Consider Cody, ChatGPT, GitHub Copilot, Groq, Ollama and Mods to name a few AI toolsā¦ and Cursor, for an AI first IDE. This seems to be getting better by the day.
Environments
Consider your production and development environments. How will it deal with your abstractions? Is it going to play with a modular codebase nicely? Environments can be the difference between a project building successfully or not. If manual changes to the code are required to deploy in common environments, merging upstream changes from git gets messy. This is not a fun experience for your users.
Portainer has a great GitOps style integration for its Stacks feature. It works great with a
docker-compose.yaml
and.env
, but it wonāt easily handle sidecar yaml configs or adocker-compose.override.yaml
. Does the configuration need to be abstracted into multiple files?Powershell Universal has an internal repo that is not designed for manual updates because of the way file-watching works to reload downstream services. Creating and importing modules can get a bit prickly without proper planning.
Application hosting services like Fly.io expect a certain file structure. Keeping your code in line with recommendations might be important to prevent refactoring later.
Can your planned abstractions be used later in this codebase? How about in future projects?
Can you release code in a way to make abstractions more compatible?
Frameworks, Libraries and other Dependencies
This one takes good judgement. Both of the follwing statements are true:
- The best kind of abstractions to work with are ones you didnāt write
- The worst kind of abstractions to work with are ones you didnāt write
Provided you are careful about the dependencies you choose, you can find yourself with a featureful set of abstractions that are well scoped, well planned, maintained, secure, rarely include breaking changes and have a healthy community happy to support.
Nothing lasts forever, but good dependencies should outlast your own projects. When choosing dependencies, check:
It doesnāt do too much.
- Depedencies that do too much become a greater risk to your code if they become unmaintained.
- Poor scoping may also be indicative of poor planning and more difficult maintenance.
Maintainer engagement.
- Are issues responded to in a timely manner?
- Are pull requests reviewed?
- Are new features added?
It has a healthy community.
- Are users asking questions and getting answers from the community?
- Is there a dedicated place for discussion?
- Is the community engaged with the direction of the project?
Itās mature.
- Does it use semantic versioning?
- Are they in Alpha/Beta or Release Candidate stages?
- Have they released a 1.0.0 version yet?
- Is the project prone to breaking changes?
- Linters? Formatters? CI/CD workflows?
- What sort of tooling is used for development?
Documentation.
- Is the code well documented?
- Is the code well commented?
- Is there lag between the main branch and docs?
Source Code.
- Is the code well structured?
- Do you understand it?
- Could you confidently make contributions to it if you needed to?
- How much code is there?
- How abstract is it? š
Rolling-your-own has maintainability implications, but at the end of the day, itās code you wrote and understand. Vulnerabilities in that code wonāt go away unless you fix them, but youāll be safe from supply chain attacks.
Dependencies come with whatever their maintainers decide to put in their code. A healthy open-source community may detect and mitigate security vulnerabilities, fix bugs, add docs and maybe even introduce new features. An unhealthy community can lead to stagnant, vulnerable, out-of-date code that may even stop working with other dependencies or language versions.
Choose wisely!