Andrew Nesbitt

From ZeroVer to SemVer: A Comprehensive List of Versioning Schemes in Open Source

2024-06-24T15:52:00+00:00

You might think that semantic versioning (SemVer) is the only versioning scheme in open source software, but there’s actually an array of alternatives out there. In this post, I’ve compiled a comprehensive list of the various versioning schemes floating around in the internet, from ZeroVer to SemVer, and everything in between.

SemVer

Semantic Versioning (SemVer) is probably the most widely used and often assumed default versioning scheme in open source software. It follows the format MAJOR.MINOR.PATCH, where:

MAJOR version increments indicate incompatible API changes,
MINOR version increments add functionality in a backward-compatible manner,
PATCH version increments make backward-compatible bug fixes.

This scheme provides a clear and predictable way to manage and communicate changes in software projects.

Originally proposed by Tom Preston-Werner in 2010, SemVer has since become the de facto standard for versioning in the open source community, used by many package managers and repositories, including npm, RubyGems, and Cargo.

CalVer

Calendar Versioning (CalVer) is a versioning scheme that uses a date-based version number, typically in the format YYYY.MM.DD. This scheme is based on the idea that version numbers should be human-readable and easy to understand, providing a clear indication of when a release was made.

CalVer offers multiple calendar-based formats to suit different project needs:

YYYY.MM.DD: Full date (e.g., 2021.03.22) indicating the exact release day.
YYYY.MM: Year and month (e.g., 2021.03) for monthly releases.
YYYY.0M.DD: Zero-padded month and day (e.g., 2021.03.05) for consistency in sorting.
YY.0M.DD: Short year, zero-padded month, and day (e.g., 21.03.05) for compactness.
YYYY.WW: Year and week number (e.g., 2021.12) for weekly releases.
YY.0W: Short year and zero-padded week (e.g., 21.05) for a simpler weekly format.

CalVer was proposed by Mahmoud Hashemi in March 2016.

ZeroVer

ZeroVer is a satirical versioning scheme where the version number always starts with 0. This convention humorously suggests that software is perpetually in its initial development stages, indicating that it is not yet stable and that APIs are subject to change. Versions might look like 0.1.0, 0.2.0, and so on.

While it’s meant as a joke, many projects unintentionally fall into ZeroVer compatibility simply because they have never made the jump to 1.0.0. This can also serve to set expectations about the software’s maturity and stability during early development.

Version 0.0.1 of ZeroVer was published by Mahmoud Hashemi on 2018-04-01.

EffVer

Intended Effort Versioning (EffVer) is a versioning scheme that, instead of quantifying the orthogonality of a change, tries to quantify the intended work required to adopt the change. EffVer doesn’t make a distinction between bug fixes, enhancements, and features, as releases often include all of these elements.

Instead, it focuses solely on the effort required for existing users to adopt new versions. The version number reflects the effort needed, with higher numbers indicating more significant changes and potentially greater effort to adopt. This approach helps users better plan for updates by understanding the impact on their workflow.

EffVer was proposed by Jacob Tomlinson in January 2024 as an alternative to SemVer and CalVer, aiming to provide a more user-centric approach to versioning.

RomVer

Romantic Versioning (RomVer) is a versioning scheme based on Semantic Versioning (SemVer) but attempts to extract the “versioning spec” from real-world usage of software versions, such as those in Node, Rails, PHP, jQuery, NPM, and the Linux Kernel. RomVer follows the format HUMAN.MAJOR.MINOR, where:

HUMAN is a memorable, human-readable name or word that encapsulates the essence or milestone of the release,
MAJOR indicates significant, potentially backward-incompatible changes,
MINOR denotes smaller, backward-compatible updates and bug fixes.

RomVer seeks to enforce some rules to make software versioning predictable and straightforward, helping users better understand the impact of updates. By using human-readable names, it provides a more intuitive way to identify and recall different versions, aligning closely with real-world practices observed in various software projects.

RomVer was proposed by Daniel V from the Legacy Blog crew in 2015.

The term as also used by Jeremy Ashkenas in 2012 in the context of Backbone.js.

Sentimental Versioning

Sentimental Versioning takes a whimsical and highly personal approach to versioning. Unlike SemVer, which has a formal specification, Sentimental Versioning provides a guide with some playful suggestions. Key aspects include:

Creativity and Originality: You should not copy another sentimental versioning system. Instead, create something unique and original.
Personal Meaning: The version number must be meaningful to you, the author.
Flexibility: You may explain the system you create if the beauty is enhanced by understanding it, or you may improvise new numbers based on your mood on that day.

Sentimental Versioning encourages a deeply personal and creative expression through version numbers, making each version a reflection of the author’s sentiments and individuality.

Sentimental Versioning was proposed by Dominic Tarr in August 2014.

HashVer

Hash Versioning (HashVer) is perfect if you are publishing very frequently. The HashVer format consists of 3 or 4 values separated by a period:

Full year (printf(“%Y”))
Zero padded month (printf(“%m”))
[Optional] Zero padded day (printf(“%d”))
10+ characters of the current source control commit’s hash

Examples:

2020.01.67092445a1abc
2019.07.21.3731a8be0f1a8

HashVer allows for frequent and precise versioning by incorporating both the release date and a unique identifier from the source control system. This makes it easy to track specific builds and changes over time.

Because commit hashes are random, additional granularity like days or a CI build index ensures proper ordering when releasing frequently. This approach allows for precise and traceable versioning by combining the release date with a unique commit identifier.

HashVer was proposed by miniscruff in 2020.

GitDate

GitDate Versioning is similar to HashVer, addressing issues with Semantic Versioning by using the commit date from a Git repository. This provides clear and precise tracking of changes.

Format: year.month.day.git-short-code

Examples:
- 2021.03.22.d31d336
- 2021.03.31.44cf59b1
- 2022.02.14.2c52a964

GitDate indicates the release date of each version for easy identification of when a version was created. It allows quick lookup of changes using git compare and provides visibility into the last release for infrastructure teams. Even without git tags, specific versions can be cloned using the format.

However, multiple builds on the same day may require git repository access to determine the order. Additionally, it allows customers to see when the software was last updated.

GitDate was proposed by Taylor Brazelton in June 2022.

PragVer

Pragmatic Versioning optimizes for communicating changes to package consumers while retaining simple semantics for package maintainers. The format is BIGRELEASE.ANNOUNCE.INCREMENT, where:

BIGRELEASE indicates major updates or significant milestones controlled by the package author.
ANNOUNCE communicates notable announcements or changes.
INCREMENT is a smaller, incremental update for every contribution.

Pragmatic Versioning addresses the challenges faced by package authors, enabling them to maintain old versions and release new ones efficiently. It helps communicate what will be maintained, how to get the most appropriate version, and allows for automatic releases without explicitly labeling each change. This approach ensures clear expectations and effective communication with the user community.

Pragmatic Versioning was proposed by Severin Ibarluzea in December 2023.

WendtVer

WendtVer is a tongue-in-cheek versioning system designed to minimize the thought required for version increments. Starting at 0.0.0, every commit increments the version number following these rules:

PATCH version increments on every commit.
MINOR version increments when the next PATCH would be 10, rolling PATCH over to 0.
MAJOR version increments when the next MINOR would be 10, rolling MINOR over to 0.

Additional labels for pre-release and build metadata are not available as extensions to the MAJOR.MINOR.PATCH format.

This system, while creating chaotic and meaningless version numbers, makes it easy to predict the next version. WendtVer serves as a humorous alternative to Semantic Versioning, emphasizing simplicity over rational versioning practices.

WendtVer was proposed by Brian Wendt in August 2018

SoloVer

SoloVer is a simple and expressive versioning specification that uses a single number with an optional postfix.

Format: []

version_number: A single integer, starting at 0.
postfix: Optional, matching regex [+-][A-Za-z0-9_]+.

Rules:

Increment the version number for each new release.
Postfixes can be added for extra context.

Precedence:

Higher numbers follow lower numbers.
+ postfixes come after no postfix (for hotfixes).
- postfixes come before no postfix (for pre-releases).
Postfixes are sorted alphanumerically; longer ones come last.

SoloVer does not communicate “backward compatibility” and focuses on simplicity and expressiveness, leaving compatibility documentation and testing to the provider and user.

SoloVer was proposed by beza1e1 in March 2024.

BreakVer

Break Versioning (BreakVer) aims to fix several issues identified with Semantic Versioning (SemVer). BreakVer addresses the complexity of SemVer’s lengthy specification and its lack of distinction between major and minor breakages.

BreakVer aims to be easy to follow strictly, making it more reliable in practice. It emphasizes the maximum potential impact of version updates, ensuring clear communication about the safety of upgrades.

BreakVer Specification:

Format: ..[-]
- major: Major breaking changes or significant non-breaking changes.
- minor: Minor breaking changes or minor non-breaking changes.
- non-breaking: Strictly no breaking changes.
- optional-qualifier: Tags like -alpha1, -RC2, etc.

Version Bumps:

non-breaking: Always a safe upgrade.
minor: Check the CHANGELOG for minor breakages.
major: Check the CHANGELOG for major breakages.

BreakVer was proposed by Peter Taoussanis in August 2015 as part of the encore clojure library.

Explicit Versioning

Explicit Versioning:

Explicit Versioning is a specification for developers who care about releasing software with clear, explicit announcements of intended breaking changes. It uses an extra required identifier to handle intentional incompatible changes.

Specification Schema: Explicit Versioning uses a schema composed of four identifiers, represented as:

Disruptive.Incompatible.Compatible.Fix[-Optional_Identifiers]

Key Differences from SemVer:

Isolation of Incompatible Releases: Isolates any intentional incompatible release from other types of releases.
Disruptive Situations: The most left number is only incremented when a disruptive situation occurs in the software, not for minimal backward incompatible changes.
Clarity and Precision: Reduces ambiguity in the usage and interpretation of the versioning schema.

This approach ensures that any significant, intentional changes are clearly communicated, providing a more precise and unambiguous versioning system for developers and users.

Explicit Versioning was proposed by Paulo Renato in February 2017, more details can be found in the Announcement Post.

ZenVer

Zen Versioning (ZenVer) is a revolutionary, simple, and straightforward versioning specification. Unlike Semantic Versioning, ZenVer demands nothing from the developer and less from the end-user. The idea is clear: number go up, software go new.

Larger is greater, greater is larger. Larger number means better software.

Specification: Given a version number VERSION, increment the:

VERSION when you make incompatible API changes.
VERSION when you add functionality in a backward-compatible manner.
VERSION when you make backward-compatible bug fixes.
VERSION when you change practically anything.

No additional labels, titles, or subtitles for pre-release and build metadata should be appended to the VERSION format. Such metadata can be used outside the version string but should be avoided in package meta information.

ZenVer was proposed by NotAShelf in May 2024.

PVP

Package Versioning Policy (PVP) is a versioning scheme designed specifically for the Haskell ecosystem to manage package dependencies and ensure compatibility. It provides a formal specification for version numbers to help developers and users maintain and use Haskell packages effectively.

Version Number Format: A.B.C

A.B: Major version number.
C: Minor version number.

Rules:

Breaking Changes: If any entity is removed or altered in a way that could break compatibility, increment the major version (A.B).
Non-Breaking Changes: If only new bindings, types, or classes are added without breaking existing functionality, increment the minor version (C).
Minor Updates: For minor, non-breaking updates like documentation corrections, other components of the version may change, but A.B.C can remain the same.

Principles:

Isolation of Incompatible Releases: Major version increments isolate incompatible changes, ensuring clarity for users about potential breaking changes.
Disambiguation: Clear rules reduce ambiguity in how versions are incremented.
Consistency: Ensures consistent dependency specifications to avoid build and compatibility issues.

PVP is specifically tailored to the Haskell ecosystem, differing from Semantic Versioning (SemVer) by not using a special meaning for 0.x.y versions and not supporting version tags or build metadata.

PVP was proposed to the Haskell community by Bulat Ziganshin in 2006, three years before SemVer, on the Haskell mailing list.

FloatVer

FloatVer is a versioning scheme that uses non-negative IEEE754 32-bit floating point numbers in base-10 format. FloatVer is natively supported by the majority of programming languages, CPUs, and GPUs. Some example FloatVer version numbers are 0.7 and 290.10008.

Version Number Format: breaking.nonbreaking

breaking: incremented by 1 for backward-incompatible changes
nonbreaking: incremented for backward-compatible changes
Purely nonbreaking changes MUST NOT increment the breaking part of the number.
Breaking changes MAY change the nonbreaking part of the number to any value.
Leading and trailing zeros are ignored, except in the case of 0.0.
0.0 is the minimum version and starting point for all FloatVer projects.
340282346638528859811704183484516925440.0000000000000000 is the largest possible version supported by FloatVer.

In the context of FloatVer, “backward-compatible” means:

no semantic changes to existing functionality
an updated existing feature uses no more memory, time, or cores than its previous version
the software takes up no more than 1.5 times the space of the previous version

These rules ensure that nonbreaking versions permit safe automatic upgrades. FloatVer was proposed by Alex Shroyer in July 2024.

AsymptoVer

AsymptoVer – yes, I’ve just made up that name – is a peculiar, maybe even esoteric, versioning scheme. It was popularized by Donald Knuth (see also Errata on the book series Computers & Typesetting). Version numbers are decimal numbers that asymptotically approach a number with an infinite decimal expansion (usually an irrational number).

Examples:

Version numbers of TeX approach Archimedes’ constant. The scheme was adopted at version number 3, followed by version 3.1, then 3.14, then 3.141, and so on (see entry A000796 in the OEIS).
Version numbers of Metafont approach Euler’s number, starting at 2, followed by 2.7, then 2.71, and so on (see entry A001113 in the OEIS).

If you feel like joking: There are plenty irrational algebraic numbers and transcendental numbers available.

If you know of any other versioning schemes that should be on this list? Let me know on mastodon or submit a pull request to GitHub.

Some other versioning related links:

Spec-ulation by Rich Hickey
SemVer will not save you by Hynek Schlawack
Software Versioning on Wikipedia
Software Versioning Schemes by Krystian Safjan

Ecosyste.ms 2023 End of Year Update

2023-12-21T00:00:00+00:00

This year has been another very productive one for Ecosyste.ms, we’ve designed, built, deployed and maintained 16 significant services to support, sustain, and secure critical digital infrastructure.

Some of the stats from the biggest services:

Packages:

Registries: 59
Packages: 8,647,405
Versions: 93,009,665
Namespaces: 1,284,530
Maintainers: 1,412,065
Database size: 245GB
Background jobs processed: 688,025,502

Repos:

Hosts: 784
Repositories: 183,755,645
Owners: 10,543,833
Tags: 188,198,256
Manifests: 190,349,152
Dependencies: 14,051,505,152
Database size: 2.49 TB
Background jobs processed: 727,979,154

Timeline:

Events: 7.43 Billion
Database size: 8.69 TB

Some new services we deployed this year:

Issues - An open API service for providing issue and pull request metadata for open source projects.
Commits - An open API service providing commit metadata for open source projects.
Summary - An open API service for producing an overview of a list of open source projects.
Docker - An open API service providing dependency metadata for docker projects.
Papers - An open API service providing mapping between scientific papers and software projects that are mentioned in them.
OST - An open API service for discovering and reviewing projects for opensustain.tech.

We also helped launch ClimateTriage.com with Opensustain.tech to help people discover a meaningful way to contribute to open source projects focused on climate technology and sustainability powered by the whole range of ecosyste.ms services.

And we published some very large open data releases, check them all out on the blog: https://blog.ecosyste.ms/2023/08/24/open-data-releases.html

We experienced some growing pains over the year, increases in API usage and the pure amount of data being indexed (over 10TB of data across all services) required upgrades and changes to infrastructure to keep it running well but things are quite stable now.

We also used up the majority of our grant from Plaintext group and are exploring ways to continue funding further development of features and level up the services.

A big project we have planned starting in the new year is to work with OSC to build tools to measure the impact of funding open source software and tracking changes in metrics over time of open source software projects that are part of OSC.

We are always looking to work with more partners to help them add support for more ecosystems to their products as well as expanding the number of ecosystems and repository hosts that we index.

Making 24 Pull Requests more inclusive for 2018

2018-11-29T00:00:00+00:00

For the past 6 years, 24 Pull Requests has been encouraging developers to give back to the open source projects they use. The idea was simple: “Send 24 pull requests between December 1st and December 24th”, giving little gifts of code throughout December.

Since the initial launch of the project in 2012, open source usage has exploded with millions of people contributing to projects around the world but one area where open source continues to struggle with is in the diversity of contributors.

One problem that compounds the diversity issue is that tools and metrics that measure open source contributions often only acknowledge code contributions, ignoring all the other ways that people can contribute to open source projects.

The GitHub contribution graph is a prime example, which only recognises a handful of kinds of contribution. This can make it difficult for people to get the recognition they deserve, potentially hampering their motivation and even hurting their job prospects.

GitHub profiles don’t show the full picture of open source contribution

It took me 6 years to realise that 24 Pull Requests has been part of that problem too. In previous years, you could only participate if your contributions could be presented in the form of a pull request. This is particularly difficult for people are contributing by:

Organising an event
Answering questions on Stackoverflow
Supporting and mentoring new contributors
Speaking at a conference or user group
Managing the governance aspects of a project
Writing blog posts about how to use a project
Mailing stickers to contributors
Donating money to help pay a projects infrastructure costs
and many other ways of contributing without writing code…

Those kinds of tasks are often under taken by women, people of colour and other underrepresented groups, only making things worse as their contributions become invisible to many people.

So this year we’re making the biggest change to 24 Pull Requests since the start of the project: you’ll be able to record every kind of contribution you make to open source, not just pull requests.

There’s a new form on the site that allows you to record the contributions you’ve made each day that wouldn’t usually make sense as a pull request:

The new 24 Pull Requests contribution form

Those non-pullrequest contributions are treated exactly the same as pull requests across the site, showing up on your profile and the homepage, in other words all contributions are considered equal.

User profile page showing a mix of pull requests and non-pull request contributions

With this change, we hope to use 24 Pull Requests to encourage all kinds of contributions to open source projects, it also opens up the ability to track contributions to open source projects that aren’t hosted on GitHub.com.

In summary, here’s how to track your contributions this year:

For contributions that are pull requests, everything will work the same as previous years, just login and sync your pull requests from the dashboard.

For contributions that you’ve made to an open source project that aren’t pull requests, you can record them in the new contribution form.

See you on December 1st!

Untangle your GitHub Notifications with Octobox

2018-11-25T00:00:00+00:00

As an active contributor and maintainer of many open source projects on GitHub, I struggle to keep up with the various in-progress issues and pull requests across multiple repositories and find it hard to see an overview of everything I’m involved in. If you manage or contribute to any number projects on GitHub, I’m betting you probably have the same issue.

This is because GitHub Notifications are marked as read and disappear from the list as soon as you load the page or view the email of the notification. This makes it very hard to keep on top of which notifications you still need to follow up on.

Most open source maintainers and even some GitHub staff end up using a complex combination of filters and labels in Gmail to manage their notifications from their inbox. If, like me, you try to avoid email, then you might want to try Octobox.

Octobox is designed with exactly that problem in mind. Firstly it adds an extra “archived” state to each notification so you can mark it as “done”. If new activity happens on the thread/issue/pr, the relevant notification will pop back into your inbox. You can also star notifications that are special to you.

This puts the control of when to clear notifications back in your hands, you can work through your inbox at your own pace. It also means you can always find old notifications, which, on GitHub, disappear a few days after you’ve read them.

The other problem that GitHub power users struggle with is the sheer number of notifications they get, some users are getting 200+ notifications every single day. Octobox helps to tackle this in a number of ways:

Filters: you can filter the notifications in your inbox in pretty much any way you can imagine; by repository, organization, type, action, state, CI status and reason and keep notifications from bots alongside your regular labels, author and assignees.
Multi-select: Archive and mute a whole bunch of notifications at once, allowing you to stay on top of even the noisiest repositories with ease.
Search: Combine a wide range of powerful search filters help you get straight to the notification you’re looking for and focus on just what you need, then pin your favorite searches to the sidebar for easy access.
Keyboard shortcuts: Quickly navigate, triage and manage your notifications like a pro using Gmail-inspired keyboard shortcuts for every function, no mouse required.

Octobox started life as a side project in December 2016, since then it’s grown to help over 10,000 people manage over 4 million notifications and the project has been downloaded almost half a million times by developers hosting their own versions. It’s now one of the most popular open source tools on GitHub.

Last month Ben and I announced that we’re going to start working full time to make Octobox a truly sustainable open source project that can financially support itself and the community.

Today we’re expanding the scope of Octobox and launching on the GitHub Marketplace, offering new, paid enhancements for private repositories on Octobox.io with a two week free trial period.

You can also get the same enhancements for private repositories by donating or becoming a sponsor on Open Collective, same price but the support goes directly to the community.

We’ve also got a whole host of new features planned for the coming months, including:

Snoozing notifications until later
Gmail-style automated notification filters
Localization and internationalization
View comment threads right in the Octobox interface
Team discussions and direct user-to-user messaging
Highlighting important notifications to you
Allow replying to an issue/pull request directly from Octobox

To keep up with everything that’s going on and even contribute directly to the project, check out the open source project on GitHub.

What does a sustainable open source project look like?

2017-11-10T00:00:00+00:00

Over the past couple years, thanks in part to Nadia Eghbal’s Roads and Bridges report, the world is starting to see the need for long-term sustainability in key open source projects that are the foundation of the ecosystems that our society relies on.

Getting financial support to work on an open source project can be difficult, but it’s not impossible. Thankfully a growing number of people are coming up with ways to make working on open source projects a financially secure prospect. But many of these models involve offering something in return for money.

Advertising offers marketing exposure to sponsors. Consulting, books, and training offer insights and knowledge not easily available from the project itself. SaaS services offer a hassle-free hosting option and ‘open core’ models offer advanced add-ons under proprietary licenses.

The key thing for me is that many of these models don’t actually support the project directly. Often the time invested in open source project development is paid for with the money left over after delivering something else you sold.

Why is this the case? By being freely available for everyone, open source makes it a challenge to sell direct improvements, as only the first customer needs to fund the improvements and everyone else gets them for free.

So we end up in a sort of standoff, with everyone waiting for someone else to support the project unless there is a time sensitive issue that’s cheap to fix. This is one of the biggest challenges in funding open source work and a particularly tricky problem to work around in a capitalist economy.

So I decided to articulate what a successful, sustainable open source project looks like, including what kinds of work do people do on it, and what kind of community of users and contributors it has. Then with that as an end point, we can work backwards to figure out how to bring that world to life.

I’m going to focus on high level things that apply to as many projects as possible rather than practices that may be specific to a programming language or ecosystem.

So here’s my list of things that make up the ideal sustainable open source project:

Governance

The project has structures in place for making high-level decisions and enforcing communication standards, codes of conduct etc.
Decisions are publicly documented and communicated to all interested parties.
The maintainers of the project have taken steps to ensure it does not rely on any single person to be able to get work done.
Plans for future development direction, ideas, and goals are kept up to date in a roadmap document.
Taking steps to ensure that the project fosters a good, diverse community and is welcoming and friendly to users and contributors alike.

Documentation

The project has good-quality documentation, covering all the public APIs and interfaces. They are updated with each release.
Commit messages should describe what and why the change was made as per Chris’ great guidance.
Human-focused release notes should be published with every release, listing notable changes and deprecations.
If possible, documentation should be available in multiple languages or at least open to contributions from translators.

Code Quality

Code should have a consistent style throughout the project, ideally programmatically enforced with linters and documented with style guides where necessary.
The project should have good test coverage, with tests being run automatically on a CI environment after every commit.
There should be a documented code review process for all contributions involving both automated checks and human approval process to keep code quality levels high.

Support

Contributions and support requests and should be responded to in a timely manner even if a fix isn’t possible straight away.
Outstanding tickets should be triaged on a regular basis to ensure stale issues don’t fall through the cracks.
If there is a long-term support release available, the policies around it should be documented and future release dates included in roadmaps.
The project should document all supported runtime/language versions and major external dependency version compatibility, which should also have automated testing setup.
New release candidates should be tested against as many upstream dependency versions as realistically possible to ensure backwards compatibility or enable communication of breaking compatibility changes.
It’s also a good practise to keep an eye out for posts on Q&A sites like Stack Overflow where users often go to get support with open source projects.

Ecosystem Collaboration

Maintainers should identify and coordinate with related projects to reduce potential for conflicts on new releases and breaking changes
For projects that are heavily depended upon, automated integration testing against key downstream dependencies should be set up for early warning detection of unseen breakages and conflicts.
Projects should have a clear process for proposing and discussing large changes such as an RFC.

Security

There should be a documented process for privately reporting security issues to the project’s maintainers as well as clear guidelines for maintainers on how to handle reported security issues.
Projects should acquire a CVE for all known security vulnerabilities and document which released versions the CVEs apply to.
Commits and releases should be signed by the authors so that users can verify whether what they downloaded matches the same contents you released.
Maintainers should have 2FA and strong passwords on all related accounts (GitHub, package manager registries, email etc)
If releases include publishing compiled binaries, there should be a provenance chain for those binaries, ideally compatible with the Reproducible Builds program.
If necessary a threat model should be documented to highlight where the software is most vulnerable to attack and how to mitigate those threats.
Any reports produced whilst researching the security aspects of the project should be published within a reasonable timeframe.

Legal

The project should be made available under one of the OSI-Approved Licenses, that’s any license that fits with the Open Source Definition.
All licenses and trademarks for the project should be properly documented and ideally available in machine readable format like SPDX as well.
There should be a succession plan in place in case of the death of maintainer to allow other maintainers to legally take control of the assets of the project incase the worst happens.

Finance

As with any entity that is dealing with money and people, correct accounting and tax reporting should be done based on the laws of the countries that the maintainers reside in.
In some cases a legal organisation should be set up to to protect the liabilities of individuals involved, either as a regular business or a not-for-profit organisation.
With people being paid to work on the project, potentially from a variety of countries, policies around pay rates and expenses should be set up.
For ultimate transparency, an open ledger of all project income and outgoings could be used to show exactly how funds are being spent on the project.

Marketing

Having a recognizable brand can help the project build a strong audience of users and contributors, that includes having a logo and website of it’s own to help users understand what the project is about and also control the brand of the project outside of the GitHub repository page.
Projects should aim to keep their users and contributors up to date with what’s going on with the project, including larger announcements and highlighting interesting goings on in the development process as well as sharing useful related content via an email newsletter, blogging and twitter.
Surveys can also be a useful tool to collect quantitative and qualitative information about how and why the use the project and what else they would find useful to help inform future roadmapping decisions

Dependency Hygiene

If the project has dependencies it should ensure that each dependency is properly licensed and that license is compatible with the project.
Dependencies should also be checked for any potential security or compatibilities on a regular basis, including transitive dependencies.

One thing you might notice here is that many of the tasks involved in running a sustainable project don’t involve writing any code at all, behind every successful, sustainable open source project are many people making non-code contributions that are necessary to keep everything working.

We can also start to see which attributes different stakeholders in the project are going to care about most, from there we can connect the dots and start to get a clearer picture of which funding models fit with a project whose stakeholders care about those attributes being present in the project.

It shouldn’t be too surprising that almost all of those attributes are also attributes of a successful software business. Every sustainable open source project ends up being run like a business in some ways, even if there aren’t any stakeholders that want to profit from it directly. Infact every successful open source project is a business. It’s just there are no paying customers.

Let’s change that!

If you’d like to add something to the list, I’ve put it up on GitHub, pull requests are very welcome: https://github.com/librariesio/sustainable-oss-attributes

As always, follow me on Twitter at @teabass for more updates.

Exploring Unseen Open Source Infrastructure

2017-02-24T00:00:00+00:00

Whilst working on Libraries.io, I often stumble across libraries that appear to be used by an incredible amount of open source projects but often don’t have any of the usual signs of being a popular project on GitHub.

Take debug_inspector for example:

25 stars
21 commits
4 contributors
5 watchers
4 forks
2 open issues
Last commit over 2 years ago

At face value if the GitHub page you’d be forgiven for mistaking it as a small, project that’s barely used, when in fact it’s listed as a dependency in over 111,000 open source projects!

Libraries.io has a number of different pages for exposing interesting and unexpected lists of libraries, including ones with a low bus factor and ones that have been yanked from their package manager, so I thought I’d add one to show the most unappreciated but highly used libraries.

This afternoon I shipped the Unseen Open Source Infrastructure page: https://libraries.io/unseen-infrastructure

To paraphrase Arfon Smith on Request for Commits #3, “Stars on GitHub are a measure of attention, more akin to a Like on Facebook than a measure of quality or usage”, this page shows hundreds of projects that are depended upon by at least 1,000 other open source repositories but have less than 100 stars.

Any of these projects could be the next left-pad or Heartbleed where an underlying, critical library is highly used but has very little attention paid to it. An unnoticed security issue or abandoned project could potentially could result in hundreds of thousands of affected software applications.

You can help these projects and the communities that depend upon them by reviewing the code for these libraries, helping out with open issues, sharing them on social media and thanking the maintainers for their hard, often unrewarding work to keep things running behind the scenes.

Something that Ben and I are planning to do a lot more of over the next year and beyond is help to highlight and support the open source software that is critical to today’s technology infrastructure.

Another area that needs exploring is system level package managers like apt and yum, which contain even more important and often overlooked libraries that often aren’t hosted on a social platform like GitHub but are still critical to world of software.

If you’d like to get involved, the whole project is open source and we’d love to help you get started contributing, or if you’d like to build tools on top of all this data, check out the Libraries.io REST API

As always, follow us on Twitter at @teabass and @benjam for more updates.