Binding nanomsg with melange

For those who haven't heard about it, nanomsg is a great library from Martin Sustrik. It provides high performance messaging with a simple API similar to that of BSD sockets. It bears some similarity to what someone might do with ZeroMQ and is licensed under the liberal terms of the MIT license. The nanomsg library is currently in a pre-alpha state.

I have recently written bindings for it for use with Dylan which are available as nanomsg-dylan.

In Dylan, we have a couple of options when writing bindings. We have a low level direct-c-ffi and a higher level C-FFI. Using the lower level interface is fairly tedious and verbose while using C-FFI is fairly convenient. But writing a binding using either involves a lot of work and hand-translation of the C APIs into the right Dylan definitions. (Currently, the direct-c-ffi system is not documented.)

This is where the melange tool is very useful. Melange can parse C headers and automatically generate the C-FFI bindings. While doing this code generation, it also handles details like automatically translations the C names into the correct Dylan names following the Dylan conventions.

Melange now has some preliminary documentation.

Using Melange

Melange generates a Dylan file which contains C-FFI bindings. This file will need to be compiled by the dylan-compiler. As part of that, you will want to provide the usual .lid and library.dylan files. We don't discuss these additional files below, but a fully working setup can be seen in the nanomsg-dylan repository on GitHub.

Melange is not a perfect tool and in particular, it does not do a perfect job of parsing C. At times, it can be rather fiddly and frustrating. We plan to continue to improve upon it though and welcome bug reports and patches. Other aspects of it, such as those discussed below, work remarkably well and should remain largely the same in the future.

Getting Melange

Melange isn't packaged yet, so you will have to clone the melange git repository. After doing that, it should be a simple matter of running make to produce a build of the melange executable.

You'll want to add the resulting _build/bin directory to your PATH environment variable. In the future, this will be more straightforward.

The first draft

I start out with a simple interface definition, nanomsg.intr:

module: nanomsg

define interface
  #include {
    "nanomsg/nn.h",
    "nanomsg/fanin.h",
    "nanomsg/inproc.h",
    "nanomsg/pair.h",
    "nanomsg/reqrep.h",
    "nanomsg/survey.h",
    "nanomsg/fanout.h",
    "nanomsg/ipc.h",
    "nanomsg/pubsub.h",
    "nanomsg/tcp.h"
  },

  equate: {"char *" => <c-string>},

  import: all;

This tells melange to process that list of include files and import all definitions found within them. The equate: clause informs melange that it should consider char * (and also const char *) to be the Dylan type <c-string>.

Melange interface definition files use the file extension .intr by convention.

However, this will import some things that we don't need to bind that are for internal use within nanomsg. To deal with this, we introduce an exclude: clause:

module: nanomsg

define interface
  #include {
    ...
  },

  equate: {"char *" => <c-string>},

  import all,

  exclude: {
    "NN_HAUSNUMERO",
    "NN_PAIR_ID",
    "NN_PUBSUB_ID",
    "NN_REQREP_ID",
    "NN_FANIN_ID",
    "NN_FANOUT_ID",
    "NN_SURVEY_ID"
  };

We might also notice that not everything was imported into Dylan. In particular, various functions are defined to cause nn-errno to return EAGAIN, EADDRINUSE and other errors that are defined by the OS. They aren't imported because they aren't defined with the headers that we're directly including.

The best way to get these imported is to specifically import them:

import: all,

// Pick up the definitions that aren't defined by nanomsg itself.
import: {
  "EADDRINUSE",
  "EADDRNOTAVAIL",
  "EAFNOSUPPORT",
  "EAGAIN",
  "EBADF",
  "EFAULT",
  "EINTR",
  "EINVAL",
  "EMFILE",
  "ENAMETOOLONG",
  "ENODEV",
  "ENOMEM",
  "ENOPROTOOPT",
  "ENOTSUP",
  "EPROTONOSUPPORT",
  "ETIMEDOUT"
},

Input / Output Parameters

If we look at nn_version in the C headers, we'll see that it is defined as:

NN_EXPORT void nn_version (int *major, int *minor, int *patch);

This is not so convenient when using it from Dylan. We can simplify this though by adding a function clause after the interface definition. This function clause will help melange refine how the function definition is mapped into Dylan's C-FFI:

function "nn_version",
  output-argument: 1,
  output-argument: 2,
  output-argument: 3;

With this refinement in place, we can now call nn-version as follows:

let (major, minor, patch) = nn-version();

Awesome!

Improving upon error status codes

With a direct mapping of the C API into Dylan, we're left having to handle error checking in the same way as the C API. This should be enough to make us all feel a bit sad:

let res = nn-bind(sock, "inproc://test");
if (res < 0)
  let error = nn-errno();
  // Do something
end if;

Fixing this is a bit trickier.

What we want to do is say that the return type of these functions, like nn-bind isn't merely an integer, but it is a special type which has meaning when it is less than zero. In Dylan's C-FFI, we call this a C-mapped-subtype. That sounds complicated, but this code should make it more readily understandable:

define class <nn-error> (<error>)
  constant slot nn-error-status :: <integer>,
    required-init-keyword: status:;
  constant slot nn-error-message :: <string>,
    init-keyword: message:,
    init-value: "Unknown error";
end;

define C-mapped-subtype <nn-status> (<C-int>)
  import-map <integer>,
    import-function:
      method (result :: <integer>) => (checked :: <integer>)
        if ((result < 0) & (result ~= $EAGAIN))
          let errno = nn-errno();
          error(make(<nn-error>,
                     status: errno,
                     message: nn-strerror(errno)));
        else
          result;
        end;
      end;
end;

Here we've just defined an error type, <nn-error> as well as our C-mapped-subtype, <nn-status>. When we import a value that is an <nn-status>, the import function is called to help map the value from C to Dylan. In this case, if it is less than 0 and not $EAGAIN, we signal an error.

In this case, we specifically exclude $EAGAIN as it isn't usually an error when it occurs, such as when using the $NN-DONTWAIT flag.

Note that Melange interface files can include regular Dylan code which will simply be directly copied to the generated Dylan file.

Now, we just need to add function clauses to specify that when to use <nn-status> as the result type:

function "nn_bind",
  map-result: <nn-status>;

function "nn_close",
  map-result: <nn-status>;

Easy, once we know what we're doing, right? :)

Handling I/O

Another small difficulty to resolve is actually sending and receiving data.

In C, the relevant functions look like:

NN_EXPORT int nn_send (int s, const void *buf, size_t len, int flags);
NN_EXPORT int nn_recv (int s, void *buf, size_t len, int flags);

For now, we'll set up I/O using <buffer> from the I/O library. Similar techniques can be used with <byte-vector> or <byte-string>.

First, we're going to want to write wrappers around the nn-send and nn-recv functions, but we'd still like for our wrappers to keep those names, so we'll rename the raw C-FFI functions, via a rename: clause in our interface definition:

rename: {
  "nn_recv" => %nn-recv,
  "nn_send" => %nn-send
};

Now, we can set up some wrapper methods:

define inline function nn-send
    (socket :: <integer>, data :: <buffer>,
     flags :: <integer>)
 => (res :: <integer>)
  %nn-send(...)
end;

define inline function nn-recv
    (socket :: <integer>, data :: <buffer>,
     flags :: <integer>)
 => (res :: <integer>)
  %nn-recv(...);
end;

To actually pass data through to %nn-send and get it back from %nn-recv, we need to do a little more work though. We want to get a pointer to the underlying storage within a <buffer> and pass that to the C functions.

To do that, we define a new C-mapped-subtype and a helper function buffer-offset, which is using some low level primitives to get at the internal storage and return the address as a <machine-word>. In this code, we don't want to use the data-offset parameter, but in cases where you want to work with a subset of a buffer, it can be useful.

define simple-C-mapped-subtype <C-buffer-offset> (<C-void*>)
  export-map <machine-word>, export-function: identity;
end;

// Function for adding the base address of the repeated slots of a <buffer>
// to an offset and returning the result as a <machine-word>.  This is
// necessary for passing <buffer> contents across the FFI.

define function buffer-offset
    (the-buffer :: <buffer>, data-offset :: <integer>)
 => (result-offset :: <machine-word>)
  u%+(data-offset,
      primitive-wrap-machine-word
        (primitive-repeated-slot-as-raw
           (the-buffer, primitive-repeated-slot-offset(the-buffer))))
end function;

We'll have to tell melange that these functions want a <C-buffer-offset>:

function "nn_recv",
  map-argument: { 2 => <C-buffer-offset> },
  map-result: <nn-status>;

function "nn_send",
  map-argument: { 2 => <C-buffer-offset> },
  map-result: <nn-status>;

And now we can provide the full definition for nn-send and nn-recv:

define inline function nn-send
    (socket :: <integer>, data :: <buffer>,
     flags :: <integer>)
 => (res :: <integer>)
  %nn-send(socket, buffer-offset(data, 0), data.size, flags)
end;

 define inline function nn-recv
    (socket :: <integer>, data :: <buffer>,
     flags :: <integer>)
 => (res :: <integer>)
  %nn-recv(socket, buffer-offset(data, 0), data.size, flags);
end;

Further Improvements

Further improvements are possible:

Define a specialized type that we use for sockets so that they can't be confused with regular integers.
Provide custom wrappers around nn-setsockopt and nn-getsockopt to handle the data conversions involved.
Do something to improve the experience of using the zero-copy nanomsg APIs.

Some of this is already done in the nanomsg-dylan repository while other work remains. Feel free to try out the bindings and report any issues that you encounter.

In future blog posts, we'll write about using the C-FFI directly as well as using the lower level direct-c-ffi.

Hopefully you have a good idea now of what is involved in producing bindings for a C library using the melange tool and are ready to try binding a library on your own!