Sneak preview: Writing Ruby gem native extensions in Rust – Brian Kung
source link: https://briankung.dev/2022/01/31/sneak-preview-writing-ruby-gem-native-extensions-in-rust/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
Sneak preview: Writing Ruby gem native extensions in Rust
Note: anytime you see RFC, that’s a “Request For Comments” about a topic I didn’t understand or take the time to look into. Please feel free to add what you know!
If this post tickles your fancy, check out the follow-up post: Writing a Rust gem from scratch
In December 2021, Ian Ker-Seymer (@ianks) submitted a pull request to enable native extensions in Rust!
I was so excited, I had to try it out, even though it hadn’t been merged yet. A lot of maintainers are showing interest and pitching in, so I have high hopes for it being merged into main. So here are my notes on writing a Rust gem extension.
Requirements
Requirements / dependencies / utilities I used and their versions on macOS Monterey v12.1 (21C52) as of 2022-01-29:
- Bundler version 2.4.0.dev
- gem version 3.4.0.dev
- cargo 1.58.0 (f01b232bc 2022-01-19)
- rustc 1.58.1 (db9d1b20b 2022-01-20)
Using a rubygems
fork
Warning! The following is based on ianks’s development branch of Rubygems. The feature may have changed – or not exist at all – by the time you read this. I’ll modify this warning if the feature ends up being merged.
Find somewhere cozy to clone @ianks’s cargo-builder
branch of rubygems
and run the following:
$ git clone –branch cargo-builder [email protected]:ianks/rubygems.git Cloning into 'rubygems'… remote: Enumerating objects: 224785, done. remote: Counting objects: 100% (5/5), done. remote: Compressing objects: 100% (2/2), done. remote: Total 224785 (delta 3), reused 3 (delta 3), pack-reused 224780 Receiving objects: 100% (224785/224785), 191.82 MiB | 38.18 MiB/s, done. Resolving deltas: 100% (124370/124370), done.
$ cd rubygems
Aliasing your default rubygems
We need to be able to use the gem
and bundle
commands from the cargo-builder
branch of rubygems
. As per the directions in the CONTRIBUTING.md
:
To run commands like
gem install
from the repo:ruby -Ilib bin/gem install
To run commands like
bundle install
from the repo:ruby bundler/spec/support/bundle.rb install
But this is a hassle, so we’ll use aliases instead of typing all of this up every time. cd
into your rubygems
directory and alias these commands with the following:
export RUBYGEMS_PATH="$(pwd)" alias gem="ruby -I$RUBYGEMS_PATH/lib $RUBYGEMS_PATH/bin/gem" alias bundle="ruby $RUBYGEMS_PATH/bundler/spec/support/bundle.rb"
We’re going to use the RUBYGEMS_PATH
variable later on, so keep that handy! Now if you check the version numbers of your default gems, they should be as follows:
$ gem –version 3.4.0.dev
$ bundler –version Bundler version 2.4.0.dev
Note that these aliases won’t be present in a new terminal shell!
Compiling an example gem
We’re ready to test the functionality of a Rust-based gem. For starters, let’s use the rust_ruby_example
gem that I’ve extracted from @ianks’s pull request:
Let’s confirm that it does, indeed, allow us to run Rust code from Ruby.
First, we need to build the gem. We do this by pointing the gem
command at a .gemspec
file. Luckily, the repo has one of those:
$ gem build rust_ruby_example.gemspec –output rust_ruby_example.gem WARNING: licenses is empty, but is recommended. Use a license identifier from http://spdx.org/licenses or 'Nonstandard' for a nonstandard license. WARNING: no homepage specified WARNING: See https://guides.rubygems.org/specification-reference/ for help Successfully built RubyGem Name: rust_ruby_example Version: 0.1.0 File: rust_ruby_example.gem
We also explicitly name the output file, otherwise we get something like rust_ruby_example-0.1.0.gem
, which is just a tad bit more awkward.
And we’re done!
…well, not exactly. As it turns out, extensions aren’t compiled until you install the gem. It makes sense that building the gem and installing the gem are two separate steps. So next we need to install it:
$ gem install rust_ruby_example.gem Building native extensions. This could take a while… Successfully installed rust_ruby_example-0.1.0 1 gem installed
Firing up cargo
took a minute or so on my machine.
Potential errors
Speaking of cargo
, if you don’t have it installed, you may see a message that looks like this:
$ gem install rust_ruby_example.gem Building native extensions. This could take a while… ERROR: Error installing rust_ruby_example.gem: ERROR: Failed to build gem native extension.
current directory: /Users/brian/.rbenv/versions/3.0.0/lib/ruby/gems/3.0.0/gems/rust_ruby_example-0.1.0 cargo rustc –target-dir /Users/brian/.rbenv/versions/3.0.0/lib/ruby/gems/3.0.0/extensions/x86_64-darwin-21/3.0.0/rust_ruby_example-0.1.0 –manifest-path /Users/brian/.rbenv/versions/3.0.0/lib/ruby/gems/3.0.0/gems/rust_ruby_example-0.1.0/Cargo.toml –lib –release –locked — -C linker\=clang -C link-arg\=-fdeclspec -L native\=/Users/brian/.rbenv/versions/3.0.0/lib -L native\=/Users/brian/.rbenv/versions/3.0.0/lib -L native\=/usr/local/opt/icu4c/lib -C link_arg\=-Wl,-undefined,dynamic_lookup -C link_arg\=-Wl,-multiply_defined,suppress -C debuginfo\=1 cargo failedNo such file or directory – cargo
Gem files will remain installed in /Users/brian/.rbenv/versions/3.0.0/lib/ruby/gems/3.0.0/gems/rust_ruby_example-0.1.0 for inspection. Results logged to /Users/brian/.rbenv/versions/3.0.0/lib/ruby/gems/3.0.0/extensions/x86_64-darwin-21/3.0.0/rust_ruby_example-0.1.0/gem_make.out
This means that you don’t have cargo installed, or rubygems
couldn’t find cargo
in your $PATH
. Make sure to install Rust and come back when you’re done!
You may also see an error like this:
$ gem install rust_ruby_example.gem Building native extensions. This could take a while… ERROR: Error installing rust_ruby_example.gem: ERROR: Failed to build gem native extension.
No builder for extension 'Cargo.toml'
Gem files will remain installed in /Library/Ruby/Gems/2.6.0/gems/rust_ruby_example-0.1.0 for inspection. Results logged to /Library/Ruby/Gems/2.6.0/extensions/universal-darwin-21/2.6.0/rust_ruby_example-0.1.0/gem_make.out
The key here being the message "No builder for extension 'Cargo.toml'."
If that’s the case, double check your bundler --version
and gem --version
to make sure they match the versions above. Your current version of the gem
utility is missing @ianks’s CargoBuilder
addition.
Inspecting rust_ruby_example
code
rust_ruby_example
includes some sample code in src/lib.rs
:
#[no_mangle] unsafe extern "C" fn pub_reverse(_klass: VALUE, mut input: VALUE) -> VALUE { let ruby_string = cstr_to_string(rb_string_value_cstr(&mut input)); let reversed = ruby_string.to_string().chars().rev().collect::<String>(); let reversed_cstring = CString::new(reversed).unwrap(); let size = ruby_string.len() as c_long;
rb_utf8_str_new(reversed_cstring.as_ptr(), size) }
If you’ve never seen Ruby internal code before, a few of these methods look like exactly what you’d call in C code, courtesy of a library called rb-sys
. The key here is in the name of the method – pub_reverse
reverses strings. Here’s where the reversal actually happens:
let reversed = ruby_string.to_string().chars().rev().collect::<String>();
There’s also an initialization function, Init_rust_ruby_example
, to actually define the Ruby modules and methods. Let’s piece together what it’s doing. Here are the relevant lines for declaring a Ruby module:
let name = CString::new("RustRubyExample").unwrap(); // … let klass = unsafe { rb_define_module(name.as_ptr()) };
…and the rest is all adding the reverse
method to the module:
// Name of the function let function_name = CString::new("reverse").unwrap();
// transmute the function for unknown reasons let callback = unsafe { std::mem::transmute::< unsafe extern "C" fn(VALUE, VALUE) –> VALUE, unsafe extern "C" fn() –> VALUE, >(pub_reverse) }; // …Bind the transmuted function as a module function on the RustRubyExample module unsafe { rb_define_module_function(klass, function_name.as_ptr(), Some(callback), 1) }
Note that it needs to translate everything into Matz’s Ruby compatible data structures. That includes the module, the module function, and even the string name for the function.
( RFC: what is the purpose of std::mem::transmute
here?)
Click to read more about the purpose of std::mem::transmute
Trying out rust_ruby_example
Alright, if you’ve seen this message:
$ gem install rust_ruby_example.gem Building native extensions. This could take a while… Successfully installed rust_ruby_example-0.1.0 1 gem installed
…you’re ready to go! Fire up IRB and require rust_ruby_example
to take it for a test drive:
$ irb irb(main):001:0' require 'rust_ruby_example' => true irb(main):002:0> RustRubyExample.reverse("rust_ruby_example") => "elpmaxe_ybur_tsur"
It reverses the string, as promised. It works!
…or does it? Let’s see if it’s really doing our bidding by modifying the code.
Adding a lowercase
method
Let’s add a RustRubyExample#lowercase
method. It will be exactly the same as RustRubyExample#reverse
, except it converts case-convertible text to lower case.
It should work like this:
$ irb irb(main):001:0' require 'rust_ruby_example' => true irb(main):002:0> RustRubyExample.lowercase("RustRubyExample") => "rustrubyexample"
And we can confirm that it currently does not work:
$ irb irb(main):001:0> require 'rust_ruby_example' => true irb(main):002:0> RustRubyExample.lowercase("RustRubyExample") Traceback (most recent call last): 4: from /Users/brian/.rbenv/versions/3.0.0/bin/irb:23:in `<main>' 3: from /Users/brian/.rbenv/versions/3.0.0/bin/irb:23:in `load' 2: from /Users/brian/.rbenv/versions/3.0.0/lib/ruby/gems/3.0.0/gems/irb-1.3.0/exe/irb:11:in `<top (required)>' 1: from (irb):2:in `<main>' NoMethodError (undefined method `lowercase' for RustRubyExample:Module)
So let’s add it. Once again we need #[no_mangle]
to tell the compiler not to alter the name of the function once it’s been compiled. Mangling essentially namespaces function names so there are no name collisions in the final binary. However, in our case, we want to be able to refer to it by the name we give it in C Ruby, so we don’t want our function name to be mangled
Add this block of code between the pub_reverse
and Init_rust_ruby_example
functions in src/lib.rs
:
// in src/lib.rs
#[no_mangle]
We’re also going to copy the function signature:
// in src/lib.rs
#[no_mangle] unsafe extern "C" fn pub_lowercase(_klass: VALUE, mut input: VALUE) -> VALUE { // … }
( RFC: why does this need the _klass
argument?)
Click to read more about _klass
argument
Next we take the Ruby VALUE
input and cast it to a Rust string, then lowercase it using standard Rust String
methods:
// in src/lib.rs
#[no_mangle] unsafe extern "C" fn pub_lowercase(_klass: VALUE, mut input: VALUE) -> VALUE { let ruby_string = cstr_to_string(rb_string_value_cstr(&mut input)); let lowercased = ruby_string.to_lowercase(); // … }
…and the rest is all glue code to convert it to a C string and then to a Ruby string:
// in src/lib.rs
#[no_mangle] unsafe extern "C" fn pub_lowercase(_klass: VALUE, mut input: VALUE) -> VALUE { let ruby_string = cstr_to_string(rb_string_value_cstr(&mut input)); let lowercased = ruby_string.to_lowercase(); let lowercased_cstring = CString::new(lowercased).unwrap(); let size = ruby_string.len() as c_long;
rb_utf8_str_new(lowercased_cstring.as_ptr(), size) }
We also need to add the method to the Ruby module. We can do that by duplicating the relevant code in Init_rust_ruby_example
:
// in src/lib.rs
#[allow(non_snake_case)] #[no_mangle] pub extern "C" fn Init_rust_ruby_example() { let name = CString::new("RustRubyExample").unwrap();
// …Code for defining "RustRubyExample#reverse" omitted
let function_name = CString::new("lowercase").unwrap(); let callback = unsafe { std::mem::transmute::< unsafe extern "C" fn(VALUE, VALUE) –> VALUE, unsafe extern "C" fn() –> VALUE, >(pub_lowercase) }; let klass = unsafe { rb_define_module(name.as_ptr()) };
unsafe { rb_define_module_function(klass, function_name.as_ptr(), Some(callback), 1) } }
Now to build and reinstall it:
$ gem build rust_ruby_example.gemspec –output rust_ruby_example.gem // Warnings omitted Successfully built RubyGem Name: rust_ruby_example Version: 0.1.0 File: rust_ruby_example.gem
$ gem install rust_ruby_example.gem Building native extensions. This could take a while… Successfully installed rust_ruby_example-0.1.0 1 gem installed
Finally, let’s test our new functionality:
$ irb irb(main):001:0> require 'rust_ruby_example' => true irb(main):002:1* RustRubyExample.lowercase("RustRubyExample") => "rustrubyexample"
Awesome!
Conclusion
This is all possible due to Ian’s long, hard slog to get this into rubygems
proper: https://github.com/rubygems/rubygems/pull/5175. This could be a hugely impactful addition to rubygems
that’s been stewing since 2019 and it’s mostly been Ian’s efforts to get it there. Thanks, Ian!
If you got this far, check out the follow-up post: Writing a Rust gem from scratch
Recommend
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK