Introduction to Rust generics [1/2]: Traits

Tue, May 31, 2022

Introduction to Rust generics:

Traits

Imagine that you want to add a camera to your computer which is lacking one. You buy a webcam and connect it via a USB port. Now imagine that you want to add storage to the same computer. You buy an external hard drive and also connect it via a similar USB port.

This is the power of generics applied to the world of physical gadgets. A USB port is a generic port, and an accessory that connects to it is a module. You don't have device-specific ports, such as a specific port for a specific webcam vendor, another port for another vendor, another one for one vendor of USB external drives, and so on... You can connect almost any USB device to any USB port and have it working (minus software drivers compatibility...). Your PC vendors don't have to plan for any module you may want to connect to your computer. They just have to follow the generic and universal USB specification.

The same applies to code. A function can perform a specific task against a specific type, and a generic function can perform a specific task on some (more on that later) types.

This post is an excerpt from my book Black Hat Rust

add can only add two i64 variables.

fn add(x: i64, y: i64) -> i64 {
    return x + y;
}

Here, add can add two variables of any type.

fn add<T>(x: T, y: T) -> T {
    return x + y;
}

But this code is not valid: it makes no sense to add two planes (for example). And the compiler don't even know how to add two planes! This is where constraints come into play.

use std::ops::Add;

fn add<T: Add>(x: T, y: T) -> T {
    return x + y;
}

Here, add can add any types that implement the Add trait. By the way, this is how we do operator overloading in Rust: by implementing traits from the std::ops module.

Generics

Generic programming's goal is to improve code reusability and reduce bugs by allowing functions, structures, and traits to have their types defined later.

In practice, it means that an algorithm can be used with multiple different types, provided that they fulfill the constraints. As a result, if you find a bug in your generic algorithm, you only have to fix it once. If you had to implement the algorithm 4 times for 4 different but similar types (let say int32, int64, float32, float64), not only you spent 4x more time to implement it, but you will also spend 4x more time fixing the same bug in all the implementations (granted you didn't introduce other bugs due to fatigue).

In Rust, functions, traits (more on that below), and data types can be generic:

use std::fmt::Display;

// a generic function, whose type parameter T is constrained
fn generic_display<T: Display>(item: T) {
    println!("{}", item);
}

// a generic struct
struct Point<T> {
    x: T,
    y: T,
}

// another generic struct
struct Point2<T>(T, T)

// a generic enum
enum Option<T> {
    Some(T),
    None
}


fn main() {
    let a: &str = "42";
    let b: i64 = 42;

    generic_display(a);
    generic_display(b);

    let (x, y) = (4i64, 2i64);

    let point: Point<i64> = Point {
        x,
        y
    };

    // generic_display(point) <- not possible. Point does not implement Display
}

Generics are what allow Rust to be so expressive. Without them, it would not be possible to have generic collections such as Vec, HashMap, or BTreeSet.

use std::collections::HashMap;

struct Contact {
    name: String,
    email: String,
}

fn main() {
    // imagine a list of imported contacts with duplicates
    let imported_contacts = vec![
        Contact {
            name: "John".to_string(),
            email: "[email protected]".to_string(),
        },
        Contact {
            name: "steve".to_string(),
            email: "[email protected]".to_string(),
        },
        Contact {
            name: "John".to_string(),
            email: "[email protected]".to_string(),
        },
        // ...
    ];

    let unique_contacts: HashMap<String, Contact> = imported_contacts
            .into_iter()
            .map(|contact| (contact.email.clone(), contact))
            .collect();
}

Thanks to the power of generics, we can reuse HashMap from the standard library and quickly deduplicate our data!

Imagine having to implement those collections for all the types in your programs?

Traits

This post is an excerpt from my book Black Hat Rust

Traits are the Rust's equivalent of interfaces in other languages (with some differences).

As defining a term by its synonym is not really useful, let see what does it mean in code:

pub trait Dog {
    fn bark(&self) -> String;
}

pub struct Labrador{}

impl Dog for Labrador {
    fn bark(&self) -> String {
        "wouf".to_string()
    }
}

pub struct Husky{}

impl Dog for Husky {
    fn bark(&self) -> String {
        "Wuuuuuu".to_string()
    }
}

fn main() {
    let labrador = Labrador{};
    println!("{}", labrador.bark());

    let husky = Husky{};
    println!("{}", husky.bark());
}

// Output:

// wouf
// Wuuuuuu

By defining a Dog interface, all types that implement this trait in our program will be considered as being a Dog.

This is why we say that traits (and interfaces) allow programmers to define shared behavior: behaviors that are shared by multiple types.

Default Implementations

It's possible to provide default implementations for trait methods:

pub trait Hello {
    fn hello(&self) -> String {
        String::from("World")
    }
}

pub struct Sylvain {}

impl Hello for Sylvain {
    fn hello(&self) -> String {
        String::from("Sylvain")
    }
}

pub struct Anonymous {}

impl Hello for Anonymous {}

fn main() {
    let sylvain = Sylvain{};
    let anonymous = Anonymous{};

    println!("Sylvain: {}", sylvain.hello());
    println!("Anonymous: {}", anonymous.hello());
}
// Output:

// Sylvain: Sylvain
// Anonymous: World

Traits composition

Traits can be composed to require more advanced constraints:

pub trait Module {
    fn name(&self) -> String;
    fn description(&self) -> String;
}

pub trait SubdomainModule {
    fn enumerate(&self, domain: &str) -> Result<Vec<String>, Error>;
}

fn enumerate_subdomains<M: Module + SubdomainModule>(module: M, target: &str) -> Vec<String> {
    // ...
}

Async Traits

As of today, async functions in traits are not natively supported by Rust. Fortunately, David Tolnay got our back covered (one more time): we can use the async-trait crate.

#[async_trait]
pub trait HttpModule: Module {
    async fn scan(
        &self,
        http_client: &Client,
        endpoint: &str,
    ) -> Result<Option<HttpFinding>, Error>;
}

Generic traits

Traits can also have generic parameters:

use std::fmt::Display;

trait Printer<S: Display> {
    fn print(&self, to_print: S) {
        println!("{}", to_print);
    }
}

struct ActualPrinter{}

impl<S: Display, T> Printer<S> for T {}

fn main() {
    let s = "Hello";
    let n: i64 = 42;
    let printer = ActualPrinter{};

    printer.print(s);
    printer.print(n);
}

// output:

// Hello
// 42

And even better, you can implement a generic trait for a generic type:

use std::fmt::Display;

trait Printer<S: Display> {
    fn print(&self, to_print: S) {
        println!("{}", to_print);
    }
}

// implements Printer<S: Display> for any type T
impl<S: Display, T> Printer<S> for T {}

fn main() {
    let s = "Hello";
    let printer: i64 = 42;

    printer.print(s);
}

// Output:

// Hello

This post is an excerpt from my book Black Hat Rust

The `derive` attribute

When you have a lot of traits to implement for your types, it can quickly become tedious and may complexify your code.

Fortunately, Rust has something for us: the derive attribute.

By using the derive attribute, we are actually feeding our types to a Derive macro which is a kind of procedural macro.

They take code as input (in this case, our type), and create more code as output. At compile-time.

This is especially useful for data deserialization: Just by implementing the Serialize and Deserialize traits from the serde crate, the (almost) universally used serialization library in the Rust world, we can then serialize and deserialize our types to a lot of data formats: JSON, YAML, TOML, BSON and so on...

use serde::{Serialize, Deserialize};

#[derive(Debug, Clone, Serialize, Deserialize)]
struct Point {
    x: u64,
    y: u64,
}

Without much effort, we just implemented the Debug, Clone, Serialize and Deserialize traits for our struct Point.

One thing to note is that all the subfields of your struct need to implement the traits:

use serde::{Serialize, Deserialize};

// Not possible:
#[derive(Debug, Clone, Serialize, Deserialize)]
struct Point<T> {
    x: T,
    y: T,
}

// instead, do this:
use serde::{Serialize, Deserialize};
use core::fmt::Debug; // Import the Debug trait

#[derive(Debug, Clone, Serialize, Deserialize)]
struct Point<T: Debug + Clone + Serialize + Deserialize> {
    x: T,
    y: T,
}

Introduction to Rust generics [1/2]: Traits

Introduction to Rust generics [1/2]: Traits

Generics

Traits

Default Implementations

Traits composition

Async Traits

Generic traits

The `derive` attribute

Recommend

“卖身”英伟达失败后，Arm 又被高通看上？

余承东说问界 M7「超越百万豪车」，何小鹏听了只想朝他扔鞋子

iOS 15.6 fixes bug that automatically puts Apple Music in the Dock after reinsta...

市場炒作美國加息放緩【匯君】

方洪波：美的的数字化转型实践!

You can now say ‘Hey Sonos’ to control your Sonos speaker and not much else

Dell UltraSharp 4K-webcam monitor challenges Apple Studio Display at $1,600

从转载阿里开源项目 Egg.js 技术文档引发的“版权纠纷”，看宽松的 MIT 许可该如何用？....

Samsung Galaxy Xcover6 Pro renders surface, bring some specs along with them

2022年江苏省机场运营现状与竞争格局分析南京禄口国际机场优势明显【组图】

About Joyk

Introduction to Rust generics [1/2]: Traits

Introduction to Rust generics [1/2]: Traits

Generics

Traits

Default Implementations

Traits composition

Async Traits

Generic traits

The derive attribute

Recommend

About Joyk

The `derive` attribute