DEV Community

swyx
swyx

Posted on • Edited on

OCaml Speedrun! 🐫🐪

OCaml is the basis of ReasonML which has been getting a lot of buzz as a leading compile-to-JS language. There was a free OCaml workshop in NYC offered by Jane Street today so I decided to take it and share my notes here. Jane Street has moved billions of dollars for years running an OCaml codebase so they are an extremely credible source of expertise.

Fair warning: This article is not like a normal Dev.To article in that this is a guided walkthrough of a workshop - you are expected to code along or this will be completely useless to you. But with these 24 examples (taking about 2-3 hours) you should get a solid taste of the key language features in OCaml!

And a Disclaimer: I have prior experience working with Haskell so I might have some unconscious knowledge with static typed/functional languages here.



Installing OCaml on your system

Follow this: https://github.com/janestreet/install-ocaml. We had no issues following these instructions. I went for the "reason IDE" extension in VS Code for my dev environment but vim/emacs seem well supported too. Sublime is not supported.

Basic knowledge

opam is the package manager of OCaml. If you followed the instructions above you've already used it.

As part of the above process you also install utop which Jane Street recommends as a better "toplevel" than the default. A "toplevel" is also known as a REPL.

merlin is what is used under the hood for compiling/syntax highlighting/code completion.

We are not using "raw OCaml" - we are using Jane Street's Base flavor which overrides OCaml's stdlib with some of their opinions. This is what you will see in the first line of all the problem sets:

open! Base
Enter fullscreen mode Exit fullscreen mode

Module imports are all done like this. We'll see more of this later.

Going through the workshop

git clone https://github.com/janestreet/learn-ocaml-workshop

and open up /02-exercises. We're going to go through all of these!

Hello World: /01-introduction

As it says in problem.ml, just run jbuilder runtest and you should see the error:

 ✝  learn-ocaml-workshop/02-exercises/01-introduction> jbuilder runtest
Entering directory '/Users/swyx/ocaml/learn-ocaml-workshop'
         ppx 02-exercises/01-introduction/problem.pp.ml (exit 1)
(cd _build/default && ./.ppx/ppx_jane/ppx.exe --dump-ast --cookie 'library-name="problem_1"' -o 02-exercises/01-introduction/problem.pp.ml --impl 02-exercises/01-introduction/problem.ml)
File "02-exercises/01-introduction/problem.ml", line 25, characters 22-23:
Error: String literal not terminated
Enter fullscreen mode Exit fullscreen mode

so if you fix line 25: let () = Stdio.printf "Hello, World!" by adding that last quote, you get

 ✝  learn-ocaml-workshop/02-exercises/01-introduction> jbuilder runtest
Entering directory '/Users/swyx/ocaml/learn-ocaml-workshop'
         run alias 02-exercises/01-introduction/runtest
Hello, World!
Enter fullscreen mode Exit fullscreen mode

Joy to the world! Notice how a new .merlin file is added when you run jbuilder - this is the compiler at work.

Basic Data Types: /02-basic_types

Head to problem.ml again and give it a read. Your task is to implement the two functions on line 65 and 68:

let int_average x y = failwith "For you to implement"
(* val float_average : float -> float -> float *)
let float_average x y = failwith "For you to implement"
Enter fullscreen mode Exit fullscreen mode

If you run jbuilder again you will see errors because these functions are currently implemented with "failwith". Time to get implementing!

Notice that the type signatures are commented out. This folder also has a problem.mli file. This file declares interfaces for the associated file and happens to have the signatures you need so we don't need to worried about it.

solution

int_average can be solved with: let int_average x y = (x + y) / 2 which makes sense. but float_average needs the float specific operators (this is different from Haskell) or you will get this error:

File "02-exercises/02-basic_types/problem.ml", line 163, characters 27-29:
Error: This expression has type float but an expression was expected of type
         Base__Int.t = int
Enter fullscreen mode Exit fullscreen mode

Notice how if you actually go to line 163 you can see the test that generated that error. You solve this with the float specific operators (mentioned in line 13-15):

let float_average x y = (x +. y) /. 2.

Defining Functions: /03-define_functions

This one is pretty straight forward. I did like the fact that

In OCaml, outside of strings, whitespace and newlines are the same.

solution

let plus x y   = x + y

let times x y  = x * y

let minus x y  = x - y

let divide x y = x / y
Enter fullscreen mode Exit fullscreen mode

Calling Functions: /04-call_functions

Average of two numbers is adding and then halving.

solution

let average x y = half (add x y)
Enter fullscreen mode Exit fullscreen mode

Playing around with the multiline syntax and implicit return , this also works:

let average x y =
  let res = add x y in
  half res
Enter fullscreen mode Exit fullscreen mode

and so does this:

let average x y =
  let res = add
    x
    y in
  half
    res
Enter fullscreen mode Exit fullscreen mode

Functions as arguments: /05-twice

Functions as a first class citizen are a pretty well accepted concept everywhere now, I feel like.

solution

let add1 x = x + 1
let square x = x * x
let twice f x = f (f x)
let add2 = twice add1
let raise_to_the_fourth = twice square
Enter fullscreen mode Exit fullscreen mode

Pattern matching: /06-pattern-matching

Another familiar pattern from Haskell and is being proposed in Javascript. But needs a special keyword match _ with

solution

let non_zero x =
  match x with
  | 0 -> false
  | _ -> true
Enter fullscreen mode Exit fullscreen mode

Recursion: /07-simple_recursion

See: Recursion. recursive functions need to be declared with let rec

solution

let rec add_every_number_up_to x =
  (* make sure we don't call this on negative numbers! *)
  assert (x >= 0);
  match x with
  | 0 -> 0
  | _ -> x + (add_every_number_up_to (x-1))

(* Let's write a function to multiply every number up to x. Remember: [factorial 0] is 1 *)
let rec factorial x =
  assert (x >= 0);
  match x with
  | 0 -> 1
  | 1 -> 1
  | _ -> x * (factorial (x-1))
Enter fullscreen mode Exit fullscreen mode

Data Type: Linked Lists: /08-list_intro

This exercise pairs arrays with pattern matching and recursion. The tricky new bit here is immediately destructuring the list that you are matching, which tripped me up a bit. But the provided length example is instructive if you look at it carefully.

solution

let rec sum lst =
  match lst with
  | [] -> 0
  | hd :: tl -> hd + (sum(tl))
Enter fullscreen mode Exit fullscreen mode

Building Lists: /09-list_range

this is another recursive answer again. you want to make the range function recursive and then call itself all the way until from equals to_. i initially tried this:

let rec range from to_ =
  match from with
  | to_ -> []
  | _ -> (from :: (range (from + 1) to_))
Enter fullscreen mode Exit fullscreen mode

and this didnt work because the matching assigns to the to_ instead of compares with it which is annoying.

solution

let rec range from to_ =
  match from = to_ with
  | true -> []
  | false -> (from :: (range (from + 1) to_))
Enter fullscreen mode Exit fullscreen mode

the single = here is an "infix equality" operator which works fine for ints, which is why we use it here. but notice how they had to use a "PPX Rewriter" (see Extra Knowledge section) to implement the list comparison [%compare.equal: int list] in this same problem set.

Recursing through a List: /10-list_product

this one is pretty much the same as exercise 8 where you did the sum of the list.

solution

let rec product xs =
  match xs with
  | [] -> 1
  | a :: b -> a * product(b)
Enter fullscreen mode Exit fullscreen mode

Abstracting functions: /11-sum_product

This is a pretty tricky one. We are creating a function that creates a function, abstracting repeated behavior. From line 5-36 they walk you through an example of how it is done, and then from 47 onward you are expected to do this for a similar but different pattern of behavior. Good luck and pay attention to the patterns that are being used!

solution

let rec every answer combine xs =
  match xs with
  | [] -> answer
  | x :: ys -> combine x (every answer combine ys)

(* Now let's rewrite sum and product in just one line each using every *)
let simpler_sum xs     = every 0 plus xs
let simpler_product xs = every 1 times xs
Enter fullscreen mode Exit fullscreen mode

pretty neat! You can also pass the infix operator as a function (let simpler_sum xs = every 0 (+) xs) but you can't do this for the * operator because (*) collides with commenting in OCaml.

Float functions: /12-list_min

We encounter Float.max, Float.min, Float.infinity and Float.neg_infinity for the first time. pretty straight forward.

solution

let rec smallest xs =
  match xs with
  | [] -> Float.infinity
  | x :: ys -> Float.min x (smallest ys)
Enter fullscreen mode Exit fullscreen mode

Abstractions and Float functions: /13-largest_smallest

Combining the last two exercises - abstracting functions again and using the Float functions.

solution

let simpler_largest  xs = every Float.neg_infinity Float.max xs
let simpler_smallest xs = every Float.infinity Float.min xs
Enter fullscreen mode Exit fullscreen mode

Data Type: Variant Types aka Enums /14-variants

Variants kind of work like Enums except that they can actually carry data:

type card_value =
  | Ace
  | King
  | Queen
  | Jack
  | Number of int

let one_card_value : card_value = Queen
let another_card_value : card_value = Number 8
Enter fullscreen mode Exit fullscreen mode

and this is nice :)

solution

let card_value_to_score card_value =
  match card_value with
  | Ace      -> 11
  | King     -> 10
  | Queen    -> 10
  | Jack     -> 10
  | Number i -> i
Enter fullscreen mode Exit fullscreen mode

this also works for the "or" matching

let card_value_to_score card_value =
  match card_value with
  | Ace      -> 11
  | King
  | Queen
  | Jack     -> 10
  | Number i -> i
Enter fullscreen mode Exit fullscreen mode

Data Type: Tuples and Parameterized Types /15-tuples

Tuples are what they are in other languages, but their typedefs look a little weird

type int_and_string_and_char = int * string * char
let example : int_and_string_and_char = (5, "hello", 'A')
Enter fullscreen mode Exit fullscreen mode

Functions dont have to define their types before hand. they can return the same types of things that are passed to them:

type 'a pair = 'a * 'a`
Enter fullscreen mode Exit fullscreen mode

solution

you can destructure within the funciton definition:

(* this works *)
(* let add coord1 coord2 =
  let (a, b) = coord1 in
  let (c, d) = coord2 in
  (a+c, b+d) *)

(* this also works *)
let add (a, b) (c, d) = (a+c, b+d)
Enter fullscreen mode Exit fullscreen mode

and again

(* this works *)
(* let first pair =
  let (a, _) = pair in
  a *)
(* this too *)
let first (a, _) = a
(* this *)
let second (_,b) = b
Enter fullscreen mode Exit fullscreen mode

the inbuilt functions fst and snd also do the same things these do.

Labelled arguments /16-labelled_arguments

labelling arguments...

val divide : dividend:int -> divisor:int -> int
let divide ~dividend ~divisor = dividend / divisor
Enter fullscreen mode Exit fullscreen mode

Labelled arguments can be passed in in any order (!)

solution

let modulo ~dividend ~divisor = dividend - (divisor * divide ~dividend ~divisor)
Enter fullscreen mode Exit fullscreen mode

Data Type: Options /17-options

An ['a option] is either [None], meaning absence of data, or [Some x] meaning
the data exists, and that data specifically is [x].

solution

let safe_divide ~dividend ~divisor =
  match divisor = 0 with
  | true -> None
  | false -> Some (dividend / divisor)
Enter fullscreen mode Exit fullscreen mode

Anonymous functions /18-anonymous_functions

lambda functions! defined with the fun keyword. eg the double function:

(fun i -> 2 * i)
Enter fullscreen mode Exit fullscreen mode

ironically the question doesnt test this knowledge at all.

solution

let apply_if_nonzero f i =
  match i = 0 with
  | true -> 0
  | false -> f i
Enter fullscreen mode Exit fullscreen mode

List operations /19-list_operations

Now were being introduced to List operations: List.map, List.iter, List.fold_left, List.find, List.find_exn.

solution

This was my first gnarly answer:

let divide dividend divisor  = dividend / divisor
let modulo ~dividend ~divisor = dividend - (divisor * divide dividend divisor)
let mod2 x = modulo x 2
let ifEven x =
  match mod2 x with
  | 0 -> 1
  | _ -> 0
let num_even_ints ints =
  let first = List.map
    ~f:(fun x -> ifEven x)
    ints in
  sum_of_my_ints first
Enter fullscreen mode Exit fullscreen mode

but Jane Street's Core apparently has a filter and a count function:

Core.List.count ~f:(fun x -> x mod 2 = 0)
Enter fullscreen mode Exit fullscreen mode

Type Definitions! /20-reading_sigs

So far we havent had any practice writing our own type definitions so this is going to be tricky. we have to write our own typedefs in line 80-ish. There are two things to be careful of here:

  1. we have to return the abstract type t instead of hardcoding it to int even though the test is int
  2. labelled arguments make it into the typedef too!

Here you also see the module import syntax. We import prelude.ml by adding open! Prelude (note the capital first letter) at the start.

We also start defining scoped modules here with the module keyword, with a sig/end pair for types, and then struct/end pair for code:

module Example : sig
  (*  type defs *)
end = struct
  (*  code *)
end
Enter fullscreen mode Exit fullscreen mode

solutions

  val create: numerator:int -> denominator:int -> t
  val value: t -> float
Enter fullscreen mode Exit fullscreen mode

Folding cardio /21-writing_list_operations

a bunch of exercises here. i failed the first time because the straight append method a :: [b] was appending things in the wrong order, so I needed to use List.append to switch the order around because [b] :: a is not valid syntax. (you can also use List.fold_right)

solutions

  let map f lst = List.fold_left
    lst
    ~init:[]
    ~f:(fun acc x ->  List.append acc [f(x)])

  let iter f lst = List.fold_left
    lst
    ~init:()
    ~f:(fun acc x -> f(x))

  let filter f lst = List.fold_left
    lst
    ~init:[]
    ~f:(fun acc x ->
      match f(x) with
      | true -> List.append acc [x]
      | _ -> acc
    )
Enter fullscreen mode Exit fullscreen mode

Data Type: Immutable Records /22-records

new data type! and making a function that returns records.

solutions

let modify_person (person : person) =
  match person.first_name with
  | "Jan" -> {person with age = 30}
  | _ -> {person with number_of_cars = person.number_of_cars + 6}
Enter fullscreen mode Exit fullscreen mode

Data Type: Mutable Records /23-mutable_records

Mutable records are declared with:

type color =
  | Red
  | Yellow
  | Green [@@deriving compare]

type stoplight =
  { location      : string (* stoplights don't usually move *)
  ; mutable color : color  (* but they often change color *)
  } [@@deriving compare]
Enter fullscreen mode Exit fullscreen mode

and modified with:

let set_color stoplight color =
  stoplight.color <- color
Enter fullscreen mode Exit fullscreen mode

solutions

let advance_color stoplight =
  match stoplight.color with
  | Red -> set_color stoplight Green
  | Yellow -> set_color stoplight Red
  | Green -> set_color stoplight Yellow
Enter fullscreen mode Exit fullscreen mode

Data Type: refs /24-refs

they are declared with:

let x = ref 0
Enter fullscreen mode Exit fullscreen mode

and modified with:

let () =
  x := !x + 1
Enter fullscreen mode Exit fullscreen mode

this solution works without a ref:

let take op a b =
  match op a b with
  | true -> a
  | false -> b

let min_and_max lst =
  List.fold_left
    lst
    ~init:(Int.max_value, Int.min_value)
    ~f:(fun (curmin, curmax) x ->
      (take (<) curmin x, take (>) curmax x)
    )
Enter fullscreen mode Exit fullscreen mode

solutions

some notes on using a ref:

  • you should scope it in your function or you have a persistent state between functions
  • List.iter is List.map without returning a value which will have a warning.
let take op a b =
  match op a b with
  | true -> a
  | false -> b

let min_and_max lst =
  let min = ref Int.max_value in
  let max = ref Int.min_value in
  List.iter
    ~f:(fun x ->
      min := take (<) !min x;
      max := take (>) !max x;
      )
    lst;
  (!min, !max)
Enter fullscreen mode Exit fullscreen mode

*note: see Christophe's solution in the comments as well *

THAT'S ALL FOLKS!

Wasn't too bad was it? You can try tackling their "froggy" example next, but it is a lot of implementation specific stuff using the Js_of_ocaml library.

Extra knowledge

PPX Rewriters extend the base OCaml language with new syntax that will compile to raw ocaml. They are the "Babel" of OCaml. for example

assert([%compare.equal: int list]
        (5 : :[1;8;4])
        [5;1;8;4])
Enter fullscreen mode Exit fullscreen mode

let goes with in, and the ;; trick

lets that aren't paired with in can bleed to the next line of code which could be unrelated. you can trap errors by adding double semicolons so that the compiler knows you are done with a toplevel definition.

let min_and_max lst =
  let min = ref Int.max_value in
  let max = ref Int.min_value in
  List.iter
    ~f:(fun x ->
      min := take (<) !min x;
      max := take (>) !max x;
      )
    lst;
  (!min, !max)
;;

Enter fullscreen mode Exit fullscreen mode

Four ways to compare things

These are basically things that were broken in our test run of the workshop; you shouldn't encounter these but these are useful references for ways to invoke the OCaml syntax (not just for comparing)


  assert ([%compare.equal: string] s "hello");
  assert (String.equal s "hello");
  assert (String.(=) s "hello");
  assert String.(s = "hello");
Enter fullscreen mode Exit fullscreen mode

Top comments (5)

Collapse
 
christopheriolo profile image
Christophe Riolo

Thank you for sharing :)

I spotted a typo in exercise 16 : the function divide that you have given has labeled arguments (type dividend:int -> divisor:int -> int) but you use it without labels (int->int->int) :
let modulo ~dividend ~divisor = dividend - (divisor * divide dividend divisor)

You should instead write :
let modulo ~dividend ~divisor = dividend - (divisor * divide ~dividend ~divisor)

Collapse
 
christopheriolo profile image
Christophe Riolo

I mean, it works just fine, but it is less ambiguous. For example:

divide divisor dividend

would give the wrong result, whereas:

divide ~divisor ~dividend

would give the correct one.

Collapse
 
christopheriolo profile image
Christophe Riolo

And here's an implementation of exercise 24 with refs:

let min_and_max lst =
    (* We get the first element as the base for min and max
       We are assured that the list is not empty, but else it would throw an exception *)
    let min = ref (List.hd_exn lst) in
    let max = ref (List.hd_exn lst) in
    List.iter ~f:(fun elem ->
        if elem < !min then min := elem;
        if elem > !max then max := elem
    ) lst;
    !min, !max
;;
Thread Thread
 
swyx profile image
swyx

nice, thank you very much!!

Collapse
 
swyx profile image
swyx

wew! guess so! thank you! will update. sorry about that!