Paul J. Lucas

Posted on Aug 5, 2018

Go-tcha: When assigning via pointer changes your type

#go #pitfall

I’m currently in the process of porting my CHSM (Concurrent Hierarchical Finite State Machine) project that has C++ and Java implementations to Go. Porting involves both rewriting the run-time library in Go as well as augmenting the CHSM compiler to emit Go code.

Briefly, a CHSM is like an ordinary finite state machine: states can be entered and exited by receipt of events that cause transitions among the states. Unlike an ordinary finite state machine, a CHSM can have some parent states that have child states nested within them: entering a parent state in turn enters one of its child states. When a state has been entered but not yet exited, it is considered to be active.

To focus on the point of this article, the implementations presented are simplified versions of the real thing. For example, let’s implement State as:

type State interface {
    Enter()
    Exit()
    Name() string
}

type CoreState struct {
    name   string
    parent Parent
}

func (s *CoreState) Enter() {
    fmt.Printf("CoreState.Enter(%s)\n", s.Name())
    if s.parent != nil {
        s.parent.switchActiveChildTo(s)
    }
}

func (s *CoreState) Exit() {
    fmt.Printf("CoreState.Exit(%s)\n", s.Name())
}

func (s *CoreState) Name() string {
    return s.name
}

That’s mostly straightforward. Next, let’s implement Parent as:

type Parent interface {
    State

    switchActiveChildTo(State)
}

type CoreParent struct {
    CoreState

    children    []State           // A parent has zero or more children.
    activeChild State             // One child is the active one.
}

func (p *CoreParent) Enter() {
    fmt.Printf("CoreParent.Enter(%s)\n", p.Name())
    p.CoreState.Enter()           // Enter ourselves first.
    if p.activeChild == nil {     // Default to first child.
        p.activeChild = p.children[0]
    }
    p.activeChild.Enter()         // Enter activeChild.
}

func (p *CoreParent) Exit() {
    p.activeChild.Exit()          // Exit activeChild first.
    p.CoreState.Exit()            // Then exit ourselves.
    fmt.Printf("CoreParent.Exit(%s)\n", p.Name())
}

func (p *CoreParent) switchActiveChildTo(child State) {
    p.activeChild = child
}

The semantics of CHSMs include the ability to enter a child state directly, bypassing its parent. If this happens and the parent is already active, the child must request that the parent switch its active child state to itself, hence the switchActiveChildTo() method.

The Problem

Let’s write a test program:

func main() {
    outer := &CoreParent{ CoreState: CoreState{ name: "outer" } }
    inner := &CoreParent{ CoreState: CoreState{ name: "inner" } }
    s := &CoreState{ name: "s" }

    // Normally, the CHSM compiler emits code to set the parent/child pointers.
    outer.children = []State{ inner }
    inner.parent = outer
    inner.children = []State{ s }
    s.parent = inner

    outer.Enter()
    fmt.Println("-------------------------")
    outer.Exit()
}

When run, it prints:

CoreParent.Enter(outer)
CoreState.Enter(outer)
CoreParent.Enter(inner)
CoreState.Enter(inner)
CoreState.Enter(S)
-------------------------
CoreState.Exit(inner)
CoreState.Exit(outer)
CoreParent.Exit(outer)

That’s wrong! The calls to Exit() should pair with the calls to Enter(), but CoreState.Exit(s) and CoreParent.Exit(inner) are missing. Why? After many hours of debugging, I tracked it down to this:

func (s *CoreState) Enter() {
    // ...
        s.parent.switchActiveChildTo(s)
    // ...
}

func (p *CoreParent) switchActiveChildTo(child State) {
    p.activeChild = child
}

But what’s wrong with that? The child is telling its parent to set its active child to itself. How could that be wrong?

Background on Interfaces

In Go, the only way to get polymorphism is via interfaces. An ordinary variable in Go has two attributes: its static type and its value. The static type is the type the variable was declared as. An interface additionally has a dynamic type that is the type of the object to which the (pointer) value points:

// static type = int; value = 42
var n int = 42

// static type = State; dynamic type = CoreParent; value = &outer
var i State = &outer

When you call a method on an interface, the implementation actually calls the corresponding method on the object to which the interface’s pointer value points. The pointer value becomes the value of the method receiver; the dynamic type of the interface becomes the static type of the receiver. This means that once an interface value “decays” into a pointer, the polymorphism disappears.

The Reason for the Bug

When the test program is run, the following happens. On the left, the name, static type, and value of the receivers are shown:

1. p *CoreParent = &outer           | CoreParent.Enter(outer)
2. s *CoreState  = &outer.CoreState |   CoreState.Enter(outer)
3. p *CoreParent = &inner           |     CoreParent.Enter(inner)
4. s *CoreState  = &inner.CoreState |       CoreState.Enter(inner)
5. p *CoreParent = &outer           |         switchActiveChildTo(inner.CoreState)

At this point, outer.activeChild that should be &inner of type *CoreParent actually gets set to &inner.CoreState of type *CoreState. The confusing part is that both of those objects have the same name (because Name() of the embedded type CoreState is forwarded from CoreParent).

When Exit(outer) is called, the following happens:

1. p *CoreParent = &outer           | CoreParent.Exit(outer)
2. s *CoreState  = &outer.CoreState |   CoreState.Exit(outer)
3. s *CoreState  = &inner.CoreState |     CoreState.Exit(inner)

CoreState.Exit(s) is never called because CoreParent.Exit(inner) is never called — CoreState.Exit(inner) is called instead. To show this even clearer, we can augment CoreParent.Enter():

func (p *CoreParent) Enter() {
    // ...
    t0 := reflect.TypeOf(p.activeChild)
    p.activeChild.Enter()         // eventually calls switchActiveChildTo()
    t1 := reflect.TypeOf(p.activeChild)
    if t1 != t0 {
        fmt.Printf("%s type changed: %v != %v\n", p.activeChild.Name(), t1, t0)
    }
}

and it additionally prints:

inner type changed: *CoreState != *CoreParent

Strictly Speaking...

To prevent purists in the Go community from coming after me with pitchforks, I’ll point out that the type of activeChild did not actually change. What actually happened is that activeChild gets assigned the pointer to a different object: the CoreState object that’s embedded inside CoreParent — so of course the type changed.

Well, yes, but: if you extract the pointer value from the interface before and after the assignment and compare it such as:

func getPtrValue(i interface{}) uintptr {
    return (*[2]uintptr)(unsafe.Pointer(&i))[1]
}

func (p *CoreParent) Enter() {
    // ...
    a0 := getPtrValue(p.activeChild)
    p.activeChild.Enter()
    a1 := getPtrValue(p.activeChild)
    if a1 != a0 {
        fmt.Printf("%s address changed: %v != %v\n", p.activeChild.Name(), a1, a0)
    }
}

the program does not print inner address changed because it doesn’t. A CoreParent object in memory has the embedded CoreState at a zero offset. So, yes, activeChild was assigned a pointer to “different” object having a different type — but that object has the same address.

A distinction without a difference?

The Fix

Back to the Go port of CHSM: how can the run-time library be changed to do what I want? The way I fixed this was to change CoreState.Enter() so that it looked up a pointer to the original state’s object (of the original type) using a new CoreState.id field in a slice containing pointers to all states:

var allStates []State

func (s *CoreState) Enter() {
    fmt.Printf("CoreState.Enter(%s)\n", s.Name())
    if s.parent != nil {
        child := allStates[s.id]
        s.parent.switchActiveChildTo(child)
    }
}

// ...

func main() {
    outer := &CoreParent{ CoreState: CoreState{ name: "outer", id: 0 } }
    inner := &CoreParent{ CoreState: CoreState{ name: "inner", id: 1 } }
    s := &CoreState{ name: "s", id: 2 }

    allStates = []State{ outer, inner, s }

    // ...
}

DEV Community

Go-tcha: When assigning via pointer changes your type

The Problem

Background on Interfaces

The Reason for the Bug

Strictly Speaking...

The Fix

Top comments (0)

Read next

Adding filter query parameters in Go Huma

Using CloudEvents in Go

Building Secure RAG Applications with Go: An Introduction to GoRag

Golang básico - Comparação de Igualdade