DEV Community

Cover image for Go gRPC Clean architecture microservice with Prometheus, Grafana monitoring and Jaeger opentracing ⚑️
Alexander
Alexander

Posted on

Go gRPC Clean architecture microservice with Prometheus, Grafana monitoring and Jaeger opentracing ⚑️

Let's try to create closer to real world gRPC microservice with tracing and monitoring πŸ‘‹:

Source code u can find here
Core tools used what will be used: πŸš€
PostgreSQL as database
Redis for sessions and caching
Jaeger open source, end-to-end distributed tracing
Prometheus monitoring and alerting
Grafana for to compose observability dashboards with everything from Prometheus

First we need to start all necessary infrastructure containers:

run make local
Enter fullscreen mode Exit fullscreen mode

UI interfaces will be available on ports:

Jaeger UI: http://localhost:16686

Prometheus UI: http://localhost:9090

Grafana UI: http://localhost:3000



After sending any requests, you are able to monitoring of metrics at the dashboard of Prometheus.

For Prometheus go to http://localhost:9090/graph type gRPC and choose one. πŸ‘¨β€πŸ’»

You can import Grafana dashboard templates from the grafana directory, default login/password is admin and password admin

I like to use evans for simple testing gRPC.

In cmd folder let's init all dependencies and start the app.

Viper is very good and common choice as complete configuration solution for Go applications.

We use here config-local.yml file approach.

configPath := utils.GetConfigPath(os.Getenv("config"))
    cfg, err := config.GetConfig(configPath)
    if err != nil {
        log.Fatalf("Loading config: %v", err)
    }
Enter fullscreen mode Exit fullscreen mode

Next let's create logger, here i used Uber's Zap under the hood, important here is to create Logger interface for be able to replace logger in the future if it's need.

type Logger interface {
    InitLogger()
    Debug(args ...interface{})
    Debugf(template string, args ...interface{})
    Info(args ...interface{})
    Infof(template string, args ...interface{})
    Warn(args ...interface{})
    Warnf(template string, args ...interface{})
    Error(args ...interface{})
    Errorf(template string, args ...interface{})
    DPanic(args ...interface{})
    DPanicf(template string, args ...interface{})
    Fatal(args ...interface{})
    Fatalf(template string, args ...interface{})
}
Enter fullscreen mode Exit fullscreen mode

Setup postgres and redis

Usually production SQL db standard solution for these days is combination of sqlx and pgx.

Good Redis Go clients is go-redis and redigo, i used first.

func NewPsqlDB(c *config.Config) (*sqlx.DB, error) {
    dataSourceName := fmt.Sprintf("host=%s port=%s user=%s dbname=%s sslmode=disable password=%s",
        c.Postgres.PostgresqlHost,
        c.Postgres.PostgresqlPort,
        c.Postgres.PostgresqlUser,
        c.Postgres.PostgresqlDbname,
        c.Postgres.PostgresqlPassword,
    )

    db, err := sqlx.Connect(c.Postgres.PgDriver, dataSourceName)
    if err != nil {
        return nil, err
    }

    db.SetMaxOpenConns(maxOpenConns)
    db.SetConnMaxLifetime(connMaxLifetime * time.Second)
    db.SetMaxIdleConns(maxIdleConns)
    db.SetConnMaxIdleTime(connMaxIdleTime * time.Second)
    if err = db.Ping(); err != nil {
        return nil, err
    }

    return db, nil
}

func NewRedisClient(cfg *config.Config) *redis.Client {
    redisHost := cfg.Redis.RedisAddr

    if redisHost == "" {
        redisHost = ":6379"
    }

    client := redis.NewClient(&redis.Options{
        Addr:         redisHost,
        MinIdleConns: cfg.Redis.MinIdleConns,
        PoolSize:     cfg.Redis.PoolSize,
        PoolTimeout:  time.Duration(cfg.Redis.PoolTimeout) * time.Second,
        Password:     cfg.Redis.Password, // no password set
        DB:           cfg.Redis.DB,       // use default DB
    })

    return client
}
Enter fullscreen mode Exit fullscreen mode

And let's set up Jaeger:

func InitJaeger(cfg *config.Config) (opentracing.Tracer, io.Closer, error) {
    jaegerCfgInstance := jaegercfg.Configuration{
        ServiceName: cfg.Jaeger.ServiceName,
        Sampler: &jaegercfg.SamplerConfig{
            Type:  jaeger.SamplerTypeConst,
            Param: 1,
        },
        Reporter: &jaegercfg.ReporterConfig{
            LogSpans:           cfg.Jaeger.LogSpans,
            LocalAgentHostPort: cfg.Jaeger.Host,
        },
    }

    return jaegerCfgInstance.NewTracer(
        jaegercfg.Logger(jaegerlog.StdLogger),
        jaegercfg.Metrics(metrics.NullFactory),
    )
}
Enter fullscreen mode Exit fullscreen mode

And add the global tracer to our application:

    tracer, closer, err := jaegerTracer.InitJaeger(cfg)
    if err != nil {
        appLogger.Fatal("cannot create tracer", err)
    }
    appLogger.Info("Jaeger connected")

    opentracing.SetGlobalTracer(tracer)
    defer closer.Close()
Enter fullscreen mode Exit fullscreen mode

Prometheus has have 4 types of metrics: Counter, Gauge, Histogram, Summary

To expose Prometheus metrics in a Go application, you need to provide a /metrics HTTP endpoint.

You can use the prometheus/promhttp library's HTTP Handler as the handler function.

func CreateMetrics(address string, name string) (Metrics, error) {
    var metr PrometheusMetrics
    metr.HitsTotal = prometheus.NewCounter(prometheus.CounterOpts{
        Name: name + "_hits_total",
    })
    if err := prometheus.Register(metr.HitsTotal); err != nil {
        return nil, err
    }
    metr.Hits = prometheus.NewCounterVec(
        prometheus.CounterOpts{
            Name: name + "_hits",
        },
        []string{"status", "method", "path"},
    )
    if err := prometheus.Register(metr.Hits); err != nil {
        return nil, err
    }
    metr.Times = prometheus.NewHistogramVec(
        prometheus.HistogramOpts{
            Name: name + "_times",
        },
        []string{"status", "method", "path"},
    )
    if err := prometheus.Register(metr.Times); err != nil {
        return nil, err
    }
    if err := prometheus.Register(prometheus.NewBuildInfoCollector()); err != nil {
        return nil, err
    }
    go func() {
        router := echo.New()
        router.GET("/metrics", echo.WrapHandler(promhttp.Handler()))
        if err := router.Start(address); err != nil {
            log.Fatal(err)
        }
    }()
    return &metr, nil
}
Enter fullscreen mode Exit fullscreen mode

user.proto file

In gRPC documentation we can find good practice recommendations and naming conventions for writing proto files.

As you can see, each field in the message definition has a unique number. These field numbers are used to identify your fields in the message binary format, and should not be changed once your message type is in use. Note that field numbers in the range 1 through 15 take one byte to encode, including the field number and the field's type (you can find out more about this in Protocol Buffer Encoding).

message LoginRequest {
  string email = 1;
  string password = 2;
}

message LoginResponse {
  User user = 1;
  string session_id = 2;
}

service UserService{
  rpc Register(RegisterRequest) returns (RegisterResponse);
  rpc FindByEmail(FindByEmailRequest) returns (FindByEmailResponse);
  rpc FindByID(FindByIDRequest) returns (FindByIDResponse);
  rpc Login(LoginRequest) returns (LoginResponse);
  rpc GetMe(GetMeRequest) returns(GetMeResponse);
  rpc Logout(LogoutRequest) returns(LogoutResponse);
}
Enter fullscreen mode Exit fullscreen mode

generate your user.proto file πŸ€“

protoc --go_out=plugins=grpc:. *.proto
Enter fullscreen mode Exit fullscreen mode

it creates user.pb.go file with server and client interfaces what need to implement in our microservice:

// UserServiceServer is the service API for UserService service.
type UserServiceServer interface {
    Register(context.Context, *RegisterRequest) (*RegisterResponse, error)
    FindByEmail(context.Context, *FindByEmailRequest) (*FindByEmailResponse, error)
    FindByID(context.Context, *FindByIDRequest) (*FindByIDResponse, error)
    Login(context.Context, *LoginRequest) (*LoginResponse, error)
    GetMe(context.Context, *GetMeRequest) (*GetMeResponse, error)
    Logout(context.Context, *LogoutRequest) (*LogoutResponse, error)
}
Enter fullscreen mode Exit fullscreen mode

Then in server.go initialize the repository, use cases, metrics and so on then start gRPC server:

func (s *Server) Run() error {
    metrics, err := metric.CreateMetrics(s.cfg.Metrics.URL, s.cfg.Metrics.ServiceName)
    if err != nil {
        s.logger.Errorf("CreateMetrics Error: %s", err)
    }
    s.logger.Info(
        "Metrics available URL: %s, ServiceName: %s",
        s.cfg.Metrics.URL,
        s.cfg.Metrics.ServiceName,
    )

    im := interceptors.NewInterceptorManager(s.logger, s.cfg, metrics)
    userRepo := userRepository.NewUserPGRepository(s.db)
    sessRepo := sessRepository.NewSessionRepository(s.redisClient, s.cfg)
    userRedisRepo := userRepository.NewUserRedisRepo(s.redisClient, s.logger)
    userUC := userUseCase.NewUserUseCase(s.logger, userRepo, userRedisRepo)
    sessUC := sessUseCase.NewSessionUseCase(sessRepo, s.cfg)

    l, err := net.Listen("tcp", s.cfg.Server.Port)
    if err != nil {
        return err
    }
    defer l.Close()

    server := grpc.NewServer(grpc.KeepaliveParams(keepalive.ServerParameters{
        MaxConnectionIdle: s.cfg.Server.MaxConnectionIdle * time.Minute,
        Timeout:           s.cfg.Server.Timeout * time.Second,
        MaxConnectionAge:  s.cfg.Server.MaxConnectionAge * time.Minute,
        Time:              s.cfg.Server.Timeout * time.Minute,
    }),
        grpc.UnaryInterceptor(im.Logger),
        grpc.ChainUnaryInterceptor(
            grpc_ctxtags.UnaryServerInterceptor(),
            grpc_prometheus.UnaryServerInterceptor,
            grpcrecovery.UnaryServerInterceptor(),
        ),
    )

    if s.cfg.Server.Mode != "Production" {
        reflection.Register(server)
    }

    authGRPCServer := authServerGRPC.NewAuthServerGRPC(s.logger, s.cfg, userUC, sessUC)
    userService.RegisterUserServiceServer(server, authGRPCServer)

    grpc_prometheus.Register(server)
    http.Handle("/metrics", promhttp.Handler())

    go func() {
        s.logger.Infof("Server is listening on port: %v", s.cfg.Server.Port)
        if err := server.Serve(l); err != nil {
            s.logger.Fatal(err)
        }
    }()

    quit := make(chan os.Signal, 1)
    signal.Notify(quit, os.Interrupt, syscall.SIGTERM)

    <-quit
    server.GracefulStop()
    s.logger.Info("Server Exited Properly")

    return nil
}
Enter fullscreen mode Exit fullscreen mode

I found this is very good gRPC Middleware repository, but we easy can create our own, for example logger interceptor:

func (im *InterceptorManager) Logger(ctx context.Context, req interface{}, info *grpc.UnaryServerInfo, handler grpc.UnaryHandler) (resp interface{}, err error) {
    start := time.Now()
    md, _ := metadata.FromIncomingContext(ctx)
    reply, err := handler(ctx, req)
    im.logger.Infof("Method: %s, Time: %v, Metadata: %v, Err: %v", info.FullMethod, time.Since(start), md, err)

    return reply, err
}
Enter fullscreen mode Exit fullscreen mode

We can access grpc metadata in service handlers too, for example here we extract and validate session_id which client must send in the request context:

md, ok := metadata.FromIncomingContext(ctx)
    if !ok {
        return "", status.Errorf(codes.Unauthenticated, "metadata.FromIncomingContext: %v", grpc_errors.ErrNoCtxMetaData)
    }
sessionID := md.Get("session_id")
Enter fullscreen mode Exit fullscreen mode

So let's create unary service handler for creating the new user:

func (u *usersService) Register(ctx context.Context, r *userService.RegisterRequest) (*userService.RegisterResponse, error) {
    span, ctx := opentracing.StartSpanFromContext(ctx, "user.Create")
    defer span.Finish()

    user, err := u.registerReqToUserModel(r)
    if err != nil {
        u.logger.Errorf("registerReqToUserModel: %v", err)
        return nil, status.Errorf(grpc_errors.ParseGRPCErrStatusCode(err), "registerReqToUserModel: %v", err)
    }

    if err := utils.ValidateStruct(ctx, user); err != nil {
        u.logger.Errorf("ValidateStruct: %v", err)
        return nil, status.Errorf(grpc_errors.ParseGRPCErrStatusCode(err), "ValidateStruct: %v", err)
    }

    createdUser, err := u.userUC.Register(ctx, user)
    if err != nil {
        u.logger.Errorf("userUC.Register: %v", err)
        return nil, status.Errorf(grpc_errors.ParseGRPCErrStatusCode(err), "Register: %v", err)
    }

    return &userService.RegisterResponse{User: u.userModelToProto(createdUser)}, nil
}
Enter fullscreen mode Exit fullscreen mode

On the first lines start tracing span. The β€œspan” is the primary building block of a distributed trace.

Each component of the distributed system contributes a span - a named, timed operation representing a piece of the workflow.

opentracing

span, ctx := opentracing.StartSpanFromContext(ctx, "user.Create")
defer span.Finish()
Enter fullscreen mode Exit fullscreen mode

let's check how it's look in Jaeger:

open http://localhost:16686/

Then we usually have to validate request input, for errors gRPC has packages status and codes

I found good practice to parse and log errors in handler layer, here i use ParseGRPCErrStatusCode method, which parse err and returns matched gRPC code.

Validator is good solution for validation.

user, err := u.registerReqToUserModel(r)
    if err != nil {
        u.logger.Errorf("registerReqToUserModel: %v", err)
        return nil, status.Errorf(grpc_errors.ParseGRPCErrStatusCode(err), "registerReqToUserModel: %v", err)
    }

    if err := utils.ValidateStruct(ctx, user); err != nil {
        u.logger.Errorf("ValidateStruct: %v", err)
        return nil, status.Errorf(grpc_errors.ParseGRPCErrStatusCode(err), "ValidateStruct: %v", err)
    }
Enter fullscreen mode Exit fullscreen mode

After request input validation call use case method which contains business logic and works with users repository:

createdUser, err := u.userUC.Register(ctx, user)
    if err != nil {
        u.logger.Errorf("userUC.Register: %v", err)
        return nil, status.Errorf(grpc_errors.ParseGRPCErrStatusCode(err), "Register: %v", err)
    }
Enter fullscreen mode Exit fullscreen mode

Inside a.user.Register(ctx, user) method we start new tracing span and call user repository methods:

func (u *userUseCase) Register(ctx context.Context, user *models.User) (*models.User, error) {
    span, ctx := opentracing.StartSpanFromContext(ctx, "UserUseCase.Register")
    defer span.Finish()

    existsUser, err := u.userPgRepo.FindByEmail(ctx, user.Email)
    if existsUser != nil || err == nil {
        return nil, grpc_errors.ErrEmailExists
    }

    return u.userPgRepo.Create(ctx, user)
}
Enter fullscreen mode Exit fullscreen mode

In user repository Create method we again start new tracing span and run our query
Important note here:
Good practice is always wrap err with some additional information, it's will make debugging much easier in the future πŸ‘
On Repository and UseCase levels usually we don't log errors, only wrap with the message and returns, because we logging errors on top layer in handlers.πŸ‘¨β€πŸ’»
So we don't need log the one error multiple time, already warped it with debug message whats went wrong.

func (r *UserRepository) Create(ctx context.Context, user *models.User) (*models.User, error) {
    span, ctx := opentracing.StartSpanFromContext(ctx, "UserRepository.Create")
    defer span.Finish()

    createdUser := &models.User{}
    if err := r.db.QueryRowxContext(
        ctx,
        createUserQuery,
        user.FirstName,
        user.LastName,
        user.Email,
        user.Password,
        user.Role,
        user.Avatar,
    ).StructScan(createdUser); err != nil {
        return nil, errors.Wrap(err, "Create.QueryRowxContext")
    }

    return createdUser, nil
}
Enter fullscreen mode Exit fullscreen mode

Finally, service handler must return response object generated by proto, here usually we need to create helpers for map our internal business logic models to response object for return it.

return &userService.RegisterResponse{User: u.userModelToProto(createdUser)}, nil
Enter fullscreen mode Exit fullscreen mode

Every app must be covered by tests, I didn't completely cover all code this one, but wrote some test of course.
For testing and mocking testify and gomock is very good tools.

Source code and list of all used tools u can find here πŸ‘¨β€πŸ’» :)
I hope this article is usefully and helpfully, I'll be happy to receive any feedbacks or questions :)

Top comments (2)

Collapse
 
dips2178 profile image
dips2178 • Edited

Hello,
Thanks for this great post. I did a "make local" & "make run"
When I run the evans app, I get below error for call Register :

userService.UserService@127.0.0.1:5000> call Register
email (TYPE_STRING) => test@gmail.com
first_name (TYPE_STRING) => test
last_name (TYPE_STRING) => test
password (TYPE_STRING) => password
role (TYPE_STRING) => engineer
avatar (TYPE_STRING) => Falcon
command call: rpc error: code = Internal desc = Register: Create.QueryRowxContext: ERROR: relation "users" does not exist (SQLSTATE 42P01)

What am I missing ?

Collapse
 
khannz profile image
Pavel Fiskovich

On my way to try it in prod 0/ Great job!