Jorge Martin

Posted on Sep 26 • Edited on Nov 7

Blog Robotica de Servicios

#programming #python #learning

Index

P1: Localized Vacuum Cleaner
P2: Rescue People
P3: Autoparkin

P1: Localized Vacuum Cleaner

Objective

The objective was to program a high-end robotic vacuum cleaner to clean a house efficiently. To achieve this, a BSA algorithm will be used. Additionally, taking advantage of the fact that it is a high-end vacuum cleaner, a self-localization algorithm will also be employed.

Procedure

1. Coords conversion

The robot is simulated in Gazebo’s 3D environment, and I need to convert its 3D world coordinates into 2D pixel coordinates. For this purpose, the following formula is applied:

However, since the goal is a 2D transformation, the z-coordinate can be discarded, and as the angular component remains unchanged, the formula can be reduced to:

To determine the scale, I recorded several pixel positions and their corresponding simulator coordinates. This was done roughly by eye, so multiple points were used to reduce error. The calculated scale is 101.5, with the x-scale equal in magnitude but negative compared to the y-scale, reflecting the axis inversion between pixel and simulator coordinates.

Using the reference pixel position (0,0), which corresponds to world coordinates (5.6,−4.17), the translations were calculated as the differences between pixel and simulator coordinates, resulting in 𝑡x=−5.6 and 𝑡y=4.17.

Now that we have calculated everything, the transformation matrix can be expressed as follows:

2.Grid map

To divide the map into cells, the first step is to determine their size. Since the robot measures 35x35 pixels, the cells should be slightly larger without being excessive. Therefore, I decided to make each cell 37x37 pixels.

Next, the obstacles are expanded so that if the percentage of black pixels within a cell exceeds 12%, that cell is assigned a value of 0, corresponding to an obstacle. Otherwise, it is assigned a value of 2, representing a dirty cell.

# Cells values
CELL_COLORS = {
    0: 0,    # Obstacle: black
    1: 131,  # Clean: green
    2: 129,  # Dirty: orange
    3: 132,  # Return: blue
    4: 134,  # Critic: violet
    5: 130   # Plan: yellow
}

Each cell has a value between 0 and 5, depending on the type of cell it represents.

3.Return and critical points

Among the different types of cells, two are essential: critical points and return points.

Critical points are determined based on the immediate neighbors of a cell (N, S, E, W). If all of its neighboring cells are obstacles and/or already cleaned cells, the cell becomes a critical point.

Return points are checkpoints where the vacuum can return once it reaches a critical point. These are generated in all immediate neighboring cells (N, S, E, W) that are neither obstacles nor already cleaned cells.

4.Route planning

To plan the route, the BSA algorithm is used, where initially the entire map consists of obstacles and dirty cells. As the robot moves, cells are marked as cleaned, while critical points and return points are identified. The algorithm enables the robot to systematically traverse all accessible areas, avoiding obstacles and optimizing the path to cover the maximum space without unnecessary movements.

During the route planning phase, the algorithm evaluates the neighboring cells of the current position in a fixed order: North, East, South, and West. The first neighboring cell that is neither an obstacle nor already planned is selected as the next cell to visit.

When the robot reaches a critical point, it cleans it and selects the return point based on the smallest Manhattan distance.

for rc in return_points:
        dist = abs(rc[0] - r) + abs(rc[1] - c)

Once the return point is selected, the robot uses a Breadth-First Search (BFS) algorithm to determine the shortest path to that point. BFS explores the neighboring cells systematically, avoiding obstacles and already cleaned cells, ensuring that the robot reaches the return point efficiently while covering all accessible areas.

5.Movement

During the planning stage, the program stores in an array the positions of the centers of the cells in the order they should be visited. This ensures that the robot stays as centered as possible while moving through each cell, guaranteeing full cleaning coverage and reducing the risk of collisions with obstacles.

Once the robot has identified the next cell center to visit, it must decide which direction to move in. The robot can move in four possible directions — North, South, East, and West — and the decision is based on the difference between the robot’s current position and the position of the target cell center.

if (abs(diff_u) > abs(diff_v)):
        if (diff_u < 0):
            direction = "East"
        else:
            direction = "West"
    else:
        if (diff_v < 0):
            direction = "South"
        else:
            direction = "North"

To rotate the robot, the system uses the yaw angle. Knowing the ideal yaw value for each direction, the robot turns until it reaches the angle corresponding to the direction it needs to follow.

if direction == "North":
        target_yaw = -np.pi/2
    elif direction == "South":
        target_yaw = np.pi/2
    elif direction == "East":
        target_yaw = np.pi
    elif direction == "West":
        target_yaw = 0.0

Challenges

One of the main challenges I faced throughout the project was the coordinate transformations from 3D to 2D. I had to identify multiple reference points to minimize errors, determine the correct scale, and construct the transformation matrix accurately.
Another challenge was planning the return route from a critical point to a return point, as I had to implement a BFS algorithm to determine the optimal path.

Functionality

(In case the video doesn´t play try this link: https://youtu.be/wA_xdow-XGE)

P2: Rescue People

Objective

The objective of this exercise was to program the behavior of a drone that must go to an area where people are floating in the sea, recognize the faces of those people, and record their coordinates so that rescue services can save them.

Procedure

1. Coords coversion

The first step is to convert the GPS coordinates to local coordinates. To do this, I first transform the GPS data into UTM coordinates.

The next step is to interpret the UTM coordinates. After that, we obtain the global positions of both the base and the rescue area. Once this is done, we convert these coordinates into the local reference frame, assuming that the base is located at the point (0, 0) in the local system.

2. Camera FOV

The next fundamental step was to determine the drone camera’s FOV, since it is needed to calculate the spacing between the route points and, later, to determine the local coordinates of detected faces.

I determined the FOV by using the drone’s starting base as a reference and moving the drone a certain distance at a fixed height of 5 m until the base was no longer visible.

3 Generate route

Knowing the coordinates of the rescue area, I defined a search zone. Considering the area covered by the drone’s camera, I divided this zone into points for the drone to follow, ensuring complete coverage of the search area. To avoid gaps, I set the spacing between points to 0.8 times the drone’s camera range, reducing the risk of leaving any areas unscanned.

4. FSM Behaviour

4.1 Reach the rescue zone

Knowing the position of the rescue area, using a position-based control is the simplest way to move the drone to that location.

zone_coords = (40, -30, 5)
HAL.takeoff(5)
HAL.set_cmd_pos(zone_coords[0], zone_coords[1], zone_coords[2], 0)

4.2 Face detection

To detect people’s faces, I used the drone’s camera along with the Haar Cascade classifier. Since the faces appear relatively small due to the drone’s altitude, it was necessary to adjust the detection parameters. Specifically, I modified the minimum size and the number of neighbors.

faces = face_cascade.detectMultiScale(
                gray, scaleFactor=1.1, minNeighbors=4,
                minSize=(20, 20), maxSize=(100, 100)

To avoid capturing duplicate faces, I store the local coordinates of detected faces. If the distance between a new detection and an existing one is below a threshold, I assume it’s the same face and discard it to prevent duplicates.

for x, y in coords:
        distance = math.sqrt((x - x_new)**2 + (y - y_new)**2)
        if distance < coord_threshold:
            return True
    return False

4.3 Select next point

Since the route points are stored in an array, moving to the next point is straightforward: simply pop the next point from the array and use position-based control to reach it.

if scan_points:
                current_scan_point = scan_points.pop(0)
                HAL.set_cmd_pos(current_scan_point[0],
                                current_scan_point[1],
                                5,
                                0)

4.4 Return to base

To return to the base, I again use position-based control, since the base’s position is known.

HAL.set_cmd_pos(0, 0, 1.5, 0)
HAL.land()

Challenges

The only difficulty I faced was the multiple detection of the same face due to the detectMultiScale parameters. I solved it by applying a coordinate filter to remove duplicate detections.

Functionality

(In case the video doesn´t play try this link: https://youtu.be/-RhBi4Nvp_I)

P3: Autoparking

Objective

The main objective of the exercise was to design the behavior of an autonomous car so that it could correctly park in an available space.

Procedure

To design the robot’s behavior, I used a finite state machine (FSM).

1. INIT_ALIGN state

To align the car, I used the laser sensors along with NumPy’s SVD function.. Valid measurements from the right-side laser are filtered, the coordinates are centered, and SVD is applied to determine the direction of the wall. Using the resulting angle, the car adjusts its orientation until properly aligned before transitioning to the CRUISE state.

2. CRUISE state

The Cruise state moves the car forward, keeping it aligned with the street using the yaw angle calculated in the previous state as a reference. During this process, the vehicle drives along the street while using the side laser sensor to detect a space large enough to park in

3. PREPARE_TO_PARK state

In this state, the car prepares to park by positioning itself advantageously.

3.1 approach substate

In this substate, the car moves forward while turning right to approach the parking area. Once it reaches a certain angle or detects an obstacle in the front, it transitions to the next substate.

3.2 align substate

In this substate, the car realigns itself using the initial yaw angle calculated in the INIT_ALIGN state as a reference. Once aligned, it transitions to the next state to begin the parking process.

4. REVERSE_PARK

This state handles reversing the car into a parking space, controlling its orientation and position using the front and rear laser sensors.

4.1 start_entering substate

In this phase, the car begins entering the parking space in reverse while turning. The yaw error relative to the initial yaw is calculated to ensure the vehicle does not deviate excessively. The car stops and transitions to the next phase if the rear gets too close to an obstacle or if the yaw exceeds a predefined threshold.

4.2 maneuver_back substate

In this phase, the car continues reversing while slightly adjusting its orientation to align correctly within the parking space. The maneuver stops once the rear reaches a safe distance and the yaw is properly aligned.

4.3 maneuver_front substate

In the final phase, the car moves forward slightly to adjust its position and complete the parking maneuver. The distance traveled and the proximity of front obstacles are monitored. Once the car reaches the correct position and orientation, all motion stops, completing the parking process.

Challenges

The only challenge I faced was aligning the car at the start using SVD, since I had never used it before.

Index

P1: Localized Vacuum Cleaner

Objective

Procedure

1. Coords conversion

2.Grid map

3.Return and critical points

4.Route planning

5.Movement

Challenges

Functionality

P2: Rescue People

Objective

Procedure

1. Coords coversion

2. Camera FOV

3 Generate route

4. FSM Behaviour

4.1 Reach the rescue zone

4.2 Face detection

4.3 Select next point

4.4 Return to base

Challenges

Functionality

P3: Autoparking

Objective

Procedure

1. INIT_ALIGN state

2. CRUISE state

3. PREPARE_TO_PARK state

3.1 approach substate

3.2 align substate

4. REVERSE_PARK

4.1 start_entering substate

4.2 maneuver_back substate

4.3 maneuver_front substate

Challenges

Functionality

Between 2 cars

Open Spot