Abstract
Aerial manipulators based on conventional multirotors can conduct manipulation only in small roll and pitch angles due to the underactuatedness of the multirotor base. If the multirotor base is capable of hovering at arbitrary orientation, the robot can freely locate itself at any point in
Keywords
Introduction
Background and motivation
Mobile manipulators with various base platforms (quadruped (Sleiman et al., 2021), ball-balancing robot (Minniti et al., 2019), multirotor (Wang et al., 2023)) have been introduced to expand the workspace of the manipulator. In particular, thanks to the capability to locate itself in wider workspace, aerial manipulators utilizing aerial robots as a base have been actively studied (Ollero et al., 2022). However, existing aerial manipulators based on conventional multirotors can conduct manipulation only in small roll, pitch angles due to the underactuatedness of the multirotor base. If additional freedom exists for a multirotor base to hover at arbitrary orientation, the workspace of the manipulator can be considerably enlarged, and such enlarged workspace may enable a manipulation task that is originally not viable. Although aerial manipulators based on a fully actuated multirotor base (Ryll et al., 2019) are studied in Tognon et al. (2019) and Nava et al. (2020) to provide hoverability in non-zero roll, pitch angles, there still exist non-trivial regions in orientation space (e.g., near 90° pitch angle) where the multirotor cannot hover. Several studies have been proposed to enhance versatility by serially connecting multiple aerial robots (Zhao et al., 2018, 2023). However, when the task involves object grasping, these platforms require substantially large, collision-free workspace as they utilize aerial robot modules to enclose the object.
In this study, we aim to investigate a mobile manipulator that can locate the floating base in any 3D space with arbitrary position and orientation, as depicted in Figure 1. To achieve this goal, we consider an omnidirectional aerial manipulator (OAM) which is a combination of an omnidirectional multirotor (Allenspach et al., 2020) and a multi-degrees-of-freedom (multi-DoF) robotic arm. Our study focuses on addressing two main issues: developing a method (1) to stably control the position and orientation of the multirotor base regardless of the arm movement or manipulation task and (2) to enable whole-body motion planning while utilizing the omnidirectionality of the multirotor’s orientation and taking into account the surrounding environment for collision avoidance and manipulation. An omnidirectional aerial manipulator (OAM) conducting a precise manipulation task of grasping-and-pulling a bar (a) on the ground and (b) on a table. The whole-body motion is computed from the proposed cascaded motion planner, and the reference trajectory is tracked by the proposed geometric robust controller.
Problem description
First, from a control perspective, two types of techniques are required: geometric control technique allowing the control input to be defined at arbitrary orientation on
Next, from a planning perspective, it is essential to perform whole-body motion planning that considers both the position and orientation of the multirotor base, as well as the joint angles of the robotic arm. In doing so, it is necessary to: (1) consider the non-Euclidean configuration space
For clarity, we will provide a complete formulation of these control and planning problems in Problem formulation section.
Method overview and contribution
To resolve these problems, we first propose a geometric robust integral of the tanh of the error (gRITE) controller. The proposed controller augments an integral of the tanh of the error term to a geometric nonlinear PID controller (Goodarzi et al., 2013) which has been applied to most existing OAMs (Su et al., 2023; Zhao et al., 2023; Bodie et al., 2021b), and stability of the closed-loop system is formally analyzed. The added integral term is shown to be effective in suppressing the ultimate error bound, which can be made arbitrarily small with sufficiently large control gain. Next, we propose a two-step trajectory-optimization-based whole-body motion planning method. There exist two main computation bottlenecks in applying optimization-based algorithms which may hinder real-time computation and even convergence of the solution: (1) high-dimensional, non-convex search space and (2) non-Euclidean space of
We conduct comparative experiments for the proposed controller where the proposed one outperforms the counterpart (i.e., geometric nonlinear PID controller) in regulating both position and orientation in the presence of the robotic arm’s motion. Then, the proposed control and planning framework is experimentally validated where an omnidirectional aerial manipulator (OAM) (Figure 3) performs grasping-and-pulling tasks in various environments requiring precise manipulation. Mobile manipulation using the OAM is successfully executed in five different scenarios: (1) ground-basic, (2) ground-yaw, (3) ground-pitch, (4) table-far, and (5) table-close. Through the experiments, we demonstrate precise control performance regardless of disturbance, such as the movement of the robotic arm and ground effect, at arbitrary position and orientation of the base. Furthermore, the proposed planning algorithm is shown to effectively utilize whole-body motion including omnidirectionality of the floating base in conducting the manipulation task while taking into account multiple physical constraints in real-time faster than 10 Hz.
In summary, the main contribution of this work can be summarized as follows: • We present a gRITE (geometric Robust Integral of the Tanh of the Error) controller for an omnidirectional multirotor base to enable precise mobile manipulation of the OAM. We formally prove that the proposed controller guarantees arbitrarily small ultimate bound with sufficiently large control gains. • We present a two-step trajectory-optimization-based whole-body motion planning method for an omnidirectional aerial manipulator with a multi-DoF robotic arm. The proposed method is capable of online replanning in a confined space faster than 10 Hz and exploiting the entire space of • We conduct multiple experiments where an OAM with a multi-DoF robotic arm performs grasping-and-pulling an object either on the ground or on a table, which would not be possible for a conventional underactuated aerial manipulator with the same manipulator configuration. They demonstrate effectiveness and applicability of the proposed controller and planner.
Related work
Comparison with state-of-the-art works on omnidirectional aerial manipulators.
aTrajectory optimization (TO), inverse kinematics (IK), virtual kinematic chain (VKC), and model predictive path integral control (MPPI).
bFeed forward (FF) term compensates only the disturbance from the manipulator’s motion and requires the acceleration measurement.
cWhole-body motion is not jointly tackled in that only the joint angles are considered.
dSingularity issue may occur due to the use of Euler angles.
eNo joint angle is considered, but there is a possibility of extension to OAM with a multi-DoF manipulator.
Control
The first objective of this study is to design a geometric robust controller for an omnidirectional aerial manipulator (OAM). Accordingly, we first review robust/adaptive or geometric robust/adaptive control techniques applied to existing floating base robotic systems including multirotors and aerial/space manipulators, then investigate control methods applied to an OAM. Lastly, we discuss related work on the proposed control technique, which is the geometric robust integral of the tanh of the error (gRITE).
For the multirotor bases without considering robotic arms, various geometric robust/adaptive controllers have been proposed. Goodarzi et al. (2015) introduced an adaptive controller with performance analysis, while Bisheban and Lee (2021) extended this approach by proposing a neural network-based adaptive control method that handles not only constant parametric but also state-dependent uncertainties. Further advancing this direction, O’Connell et al. (2022) presented a framework that integrates learning-based and adaptive control techniques to accommodate state-independent uncertainties, such as wind disturbances. However, these adaptive control approaches still exhibit performance degradation when encountering time-varying uncertainties that are not included in the training data or lack parametric modeling. Recently, Wu et al. (2025) proposed an
Next, aerial/space manipulators inherently involve dynamic coupling effect between a floating base and a rigidly attached robotic arm (Kim et al., 2017), and the base can become unstable if this coupling effect is not suitably addressed in a controller (Huber et al., 2013). To resolve this problem, various studies on a conventional multirotor-based aerial manipulator (Lee et al., 2022; Kim et al., 2017; Lee et al., 2021; Liang et al., 2023) design robust controllers where the motion of the robotic arm is treated as external disturbance. However, their use of Euler angles in attitude control renders the controllers numerically unstable near 90° pitch angle, limiting their applicability to an OAM. Although Yu et al. (2020) propose a geometric robust controller using orientation error directly defined on
Studies on an OAM utilize a controller considering the non-Euclidean property of
This study proposes a gRITE controller, which is an extension to a RISE (robust integral of the sign of the error) controller first presented in Xian et al. (2004). The RISE control achieves asymptotic stabilizability for an uncertain system using a continuous control input, and it has been adopted in various platforms (Kamaldin et al., 2019; Deng and Yao, 2021; Shin et al., 2011). However, due to the derivative of the control input not being continuous, input chattering may occur when applying high gains. Accordingly, in the hardware perspective where desired force/torque is tracked by actuators, the actuators’ tracking performance may deteriorate, resulting in overall performance degradation. To alleviate such problem, Kidambi et al. (2021) replace the sign function with the tanh function when validating its controller in simulation, but analysis for the replaced tanh function is not conducted. The original stability analysis with the sign function can no longer be applied to our case where the sign function is substituted by the tanh function because there appears a residual term in the derivative of the Lyapunov function that cannot be shown to be non-positive. Xian and Zhang (2016) present formal stability analysis for a smooth counterpart of the RISE control using the tanh function, but the analysis is conducted only in
Whole-body motion planning
Our second objective is to present a whole-body motion planner exploiting the omnidirectionality of an OAM while abiding by various state constraints such as collision avoidance. Considering this objective, we first review whole-body planning techniques for ground robot-based mobile manipulators. Then, we investigate planning algorithms designed for aerial manipulators including OAMs. Lastly, we discuss related work on optimization-based motion planning where the whole configuration space of orientation, that is,
A whole-body motion generation algorithm developed for a mobile manipulator whose base is a planar wheeled robot can be found in Zhang et al. (2020). Planar mobile robots only demand single scalar values representing heading angles to fully describe the orientation whereas at least three variables are needed for the OAM’s base. Thus, it is not directly applicable to the OAM. Quadruped robots have widely been used as bases of mobile manipulators (Sleiman et al., 2021; Arcari et al., 2023; Chiu et al., 2022). However, since they conduct mobile manipulation while maintaining the pedals to have a stable contact with the ground, roll and pitch angles of the base orientation is restricted below 90°. Accordingly, they rely on Euler angles to represent the base orientation. Another widely studied mobile manipulation platform is humanoids. Burget et al. (2013) and Qin et al. (2023) leverage mobility such as standing up from a seated posture, and Murooka et al. (2021) incorporate bipedal locomotion for loco-manipulation. However, they either do not fully exploit bipedal walking, or when whole-body motion is considered, computation time is on the order of seconds even when using Euler angles due to the high-dimensional state space. If the full
Similarly, existing studies on aerial manipulators (Lee et al., 2020; Lee and Kim, 2017) have limitations in that they utilize Euler angles in designing a motion planner. Although there exist studies employing the whole configuration space of
Several studies have explored trajectory generation on
On the other hand, there exist studies that incorporate collision-free trajectory generation on
Problem formulation
List of notations and variables.
MoI: Moment of inertia.
The controller is designed with the following objectives: • Robustness to external disturbance and model uncertainty to ensure robustness against multiple sources of disturbance including the robotic arm’s motion, ground effect and model uncertainty arising from an object at the end-effector. • Well-defined control law in the entire state space without singularity to fully exploit the omnidirectionality of the aerial robot which can be hindered if using local coordinates such as Euler angles for representing orientation.
As for whole-body motion planning, the objectives are: • Collision-free trajectory generation to avoid both self-collision between the manipulator and the base, and collision with static obstacles in the environment. • Simultaneous exploration of the joint angles and base pose over the configuration space • Fast computation of trajectories to ensure real-time replanning capability for reacting to potential uncertainties.
Based on these objectives, we define the following control and planning problems under assumptions:
Geometric robust control
For any given smooth reference trajectory for the omnidirectional multirotor base (i.e., position and orientation of the base), design a geometric controller capable of bounding the state tracking errors arbitrarily small under the presence of disturbances and model uncertainties.
Whole-body motion planning
For the given goal pose of the end-effector, find a whole-body motion trajectory of the OAM (i.e., pose of the floating base and joint angles of the robotic arm) that utilizes the entire configuration space
External disturbances and their time-derivatives
All obstacles are static, known, and modeled as composites of multiple ellipsoids.
There exists at least one curve in
The first assumption is about the boundedness of the external disturbance and its time-derivatives, which is common among other papers tackling robust control against time-varying disturbances (Kim et al., 2017; Lee et al., 2022; Hua et al., 2021). The second is to allow us to formulate collision avoidance constraints with obstacles using finite dimensional parameters. The last assumption can be interpreted as the goal not being positioned within a fully obstructed region enclosed by obstacles. This is to ensure existence of a path from any initial position to the goal position of the end-effector.
To address the control problem, a geometric robust integral tanh of the error (gRITE) controller is proposed, and we show that the proposed method guarantees an arbitrarily small state error bound. For the planning problem, a two-step trajectory-optimization-based whole-body motion planning method is introduced, which considers the entire configuration space while maintaining real-time performance ( Overall algorithm flow for hardware experiments. EE and WB stand for end-effector and whole-body, respectively.
Controller design
In this section, we propose a geometric Robust Integral Tanh of the Error (gRITE) controller which guarantees ultimate boundedness of the closed-loop system with arbitrarily small ultimate bound by choosing proper control gains.
System dynamics
We consider the following system dynamics:
(1a) models translational dynamics and (1b) is for rotational dynamics. The motion of the robotic arm induces a change in the moment of inertia of the aerial manipulator
Control allocation
The OAM considered in this paper is illustrated in Figure 3. The omnidirectional multirotor base consists of six rotors and six servomotors. As visualized in Figure 3, actuator inputs are CAD for the omnidirectional aerial manipulator. Actuator inputs are visualized which are rotor thrust

The proposed control law: gRITE controller
In this subsection, we construct control laws for
Translational dynamics
We design a controller for the translational part first. We define error variables as
Rotational dynamics
Similar to the controller design for the translational dynamics, we take error variables first in designing a controller for the rotational counterpart as follows:
Stability analysis
Using the derived control laws for the translational and rotational dynamics (6a)–(6c) and (8a)–(8c), we analyze stability of each closed-loop system. Note that the following analysis is motivated by Xian and Zhang (2016).
Translational dynamics
From (1a), (6a)–(6c), dynamics of
Now, we define a Lyapunov candidate function V
t
as
Take
Take
Assume that the following scalar differential equation holds for any sufficiently smooth
For control gains satisfying
Briefly speaking, to satisfy the control gain conditions, one should take large enough
Rotational dynamics
From (1b) and (8a)–(8c), dynamics of
Now, we define a Lyapunov candidate function V
r
as
Assume that control gains satisfy the following:
Assume that control gains
Although the sign function can provide additional merit of asymptotic stability (Gu et al., 2022; Xian et al., 2004) if used instead of the tanh function in the control law (6), (8), it may result in input chattering. Considering the omnidirectional aerial robot where some actuators (i.e., servomotors) show unknown, non-negligible time delay in tracking the input command, such input chattering can deteriorate the tracking performance. The tanh function in the proposed control law relieves this problem by providing sufficient smoothness in the control input while endowing the necessary disturbance attenuation property with moderately high control gains of
As done in Xian and Zhang (2016), gRITE control only with the robust control law
Whole-body motion planning
This section overviews our whole-body trajectory generation algorithm designed for an OAM. We utilize both offline and online planning to enhance convergence to the predefined end-effector goal pose and to enable reactive real-time replanning with respect to potential disturbance. This allows the OAM to reach the predefined end-effector goal pose and refine the entire state trajectory in real time. In the initial stage, our algorithm focuses on the end-effector trajectory. The end-effector pose is modeled as a particle in a 3D Euclidean space with orientation. As a result, the globally optimal, collision-free end-effector pose trajectory can be obtained within a few seconds even under a setting where the problem is highly nonlinear due to the rotational motion, and the trajectory horizon is longer than 10 s. Then, our algorithm proceeds to local online whole-body motion planning. Here, the primary consideration is to consider the whole-body motion while tracking the previously determined optimal end-effector pose trajectory. Before presenting algorithmic details, we rewrite the Assumption 2 mathematically.
Only for notational simplicity, we describe an obstacle with a single ellipsoid from the following. That is,
Now, we formulate the following optimal control problems (OCPs) for our cascaded whole-body motion planner.
Offline end-effector trajectory generation
From Assumption 2, it becomes apparent that the end-effector trajectory avoids collisions with obstacles if and only if the following inequalities are satisfied:
The modified collision avoidance constraints (16) rely only on the optimization variables in the translational motion. Thus, we can construct the following decoupled jerk minimization problem.
Online whole-body motion planning
Unlike the offline end-effector trajectory generation, whole-body motion planning necessitates the consideration of collision avoidance among rigid bodies, identification of the optimal configuration among those achieving the same end-effector pose, and online replanning to react to uncertainties and refine the end-effector trajectory if unavoidable. This subsection provides details of how each of these considerations is incorporated into the OCP.
Collision avoidance constraint
Let us define the ith ellipsoid
When obstacles and robot components are all modeled with ellipsoids, collision avoidance can be ensured if there is no intersection between each pair of robot and obstacle ellipsoids. Let us consider a set generated by the Minkowski sum of two sets of ellipsoids as
Cost function
In comparison with ground robot-based mobile manipulators, aerial manipulators are more susceptible to disturbance caused by interaction forces between the end-effector and the target object during manipulation. To mitigate this, it is imperative to secure the manipulability of the robot arm. Consequently, we incorporate the following manipulability index into our cost function:
Kinematics-level nonlinear model predictive control
With the obstacle avoidance constraints (19) and the cost functions (21), we formulate and solve the following OCP at each replanning interval, employing a nonlinear model predictive control (NMPC) approach:
In scenarios where the manipulator is constrained to planar motion, the manipulability index ϕ
m
in (20) becomes 0 as
Experimental results
Setup
Main components of the system.
Three experiments are conducted. In the first experiment, we show the performance of the proposed controller by comparing the result that is obtained with baseline controllers. The objective is to regulate the pose of the multirotor base in the presence of the robotic arm’s motion, and better regulation performance can be observed with the proposed control law. The second and third experiments are to demonstrate the proposed framework in one precise manipulation task of grasping-and-pulling an object. We consider two different environment settings: grasping-and-pulling an object (1) on the ground and (2) on a table. Compared to a conventional aerial manipulator based on an underactuated multirotor base inheriting a limited workspace, the OAM equipped with the proposed framework could accomplish the task in both environments by leveraging omnidirectionality and the extended workspace.
Implementation details
Controller and control allocation parameters (diagonal elements for matrices).
We empirically find that the 2nd and 5th rotors depicted in Figure 3 go near saturation when the robot hovers at 90° pitch angle with the identity weight matrix
Our experiments involve grasping a target object. Thus, a certain criterion is needed to determine whether the object is firmly grasped or not. For the criterion, we choose the current of the gripper servomotor. Before the experiments, the threshold for successful grasping is determined by securing various test objects with the gripper and observing the required value for it. In actual experiments, a low-pass filter is applied to the gripper servomotor’s current. If the filtered value surpasses the pre-determined threshold after 1.5 s, the grasping is considered successful, and a new trajectory is planned for the subsequent mission, which is pulling. The command to grasp is given when the position error between the end-effector and the goal is less than 3 cm. If the criterion is not met after the command, the gripper is reopened. This process is repeated until successful grasping occurs.
Planner parameters and numerical integration method (block diagonal matrices for
Experiment 1: Controller comparison
The first experiment is to show effectiveness of the proposed controller under disturbance, in comparison with the baseline controllers: (1) Euler-angle-based disturbance-observer (DOB) (Lee et al., 2021) which is a nonlinear robust controller, (2) geometric PID controller (gPID) (Goodarzi et al., 2013; Su et al., 2023), (3) geometric
The baseline controllers share the same PD control structure, with an additional control term designed to reject disturbance. For fair comparison, we set the corresponding PD gains of the proposed controller to be identical to those used in the baseline controllers. Only the gains associated with the additional control terms were tuned. Although g
The experiment is carried out in two different settings: pose regulation at 0° pitch angle and −30° pitch angle. We intentionally oscillate the manipulator to provide external disturbance to the multirotor base. In all settings, only the first and second joints of the manipulator are commanded to move from −45° to 45° with a period of 10 s. As a representative example that most clearly shows the difference between controllers in the two settings, we first present comparison results between the gPID and the proposed controller in Figures 4(a) and (b). In both figures, the above composite images show the motions of the manipulator and the multirotor base, exhibiting the superior performance of the proposed controller. The below graphs in both figures display pose error Results of Experiment 1: Comparison between gPID and the proposed controller (gRITE).
In both experiments, while the maximum error in the translational motion does not exceed 0.02 m in the results obtained with the proposed controller that in the results obtained without the proposed controller nearly reaches 0.04 m. The performance gap is more significant in the rotation direction which can be found in the second row of the graphs in Figures 4(a) and (b). We presume that not the control law itself but the mechanical vibration of the manipulator and the resultant measurement noise in angular velocity are responsible for the jittering in τ2 during all experiments with or without the proposed control law. This is because such jittering does not occur in any other control inputs, and we have experienced mechanical vibration due to backlash and clearance of servomotors which may be improved by fabricating the OAM using high-end servomotors.
The comparison results, including all baseline controllers, are summarized in Figure 5 and Table 6. Due to the motion of the robotic arm, the external disturbance is most significant along the body y − axis. Accordingly, the disturbance attenuation capability of each controller is most clearly reflected in the bottom-left plots of Figures 5(a) and (b). The first, second, and third quartiles (Q1, Q2, Q3) of the gPID controller are more than twice as large as those of the other controllers. Thus, we can confirm that all other controllers are substantially robust to disturbance. For DOB, which is designed based on Euler angles, both position and orientation errors noticeably increase when the pitch angle is −30° compared to the 0° case. This implies the importance of fully accounting for the nonlinear structure of Box plot of Experiment 1 for comparison with baseline controllers. Performance metrics of Experiment 1. Bold numbers indicate the best performance (i.e., lowest error) among all controllers for each metric. aEuclidean norm of e
p
bGeodesic distance d
g
between the two rotation matrices R and R
d
, that is, 
Experiment 2: Grasping-and-pulling an object on the ground
The second experiment is grasping-and-pulling an object on the ground which can hardly be conducted with a conventional aerial manipulator whose manipulator is attached on the top as the OAM in Figure 1. We carry out three different scenarios which we call (a) ground-basic, (b) ground-yaw, and (c) ground-pitch. All scenarios and the data collected during the experiments are visualized in Figure 6. Compared to (a) ground-basic, the initial orientation of the OAM in (b) ground-yaw is 180° rotated in the yaw direction. (c) ground-pitch starts with the same orientation as (a) ground-basic, but the target end-effector orientation is 180° rotated in the pitch direction. Results of Experiment 2: Grasping-and-pulling an object on the ground. The notations (a), (b), and (c) indicate to which scenario the figure corresponds to: (a) ground-basic, (b) ground-yaw, and (c) ground-pitch. We use quaternion q = [q
w
, q
x
, q
y
, q
z
] to represent the orientation of the multirotor base.
We could confirm that the proposed whole-body motion planning algorithm can compute a collision-free and goal-reaching trajectory in all three scenarios having either different initial conditions or different target poses. For the ground-pitch scenario, we visualize the end-effector trajectory computed by the first-step offline planner (red), the whole-body trajectory generated by the second-step online planner (green) and the actual traversed end-effector trajectory (black) in Figure 7. Each ellipsoid represents each single rigid body comprising the OAM, used in deriving the collision avoidance constraints. We could also validate online replannability of the second-step NMPC-based planning algorithm in Table 7. The maximum computation time is less than 20 ms in all scenarios, indicating online replannability faster than 50 Hz. Visualization of trajectories computed by the whole-body motion planner in Experiment 2 setting (c) ground-pitch. The blue line indicates the motion of the multirotor base, and the red and green lines are end-effector trajectories computed by the offline and online planners. The black line is the actual trajectory traversed by the end-effector. Performance metrics of Experiments 2 and 3. aEuclidean norm of e
p
bGeodesic distance d
g
between the two rotation matrices R and R
d
, that is, 
The composite images at the top of Figure 6 show the motion of the OAM during experiments. Position
Quantitative results are summarized in Table 7, and a detailed analysis of state errors and computation time is illustrated in the box plots in Figure 12. All scenarios in experiment 2 exhibit position tracking errors in terms of both RMS and mean below 2 cm, and orientation tracking errors remain within 3°. These errors are computed using the Euclidean norm of e
p
and the geodesic distance between the desired and actual rotation matrices. Figure 12 further provides component-wise distributions of the position and orientation errors. For each element of e
p
and e
R
, the box plots capture the median and quartiles over time. This offers insight into axis-wise behavior of the tracking performance. For instance, the position error in z-direction
To compare the overall behaviors with and without the collision avoidance constraint, simulation results for the ground-basic scenario are shown in Figure 8. When the collision constraints (19) are not considered, the planned trajectory penetrates the ground, as shown in (b), and the constraint value drops significantly below zero, as seen in (c). In contrast, as illustrated in (a), when the constraints are imposed, the planner avoids collision by tilting the OAM’s base toward the target, effectively utilizing its omnidirectional flight capability. In this case, we can guarantee collision avoidance, as the minimum value among the collision avoidance constraints remains above zero for all time, as shown in (c). These results suggest that under the collision avoidance constraint, the planner can utilize the OAM’s omnidirectional flight capability to avoid collision and promote effective use of the extended workspace when reaching the target. Simulated behaviors in the ground-basic scenario under the same initial hovering condition, with (a) and without (b) the collision avoidance constraint in (19). The minimum constraint at time 
Experiment 3: Grasping-and-pulling an object on a table
The last experiment is to validate applicability of the proposed framework in a different setting of grasping-and-pulling an object on top of a table. The OAM should now additionally avoid collision with the table while accomplishing the task. Two different scenarios are considered: (a) table-far and (b) table-close. The results are summarized in Figure 9. The proposed whole-body motion planner computes a trajectory so that the OAM simultaneously tilts the pitch angle and stretches the manipulator to satisfy both the collision avoidance constraint and the goal-reaching objective. It is noticeable that when the object on the table is far from the robot (i.e., scenario (a)), to ensure safety, the proposed whole-body motion planner computes a trajectory not only to tilt the pitch angle and stretch the manipulator but also to stay above the table. Although a non-negligible ground effect from the table exists while the OAM being above the table in scenario (a), sufficient tracking performance to accomplish the precise manipulation task can be achieved with the proposed controller. Results of Experiment 3: Grasping-and-pulling an object on a table. The notations (a) and (b) indicate to which scenario the figure corresponds to: (a) table-far and (b) table-close. We use quaternion q = [q
w
, q
x
, q
y
, q
z
] to represent the orientation of the multirotor base.
The computation time taken for solving the second step NMPC is listed in Table 7. As an additional object to avoid exists compared to the Experiment 2, a little longer computation time is required, but still about 30 Hz on average is obtained, which is sufficient for online replanning. Trajectories computed from the offline and online planning algorithms and the actual traversed trajectory are visualized in Figure 10 for scenario (a). Visualization of trajectories computed by the whole-body motion planner in Experiment 3 setting (a) table-far. The blue line indicates the motion of the multirotor base, and the red and green lines are end-effector trajectories computed by the offline and online planner. The black line is the actual trajectory traversed by the end-effector.
Quantitative results are summarized in Table 7, and a detailed analysis of state errors and computation time is illustrated in the box plots in Figure 12. All scenarios in experiment 3 exhibit position tracking errors (in terms of both RMS and mean) below 1 cm, and orientation tracking errors remain within 2°. Figure 11 shows the effect of the collision avoidance constraint (19) in the table-far scenario. Similar to the ground-basic case, the OAM avoids collision with the table by ascending above it and simultaneously tilts its base to effectively reach the target located at the middle of the table. Simulated behaviors in the table-far scenario under the same initial hovering condition, with (a) and without (b) the collision avoidance constraint in (19). The minimum constraint at time Box plot for state errors (e
p
and e
R
) and NMPC computation time of Experiments 2 and 3.

Conclusion
In this work, we presented a control and planning framework to enable mobile manipulation with arbitrary base position and orientation. To achieve this objective, we first constructed an omnidirectional aerial manipulator composed of an omnidirectional multirotor and a multi-DoF robotic arm. Then, a geometric robust controller is proposed for the multirotor base which we call a geometric robust integral of the tanh of the error (gRITE) controller. The controller is designed to ensure sufficient performance in the presence of external disturbance including aerodynamic effect, interaction wrench, and the uncertain motion of the robotic arm. The stability showed that the proposed controller can guarantee arbitrarily small error bound by choosing sufficiently large control gains. Next, a two-step trajectory-optimization-based whole-body motion planning algorithm was proposed while taking omnidirectionality of the OAM and physical constraints including collision avoidance into account. We composed the two offline and online planning phases to formulate a numerically stable optimization problem.
The proposed control and whole-body motion planning framework is validated in hardware experiments. In the first experiment, we compared the proposed controller with a nonlinear PID controller. The proposed controller outperformed the counterpart by showing better regulation performance in the presence of external disturbance by the manipulator. The second and third experiments demonstrated effectiveness of the proposed framework in precise manipulation where the OAM conducted grasping-and-pulling of an object on (1) the ground and (2) a table. We accomplished precise manipulation even in the presence of external disturbance involving ground effect while whole-body motion was exploited in realizing abidance to physical constraints and task completion. During the experiments of precise manipulation, the OAM maintained stable flight around the pitch angle over 90° and even 180°, showing mobile manipulation in arbitrary base pose.
As a future work, we aim to explore collaborative transportation where multiple OAMs transport a common object while each agent exploiting the enlarged workspace. Compared to aerial manipulators based on conventional underactuated multirotors, the additional advantage of the OAMs’ enlarged workspace would enhance manipulability of the transporting object, particularly in confined environments. In structured environments, such as warehouses and automated manufacturing facilities, obstacle locations are typically known in advance. However, we also consider future research directions involving dynamic and unknown obstacles, which are common in settings like disaster response or human–robot coexistence. Our framework may be combined with high-confidence motion prediction for moving agents to address dynamic obstacles. Furthermore, for unknown environments with static obstacles, the detection of a potential collision using onboard sensors such as cameras can serve as a trigger to re-initiate the planner. This allows the robot to adaptively update its motion plan while preserving whole-body manipulation capabilities and ensuring collision avoidance. In addition to these, when manipulating heavy objects, it can be beneficial to consider the effects of gravity and the feasible wrench space. Thus, we also consider real-time adaptation of the OAM’s configuration to simultaneously minimize the adverse effects caused by the payload’s weight at the end-effector.
Supplemental Material
Footnotes
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (grant number RS- 2024-0436984). Byeongjun Kim was supported by the Hyundai MotorChung Mong-Koo Foundation.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
Note
Appendix
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
