A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics

Recent advances in Large Language Models (LLMs) have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason across a wide range of tasks and scenarios. Previous works have investigated various prompt engineering techniques for improving the performance of LLMs to accomplish tasks, while others have proposed methods that utilize LLMs to plan and execute tasks based on the available functionalities of a given robot platform. In this work, we consider both lines of research by comparing prompt engineering techniques and combinations thereof within the application of high-level task planning and execution in service robotics. We define a diverse set of tasks and a simple set of functionalities in simulation, and measure task completion accuracy and execution time for several state-of-the-art models. We make our code, including all prompts, available at url{https://github.com/AIS-Bonn/Prompt_Engineering}.

Citation information

Bode, Jonas; Pätzold, Bastian; Memmesheimer, Raphael; Behnke, Sven: A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics, 2024 IEEE-RAS 23rd International Conference on Humanoid Robots (Humanoids), 2024, https://ieeexplore.ieee.org/abstract/document/10769825, Bode.etal.2024a,

Associated Lamarr Researchers

lamarr institute person Behnke Sven - Lamarr Institute for Machine Learning (ML) and Artificial Intelligence (AI)

Prof. Dr. Sven Behnke

Area Chair Embodied AI to the profile