RAI is a flexible AI agent framework designed to handle tasks in generalized manner. It is easily extendable, allowing developers to adapt and integrate new functionalities and tools to meet their needs.
-
Follow installation instructions from Quick Start
-
Activate pre-commit hook for automatic code checking and formatting (it uses ../.pre-commit-config.yaml).
pre-commit install
from rai import ROS2Agent
agent = ROS2Agent() # vendor will be automatically initialized based on the config.toml
print(agent("What topics, services, and actions are available?"))
print(agent("Please describe the interfaces of two of the existing topics."))
print(agent("Please publish 'Hello RAI' to /chatter topic only once")) # make sure to listen first ros2 topic echo /chatter
Follow instructions to configure RAI identity for your robot.
To run the fully initialized Streamlit HMI, use the following command, replacing my_robot_whoami
with the name of the package you created:
export ROBOT_NAME=my_robot_whoami # set the name of created whoami package
ros2 launch rai_whoami rai_whoami.launch.py robot_description_package:=$ROBOT_NAME &
streamlit run src/rai_hmi/rai_hmi/text_hmi.py $ROBOT_NAME
To verify if Streamlit successfully loaded the configuration:
-
Expand the "System status" menu in the Streamlit interface.
-
Check for the following indicators:
- "robot_database": true
- "system_prompt": true
If both indicators are present and set to true, your configuration has been loaded correctly. You can now interact with your robot by:
-
Asking about its identity and purpose
-
Inquiring about its capabilities
-
Requesting information on the ROS topics it can access
This interaction will help you verify that the robot's 'whoami' package is functioning as intended and providing accurate information about your robot's configuration.
RAI has general capabilities to interact through ROS interfaces such as actions, services and topics. However, you can extend RAI with tools dedicated to what your robot needs to do. These functions should be decorated with @tool and should interact with your robot's API. See the example below.
from langchain.tools import tool, BaseTool
from pydantic import BaseModel, Field
from typing import Type
from myrobot import robot
# decorator based api, for basic tools
@tool
def pick_up_object(bbox_3d):
"""Tool used for picking up objects"""
return robot.pick_up_object(bbox_3d)
@tool
def scan_object():
"""Tool used for examining previously picked object"""
return robot.scan_object()
# class based api, useful when tools use objects like ROS 2 Node
class SayToolInput(BaseModel):
text: str = Field(..., description="Text to be said.")
class SayTool(BaseTool):
name: str = "say"
description: str = "Tool used for speaking in e.g. human-robot conversation"
robot: type[Robot]
args_schema: Type[SayToolInput] = SayToolInput
def _run(self, text: str):
return self.robot.speak(text)
def state_retriever():
"""
State retriever used for feeding state information to agent every iteration.
The state can consist of robot's logs, as well as any information that might
be useful to the robot's operations and agent's reasoning.
"""
return {"temperature": 30, "battery_state": 33, "logs": [...]}
Once you have implemented your tools, you can run the agent of choice with these new tools. Currently, the following agents are available:
-
state_based_agent A more advanced agent that provides:
- Automatic environment parsing via state retriever
- Support for asynchronous ROS 2 actions
- Continuous state monitoring and reasoning
- System prompt retrieval from the rai_whoami node
-
conversational_agent A simpler agent ideal for getting started, offering:
- Human-robot conversation capabilities
- Synchronous ROS 2 tool execution
- Straightforward implementation and usage
- Inline system prompt declaration
Choose the conversational_agent when beginning with RAI, and consider upgrading to state_based_agent for more complex applications requiring state awareness.
from myrobot import robot
from rai.agents.conversational_agent import create_conversational_agent
from rai.agents.state_based import create_state_based_agent
from rai.utils.model_initialization import get_llm_model
SYSTEM_PROMPT = "You are a robot with interfaces..."
llm = get_llm_model(
model_type="complex_model"
) # initialize your vendor of choice in config.toml
tools = [pick_up_object, scan_object, SayTool(robot=robot)]
conversational_agent = create_conversational_agent(
llm=llm, tools=tools, system_prompt=SYSTEM_PROMPT
)
state_based_agent = create_state_based_agent(
llm=llm, state_retriever=state_retriever, tools=tools
)
# run the agent of choice
conversational_agent.invoke({"messages": ["Please pick up an object and scan it."]})
state_based_agent.invoke({"messages": ["Please pick up an object and scan it."]})
Additional resources:
- Tracing configuration for genAI models and agents.
- Multimodal Messages definition.
- Available ROS 2 packages: ros packages.
- Human-Robot Interface through voice and text.