Large Language Model

GestureGPT: Zero-shot Interactive Gesture Understanding and Grounding with Large Language Model Agents

arxiv. Triple-agent based LLM system for gesture understanding and grounding to system actions.