Overview
DroidRun is an open-source mobile automation framework that enables driving device interactions via natural language commands. It is an LLM-agnostic mobile agent that combines model inference, semantic retrieval, and device control to translate natural language into touch events and workflows.
Key Features
- Generate and execute device action sequences from natural language task descriptions.
- LLM-agnostic design with pluggable model and retrieval component support.
- CLI and integration APIs for scripting, pipeline automation, and CI/CD integration.
- Combines retrieval results with session context to improve accuracy and robustness.
Use Cases
- Automating mobile testing and regression validation to increase coverage and efficiency.
- Building mobile assistants that complete complex multi-step tasks via natural language.
- Rapidly validating mobile interaction logic and UX during product prototyping.
Technical Details
- End-to-end pipeline combining natural language understanding, vector retrieval, and device controllers.
- Pluggable adapters for models and retrieval systems to ease replacement and extension.
- Scriptable and engineering-friendly design for remote device pool and CI/CD execution.