Skip to content

hidatara-ds/pepper-robots

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– Pepper Robot Management System

πŸ“‹ Project Overview

This project is a comprehensive, production-ready management system for Pepper humanoid robots, featuring advanced capabilities in AI integration, face recognition, natural language processing, and robotic control.

Copyright Notice: This software is proprietary research work. All rights reserved by the original creator. See LICENSE file for usage terms.


🎯 Project Objectives

This research project was developed to explore:

  • Full-Stack Development: Complete backend API with modern web interfaces
  • AI/ML Integration: Advanced face recognition and conversational AI capabilities
  • Cloud-Native Architecture: Integration with Google Cloud Platform services
  • Robotic Control Systems: Comprehensive movement and interaction management
  • Production-Ready Code: Well-structured, documented, and tested codebase

πŸ—οΈ System Architecture

The project consists of three main components, each demonstrating different aspects of modern software development:

1. Pepper Backend Management System (pepper-be/)

A Flask-based RESTful API backend providing comprehensive robot management capabilities:

  • User authentication and authorization
  • Face recognition and identity management
  • Robot movement and sequence control
  • AI conversation integration
  • SSH-based robot communication
  • Multi-language support (Indonesian/English)
  • Web-based administration interface

Key Technologies:

  • Python 3.11.1, Flask
  • SQLite database with SQLAlchemy ORM
  • JWT authentication
  • Google Cloud Storage integration
  • RESTful API design

2. AI Conversation Service (pepper-ai-discussion/)

An intelligent voice-based conversation system:

  • Speech-to-Text using Google Cloud Speech-to-Text API
  • Natural language processing with Google Vertex AI (Gemini)
  • Text-to-Speech with Google Cloud TTS
  • Session management and conversation history
  • Docker containerization for cloud deployment

Key Technologies:

  • Flask REST API
  • Google Vertex AI (Gemini 2.0 Flash)
  • Google Cloud Speech-to-Text & Text-to-Speech
  • Docker & Cloud Run deployment ready

3. Face Recognition Application (final test face reco app/)

Advanced face recognition system with cloud storage integration:

  • Real-time face detection and recognition
  • DeepFace-based facial analysis
  • Google Cloud Storage as distributed database
  • Automatic synchronization and caching
  • RESTful API for robot integration

Key Technologies:

  • Flask API
  • DeepFace library
  • OpenCV for image processing
  • Google Cloud Storage
  • VGGFace model integration

✨ Key Features

πŸ€– Robot Control

  • Movement sequence management
  • Dance and walk pattern control
  • Real-time robot state monitoring
  • SSH-based command execution

πŸ‘€ Face Recognition

  • Real-time face detection and identification
  • Multi-person recognition
  • Cloud-based face database
  • Automatic model synchronization

πŸ’¬ AI Conversation

  • Natural language understanding
  • Context-aware conversations
  • Voice input/output support
  • Multi-turn dialogue management

πŸ” Security & Authentication

  • JWT-based authentication
  • Password reset functionality
  • Secure credential management
  • Role-based access control

🌐 Web Interface

  • Responsive admin dashboard
  • Real-time status monitoring
  • Interactive robot control panels
  • Multi-language UI support

πŸ› οΈ Technical Stack

Backend

  • Python 3.11.1 - Core programming language
  • Flask - Web framework
  • SQLAlchemy - ORM for database management
  • JWT - Authentication tokens
  • OpenCV - Computer vision
  • DeepFace - Face recognition

Cloud Services

  • Google Cloud Storage - Distributed file storage
  • Google Cloud Speech-to-Text - Voice recognition
  • Google Cloud Text-to-Speech - Voice synthesis
  • Google Vertex AI (Gemini) - Natural language processing

Frontend

  • HTML5/CSS3 - Modern web standards
  • JavaScript/jQuery - Interactive UI
  • Bootstrap - Responsive design framework

DevOps

  • Docker - Containerization
  • Git - Version control
  • Virtual Environments - Dependency isolation

πŸ“ Project Structure

pepper-robots/
β”œβ”€β”€ pepper-be/                    # Main backend management system
β”‚   β”œβ”€β”€ app/
β”‚   β”‚   β”œβ”€β”€ controller/          # API controllers
β”‚   β”‚   β”œβ”€β”€ model/               # Database models
β”‚   β”‚   β”œβ”€β”€ services/            # Business logic services
β”‚   β”‚   β”œβ”€β”€ templates/           # Web UI templates
β”‚   β”‚   β”œβ”€β”€ static/              # CSS, JS, images
β”‚   β”‚   └── utils/               # Utility functions
β”‚   β”œβ”€β”€ tests/                   # Unit and integration tests
β”‚   └── docs/                    # API documentation
β”‚
β”œβ”€β”€ pepper-ai-discussion/         # AI conversation service
β”‚   β”œβ”€β”€ app.py                   # Main Flask application
β”‚   β”œβ”€β”€ Dockerfile               # Container configuration
β”‚   └── requirements.txt         # Python dependencies
β”‚
└── final test face reco app/    # Face recognition service
    β”œβ”€β”€ app.py                   # Face recognition API
    β”œβ”€β”€ gcs_handler.py           # Cloud storage integration
    └── pepper_client.py         # Robot client library

πŸš€ Getting Started

Prerequisites

  • Python 3.11.1 or higher
  • Google Cloud Platform account with appropriate APIs enabled
  • Access to Pepper robot (for full functionality testing)

Quick Start

  1. Clone the repository

    git clone <repository-url>
    cd pepper-robots
  2. Set up backend system

    cd pepper-be
    python -m venv venv
    venv\Scripts\activate  # Windows
    # source venv/bin/activate  # Linux/Mac
    pip install -r requirements.txt
  3. Configure environment variables

    • Set up .env file with database and API credentials
    • Configure Google Cloud credentials
  4. Run the application

    python run.py

For detailed setup instructions for each component, please refer to the individual README files in each subdirectory.


πŸ“Š Project Highlights

Code Quality

  • βœ… Modular architecture with separation of concerns
  • βœ… Comprehensive error handling
  • βœ… RESTful API design principles
  • βœ… Database normalization and optimization
  • βœ… Security best practices implementation

Innovation

  • πŸ”¬ Integration of multiple AI services
  • πŸ”¬ Cloud-native architecture
  • πŸ”¬ Real-time robot-human interaction
  • πŸ”¬ Scalable microservices design

Documentation

  • πŸ“š Inline code documentation
  • πŸ“š API endpoint documentation
  • πŸ“š Setup and deployment guides
  • πŸ“š Architecture diagrams and explanations

πŸŽ“ Learning Outcomes Demonstrated

This project showcases proficiency in:

  1. Software Engineering: Clean code, design patterns, architecture
  2. AI/ML Integration: Face recognition, NLP, speech processing
  3. Cloud Computing: GCP services, distributed systems
  4. Full-Stack Development: Backend APIs, frontend interfaces
  5. DevOps: Containerization, deployment automation
  6. Robotics: Robot control systems, sensor integration

πŸ“ Research Notes

This project represents original research and development work in the field of human-robot interaction and AI integration. All code, documentation, and architecture decisions were made with the intention of demonstrating:

  • Technical Competency: Advanced programming and system design skills
  • Best Practices: Industry-standard development methodologies
  • Innovation: Creative integration of modern technologies
  • Completeness: Full-stack, production-ready system

The codebase is production-ready and demonstrates real-world application development capabilities suitable for enterprise-level projects and academic research.


πŸ“„ License

This software is proprietary and protected by copyright law. See the LICENSE file for complete terms and conditions.

Copyright Β© 2025. All Rights Reserved.

This project is the intellectual property of the original creator and is intended for research and academic purposes. Any use, reproduction, or distribution requires explicit written permission from the copyright holder.


πŸ™ Citation & Attribution

If you reference this work in academic papers, research, or any publication, please cite appropriately:

[Author Name]. (2025). Pepper Robot Management System: 
A Comprehensive AI-Integrated Robotic Control Platform. 
[Research Institution/Affiliation].

This project represents significant original research and development effort in the field of human-robot interaction, AI integration, and robotic control systems.


About

A Cost-Effective Cloud-Offloading Architecture for Human-Robot Interaction: Integrating LLMs and Computer Vision on Resource-Constrained Humanoid Robots

Topics

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors