Attribute Based Pedestrian Detection


Real-Time Pedestrian Attribute Detection with React and Flask

This project demonstrates a user-friendly application that leverages real-time camera feeds to identify individuals based on specific attributes. It highlights pedestrians with desired characteristics through bounding boxes, enhancing surveillance efficiency.

Technology Stack:

  • Frontend: React
  • Backend: Flask
  • Deep Learning Models: YOLO variants

Project Structure:

  • notebooks: Jupyter Notebooks containing training runs for various YOLO models.
  • app: React and Flask application code for user interaction and model integration.

Data Acquisition and Annotation:

The image data used for training the model originated from public datasets:

Supported Pedestrian Attributes:

The application can detect a wide range of attributes, categorized as follows:

Category Attributes
Gender Male, Female
Upper Body T-shirt, Shirt, Coat
Upper Body Color Black, White, Red, Green, Blue, Yellow, Brown
Umbrella Yes, No
Handbag Yes, No
Backpack Yes, No
Lower Body Trousers, Skirt, Shorts
Lower Body Color Black, White, Red, Green, Blue, Yellow, Brown
Footwear Shoes, Boots
Glasses Yes, No
Cap/Helmet Yes, No

Project Workflow:

Project Workflow

Getting Started:

To run the application locally, you'll need to set up the required dependencies and environment. Please refer to the project's directory specific README files for detailed instructions.

Example Output: Example Output of the YOLO Model

Future Enhancements:

  • Improved Model Training: Train a larger YOLO variant on a significantly bigger dataset encompassing a wider range of attributes for enhanced real-world performance.
  • Granular Filtering: Implement more granular control over attribute filtering, allowing users to refine search criteria for precise results (e.g., clothing color, accessories, garment types).
  • User Prompt Integration: Explore user prompts for pedestrian identification. The application could extract attributes (e.g., "red hat, backpack") and perform attribute-based detection for a natural interaction method.