Skip to content

Latest commit

 

History

History
871 lines (608 loc) · 27.7 KB

File metadata and controls

871 lines (608 loc) · 27.7 KB

Computer Vision RPS

Milestone 1: Set up the environment

First thing starting this project was to setup and clone the Github repo, which was an easy task, as we done the same already in the previous project "Hangman".

Milestone 2: Create the computer vision system

Creating the computer vision system was a a breeze using Teachable Machine, which provides a simple way to record your data and train a deep learning model. Using our webcam, we recorded around 500 photos for each class (Rock, Paper, Scissors, Nothing), that were used to train the computer vision model.

After training was finished, we downloaded the model as "keras_model.h5" and "labels.txt" into the local repository and commit and push the changes to Github.

Milestone 3: Install the dependencies

Before running the program, we need to install the dependencies. We will set up a virtual environment and install opencv-python, tensorflow, and ipykernel using pip install.

Milestone 4: Create a Rock-Paper-Scissors game

Task 1: Store the user's and computer's choices

This code needs to randomly choose an option (rock, paper, or scissors) and then ask the user for an input.

Create another file called manual_rps.py that will be used to play the game without the camera.

You will need to use the random module to pick a random option between rock, paper, and scissors and the input function to get the user's choice.

Create two functions: get_computer_choice and get_user_choice.

The first function will randomly pick an option between "Rock", "Paper", and "Scissors" and return the choice.

The second function will ask the user for an input and return it.

import random

def get_computer_choice():
    
    '''The function "get_computer_choice" returns a randomly chosen move from a list of "Rock",
    "Paper", and "Scissors".
    
    Returns
    -------
        a randomly chosen move from the list of "Rock", "Paper", and "Scissors".
    
    '''
    
    moves = ["Rock", "Paper", "Scissors"]
    computer_choice =  random.choice(moves)
    
    return computer_choice

def get_user_choice():
    
    '''This function prompts the user to choose between Rock, Paper, or Scissors and returns their
    choice.
    
    Returns
    -------
        the user's choice of either "Rock", "Paper", or "Scissors".
    
    '''

    while True:
        user_choice = input("Please choose Rock, Paper or Scissors: ").capitalize()
        if user_choice in ["Rock", "Paper", "Scissors"]:
            return user_choice
        else:
            print("Please choose either Rock, Paper, or Scissors")

Task 2: Figure out who won

Using if-elif-else statements, the script should now choose a winner based on the classic rules of Rock-Paper-Scissors.

For example, if the computer chooses rock and the user chooses scissors, the computer wins.

Wrap the code in a function called get_winner and return the winner.

This function takes two arguments: computer_choice and user_choice.

If the computer wins, the function should print "You lost", if the user wins, the function should print "You won!", and if it's a tie, the function should print "It is a tie!".

import random

def get_computer_choice():
    
    '''The function "get_computer_choice" returns a randomly chosen move from a list of "Rock",
    "Paper", and "Scissors".
    
    Returns
    -------
        a randomly chosen move from the list of "Rock", "Paper", and "Scissors".
    
    '''
    
    moves = ["Rock", "Paper", "Scissors"]
    computer_choice =  random.choice(moves)
    
    return computer_choice


def get_user_choice():
    
    '''This function prompts the user to choose between Rock, Paper, or Scissors and returns their
    choice.
    
    Returns
    -------
        the user's choice of either "Rock", "Paper", or "Scissors".
    
    '''
    while True:
        user_choice = input("Please choose Rock, Paper or Scissors: ").capitalize()
        if user_choice in ["Rock", "Paper", "Scissors"]:
            return user_choice
        else:
            print("Please choose either Rock, Paper, or Scissors")


def get_winner(computer_choice, user_choice):
    
    '''The function takes in two choices (computer and user) and determines the winner based on the
    rules of rock-paper-scissors.
    
    Parameters
    ----------
    computer_choice
        The choice made by the computer in a game of rock-paper-scissors.
    user_choice
        The choice made by the user, which can be "Rock", "Paper", or "Scissors".
    
    '''
    
    if computer_choice == "Rock" and user_choice == "Paper":
        print("You won!")
        
    elif computer_choice == "Rock" and user_choice == "Scissors":
        print("You lost!")
        
    elif computer_choice == "Paper" and user_choice == "Rock":
        print("You lost!")
        
    elif computer_choice == "Paper" and user_choice == "Scissors":
        print("You won!")
        
    elif computer_choice == "Scissors" and user_choice == "Rock":
        print("You won!")
        
    elif computer_choice == "Scissors" and user_choice == "Paper":
        print("You lost!")
        
    elif computer_choice == user_choice:
        print("It's a tie!")

Task 3: Create a function to simulate the game

All of the code you've programmed so far relates to one thing: running the game - so you should wrap it all in one function.

Create and call a new function called play. Inside the function you will call all the other three functions you've created (get_computer_choice, get_user_choice, and get_winner)

Now when you run the code, it should play a game of Rock-Paper-Scissors, and it should print whether the computer or you won.

import random

def get_computer_choice():
    
    '''The function "get_computer_choice" returns a randomly chosen move from a list of "Rock",
    "Paper", and "Scissors".
    
    Returns
    -------
        a randomly chosen move from the list of "Rock", "Paper", and "Scissors".
    
    '''
    
    moves = ["Rock", "Paper", "Scissors"]
    computer_choice =  random.choice(moves)
    
    return computer_choice


def get_user_choice():
    
    '''This function prompts the user to choose between Rock, Paper, or Scissors and returns their
    choice.
    
    Returns
    -------
        the user's choice of either "Rock", "Paper", or "Scissors".
    
    '''
    while True:
        user_choice = input("Please choose Rock, Paper or Scissors: ").capitalize()
        if user_choice in ["Rock", "Paper", "Scissors"]:
            return user_choice
        else:
            print("Please choose either Rock, Paper, or Scissors")


def get_winner(computer_choice, user_choice):
    
    '''The function takes in two choices (computer and user) and determines the winner based on the
    rules of rock-paper-scissors.
    
    Parameters
    ----------
    computer_choice
        The choice made by the computer in a game of rock-paper-scissors.
    user_choice
        The choice made by the user, which can be "Rock", "Paper", or "Scissors".
    
    '''
    
    if computer_choice == "Rock" and user_choice == "Paper":
        print("You won!")
        
    elif computer_choice == "Rock" and user_choice == "Scissors":
        print("You lost!")
        
    elif computer_choice == "Paper" and user_choice == "Rock":
        print("You lost!")
        
    elif computer_choice == "Paper" and user_choice == "Scissors":
        print("You won!")
        
    elif computer_choice == "Scissors" and user_choice == "Rock":
        print("You won!")
        
    elif computer_choice == "Scissors" and user_choice == "Paper":
        print("You lost!")
        
    elif computer_choice == user_choice:
        print("It's a tie!")


def play():
    
    '''Plays the game by running the above functions.'''
    
    get_winner(get_computer_choice(), get_user_choice())


if __name__ == "__main__":
    play()

Milestone 5: Use the camera to play Rock-Paper-Scissors

Task 1: Putting all together

Replace the hard-coded user guess with the output of the computer vision model. Create a new file called camera_rps.py where you will write the new code.

Create a new function called get_prediction that will return the output of the model you used earlier.

Remember that the output of the model you downloaded is a list of probabilities for each class. You need to pick the class with the highest probability. So, for example, assuming you trained the model in this order: "Rock", "Paper", "Scissors", and "Nothing", if the first element of the list is 0.8, the second element is 0.1, the third element is 0.05, and the fourth element is 0.05, then, the model predicts that you showed "Rock" to the camera with a confidence of 0.8.

The model can make many predictions at once if given many images. In your case you only give it one image at a time. That means that the first element in the list returned from the model is a list of probabilities for the four different classes. Print the response of the model if you are unclear of this.

from RPS_Template import prediction
import numpy as np
import random

def get_computer_choice():
    
    '''The function "get_computer_choice" returns a randomly chosen move from a list of "Rock",
    "Paper", and "Scissors".
    
    Returns
    -------
        a randomly chosen move from the list of "Rock", "Paper", and "Scissors".
    
    '''
    
    moves = ["Rock", "Paper", "Scissors"]
    computer_choice =  random.choice(moves)
    
    return computer_choice
        return computer_choice


def get_prediction():

    '''Gets the user's choice of Rock, Paper, or Scissors using a machine learning model and webcam.
    
    Returns:
        The user's choice as a string.
    '''

    with open('labels.txt', 'r') as f:
        # Read the lines of the file and strip the newline characters
        lines = [line.strip() for line in f.readlines()]
    # Create an empty dictionary to store the indices and labels 
    labels = {}

    # Loop through the lines and split them into key-value pairs
    for line in lines:
        idx, label = line.split()
        labels[int(idx)] = label

    index = np.argmax(prediction)
    #confidence_score = prediction[0][index]
    print(f"You chose: {labels[index]}")

    if labels[index] in ["Rock", "Paper", "Scissors"]:
        return labels[index]
    
    #  In case it predicts "Nothing"
    else:
        print("Please choose Rock, Paper or Scissors!")
        self.get_prediction()


def get_winner(computer_choice, user_choice):
    
    '''The function takes in two choices (computer and user) and determines the winner based on the
    rules of rock-paper-scissors.
    
    Parameters
    ----------
    computer_choice
        The choice made by the computer in a game of rock-paper-scissors.
    user_choice
        The choice made by the user, which can be "Rock", "Paper", or "Scissors".
    
    '''
    
    if computer_choice == "Rock" and user_choice == "Paper":
        print("You won!")
        
    elif computer_choice == "Rock" and user_choice == "Scissors":
        print("You lost!")
        
    elif computer_choice == "Paper" and user_choice == "Rock":
        print("You lost!")
        
    elif computer_choice == "Paper" and user_choice == "Scissors":
        print("You won!")
        
    elif computer_choice == "Scissors" and user_choice == "Rock":
        print("You won!")
        
    elif computer_choice == "Scissors" and user_choice == "Paper":
        print("You lost!")
        
    elif computer_choice == user_choice:
        print("It's a tie!")


def play():
    
    '''Plays the game by running the above functions.'''
    
    get_winner(get_computer_choice(), get_user_choice())


if __name__ == "__main__":
    play()

play() output

Task 2: Count down

Here we add a count down over the camera feed and the model predicts on the final frame when count down reaches zero. I updated the RPS_Template.py and saved the new code in RPS_countdown.py.

import cv2
from keras.models import load_model
import numpy as np

model = load_model('keras_model_mat_2_day_night.h5')
cap = cv2.VideoCapture(0)
data = np.ndarray(shape=(1, 224, 224, 3), dtype=np.float32)

# Define countdown animation function
def countdown_animation(frame, seconds_left):

    '''Draws a countdown animation on the specified frame.
    
    Args:
        frame (ndarray): The frame to draw the countdown animation on.
        seconds_left (int): The number of seconds left in the countdown.
    '''

    # Draw countdown text on the frame
    font = cv2.FONT_HERSHEY_SIMPLEX
    text = str(seconds_left)
    textsize = cv2.getTextSize(text, font, 4, 5)[0]
    text_x = int((frame.shape[1] - textsize[0]) / 2)
    text_y = int((frame.shape[0] + textsize[1]) / 2)
    cv2.putText(frame, text, (text_x, text_y), font, 4, (0, 0, 255), 5)

# Set countdown timer
countdown_time = 5  # seconds
start_time = cv2.getTickCount() / cv2.getTickFrequency()

# Start countdown
while True:
    # Capture a frame from the camera
    ret, frame = cap.read()
    # Check if frame was successfully captured
    if not ret:
        print("Unable to capture frame")
        break
    # Calculate time elapsed
    time_elapsed = (cv2.getTickCount() / cv2.getTickFrequency()) - start_time
    # Check if countdown has finished
    if time_elapsed >= countdown_time:
        break
    # Display countdown animation
    countdown_animation(frame, countdown_time - int(time_elapsed))
    # Display the frame
    cv2.imshow("Countdown", frame)
    # Wait for a key press
    key = cv2.waitKey(1)
    if key == ord('q'):
        break

# Capture final frame
ret, frame = cap.read()
# Check if frame was successfully captured
if not ret:
    print("Unable to capture frame")
# Resize and normalize image
resized_frame = cv2.resize(frame, (224, 224), interpolation=cv2.INTER_AREA)
image_np = np.array(resized_frame)
normalized_image = (image_np.astype(np.float32) / 127.0) - 1 # Normalize the image
data[0] = normalized_image
# Predict on final frame
prediction = model.predict(data)
# Release the capture and destroy all windows
cv2.waitKey(500)
cap.release()
cv2.destroyAllWindows()

Countdown

Task 3: Repeat until a player gets three victories

At this stage we need to combine the two pieces of code: RPS-countdown and camera_rps. This is needed so that the RPS-countdown script runs anew inside play() function for every round of the game. After user or computer reaches 3 wins, the game is over and the final result is displayed on the screen.

import numpy as np
import random
import cv2
from keras.models import load_model

def get_computer_choice():
    
    '''The function randomly selects and returns a move (Rock, Paper, or Scissors) for the computer.
    
    Returns
    -------
        the computer's choice of either "Rock", "Paper", or "Scissors".
    
    '''
    
    moves = ["Rock", "Paper", "Scissors"]
    computer_choice =  random.choice(moves)
    print(f"Computer chose: {computer_choice}")
    return computer_choice


def countdown_animation(frame, seconds_left):
    
    '''This function draws a countdown text on webcam feed using OpenCV library in Python.
    
    Parameters
    ----------
    frame
        The current frame of the video or image on which the countdown animation will be drawn.
    seconds_left
        The number of seconds left in the countdown. This parameter is used to generate the text that will
    be displayed on the frame.
    
    '''
    
    # Draw countdown text on the frame
    font = cv2.FONT_HERSHEY_TRIPLEX
    text = str(seconds_left)
    textsize = cv2.getTextSize(text, font, 4, 5)[0]
    text_x = int((frame.shape[1] - textsize[0]) / 2)
    text_y = int((frame.shape[0] + textsize[1]) / 2)
    cv2.putText(frame, text, (text_x, text_y), font, 4, (0, 0, 255), 5)


def get_prediction():
    
    '''This function loads a pre-trained Keras model and uses it to predict the label of an image captured
    from a webcam after a countdown animation.
    
    Returns
    -------
        the label of the predicted object in the final frame captured by the camera.
    
    '''
    
    model = load_model('keras_model_mat_2_day_night.h5')
    with open('labels.txt', 'r') as f:
        # Read the lines of the file and strip the newline characters
        lines = [line.strip() for line in f.readlines()]

    # Create an empty dictionary to store the indices and labels 
    labels = {}
    
    # Loop through the lines and split them into key-value pairs
    for line in lines:
        idx, label = line.split()
        labels[int(idx)] = label
    
    cap = cv2.VideoCapture(0)
    data = np.ndarray(shape=(1, 224, 224, 3), dtype=np.float32)
    # Set countdown timer
    countdown_time = 5  # seconds
    start_time = cv2.getTickCount() / cv2.getTickFrequency()

    # Start countdown
    while True:
        # Capture a frame from the camera
        ret, frame = cap.read()
        
        # Check if frame was successfully captured
        if not ret:
            print("Unable to capture frame")
            break

        # Calculate time elapsed
        time_elapsed = (cv2.getTickCount() / cv2.getTickFrequency()) - start_time
        # Check if countdown has finished

        if time_elapsed >= countdown_time:
            break

        # Display countdown animation
        countdown_animation(frame, countdown_time - int(time_elapsed))
        # Display the frame
        cv2.imshow("Countdown", frame)
        # Wait for a key press
        key = cv2.waitKey(1)

        if key == ord('q'):
            break

    # Capture final frame
    ret, frame = cap.read()
    
    # Check if frame was successfully captured
    if not ret:
        print("Unable to capture frame")

    # Resize and normalize image
    resized_frame = cv2.resize(frame, (224, 224), interpolation=cv2.INTER_AREA)
    image_np = np.array(resized_frame)
    normalized_image = (image_np.astype(np.float32) / 127.0) - 1 # Normalize the image
    data[0] = normalized_image
    # Predict on final frame
    prediction = model.predict(data)
    index = np.argmax(prediction)
    #confidence_score = prediction[0][index]
    print(f"You chose: {labels[index]}")
    cap.release()
    cv2.destroyAllWindows()
    
    if labels[index] in ["Rock", "Paper", "Scissors"]:
        return labels[index]
    
    #  In case it predicts "Nothing"
    else:
        print("Please choose Rock, Paper or Scissors!")
        get_prediction()


def get_winner(comp, user):
    
    '''This function determines the winner of a rock-paper-scissors game between the computer and the user
    and updates the global variables for the number of wins for each player.
    
    Parameters
    ----------
    comp
        represents the computer's choice of rock, paper, or scissors
    user
        represents the user's choice of rock, paper, or scissors in a game against the computer.
    
    '''
    
    global computer_wins
    global user_wins
    
    if comp == "Rock" and user == "Paper":
        print("You won!")
        user_wins += 1
        
    elif comp == "Rock" and user == "Scissors":
        print("You lost!")
        computer_wins += 1
        
    elif comp == "Paper" and user == "Rock":
        print("You lost!")
        computer_wins += 1
        
    elif comp == "Paper" and user == "Scissors":
        print("You won!")
        user_wins += 1
        
    elif comp == "Scissors" and user == "Rock":
        print("You won!")
        user_wins += 1
        
    elif comp == "Scissors" and user == "Paper":
        print("You lost!")
        computer_wins += 1
        
    elif comp == user:
        print("It's a tie!")


def play():
    
    '''The function "play" sets the initial scores to 0 and runs a loop until either the computer or user
    wins three times, calling other functions to get user input and computer choice, and determine the
    winner of each round.
    
    '''
    
    global computer_wins
    global user_wins
    computer_wins = 0
    user_wins = 0
    
    while True:
        
        if computer_wins == 3:
            print(f"Game over! Computer won {computer_wins}:{user_wins}.")
            break
        
        elif user_wins == 3:
            print(f"Game over! You won {user_wins}:{computer_wins}.")
            break
        
        else:
            user = get_prediction()
            comp = get_computer_choice()
            get_winner(comp, user)


if __name__ == "__main__":
    play()

Final Output after 3 wins

Additionally we can also rewrite the code above into a class object like this:

import numpy as np
import random
import cv2
from keras.models import load_model

class RPS:
    
    '''Represents a game of Rock, Paper, Scissors.
    
    Attributes:
        computer_wins (int): The number of times the computer has won.
        user_wins (int): The number of times the user has won.
    '''
    
    def __init__(self):
        
        '''Initializes a new instance of the RPS class with the computer_wins and user_wins attributes set to 0.'''
        
        self.computer_wins = 0
        self.user_wins = 0

    def play(self):
        
        '''The function runs a loop until either the computer or user wins three times, 
        calling other functions to get user input and computer choice, and determine the
        winner of each round.
        
        '''
        
        while True:
            
            if self.computer_wins == 3:
                print(f"Game over! Computer won {self.computer_wins}:{self.user_wins}.")
                break
            
            elif self.user_wins == 3:
                print(f"Game over! You won {self.user_wins}:{self.computer_wins}.")
                break
            
            else:
                user = self.get_prediction()
                comp = self.get_computer_choice()
                self.get_winner(comp, user)

    def get_computer_choice(self):
    
        '''The function randomly selects and returns a move (Rock, Paper, or Scissors) for the computer.
        
        Returns
        -------
            the computer's choice of either "Rock", "Paper", or "Scissors".
        
        '''
        
        moves = ["Rock", "Paper", "Scissors"]
        computer_choice =  random.choice(moves)
        print(f"Computer chose: {computer_choice}")
        
        return computer_choice
    
    
    def countdown_animation(self, frame, seconds_left):
        
        '''This function draws a countdown text on a given frame using OpenCV library in Python.
        
        Parameters
        ----------
        frame
            The current frame of the video or image on which the countdown text will be drawn.
        seconds_left
            The number of seconds left in the countdown. This parameter is used to display the remaining time
        on the frame.
        
        '''
        
        # Draw countdown text on the frame
        font = cv2.FONT_HERSHEY_TRIPLEX
        text = str(seconds_left)
        textsize = cv2.getTextSize(text, font, 4, 5)[0]
        text_x = int((frame.shape[1] - textsize[0]) / 2)
        text_y = int((frame.shape[0] + textsize[1]) / 2)
        cv2.putText(frame, text, (text_x, text_y), font, 4, (0, 0, 255), 5)


    def get_prediction(self):
    
        '''This function loads a pre-trained Keras model and uses it to predict the label of an image captured
        from a webcam after a countdown animation.
        
        Returns
        -------
            the label of the predicted object in the final frame captured by the camera.
        
        '''
        
        model = load_model('keras_model_mat_2_day_night.h5')
        with open('labels.txt', 'r') as f:
            # Read the lines of the file and strip the newline characters
            lines = [line.strip() for line in f.readlines()]
        # Create an empty dictionary to store the indices and labels 
        labels = {}
        # Loop through the lines and split them into key-value pairs
        for line in lines:
            idx, label = line.split()
            labels[int(idx)] = label
        
        cap = cv2.VideoCapture(0)
        data = np.ndarray(shape=(1, 224, 224, 3), dtype=np.float32)
        # Set countdown timer
        countdown_time = 5  # seconds
        start_time = cv2.getTickCount() / cv2.getTickFrequency()

        # Start countdown
        while True:
            # Capture a frame from the camera
            ret, frame = cap.read()
            
            # Check if frame was successfully captured
            if not ret:
                print("Unable to capture frame")
                break
            
            # Calculate time elapsed
            time_elapsed = (cv2.getTickCount() / cv2.getTickFrequency()) - start_time
            # Check if countdown has finished
            
            if time_elapsed >= countdown_time:
                break
            
            # Display countdown animation
            self.countdown_animation(frame, countdown_time - int(time_elapsed))
            # Display the frame
            cv2.imshow("Countdown", frame)
            # Wait for a key press
            key = cv2.waitKey(1)
            
            if key == ord('q'):
                break

        # Capture final frame
        ret, frame = cap.read()
        
        # Check if frame was successfully captured
        if not ret:
            print("Unable to capture frame")
            
        # Resize and normalize image
        resized_frame = cv2.resize(frame, (224, 224), interpolation=cv2.INTER_AREA)
        image_np = np.array(resized_frame)
        normalized_image = (image_np.astype(np.float32) / 127.0) - 1 # Normalize the image
        data[0] = normalized_image
        # Predict on final frame
        prediction = model.predict(data)
        index = np.argmax(prediction)
        #confidence_score = prediction[0][index]
        print(f"You chose: {labels[index]}")
        cap.release()
        cv2.destroyAllWindows()
        
        if labels[index] in ["Rock", "Paper", "Scissors"]:
            return labels[index]
        
        #  In case it predicts "Nothing"
        else:
            print("Please choose Rock, Paper or Scissors!")
            self.get_prediction()


    def get_winner(self, comp, user):
        
        '''Explains the game rules and logic.'''
        
        if comp == "Rock" and user == "Paper":
            print("You won!")
            self.user_wins += 1
            
        elif comp == "Rock" and user == "Scissors":
            print("You lost!")
            self.computer_wins += 1
            
        elif comp == "Paper" and user == "Rock":
            print("You lost!")
            self.computer_wins += 1
            
        elif comp == "Paper" and user == "Scissors":
            print("You won!")
            self.user_wins += 1
            
        elif comp == "Scissors" and user == "Rock":
            print("You won!")
            self.user_wins += 1
            
        elif comp == "Scissors" and user == "Paper":
            print("You lost!")
            self.computer_wins += 1
            
        else:
            print("It's a tie!")


if __name__ == "__main__":
    rps_game = RPS()
    rps_game.play()

Conclusion

With the code above (and model training with Teachable Machine) we build created a nice Rock-Paper-Scissors game that we can play with our computer via a webcam. Around 3000 photos of each class was used to train the model to get a pretty reliable confidence score. Photos were taken at day/night (lighting variation) and in different clothes. We included a smooth count down animation on top of the camera video feed (not individual frames).

Game starts with opening a window with our webcam feed and counts down from 5. It then predicts the class from the last frame recorded when count down reaches 0. That prediction is then compared against the computer's choice and run through the game logic. It then adds a point to the user or computer. Game is over when either reaches 3 points.