Watson Brothers Entertainment Inc. (commonly known as Watson Bros. or abbreviated as WB) is an American film and entertainment studio headquartered at the Watson Bros. Studios complex in Armonk, New York. Founded in 1911 by two brothers, Thomas Jr. and Arthur Watson, the company established itself as a leader in the American film industry before diversifying into animation, television, and video games, and is one of the largest companies in the world.
They are pioneers in the usage of technology, and actively advocate for its usage to improve their customer's experience.
Watson Bros. wishes to better understand how their customers feel about the movies they go see in their cinemas. They have attempted to achieve this several times, with little success, due mainly to technological difficulty, high costs, and data privacy concerns. Now that Generative AI is being massively adopted, and knowing IBM's watsonx is the most trusted platform for Enterprise AI, they want to try again.
Watson Bros. has requested IBM to develop a solution able to classify movie reviews as either "Positive" or "Negative", based solely on the review itself. To develop this solution, Watson Bros. is willing to provide access to a database containing a wide list of movie reviews.
You must engineer a prompt using one of the supported foundation models available with watsonx.ai to classify the movie reviews given to you so that if a new review is inputted, the system will respond either Positive
, Negative
or Unknown
if it's impossible to determine.
Additionally, to prove your solution's performance, you must evaluate its results (based on the test dataset) on the following metrics:
- Accuracy
- Precision & Recall
- F1-Score
To do this, you have been given a train/test dataset in csv
format containing three columns: row number
, review
and score
, where score is 1
for Positive reviews and 0
for Negative ones. In addition, a Jupyter notebook sample has been provided to you, which you must improve so it meets the criteria mentioned above.
.
├── assets/ # Repository Assets
│ ├── images/ # Images used in the present document
│ └── data/ # Train & Test datasets
├── notebook/ # Application source code
└── README.md # Present document
- Solution must be developed on the Jupyter Notebook present in this repository.
- Documentation must be in English.
- In order to meet the client's industry's Safety Quality and Compliance requirements, it is imperative for this solutions to meet the best development standards and practices for security and scalability.
- Access credentials to all services required for solution development will be provided to you by The Client via e-mail.
When developing your solution:
- Be sure to comment your code.
- Add markdown cells for explaining graphics, evaluation results, or any information that isn't self-explanatory.
- Do not share your access credentials with others.
You will be using Generative AI for this challenge. These models drive high costs and must be used carefully. Platform usage will be monitored throughout the challenge.
Non-challenge use of the platform is strictly prohibited and will result in challenge disqualification.
It is recommended you run this Jupyter notebook locally. You can do this via Visual Studio Code.
Create your local environment (one time only):
cd notebook/
python -m venv .venv
Note: if you get a
command not found: python
, try withpython3
.
Activate your environment
source .venv/bin/activate
Install dependencies (one time only):
pip install -r requirements.txt
Then, open your Jupyter notebook on Visual Studio Code and use it normally.
After the challenge is presented to you, you will receive an e-mail containing the access credentials you'll need to complete it. You will have one week to complete the challenge, starting from the moment the e-mail is sent to you.
Once you have finished the challenge, or your time has ran out, you must deliver the following assets:
- Source code
- Resulting Jupyter Notebook in
.zip
format. Remove any credentials before sending. Review implementation considerations before submission. - Do not fork this repo. Contribute locally instead.
- Resulting Jupyter Notebook in
- Showcase video
- Video detailing what the challenge was about, how you implemented it, and what you learned from the experience.
- Duration must be between two and five minutes. This will be taken into consideration during evaluation.
.mp4
format is preferred.- The video must be in English, so the client's Technical Leadership can understand it.
For privacy and security reasons, assets must be submitted via e-mail attachment, instead of forking/commiting to this repository.
Communication, both for questions and delivery purposes, must be sent to:
Gabriela Retamosa
. Senior Build Lab Leader, SSA & MX. [email protected]Josefina R. Casanova
. Build Lab Engagement Lead, Americas. [email protected]Sebastian Fripp
. Associate Build Lab Engagement Lead, Americas. [email protected]Nadia dos Santos
. Associate Build Lab Engagement Lead, Americas. [email protected]
- Josefina R. Casanova | IBM Build Lab Engagement Lead Americas