The data used for this analysis is scrapped from the official SIH website and is publicly available here github link
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
just some default configurations to make the outputs look pretty
plt.style.use('ggplot')
sns.set_palette('cool')
sns.set(rc={"figure.figsize":(12, 7)})
sns.set_context("paper", font_scale=1.5, rc={"lines.linewidth": 2.5})
pd.set_option('display.max_columns', 200)
df = pd.read_excel(r'sih-2023-result.xlsx', index_col = 0)
df
ORGANISATION | PSID | PS CATEGORY | TEAM ID | IDEA ID | TEAM NAME | TEAM LEADER NAME | COLLEGE ID | COLLEGE | NODAL CENTER | Winning Status | Prize Money | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
S.NO. | ||||||||||||
1 | AICTE | SIH1458 | Software | 27179 | 50141 | Swasthya Sahayak | PRAVEEN KUSHWAHA | 158737 | AJAY KUMAR GARG ENGINEERING COLLEGE, GHAZIABAD... | O P Jindal University, Raigarh,Chhattisgarh,Ra... | Winner | 100000 |
2 | AICTE | SIH1459 | Software | 37933 | 48853 | Team.Phoenix | HARSH DHARIWAL | 103534 | JAYPEE INSTITUTE OF INFORMATION TECHNOLOGY, NO... | O P Jindal University, Raigarh,Chhattisgarh,Ra... | Winner | 100000 |
3 | AICTE | SIH1460 | Software | 16793 | 47672 | Bit Lords V1 | TANMAI KIRAN KAMAT | 139772 | SHRI VILE PARLE KELAVANI MANDALS DWARKADAS J. ... | O P Jindal University, Raigarh,Chhattisgarh,Ra... | Winner | 100000 |
4 | AICTE | SIH1461 | Software | 15122 | 18545 | AssetSentinels | PRATHAM POOJARI | 139723 | MAHAVIR EDUCATION TRUSTS SHAH AND ANCHOR KUTCH... | O P Jindal University, Raigarh,Chhattisgarh,Ra... | Winner | 100000 |
5 | AICTE | SIH1463 | Software | 28137 | 33801 | Sane No More | SNEHA JANARTHANAN | 150210 | SRI KRISHNA COLLEGE OF ENGINEERING AND TECHNOL... | O P Jindal University, Raigarh,Chhattisgarh,Ra... | Winner | 100000 |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
271 | Open Innovation | 0 | Software | 298 | 4826 | CodeElixir_T187 | AGNISHA BHATTA | 164138 | TECHNO INTERNATIONAL NEW TOWN | Veer Surendra Sai University of Technology, Od... | Second Runnerup | 50000 |
272 | AICTE, MIC-Student Innovation | SIH1484 | Software | 6451 | 46658 | SeedShare | APURVA NARAYAN | 149873 | SRI SIVASUBRAMANIYA NADAR COLLEGE OF ENGINEERI... | Prasad V Potluri Siddhartha Institute of Techn... | AWS Winner | 50000 |
273 | AICTE, MIC-Student Innovation | SIH1486 | Software | 11415 | 15645 | Destination Designer | GAURAV TIWARI | 143391 | NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY N... | Rungta College of Engineering and Technology, ... | AWS Winner | 50000 |
274 | AICTE, MIC-Student Innovation | SIH1488 | Software | 15372 | 21270 | CodeCopter | ADITYA HAKANI | 109898 | 3025-SHRI BHAGUBHAI MAFATLAL POLYTECHNIC, MUMB... | Sreenidhi Institute of Science & Technolog... | AWS Winner | 50000 |
275 | AICTE, MIC-Student Innovation | SIH1493 | Software | 18569 | 11066 | Grand Line | SHREYAS M MURAGODMATH | 102942 | KLE TECHNOLOGICAL UNIVERSITY,KARNATAKA,DHARWAD | GMR Institute Of Technology,Andhra Pradesh,Rajam | AWS Winner | 50000 |
275 rows × 12 columns
df.nunique()
ORGANISATION 36
PSID 228
PS CATEGORY 2
TEAM ID 275
IDEA ID 275
TEAM NAME 275
TEAM LEADER NAME 273
COLLEGE ID 201
COLLEGE 213
NODAL CENTER 48
Winning Status 7
Prize Money 5
dtype: int64
The Dataset itself is clean so we don't need to much preprocessing in it.
We will directly jump to exploratory data analysis (EDA)
import locale
locale.setlocale(locale.LC_MONETARY, 'en_IN')
val = df['Prize Money'].sum()
print(locale.currency(val, grouping=True))
₹ 2,27,10,000.00
More than 2 crores rupees were spent just on prizes
sns.countplot(x = df['PS CATEGORY'], palette='flare').set_title("Problem Statement Categories");
The software solutions outnumber hardware solutions by more than double.
Looks like the government requires more software solutions then hardware.
sns.boxplot(list(df.groupby(by='NODAL CENTER')['PSID'].count()), palette='flare').set(xticklabels=[], title="Average problem statments in nodal centers", ylabel="number of problems");
On average 5-6 problem statements are assigned to each nodal center
fig, axes = plt.subplots(figsize = (7,10))
vals = df['ORGANISATION'].value_counts()
axes = sns.barplot(y = vals.index, x=vals.values, palette='flare').set(xlabel="problem statements", title="Problems by each ministry")
As expected AICTE has the most number of problem statements because of there student innovation track in which the students can submit any of there idea. They are followed by Ministry of Power and Jal Shakti.
vals = df[['ORGANISATION', 'Prize Money']].groupby(by='ORGANISATION')['Prize Money'].sum().sort_values(ascending=False)[:6]
sns.barplot(y=vals.index, x=vals.values, palette='flare').set(xlabel="capital (in lakhs)", ylabel="", title="The top 6 ministries in expenditure");
The number of problem statements of a ministry directly correlate with the expenditure on prizes meaning more problems, more money.
or was it more money, more problems 😉
import textwrap
def wrap_labels(ax, width, break_long_words=False):
labels = []
for label in ax.get_yticklabels():
text = label.get_text()
labels.append(textwrap.fill(text, width=width,
break_long_words=break_long_words))
ax.set_yticklabels(labels, rotation=0)
return ax
vals = df['COLLEGE'].value_counts()[:6]
ax = sns.barplot(y=vals.index, x=vals.values, palette='flare')
ax = wrap_labels(ax, 50)
ax.set(title='Teams from each college', xlabel='number of teams');
A wopping 10 teams were sent by BRACTVIT, Pune.
if len(df.query('COLLEGE == `NODAL CENTER`')) > 0:
print("Yeah nodal centres also participate")
else:
print("No nodal centres don't participate")
No nodal centres don't participate
print('Problem Statements in each nodel centre')
vals = pd.DataFrame(df['NODAL CENTER'].value_counts())
vals.index.names = ['NODAL CENTRES']
vals.rename(columns = {'NODAL CENTER':'PROBLEM STATEMENTS'}, inplace = True)
vals
Problem Statements in each nodel centre
PROBLEM STATEMENTS | |
---|---|
NODAL CENTRES | |
Lovely Professional University,Punjab,Jalandhar | 10 |
Nalla Malla Reddy Engineering College,Telangana,Hyderabad | 9 |
Gujarat Technological University, Ahmedabad,Gujarat,Ahmedabad | 8 |
P.S.N.A. College of Engineering and Technology,Tamil Nadu,Dindigul | 7 |
Sreenidhi Institute of Science & Technology,Telangana,Hyderabad | 7 |
The National Institute of Engineering,Karnataka,Mysuru | 7 |
Anand Institute of Higher Technology,Tamil Nadu,Chennai | 7 |
Galgotias University ,Uttar Pradesh,Greater Noida | 7 |
Chandigarh Engineering College-CGC Landran,Punjab,Mohali | 7 |
Amity University Uttar Pradesh, Noida,Uttar Pradesh,Noida | 7 |
Prasad V Potluri Siddhartha Institute of Technology,Andhra Pradesh,Vijayawada | 6 |
Oriental Institute of Science and Technology,Madhya Pradesh,Bhopal | 6 |
Sri Venkateswara College of Engineering and Technology ,Andhra Pradesh,Chittoor | 6 |
Techno Main Salt Lake,West Bengal,Kolkata | 6 |
Coimbatore Innovation and Business Incubator (Forge),Tamil Nadu,Coimbatore | 6 |
Noida Institute of Engineering and Technology, Greater Noida,Uttar Pradesh,Greater Noida | 6 |
Rungta College of Engineering and Technology, Bhilai,Chhattisgarh,Bhilai | 6 |
Aditya Engineering College,Andhra Pradesh,Surampalem | 6 |
IES College of Technology,Madhya Pradesh,Bhopal | 6 |
New Horizon College of Engineering, Bangalore,Karnataka,Bangalore | 6 |
G H Raisoni College of Engineering,Maharashtra,Nagpur | 6 |
VNR Vignana Jyothi Institute of Engineering & Technology,Telangana,Hyderabad | 6 |
Manav Rachna International Institute of Research and Studies,Haryana,Faridabad | 6 |
Poornima Institute of Engineering & Technology,Rajasthan,Jaipur | 6 |
Vidyavardhaka College of Engineering,Karnataka,Mysuru | 6 |
GMR Institute Of Technology,Andhra Pradesh,Rajam | 6 |
Lakshmi Narain College of Technology,Madhya Pradesh,Bhopal | 6 |
The Assam Royal Global University,Assam,Guwahati | 5 |
GIET University, Gunupur, Odisha,Odisha,Gunupur | 5 |
St. Joseph'S College of Engineering,Tamil Nadu,Chennai | 5 |
Techno India NJR Institute of Technology,Rajasthan,Udaipur | 5 |
O P Jindal University, Raigarh,Chhattisgarh,Raigarh | 5 |
Kolhapur Institute of Technology'S College of Engineering (Autonomous), Kolhapur,Maharashtra,Kolhapur | 5 |
Vignana Bharathi Institute of Technology,Telangana,Hyderabad | 5 |
QIS College of Engineering and Technology,Andhra Pradesh,Ongole | 5 |
P. R. Pote Patil College of Engineering & Management, Amravati,Maharashtra,Amravati | 5 |
Chandigarh Engineering College, Jhanjeri, Mohali,Punjab,Mohali | 5 |
Manipal University Jaipur,Rajasthan,Jaipur | 5 |
MIT Art, Design and Technology University, Pune,Maharashtra,Pune | 5 |
Bhilai Institute of Technology, Durg,Chhattisgarh,Durg | 5 |
Techno India Unversity,West Bengal,Kolkata | 4 |
Prin L. N. Welingkar Institute of Management Development & Research (PGDM),Maharashtra,Mumbai | 4 |
Shobhit Institute of Engineering and Technology ,Uttar Pradesh,Meerut | 4 |
C V Raman Global University,Odisha,Bhubaneswar | 4 |
Swami Keshvanand Institute of Technology, Management & Gramothan,Rajasthan,Jaipur | 4 |
Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal,Karnataka,Manipal | 4 |
Chennai Institute of Technology,Tamil Nadu,Chennai | 4 |
Veer Surendra Sai University of Technology, Odisha, Sambalpur | 4 |