You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Any form of commercial usage is not allowed!
Please cite the following paper if you publish your work:
Haw-Shiuan Chang, Hwai-Jung Hsu and Kuan-Ta Chen,
"Modeling Exercise Relationships in E-Learning: A Unified Approach,"
International Conference on Educational Data Mining (EDM), 2015.
Introduction
The dataset contains the problem log and exercise-related information on the Junyi Academy ( http://www.junyiacademy.org/ ), an E-learning platform established in 2012 on the basis of the open-source code released by Khan Academy. In addition, the annotations of exercise relationship we collected for building models are also available.
Meaning of Fields
junyi_Exercise_table.csv:
字段名
说明
name
Exercise name (The name is also an id of exercise, so each name is unique in the dataset). If you want to access the exercise on the website, please append this name after url, http://www.junyiacademy.org/exercise/ (e.g., http://www.junyiacademy.org/exercise/similar_triangles_1 ). Please note that Junyi Academy are constantly changing their contents as Khan Academy did, so some url of exercises might be unavaible when you access them.
live
Whether the exercise is still accessible on the website on Jan. 2015
prerequisite
Indicate its prerequisite exericse (parent shown in its knowledge map)
h_position
The coordiate on the x axis of the knowledge map
v_position
The coordiate on the y axis of the knowledge map
creation_date
The date this exercise is created
seconds_per_fast_problem
The website judge a student finish the exercise fast if he/she takes less then this time to answer the question. The number is manually assigned by the experts in Junyi Academy.
pretty_display_name
The chinese name of exercise shown in the knowledge map (Please use UTF-8 to decode the chinese characters)
short_display_name
Another chinese name of exercise (Please use UTF-8 to decode the chinese characters)
topic
The topic of each exercise, and the topic would be shown as a larger node in the knowledge map.
area:
The area of each exercise (Each area contains several topics)
The tab delimited format used in PSLC datashop, please refer to their document ( https://pslcdatashop.web.cmu.edu/help?page=importFormatTd )
The size of the text file is too large (9.1 GB) to analyze using tools of websites, so we compress the text file and put it as an extra file of the dataset. We also upload a small subset of data into the website for the illustration purpose. Note that there are some assumptions when converting the data into this format, please read the description of our dataset for more details.
Example
Anon Student Id
Session Id
Time
Student Response Type
Tutor Response Type
Level (Unit)
Level (Section)
Problem Name
Problem Start Time
Step Name
Outcome
Condition Name
Condition Type
Selection
Action
Input
KC (Exercise)
KC (Topic)
KC (Area)
CF (points_earned)
CF (earned_proficiency)
12884
148691
1420714809324
ATTEMPT
RESULT
telling-time
time_terminology
time_terminology--analog_word
1420714806324
time_terminology--analog_word
INCORRECT
Choose_Exercise
NA
NA
NA
NA
time_terminology
telling-time
arithmetic
0
0
12884
148691
1420714810324
ATTEMPT
RESULT
telling-time
time_terminology
time_terminology--analog_word
1420714809324
time_terminology--analog_word
INCORRECT
Choose_Exercise
NA
NA
NA
NA
time_terminology
telling-time
arithmetic
0
0
239464
93497
1403098400837
ATTEMPT
RESULT
multiplication-division
multiplication_1
multiplication_1--0
1403098398837
multiplication_1--0
CORRECT
Choose_Exercise
NA
NA
NA
NA
multiplication_1
multiplication-division
arithmetic
14
0
Questions and Collaboration:
1. If you have any question to this dataset, please e-mail to [email protected].
2. If you have intention to acquire more data which fit your research purpose, please contact Junyi Academy directly for discussing the further cooperation opportunites by emailing to [email protected]
Note:
1. The dataset we used in our paper (Modeling Exercise Relationships in E-Learning: A Unified Approach) is extracted from Junyi Academy on July 2014, and this dataset is extracted on Jan 2015. After applying our method on the new dataset, we got similar observation with that in our paper, even though this dataset contains more users and exercises.
2. After uncompress the original problem log and problem log using PLSC format, the text files will take around 2.6 GB and 9.1 GB respectively. Please prepare enough space in your disk.