Human mobility prediction with GPS trajectory and Twitter data

Requirements:

Python 3.7 or later
MySQL
Postgres
config.cfg
JUMAN++ or MeCab
- For Japanese tweets

You'll need a configuration file "config.cfg" with MySQL/Postgres connection information (replace * with your keys).

[X/Twitter database]
host = ****
user = ****
passwd = ****
db_name = ****

[GPS trajectory database]
host = ****
user = ****
passwd = ****
db_name = ****

Workflow

(optional: tmux)

tmux new -s prediction
tmux attach -t prediction
control + b, d to detach

Using Conda

(assuming Nvidia libraries are properly configured)

conda env create --name prediction --file environment.yml
conda activate prediction

Run code

Review settings.py
- Check settings and parameters of models.
- What is your target area (in xy coordinates)?
- What is your target training time span? (e.g. 2012-07-25 00:00:00 ~ 2012-07-31 23:59:59)
- What is your target test time span? (e.g. 2012-07-25 08:00:00 ~ 2012-07-25 08:59:59)
Preprocessing
1. Embedding
  1. Run preprocessing_1_embedding.py
2. X/Twitter topic modeling
  1. Run preprocessing_2_train_topic_modeling_local.py
  2. Run preprocessing_3_create_topic_features.py
3. GPS (mobile phones)
  1. Run preprocessing_4_create_experiment_profile_table.py
    - Load GPS trajectory from DB and create profile table.
  2. Run preprocessing_5_interpolated_gps_db.py
    - Load GPS from DB and process data for machine learning model.
Run experiment (training and evaluation)
1. Training
  1. Run prediction_1_gps_grid_baseline_ngram.py for runnning ngram baseline model.
  2. RUN prediction_2_gps_grid_baseline_rnn.py for running RNN baseline model.
  3. Run prediction_3_gps_grid.py for GPS only model.
  4. Run prediction_4_gps_grid_mode.py for GPS and mode model.
  5. Run prediction_5_gps_grid_topic.py for GPS and topic model.
  6. Run prediction_6_gps_grid_mode_topic.py for GPS, mode, and topic model.
  7. Run prediction_7_gps_grid_mode_topic_attnention.py for GPS, mode, and topic model with attention layer.
2. Evaluation
  1. Run evaluation_1_prediction.py
    - Evaluate prediction with Loss
  2. Run evaluation_2_prediction_cityemd.py
    - Evaluate prediction with CityEMD
  3. Run evaluation_3_prediction_small_target.py
    - Evaluate prediction in small target area

data

directory	Description
DATA_DIR_RAW = "../data/raw/"	Raw data. Ignored.
DATA_DIR_PROCESSED = "../data/processed/"	Processed data. Ignored.
DATA_DIR_INTERIM = "../data/interim/"	Interim data. Ignored.
OUTPUT_DIR = "../data/output/" + EXPERIMENT_PARAMETERS["EXPERIMENT_NAME"] + "/"	Output data. Ignored, but uploaded to Dropbox.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Human mobility prediction with GPS trajectory and Twitter data

Requirements:

Workflow

(optional: tmux)

Using Conda

Run code

data

Files

README.md

Latest commit

History

README.md

File metadata and controls

Human mobility prediction with GPS trajectory and Twitter data

Requirements:

Workflow

(optional: tmux)

Using Conda

Run code

data