Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open Dataset scripts & docs #10

Open
MrOrz opened this issue Nov 23, 2017 · 2 comments
Open

Open Dataset scripts & docs #10

MrOrz opened this issue Nov 23, 2017 · 2 comments
Assignees

Comments

@MrOrz
Copy link
Member

MrOrz commented Nov 23, 2017

Goal

From 1122 Johnson's goals
開放資料庫資料,讓統計人工智慧得以投入:
(a) 分析訊息組成,讓有興趣之人尋找「台灣人會對什麼產生疑惑」「有多少訊息來自境外、來自哪些地方」等問題的答案,或者是透過 social network graphical model 來理解帳號與關係的關聯性。
(b) 自動輔助分類與分領域。

Previous decisions & spec

http://beta.hackfoldr.org/cofacts/https%253A%252F%252Fhackmd.io%252Fs%252FSysG-Jxo- (準備 data 頁面)

Related discussion

https://g0v-tw.slackarchive.io/cofacts/page-9/ts-1497944951784497 (Web QA)
https://g0v-tw.slackarchive.io/cofacts/page-17/ts-1505700819000037 (Discussion)

Actionable steps

  1. Finalize the index & fields to put in dataset
  2. Design Schema for this dataset
  3. Build a script that outputs a bundled dataset
  4. Prepare a page (either hackmd or github page or README.md) to describe the dataset and the field
@MrOrz
Copy link
Member Author

MrOrz commented Apr 22, 2018

I think it's done for now. Let's close this.

@MrOrz MrOrz closed this as completed Apr 22, 2018
@MrOrz
Copy link
Member Author

MrOrz commented Feb 18, 2020

Advanced items

  1. Add setup script for google cloud storage. Executing the script can install gcs backup repository on specified elasticsearch container, given the gcs credential.
  2. Add backup script and instructions in README for periodic backups
  3. Add restore script and instructions in README for fetching latest backup from the same gcs repo and run opendata generation upon finish.
  4. Add Dockerfile that is capable of execute 2~3 on Google Cloud Run, so that we can automatically put opendata files on Google cloud storage.

@MrOrz MrOrz reopened this Feb 18, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants