-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
減少不必要的資料庫空間消耗 #35
Comments
ddio
added a commit
that referenced
this issue
Aug 16, 2019
ddio
added a commit
that referenced
this issue
Aug 16, 2019
Add pagination to lower memory usage, #35
ddio
added a commit
that referenced
this issue
Aug 17, 2019
ddio
added a commit
that referenced
this issue
Aug 17, 2019
Fix wrong cursor usage and add tgz packaging to reduce file size, #35
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
一開始為了簡化邏輯,把原始資料、結構化資料、歷史資料全都塞在資料庫裡,導致資料庫以每週 1.5G 的速度吃掉硬碟,但其實裡面大部分的東西,都是沒有用的。
目前資料庫裡會被用到的 tables
syncstateful
會拿來判斷物件花多久出租,須保留過去 60 天資料,確定儲存完整的 591 開放週期沒有用到的 tables
計畫
因為這兩個 table 在爬蟲進行時,還是會有頻繁的修改,所以在爬的時候,還是需要存在,所以:
目前所佔空間:
![2019-07-01 18-42-24 的螢幕擷圖](https://user-images.githubusercontent.com/271157/60431019-4e26b180-9c31-11e9-8461-c287e7436910.png)
The text was updated successfully, but these errors were encountered: