Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

problem about read data #2

Open
hzjry123 opened this issue Mar 21, 2018 · 3 comments
Open

problem about read data #2

hzjry123 opened this issue Mar 21, 2018 · 3 comments

Comments

@hzjry123
Copy link

No description provided.

@hzjry123
Copy link
Author

I have read train data successfully, but My memory usage is 4.3G which much bigger than you, could you help me fix that?
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 40428967 entries, 0 to 40428966
Data columns (total 24 columns):
id object
click bool
hour datetime64[ns]
C1 uint16
banner_pos uint16
site_id object
site_domain object
site_category object
app_id object
app_domain object
app_category object
device_id object
device_ip object
device_model object
device_type uint16
device_conn_type uint16
C14 uint16
C15 uint16
C16 uint16
C17 uint16
C18 uint16
C19 uint16
C20 uint16
C21 uint16
dtypes: bool(1), datetime64ns, object(10), uint16(12)
memory usage: 4.3+ GB

@hzjry123 hzjry123 changed the title Hi problem about read data Mar 21, 2018
@diogojapinto
Copy link
Owner

Hi! It's a bit strange. Can you validate if you are using Python 3 and an updated version of Pandas?

@hzjry123
Copy link
Author

That's my python version and pandas version:

python: 3.6.4.final.0
python-bits: 64
OS: Darwin
OS-release: 17.3.0
machine: x86_64
processor: i386
byteorder: little
LC_ALL: None
LANG: None
LOCALE: None.None

pandas: 0.22.0
pytest: 3.3.2
pip: 9.0.2
setuptools: 38.4.0
Cython: 0.27.3
numpy: 1.14.2
scipy: 1.0.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants