calculate prices from csv #96

scientes · 2021-12-13T14:10:37Z

This pr does 4 Things:

updates the year to 2021
All values are now stored as str in db to prevent floating point bugs in the future (there is no decimal in sqlite)
Tablenames are now Alphabeticaly sorted ETH/BNB -> BNB/ETH and converted to prevent two tables being created for the same pair
If there are two timestamps from the same platform the price between them is calculated when one is a buy operation and the other is a sell operation ([Binance] Calculate price from transaction statement #50)

updated year changed db format from float to str changed tablename format

provinzio

good idea to group the tables of opposite coin pairs. I am currently unsure, if this is a good idea. e.g. BTC/USD is like 60.000$ but USD/BTC would be 0.00001666666 which is unnecessary small.

Is it enough to save the decimals as string for that? Please check this explicitly.

src/book.py

src/price_data.py

Griffsano · 2021-12-15T22:28:25Z

Hey, I think the inversion of prices will get problematic for coins that are listed for 0 EUR (e.g. airdrops of some new tokens). Would it be possible to search the database for the inverse coin pair if the original cannot be found, before creating a new table?

provinzio · 2021-12-15T23:11:59Z

Hey, I think the inversion of prices will get problematic for coins that are listed for 0 EUR (e.g. airdrops of some new tokens). Would it be possible to search the database for the inverse coin pair if the original cannot be found, before creating a new table?

If the coin pair ABC/XYZ is valued at 0 Eur, the coin pair XYZ/ABC should be 0 Eur as well. Therefore creating the reciprocal requies to except the case when the pair is valued at 0 Eur. The reciprocal function in misc.py does exactly that. I mentioned it in the code review.

Furthermore, this PR sorts the coin pair alphabetically to form the table name. Therefore, each coin pair plus his "inverse coin pair" will have exactly one table. There wouldn't be the need to search for another table, as there is only this one.

Griffsano · 2021-12-16T09:40:30Z

You're right, it should also work with this function, basically overwriting inf by 0. Of course, if the inverse price can always be computed, the alphabetic sorting works and we don't have to search for the inverse coin pair in the database.

scientes · 2021-12-16T11:33:35Z

Hey @scientes,

good idea to group the tables of opposite coin pairs. I am currently unsure, if this is a good idea. e.g. BTC/USD is like 60.000$ but USD/BTC would be 0.00001666666 which is unnecessary small.

Is it enough to save the decimals as string for that? Please check this explicitly.

The default precision of decimal is 28
also small test:

>>> Decimal(1)/(Decimal(1) /Decimal(123456789123456789))
Decimal('123456789123456789.0000000000')
>>> Decimal(1)/(Decimal(1) /Decimal(123456789123456789.1))
Decimal('123456789123456784.0000000000')
>>> Decimal(1)/(Decimal(1) /Decimal(12345.1))
Decimal('12345.10000000000036379788071')

the max of string length in sqllite is 2^31-1 bytes which i think is more than enough

I'm not super sure how to fix that properly, i mean it's still more accurate than fetching the 1 minute average. The easiest would be just to up the precision on decimal. or we could make two columns in the db one for the price and one for if its inverted relative to the table name and we do not invert the price at all

provinzio · 2021-12-16T22:39:32Z

The precision should be high enough as it is right now. At least I don't know of a coin pair with such a huge factor. Binance shows the prices with 8 decimals places (at least in the GUI).

fixed reciprocal (both not finished)

scientes · 2021-12-25T14:19:41Z

@provinzio I added the migrations to the check_db function. but apparently its not used anywhere. is that the correct place for the migration or should i make an extra function for them?

src/book.py

src/price_data.py

src/book.py

src/price_data.py

* reciprocal price for API price data, note for CSV price data, address flake8 and mypy errors * remove duplicate reciprocal * remove set reciprocal False

Not all values in the list are strings

- Add comments to get_price_from_csv - Rename operations variables - CHANGE Calculated price to buy/sell instead of sell/buy

provinzio · 2022-01-02T21:21:17Z

I added some docstrings and comments and worked on an first idea of a patch system. Unfortunatly, I wasn't able to finish my work and am still unsure whether this is the best approach. I think that we are currently missing a central database class which handles the interaction between the code and our database system. We might want to factor out the database functions from price_data.py and some functions from patch_database.py and move them to a new database.py file. What do you think of that?

The patches in patch_database.py aren't finished, but my ground work should give an idea of what is missing. We need a create patches, so that (my) old databases with unsorted tablenames and float prices are converted to the new format with sorted tablenames and string prices.

Can you work on it?

* barebones config * RM debug print * ADD Comment to CALCULATE VIRTUAL SELL config * fixes in DB patch functions Co-authored-by: scientes <[email protected]> Co-authored-by: Jeppy <[email protected]>

Unfortunatly, there is no good place in book.py to create a db as preprocessing step as the platform string is at first defined in the read_function and not beforehand

provinzio · 2022-02-05T17:26:07Z

This PR definitly got quite out of scope quickly, but you two, @Griffsano and @scientes, handled it really good, so that we can now calculate the prices directly from CSV files and have an database versioning system!

While reviewing, I noticed, that we got the database type of the price column wrong. "STR" is not supported by sqllite. Please check your databases for the correct type, which should be at least VARCHAR(255). Otherwhise your prices will be stored with a maximal precision of 1e-9.

I added the functionality, that prices from CSV overwrite already existing prices in the database.

Again, good job you two!

scientes added 3 commits December 13, 2021 15:06

calculate prices from csv

43e1fe5

updated year changed db format from float to str changed tablename format

Merge branch 'main' into prices-from-csv

74077df

fixed linting except line to long

f2729c7

provinzio requested changes Dec 13, 2021

View reviewed changes

Griffsano mentioned this pull request Dec 13, 2021

Optimize getting Price data #14

Open

fixed for multiple coins

13fa82e

fixed reciprocal (both not finished)

provinzio marked this pull request as draft December 19, 2021 15:12

scientes added 2 commits December 25, 2021 14:37

added sql migration and extra function

9afb70a

bugfix (prices were inverted db) and migration fix

8e21d8a

Griffsano reviewed Dec 30, 2021

View reviewed changes

src/book.py Outdated Show resolved Hide resolved

Griffsano reviewed Dec 30, 2021

View reviewed changes

src/price_data.py Outdated Show resolved Hide resolved

scientes added 2 commits December 31, 2021 11:44

fixed missing inversion

9c52dd8

formatting

f74327d

scientes commented Dec 31, 2021

View reviewed changes

src/book.py Outdated Show resolved Hide resolved

src/price_data.py Outdated Show resolved Hide resolved

Prices from csv (#3)

5112a14

* reciprocal price for API price data, note for CSV price data, address flake8 and mypy errors * remove duplicate reciprocal * remove set reciprocal False

scientes marked this pull request as ready for review December 31, 2021 16:52

provinzio added 7 commits January 2, 2022 19:54

FIX type annotation of group_by

45b6a39

Not all values in the list are strings

REFACTOR add newline

7c0f924

REFACTOR get_price_from_csv

dc852ee

- Add comments to get_price_from_csv - Rename operations variables - CHANGE Calculated price to buy/sell instead of sell/buy

ADD docstring to _sort_pair

62b861c

RENAME reciprocal bool to inverted

5124ebe

ADD comment in get_price when price is missing

3d10f3d

TODO ADD rudimentary patch functions

30fe876

fix iosort

7265ee7

scientes marked this pull request as ready for review January 19, 2022 20:57

Griffsano mentioned this pull request Jan 26, 2022

Prices from csv scientes/CoinTaxman#5

Merged

Griffsano and others added 24 commits January 28, 2022 10:09

Prices from csv (#5)

7bb7bd8

* barebones config * RM debug print * ADD Comment to CALCULATE VIRTUAL SELL config * fixes in DB patch functions Co-authored-by: scientes <[email protected]> Co-authored-by: Jeppy <[email protected]>

Refactor logging patching info

179bb05

CHANGE get_tablenames does not query §version table per default

acc215b

UPDATE transfer platform parameter

1419746

REFACTOR function docstrings

d70559f

FIX mypy linting error

09a81e5

Reorder preamble of set_price_db

a06f735

UPDATE assert db exists before setting new price

e99a7aa

ADD helper functions to get patch func names/versions

cd33058

REFACTOR patch databases

b54c366

ADD create database in book.py when missing

63b3959

AUTOFORMAT database.py

c400a4c

Avoid circular import

78046f4

CHANGE update_version create §version table when missing

7b81aa9

Merge remote-tracking branch 'origin/main' into prices-from-csv

700ea26

UPDATE format of warning message

0f2e7fb

UPDATE reword debug message

a6fc0eb

ADD debug message when updating db version

7cae5c3

CHANGE Create db not in book, but on set_price when db is missing

f932e0c

Unfortunatly, there is no good place in book.py to create a db as preprocessing step as the platform string is at first defined in the read_function and not beforehand

UPDATE duplicate price warning, add db price for comparison

0b93b3b

AUTOFORMAT price_data

81b4e30

ADD comment to price_data when querying new price

53753d6

FIX database type for price

16dabab

ADD set_price_db parameter to overwrite already existing prices

c1538cf

FIX linting errors

9f906b1

provinzio merged commit e8a20f0 into provinzio:main Feb 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

calculate prices from csv #96

calculate prices from csv #96

scientes commented Dec 13, 2021 •

edited by provinzio

Loading

provinzio left a comment

Griffsano commented Dec 15, 2021

provinzio commented Dec 15, 2021

Griffsano commented Dec 16, 2021

scientes commented Dec 16, 2021

provinzio commented Dec 16, 2021

scientes commented Dec 25, 2021

provinzio commented Jan 2, 2022

provinzio commented Feb 5, 2022

calculate prices from csv #96

calculate prices from csv #96

Conversation

scientes commented Dec 13, 2021 • edited by provinzio Loading

provinzio left a comment

Choose a reason for hiding this comment

Griffsano commented Dec 15, 2021

provinzio commented Dec 15, 2021

Griffsano commented Dec 16, 2021

scientes commented Dec 16, 2021

provinzio commented Dec 16, 2021

scientes commented Dec 25, 2021

provinzio commented Jan 2, 2022

provinzio commented Feb 5, 2022

scientes commented Dec 13, 2021 •

edited by provinzio

Loading