-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate behavior #6
Conversation
Also set key_list=None
Just wanted to chime in here and say that your diagram and the way you structured things makes sense to me. From a high-level and after reviewing the code I do think the hierarchy should be inverted, i.e. the The changes you made to fix the |
Architecturally, I've summarized how I think we can improve things in #9. As for this PR, I think the ideas are sensible enough, e.g. moving more out of As for the issue with the dataframe growing in size, I think that can be addressed either in the current or existing architecture: the use of |
self.define_points([kdim1, kdim2], df[f'point_{kdim1}'], df[f'point_{kdim2}']) | ||
self.clear_edits() | ||
|
||
def add_schema_to_conn(self, conn): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The problem here I think is that I don't think all connectors should support generate_schema
. This code was on the SQLiteDB
connector as I think only SQLiteDB
should auto-generate schemas in this way (for convenience). For 'real' databases, you probably can't (and shouldn't) just create a bunch of tables automatically that are likely supposed to be transient.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't think that's necessarily true, I have seen many applications in customer environments that autogenerate tables whether that's Postgres or Snowflake.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose it can be optional..but I wouldn't expect every connector to support this and SQLite
would still be the default.
Observed that when you have identical
_field_df
in your database, it will drop duplicates. This is not a desired behavior.Example on main:
Though, if I do use drop duplicates (like in this PR), a new annotator with the same connector will pretty fast. This is why the second test is marked xfail.