Skip to content

Commit 2e3914f

Browse files
authored
Requirements document for the dataframe interchange protocol (#35)
* Add a summary document for the dataframe interchange protocol Summarizes the various discussions about and goals/non-goals and requirements for the `__dataframe__` data interchange protocol. The intended audience for this document is Consortium members and dataframe library maintainers who may want to support this protocol. The aim is to keep updating this till we have captured all the requirements and answered all the FAQs, so we can actually design the protocol after and verify it meets all our requirements. Closes gh-29 * Process some review comments * Process a few more review comments. * Link to Release callback semantics in Arrow C Data Interface docs * Add design requirements for column selection and df metadata * Edit the nested/heterogeneous dtypes non-requirement * Add requirements for chunking and memory layout description Also address some smaller review comments. * Add TBD notes on dataframe-array connection and from_dataframe Also add more details on the Arrow C Data Interface. * Address review comments * Add details on implementation options * Add details about the C implementation * Add an image of the dataframe model and its memory layout. * Add link to discussion on array-dataframe connection * Some more updates for review comments * Update table to indicate Arrow does support categoricals. * Add section on dtype format strings * Reflow some lines * Add a requirement on semantic meaning of NaN/NaT, and timezone detail * Textual tweak: say columns in a data frame are ordered * Update requirements document for recent decisions/insights
1 parent 6af8c2a commit 2e3914f

4 files changed

+998
-0
lines changed
20.9 KB
Loading

0 commit comments

Comments
 (0)