Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Blog post with DataFusion July - Sep 2024 #11631

Open
alamb opened this issue Jul 24, 2024 · 7 comments
Open

Blog post with DataFusion July - Sep 2024 #11631

alamb opened this issue Jul 24, 2024 · 7 comments
Assignees
Labels
enhancement New feature or request

Comments

@alamb
Copy link
Contributor

alamb commented Jul 24, 2024

Is your feature request related to a problem or challenge?

We have had good luck writing up quarterly updates for DataFusion, most recently:
https://datafusion.apache.org/blog/2024/07/24/datafusion-40.0.0/

See #9602

Describe the solution you'd like

Blog post

Describe alternatives you've considered

No response

Additional context

No response

@alamb alamb added the enhancement New feature or request label Jul 24, 2024
@alamb
Copy link
Contributor Author

alamb commented Jul 24, 2024

Here is my wishlist for things to write about in the next blog:

Also, of course, I would love to have more help writing a blog (maybe someone else could draft it 🤔 🎣 )

@dharanad
Copy link
Contributor

@alamb Thank you for considering me, but I think there may be some confusion - I wasn't involved in the work on Substrait. However, I'd be happy to contribute to a blog post on MAP once I've completed adding support for Arrays in #11436

@alamb
Copy link
Contributor Author

alamb commented Jul 24, 2024

@alamb Thank you for considering me, but I think there may be some confusion

Yes I was probably confused -- sorry about that

@Blizzara
Copy link
Contributor

@alamb for Substrait - maybe the work @Lordworms has been doing to add the TPC-H tests would be good at least? From my side, I don't know if there's any precise milestone as such - but maybe something around supporting VirtualTables, more literals and types, better interoperability with other substrait producers. (I do hope to write a separate blog post from our perspective if/when I've proven the whole setup I'm working on works and is faster, but we're not there yet unfortunately.)

@alamb
Copy link
Contributor Author

alamb commented Aug 5, 2024

Blog with #11627 performance high cardinality aggs / partial skipping

@alamb alamb self-assigned this Aug 20, 2024
@alamb
Copy link
Contributor Author

alamb commented Aug 22, 2024

It would also be cool to discuss efforts for chunked emission #11943 for (more) aggregage performance

@alamb
Copy link
Contributor Author

alamb commented Oct 15, 2024

My plan for this is that we will finish up enabling string view and then make that performance improvement be the headline for this post

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants