-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Presto functions #2262
Comments
Thanks for sharing the list of missing functions. Following functions have been added: JSON functions Working on the following functions: JSON functions |
I am going to start working on array_union. |
There was a prior PR for array_union #867 that was abandoned. Maybe you can check with @kagamiori about it. |
@aditi-pandit Thank you for bringing up this! It was a PR I didn't finish. I originally planned to rewrite it as a simple function once it's supported. (I just confirmed that it's not supported yet currently.) Do you need this function soon? Please feel free to take it over, or let me know if you want me to finish #867. |
There isn't an urgency for the array_union function rightaway. We were just picking stuff from Masha's list above. |
I will leave array_union to @kagamiori and instead start on array_average instead. |
Hi @pramodsatya! Looking for functions to pick up. Wondering if you are still working on truncate? Cheers, |
Hi @gosharz, I am not working on truncate function. Thanks for checking. |
@pramodsatya mind if I pick it up? |
No, please go for it. Thank you!
|
Adding truncate: #2862 |
Will also give a try to strrpos if nobody minds :) |
Here we go: #2903 |
Hi @mbasmanova, I'd like to claim this array function first: |
Looks like there is a duplicate WIP PR on the same udf, array_has_duplicates: #3320 I switched to shuffle: #3404 (ready for review) |
Hello @mbasmanova, I'm claiming functions: |
Hi @mbasmanova |
Sounds great. |
Hi @mbasmanova, I would like to claim the |
Thanks @mbasmanova, must've missed that. Then may I claim the |
Hi @mbasmanova, I would actually like to take the |
That's fine. Thanks. |
Hi @mbasmanova, I would also like to claim the following functions:
|
hello all , seems like a lot has been done !. Is there anything that I could be able to work on ? , I am a newbie to opensource , but I believe I have good skills in C++ , C , Python and such , I hope @mbasmanova or someone would be able to help me soon , cheers! |
I want to join this interesting work, |
@SANTHOSH-MAMIDISETTI @dusx1981 Welcome, folks. Would you provide some context re: your interest in Velox. Are you part of the teams that use Velox? If so, what are these team do? |
We are working on a distributed database, and we need to use things like ICompiledCall. |
Which array related functions can I claim? |
@dusx1981 Curious, which database do you work on? BTW, development of codegen in Velox has been paused long time ago. Consider, adding a family of map_top_n_xxx functions: https://prestodb.io/docs/current/functions/map.html#map_top_n_keys |
We are a project being worked on by a studio, and we are currently in the development stage, and we are also soliciting the name of the database. Do you mean adding functions to this framework?presto |
Curious which studio is this?
This link is in Chinese, which I unfortunately cannot read. You asked "Which array related functions can I claim?" and I suggested to pick up map_top_n_xxx Presto functions. |
Sure. I would need to understand a bit more about the system you are building to advise. What kind of problems are you looking to solve and what is your tentative solution? |
I just have a question, we want to use Go to implement a set of JVM compilation and execution logic like Presto. But you know that Java can directly generate bytecode at runtime and load it through ClassLoader, but like Go or C++, it can only be compiled and executed statically. So we want to refer to the implementation of Velox and implement a Go version. Volex stopped the development of codegen, is it because of performance reasons, is my idea above feasible? |
@dusx1981 What is the problem you are trying to solve? Are you looking to build a more efficient / faster query engine? Where do you think the speedup / efficiency will come from? We stopped codegen development primarily because codegen is harder to debug and develop. We do not believe it will be faster for analytical workloads. |
Another question, why not perform compiling and linking operations in memory instead of writing to files, which adds IO operations? Will this have a significant impact on performance? |
@dusx1981 I suggest to open a separate GitHub issue and continue discussion there. CC: @laithsakka |
|
* update velox to 6.14
Hi all, I am new to OLAP databases and I am extremely interested in Velox. Is there anything that I could work on? I am thinking of adding |
@Real-Chen-Happy Welcome! I suggest to start with #3728 or one of fuzzer-found issues: https://github.com/facebookincubator/velox/issues?q=is%3Aopen+is%3Aissue+label%3Afuzzer-found |
@Real-Chen-Happy array_frequency function has been added in #3807 |
Thank you for your reply! I will start to take a look at #3728 |
@Real-Chen-Happy Thanks. BTW, would you introduce yourself and share a bit about where / how you use Velox? Do you know anyone in Velox community who can help you onboard to the codebase and provide guidance on your first PRs? If not, please, create a GitHub issue to ask if anyone is willing to help with that. |
Yeah sure! I am Real and I have some experiences in OLTP systems. My current work is not related to Velox directly. Contributing to Velox is my personal interest because I believe the future of DBMS will be composable, and Velox will definitely play a key role. I would love to contribute my efforts in this area. I am new in this community, so let me know if anybody in this community is interested in providing some mentorship! #9262 |
Where should I find the functions that need to be supplemented? What I see in this link seems a bit outdated |
@Sutter099 A coverage map might be a good place to find functions still missing in Velox: https://facebookincubator.github.io/velox/functions/presto/coverage.html |
Velox includes many of the PrestoSQL functions, but a few are still missing. It would great to add these.
Function coverage map: https://facebookincubator.github.io/velox/functions/coverage.html
A subset of missing functions that would be most helpful to add.
Array functions
all_match (lambda function) #3356
any_match (lambda function)
array_average #2434
array_frequency #3807
array_has_duplicates #3320
array_normalize
array_remove
array_union
flatten
repeat
sequence
shuffle #3404
zip_with (lambda function) #2685
Map functions
map_from_entries #3417
map_normalize #9086
map_zip_with (lambda function) #2711
JSON functions
is_json_scalar #2291
json_array_contains #2299
json_array_get
json_array_length #2294
json_extract #5269
json_format #3525
json_parse #3663
json_size #3413
String functions
split_to_map
split_to_multimap
strrpos
Regular expression functions
regexp_split
Date and Time functions
timezone_hour
timezone_minute
week #2287
week_of_year #2287
Mathematical functions
truncate
The text was updated successfully, but these errors were encountered: