Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Epic] Complete pulling out special SQL planning from the Sql Parser #11207

Closed
13 tasks done
alamb opened this issue Jul 2, 2024 · 18 comments
Closed
13 tasks done

[Epic] Complete pulling out special SQL planning from the Sql Parser #11207

alamb opened this issue Jul 2, 2024 · 18 comments
Labels
enhancement New feature or request

Comments

@alamb
Copy link
Contributor

alamb commented Jul 2, 2024

Is your feature request related to a problem or challenge?

As discussed in #10534, @jayzhan211 added a UserDefinedSQLPlanner in #11180 so that the translation of certain SQL sytanx to LogicalPlans and Exprs are not hard coded in SqlToRel but instead are controlled by a UserDefinedSQLPlanner

Now that we have the pattern, we need to move the other remaining functionality that is hard coded (e.g. looking up a function "date_part" by name) in SqlToRel to the UserDefinedSQLPlanner

Describe the solution you'd like

To rewrite with sql planner

Describe alternatives you've considered

No response

Additional context

Discussion is here: #10534

@samuelcolvin
Copy link
Contributor

#11208 allows user defined sql planners to be defined.

@samuelcolvin
Copy link
Contributor

See datafusion-contrib/datafusion-functions-json#26 - support for custom SQL operators in datafusion-functions-json using #11208.

@dharanad
Copy link
Contributor

dharanad commented Jul 2, 2024

hello @alamb Just checking in on the remaining tasks. Is there anything specific we're waiting on before we create issues ?
If we're all set, i would be happy to jump in and get started to pick up few tasks.

@alamb
Copy link
Contributor Author

alamb commented Jul 2, 2024

hello @alamb Just checking in on the remaining tasks. Is there anything specific we're waiting on before we create issues ? If we're all set, i would be happy to jump in and get started to pick up few tasks.

Hi @dharanad I don't think there is anything from my perspective. Thank you for offering

In fact it seems as if @xinlifoobar has already started with #11215 ❤️

@dharanad
Copy link
Contributor

dharanad commented Jul 2, 2024

I've created issues for a couple of tasks. Please let me know if you think anything needs updating in the descriptions. I'm new here and learning from shadowing the experienced folks

@alamb
Copy link
Contributor Author

alamb commented Jul 2, 2024

I've created issues for a couple of tasks. Please let me know if you think anything needs updating in the descriptions. I'm new here and learning from shadowing the experienced folks

thank you @dharanad -- this is very helpful 🙏

@alamb
Copy link
Contributor Author

alamb commented Jul 2, 2024

FWIW in general @dharanad I have had the best luck with writing a description on tickets that requires as little context as possible (aka distill down what is needed into the the description, rather than assuming the new contributor will read the epic and get all the backstory)

The rationale for this duplication is to lower the barrier to new contrbutors

@dharanad
Copy link
Contributor

dharanad commented Jul 3, 2024

FWIW in general @dharanad I have had the best luck with writing a description on tickets that requires as little context as possible (aka distill down what is needed into the the description, rather than assuming the new contributor will read the epic and get all the backstory)

The rationale for this duplication is to lower the barrier to new contrbutors

Thanks for the feedback! I really appreciate. You're right, making the ticket description concise and self-contained will definitely help reduce the barrier for new contributors. I'll update the description to include the necessary context. Thanks you

@dharanad
Copy link
Contributor

dharanad commented Jul 3, 2024

Create issues for the remaining tasks, tried adding a description based on my understanding of the issue. Also update the same for the older ones

@samuelcolvin
Copy link
Contributor

Given how much UserDefinedSQLPlanner is being used for existing stuff within datafusion, perhaps it should be called just SQLPlanner or CustomSQLPlanner?

@alamb
Copy link
Contributor Author

alamb commented Jul 3, 2024

Given how much UserDefinedSQLPlanner is being used for existing stuff within datafusion, perhaps it should be called just SQLPlanner or CustomSQLPlanner?

I agree

Or maybe something like ExprPlanner 🤔 as it is being used to plan specific exprs.

@samuelcolvin
Copy link
Contributor

ExprPlanner sounds good.

@xinlifoobar
Copy link
Contributor

xinlifoobar commented Jul 4, 2024

Given #11220 and #11243, those are very similar APIs with UDF plans. I am trying to draft an API, e.g.,

    // Plan the user defined function, returns origin expression arguments if not possible
    fn plan_udf(
        &self,
        _sql: &sqlparser::ast::Expr,
        args: Vec<Expr>,
    ) -> Result<PlannerResult<Vec<Expr>>> {
        Ok(PlannerResult::Original(args))
    }

to uniform the usages.

I have created a draft PR #11263 to discuss this. The flaw here is that the parameter sql is partially borrowed and has to be cloned at the very beginning. Maybe we should consider using references if possible.

@xinlifoobar
Copy link
Contributor

Given #11220 and #11243, those are very similar APIs with UDF plans. I am trying to draft an API, e.g.,

    // Plan the user defined function, returns origin expression arguments if not possible
    fn plan_udf(
        &self,
        _sql: &sqlparser::ast::Expr,
        args: Vec<Expr>,
    ) -> Result<PlannerResult<Vec<Expr>>> {
        Ok(PlannerResult::Original(args))
    }

to uniform the usages.

I have created a draft PR #11263 to discuss this. The flaw here is that the parameter sql is partially borrowed and has to be cloned at the very beginning. Maybe we should consider using references if possible.

Eventually, I made this #11263, please let me know your thoughts. Thanks :)

CC @jayzhan211 @dharanad @alamb

@alamb
Copy link
Contributor Author

alamb commented Jul 6, 2024

ExprPlanner sounds good.

Filed #11304

@alamb
Copy link
Contributor Author

alamb commented Jul 15, 2024

I think we are pretty close to calling this done.

I just double checked and sql_compound_identifier_to_expr is the only thing that needs this treatment to remove the call to get_function_meta:

if let Some(udf) =
self.context_provider.get_function_meta("get_field")

That appears to be the last issue
https://github.com/search?q=repo%3Aapache%2Fdatafusion+get_function_meta+path%3A%2F%5Edatafusion%5C%2Fsql%5C%2F%2F&type=code

@alamb
Copy link
Contributor Author

alamb commented Jul 15, 2024

Filed #11473

@alamb
Copy link
Contributor Author

alamb commented Jul 22, 2024

I think we can claim we are done 🎉

thanks everyone

@alamb alamb closed this as completed Jul 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants