Skip to content

chore: Use native impl of soundex function#4824

Open
kazantsev-maksim wants to merge 60 commits into
apache:mainfrom
kazantsev-maksim:native_soundex
Open

chore: Use native impl of soundex function#4824
kazantsev-maksim wants to merge 60 commits into
apache:mainfrom
kazantsev-maksim:native_soundex

Conversation

@kazantsev-maksim

@kazantsev-maksim kazantsev-maksim commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

Which issue does this PR close?

  • N/A

Rationale for this change

This PR migrates the soundex string function from Comet's codegen-based dispatch path to the shared datafusion-spark implementation, registering it as a native scalar UDF.

What changes are included in this PR?

  • Register the SparkSoundex scalar UDF in register_datafusion_spark_function so it is available to the DataFusion session context alongside the other Spark-compatible UDFs.
  • Change CometSoundEx from CometCodegenDispatch[SoundEx] to CometScalarFunction[SoundEx]("soundex")

How are these changes tested?

Significantly expand the test coverage with many additional inputs to exercise edge cases

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant