Skip to content

feat(Impala): support .distinct() for a subset of columns #10370

Open
@contang0

Description

@contang0

Is your feature request related to a problem?

At the moment Impala backend only supports .distinct() on a full table.

This works:

table.distinct()

This does not:

table.distinct(on=['col1', 'col2'])

Translation to backend failed
Error message: OperationNotDefinedError("Compilation rule for 'First' operation is not defined")

What is the motivation behind your request?

This forces me to write verbose workarounds.

.distinct() on a subset of a table is pretty fundamental, in my view.

Describe the solution you'd like

The on clause in .distinct() should work.

What version of ibis are you running?

10.5

What backend(s) are you using, if any?

Impala

Code of Conduct

  • I agree to follow this project's Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    featureFeatures or general enhancements

    Type

    No type

    Projects

    Status

    backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions