Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Customizable mining policies, Bulldozer refactoring and a first version of alphabet selection (binning) #41

Merged
merged 92 commits into from
Dec 4, 2024

Conversation

mauro-milella
Copy link
Member

Bulldozer now gets an entire dataset slice (a generic MineableData type) instead of just one instance.

Alphabet selection can now be performed using select_alphabet: this essentially performs binning by following a custom strategy. Since the returned objects are propositions (which can then be used to construct Items), this functionality could be moved to SoleData in the future.

The package now exports the mining policies defined in mining-policies.jl. They aim to reduce the amount of Itemsets and ARules mined. To support them, Miner constructor has been changed and two new getters itemset_mining_policies and arule_mining_policies.

The policies related to Itemsets are applied before saving them inside Miner freqitems structure.
The policies related to ARules are applied during generaterules coroutine.

mauro-milella and others added 30 commits October 25, 2024 15:22
`Bulldozer` supporting dataset slices instead of wrapping just a single instance
…-selection.jl`. `equicut`, `quantilecut`, `makeconditions` renamed.
@mauro-milella
Copy link
Member Author

A this point, code is more robust and the code in alphabet-selection is being integrated in SoleData. This can be closed.

@mauro-milella mauro-milella reopened this Dec 4, 2024
@mauro-milella mauro-milella merged commit ce4929c into main Dec 4, 2024
0 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant