Skip to content

Conversation

@mauro-milella
Copy link
Member

Bulldozer now gets an entire dataset slice (a generic MineableData type) instead of just one instance.

Alphabet selection can now be performed using select_alphabet: this essentially performs binning by following a custom strategy. Since the returned objects are propositions (which can then be used to construct Items), this functionality could be moved to SoleData in the future.

The package now exports the mining policies defined in mining-policies.jl. They aim to reduce the amount of Itemsets and ARules mined. To support them, Miner constructor has been changed and two new getters itemset_mining_policies and arule_mining_policies.

The policies related to Itemsets are applied before saving them inside Miner freqitems structure.
The policies related to ARules are applied during generaterules coroutine.

mauro-milella and others added 30 commits October 25, 2024 15:22
`Bulldozer` supporting dataset slices instead of wrapping just a single instance
…-selection.jl`. `equicut`, `quantilecut`, `makeconditions` renamed.
@mauro-milella
Copy link
Member Author

A this point, code is more robust and the code in alphabet-selection is being integrated in SoleData. This can be closed.

@mauro-milella mauro-milella reopened this Dec 4, 2024
@mauro-milella mauro-milella merged commit ce4929c into main Dec 4, 2024
0 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants