Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you explain more on Freq mode and non freq mode? #2

Open
xindi-dumbledore opened this issue Nov 6, 2018 · 1 comment
Open

Comments

@xindi-dumbledore
Copy link

Hi :) I'm wondering what exactly are the freq mode and non-freq mode edge weight? Also, it seems to me non-freq is much slower than freq mode? (I always got stuck on "Building Cache") Thanks!

@xyjprc
Copy link
Owner

xyjprc commented Dec 2, 2018

Hi @xindi-dumbledore , the Freq mode produces the count of sub-sequences, and the non freq mode is the normalized probability of subsequences.

For example, for observations
A -> B
A -> B
A -> C

Freq mode gives you
A -> B: 2
A -> C: 1

non-freq mode gives you
A -> B: 0.666667
A -> C: 0.333333

Note that this doesn't impact the higher-order rule extraction. Under the hood, the rule extraction works on the normalized probabilities. This freq mode and non-freq mode are just convenience functions for formatting outputs.

It is weird that freq and non-freq mode would behave differently. Can you reproduce the behavior, by closing python and re-running from scratch again for both methods? Let me know if you still keep seeing issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants