Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the input format #2

Open
huHHhhuhu opened this issue Mar 21, 2016 · 4 comments
Open

About the input format #2

huHHhhuhu opened this issue Mar 21, 2016 · 4 comments

Comments

@huHHhhuhu
Copy link

Hi Flavio,

Sorry for bothering you again for the following problem.

I’m wondering the input format.
The “Readme" says the first column of “dt user from to” is the amount of time the user spent on “from” before going to “to”.
So the unit of the amount of time is “second"?

i.e., if I have the following .tsv text document:
--------timestamp----------userID--itemID--
2009-04-08T01:53:56Z user01 item01
2009-04-08T01:57:47Z user01 item02

I should turn it into the format like:
231 user01 item01 item02

However, if the time interval exceeds a day, the value of the amount of time will be big.

@flaviovdf
Copy link
Owner

That's it. But this model does not consider time, it is necessary as input simply because that's how I had my files set-up for other models. You should look at:

https://github.com/flaviovdf/tribeflow
or
https://github.com/flaviovdf/prme

@huHHhhuhu
Copy link
Author

Okay,
Thanks a lot !

@kjacks21
Copy link

kjacks21 commented Nov 8, 2018

I think #4 tried to ask this, but since this issue is still open I'll ask it here. I also looked at the samples in https://github.com/flaviovdf/tribeflow and it still wasn't clear to me. Let's say I want to replicate the input data from Figure 1 of the fpmc paper (see below).

image

Would it be like the following?

0 user01 [a, b, c] [b, c]
0 user01 [b, c] [a,b]
0 user02 [a] [a, c]
0 user03 [d] [c, e]
0 user03 [c, e] [e]

or

0 \t user01 \t  a b c \t b c
0 \t  user01 \t  b c \t a b
0 \t  user02 \t  a \t a c
0 \t  user03 \t d \t c e
0 \t  user03 \t c e \t e

In other words, it isn't clear how to represent the baskets. Thanks!

@flaviovdf
Copy link
Owner

flaviovdf commented Nov 8, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants