PA learning #1

pberko · 2021-07-23T10:28:34Z

Hello

Do you have a working example for pa_learning?

Thanks

vhavlena · 2021-07-23T13:02:20Z

Hello,
an example of the input file you can find on https://github.com/matousp/datasets/tree/master/scada-iec104/iec104-traffic

pberko · 2021-07-23T13:11:17Z

Thanks

The file looks different from what was in my mind.
I thought it should contain paths from a DFA alphabet.
how is this file extracted from DFA?

vhavlena · 2021-07-23T13:23:34Z

It is quite tailored for our application. The input file is basically a long sequence of symbols. In the first step based on an expert knowledge, these symbols are put together into strings and in the second step such a multiset of strings is then the input for DPA learning.

pberko · 2021-07-23T13:37:26Z

So in line 66 I get list of strings which is actually automata paths?

pberko · 2021-07-23T13:43:38Z

@vhavlena
Another question
the function
""" Add string to frequency prefix tree """ def add_string(self, string, label=0): act = self._root self._ini[act] = self._ini[act] + 1 for i in range(len(string)): try: self.flanguages[act][tuple(string[i:])] += 1 ....

is recieving only tuples as input (as in your example) or can also get simple string i.e. "a a a"

vhavlena · 2021-07-24T07:25:24Z

Yes. The list of strings is the input for learning. In your output, each line (list) is a string where single pair of strings is a symbol.
The variable string should be a list of arbitrary symbols; you can give there arbitrary string (in your case "aaa").

pberko · 2021-07-25T16:59:09Z

Hello @vhavlena

I tried the algorithm with other samples but I'm afraid I did a mistake since the results are different from expected:

The apleh-bet is "a, b"
input file with paths generated from a "blackbox automaton".
input file :https://github.com/pberko/detano/blob/master/train1clean.csv
https://github.com/pberko/detano/blob/master/blackbox111.pdf

I use the file as input file to pa_learning but I got the output:
https://github.com/pberko/detano/blob/master/graphviz%20(2).pdf
which looks different

vhavlena · 2021-07-26T08:52:40Z

Hello @pberko,

Not sure if you are doing something wrong. I think there are several issues to consider. Your black-box automaton is Markov chain, right? In that case you likely get something different, because DPA on the output has accepting probabilities that may cause some bias for your setting (maybe you can try specialised algorithms for learning MCs). The second issue are learning parameters: different parameters may lead to different automata.

vhavlena added a commit that referenced this issue Jan 4, 2022

typing #1

ed063e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PA learning #1

PA learning #1

pberko commented Jul 23, 2021

vhavlena commented Jul 23, 2021

pberko commented Jul 23, 2021

vhavlena commented Jul 23, 2021

pberko commented Jul 23, 2021

pberko commented Jul 23, 2021

vhavlena commented Jul 24, 2021

pberko commented Jul 25, 2021 •

edited

Loading

vhavlena commented Jul 26, 2021

PA learning #1

PA learning #1

Comments

pberko commented Jul 23, 2021

vhavlena commented Jul 23, 2021

pberko commented Jul 23, 2021

vhavlena commented Jul 23, 2021

pberko commented Jul 23, 2021

pberko commented Jul 23, 2021

vhavlena commented Jul 24, 2021

pberko commented Jul 25, 2021 • edited Loading

vhavlena commented Jul 26, 2021

pberko commented Jul 25, 2021 •

edited

Loading