Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

turn / token plotting #38

Open
mdingemanse opened this issue Dec 15, 2023 · 3 comments
Open

turn / token plotting #38

mdingemanse opened this issue Dec 15, 2023 · 3 comments

Comments

@mdingemanse
Copy link
Contributor

mdingemanse commented Dec 15, 2023

image

code voor deze plots in elpaco-annrev.R in
https://osf.io/qj2fd

@mdingemanse
Copy link
Contributor Author

another idea:

  • find top turns (e.g. mhmm, oh, yes)
  • pick a turn format of interest (e.g. oh)
  • find all standalone turns of this format (oh)
  • find this format as a token within turns (e.g., oh I see)
  • plot the standalone turns as one layer
  • plot the within-turn tokens of oh as another layer
  • plot as text

@mdingemanse
Copy link
Contributor Author

mdingemanse commented Dec 15, 2023

@bvreede, I just pushed a commit to playground with code replicating the Annual Review plots for 6 random sampled IFADV sources.

Code here: https://github.com/elpaco-escience/playground/blob/main/plotting_topturns_and_tokens.R

One of six plots:

image

top turns in interjections shows that the purple circles here are very likely ja :D

> interjections
# A tibble: 10 × 5
   language utterance_stripped     n  rank total_turns
   <chr>    <chr>              <dbl> <dbl>       <int>
 1 dutch    ja                 77862     1       13364
 2 dutch    nee                 9041     2       13364
 3 dutch    mmm                 7341     3       13364
 4 dutch    oh                  6948     5       13364
 5 dutch    mm-hu               6646     6       13364
 6 dutch    uhu                 3086     8       13364
 7 dutch    uhm                 2937     9       13364
 8 dutch    uh                  2695    10       13364
 9 dutch    nou                 2038    14       13364
10 dutch    ah                  2018    15       13364

@mdingemanse
Copy link
Contributor Author

En gewoon omdat het kan, ook een versie met de interjections geplot (zelfde repo, deze commit).

image

This was referenced Jan 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant