Quper

Folder Structure

├── README.md
├── LICENSE.md
├── INSTALL.md
├── requirements.txt
├── src
│   ├── Quper.sh
│   ├── compliance of disclosure
│   │	├── compliance_of_disclosure.py
│   │       ├── predict_content.py  
│   │       ├── find_subtitle.py
│   │       ├── bys_classifier.pkl
│   │       ├── bys_tf.pkl
│   │	├── pp_example
│   ├── timeliness
│   │	├── timeline.py
│   ├── availability
├── ├── <!-- External Link -->
│   │   ├── get_external_link.py
├── ├── <!-- Language Type -->
│       ├── get_language_type.py
│   ├── readability
│   │	├── readability.py
│   │	├── doubleNeg_obscure_qualifiers.py
│   │	├── main_idea-location.py
│   │	├── pp_example
├── dataset
│   ├── title.csv
│   ├── language_data_set.xlsx

Note: This tree includes only main files.

Configuration environment

conda create env -n Quper

conda activate Quper

pip install -r requirements.txt

How to use Quper

cd src/

bash Quper.sh

Run this file obtain the full output of Quper.

src

Compliance of disclosure

compliance of disclosure.py: Run this file to obtain the full outputs on the console. The console will receive a list consisting of a series of 0s and 1s, where 1 represents the corresponding component being present in the privacy policy, and 0 indicates that the privacy policy does not include the corresponding component. Filter out privacy policies with unsupported formats and without subheadings. By default, the privacy policy results generated based on the pp_example folder will be printed. Component order is : COLLECT, COOKIE, SHARE, SECURITY, RIGHT, CHILDREN, REGION, UPDATE, HOW_USE, PROVIDER, RETENTION, DATA_USE.

predict_content.py: Use trained Bayesian classifier(bys_classifier.pkl) as well as a feature vector transformer (bys_tf.pkl) to predict the presence of subtitles in the privacy policy. Please refer to predict_content.md for details on how to use it.

find_subtitle.py: Detect subtitle tags in the privacy policy document. Please refer to find_subtitle.md for details on how to use it.

bys_classifier.pkl: Title Bayesian classifier trained from dataset/title.csv.

bys_tf.pkl: Title Bayesian feature vector transformer trained from dataset/title.csv.

pp_example: Some examples of privacy policy documents.

Timeliness

timeline.py: Automated test script to get privacy policy web page updates by visiting the waybackmachine website. The results encompass the initial release date, last update date, and the number of updates for privacy policies. The detailed results are provided in the console.

Availability

get_external_link.py: Find all the external links in the html page and check their status codes. Please refer to get_external_link.md for details on how to use it.

get_language_type.py: Automated test script tp get all supported languages in a privacy policy web page. Please refer to get_language_type.md for details on how to use it.

pp_example: Some examples of privacy policy documents.

Readability

readability.py:Calculate the ARI, FRES, LIX, Average Syllables per Word, Average Words per Sentence, Average Letters per Word, Sentence Count, Word Count, Reading Time (minutes), Speaking Time (minutes) of a text.

doubleNeg_obscure_qualifiers.py: Find all sentences in the text that are double negatives and those that contain obscure qualifiers.

main_idea_location.py: Determine the location of the central idea of the sentence. Please refer to main_idea_location.md for details on how to use it.

pp_example: Some examples of privacy policy documents.

dataset

title.csv ---> used by bys_classifier.pkl and bys_tf.pkl for Title Bayesian classifier and Title Bayesian feature vector transformer training. language_data_set.xlsx ---> used by get_language_type.py for compare with the supported language versions of the acquired skill privacy policy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quper

Folder Structure

Configuration environment

How to use Quper

src

Compliance of disclosure

Timeliness

Availability

Readability

dataset

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
dataset		dataset
src		src
INSTALL.md		INSTALL.md
LICENSE.txt		LICENSE.txt
README.md		README.md
find_subtitle.md		find_subtitle.md
get_external_link.md		get_external_link.md
get_language_type.md		get_language_type.md
main_idea_location.md		main_idea_location.md
predict_content.md		predict_content.md
requirements.txt		requirements.txt

License

UQ-Trust-Lab/Quper

Folders and files

Latest commit

History

Repository files navigation

Quper

Folder Structure

Configuration environment

How to use Quper

src

Compliance of disclosure

Timeliness

Availability

Readability

dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages