xikipedia

Wikipedia as a social media feed

Try it: xikipedia.org

About

Xikipedia is a pseudo social media feed that algorithmically shows you content from Simple Wikipedia. It is made as a demonstration of how even a basic non-ML algorithm with no data from other users can quickly learn what you engage with to suggest you more similar content. No data is collected or shared here, the algorithm runs locally and the data disappears once you refresh or close the tab.

Generating data

To run Xikipedia, you need the .json file that contains the data required. This repo already has a file for the Simple Wikipedia included, but you can also make your own by replacing the files in the process_data.py file with your own WikiMedia data dumps.

Algorithm

The algorithm used for Xikipedia is pretty simple. Each post has a set of categories, which consists of the post's Wikipedia category tree, and the pagelinks in the post. These categories have point scores assigned to them.

Here are the actions and their respective scores:

Scrolling past a post: -5
Liking a post: 50 + 4*posts_since_last_like
Clicking on an article: 75
Clicking on an image: 100

These scores are applied through the engagePost function in the code.

Each post has a base score, which is 0 by default. If a post has an image, it gets +5 on its base score. If you've already seen a post, its base score will be (3**(post_seen_times)-1) * -5000.

To get the next post in the feed, 10000 random posts are picked out from the data set. Then, one of three things will randomly happen:

(40% chance) The scores of all posts are summed together, and a random value is picked. It's kind of like picking a random value, except posts with higher scores have a higher likelyhood of getting picked.
(42% chance) The post with the highest score is shown.
(18% chance) A completely random post is shown.

The categories given names and surnames start off with a base score of -1000 due to how prevelant they would be otherwise.

Licensing

This project is licensed under AGPLv3. This license applies to the project itself, but not the included json file that contains data from Wikipedia. If you'd like to use this project, but can't use it due to its current license, let me know and I might relicense it.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LICENSE		LICENSE
README.md		README.md
favicon.ico		favicon.ico
index.html		index.html
process_data.py		process_data.py
smoldata.json.br		smoldata.json.br

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

xikipedia

Try it: xikipedia.org

About

Generating data

Algorithm

Licensing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

xikipedia

Try it: xikipedia.org

About

Generating data

Algorithm

Licensing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages