CPR Search Tool - FastAPI

Summary

Thought this would be a simpler implementation rather than Django, so thought I'd take the opportunity to use FastAPI for a slightly larger project.

The data is stored in memory, so could be the basis of a sharded application, but may want to drop to a lower level language. Otherwise look into scaled DBs e.g. Redis/ES/Aurora

For a fully structured source package layout etc see the Django implementation.

Installation

Local install

Make virtual environment e.g.

pip install virtualenvwrapper
mkvirtualenv cpr

Install python requirements

pip install --upgrade pip
pip install -r requirements.txt

Optional: download spacy data file

python -m spacy download en_core_web_md

Launch the server!

uvicorn main:app --reload

OR

Docker Install and run

Build and launch Docker container

docker build -t cpr2 . && docker run -p 8000:8000 -v %cd%:/usr/src/app cpr2

That's it!

To stop the docker container

docker ps
docker stop *dockerid*

Where the first few letters of the docker instance are enough.

Endpoints

Hello World

Obligatory! http://127.0.0.1:8000/

Policies

List Policies

http://127.0.0.1:8000/policy

Get Particular Policy

http://127.0.0.1:8000/policy/10088

Search Policies with space delimited query

http://localhost:8000/policy/search/environment%20policy Will return jaccard_similarity in documents, which are ordered descending by that similarity.

Search Policies with space delimited query using Spacy NLP

If you're not on Windows! http://localhost:8000/policy/spacy_search/environment%20policy This is not preloaded at the moment but the nlp processing is cached so it will get speedier with use.

Docs

Autogenerated with FastAPI http://127.0.0.1:8000/docs http://127.0.0.1:8000/redoc

Tests

Install pytest

pip install pytest

Then run tests

pytest tests/tests.py

Things to do with more time.

Complete tests
Put/Patch/Delete
Pydantic/ORM etc
tfidf maybe?
Store vectors
NLP models, Bert/Google Universal Sentence Encoder?
Preload nlp for documents
Use scalable DB e.g. ES, redis, serverless DB etc.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
cpr-task_1-full.csv		cpr-task_1-full.csv
data.py		data.py
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CPR Search Tool - FastAPI

Summary

Installation

Local install

Make virtual environment e.g.

Install python requirements

Docker Install and run

Build and launch Docker container

Endpoints

Hello World

Policies

List Policies

Get Particular Policy

Search Policies with space delimited query

Search Policies with space delimited query using Spacy NLP

Docs

Tests

Things to do with more time.

About

Releases

Packages

Languages

leunga1000/cprsearchfastapi

Folders and files

Latest commit

History

Repository files navigation

CPR Search Tool - FastAPI

Summary

Installation

Local install

Make virtual environment e.g.

Install python requirements

Docker Install and run

Build and launch Docker container

Endpoints

Hello World

Policies

List Policies

Get Particular Policy

Search Policies with space delimited query

Search Policies with space delimited query using Spacy NLP

Docs

Tests

Things to do with more time.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages