Add Flask Web App interface #29

fhallee · 2025-01-27T17:55:32Z

This PR introduces a web app to host the CityLex database, allowing users to access the data through a user-friendly interface. The app is functional and allows users to query and download data in long TSV format.

Next steps:

CELEX Integration: CELEX data has not yet been incorporated. We discussed adding password protection for CELEX data, but this functionality is not yet implemented.
Additional output formats: We could implement short format (join on word) TSV and SQLite dump options.
Feature script: features.py doesn’t convert the format for all tags and needs to be updated.

…lask_app

Merge branch 'master' into flask_app Needed to merge the latest changes from the master branch to integrate the "features.py" script functionality into app.py# the commit.

…e format conversion in Flask app

kylebgorman

Hi @fhallee, since you'll be back working on this before terribly long, I just gave you comments here, since I probably won't be dedicating much time to this in the next few weeks.

kylebgorman · 2025-02-11T22:03:19Z

citylex/populate.py

@@ -14,6 +14,8 @@
 import pandas  # type: ignore
 import requests

+from citylex.features import tag_to_tag


Could you use module-level imports, thus from citylex import features?

kylebgorman · 2025-02-11T22:05:50Z

citylex/populate.py

            cursor.execute(
                """
-                INSERT INTO morphology (
+                INSERT INTO features (


I just want to make sure I understand what you did here. It seems like you're still preserving the idea of features as a table with multiple sources, but when the source is CELEX you also convert that to UD and UM tags, right? And so on for other sources.

If that's right, is there any argument for doing that at DB creation time vs. on the fly as needed? I guess I could see either option as viable, but I probably would have done it on the fly.

If for sake of review you want to suppress this new feature, I'd be fine with that. We can come back to it later.

kylebgorman · 2025-02-11T22:08:20Z

citylex/populate.py

@@ -490,7 +495,7 @@ def main():
    parser = argparse.ArgumentParser(description="Creates a CityLex lexicon")
    parser.add_argument(
        "--db_path",
-        default="citylex.db",
+        default="data/citylex.db",


I'm going to suggest at this point that we just hardcode this; put DB_PATH = "data/citylex.db" at the top of the program (but after the imports). Do we ever want more than one copy?

kylebgorman · 2025-02-11T22:08:33Z

flask_app/app.py

+
+from flask import Flask, render_template, request, send_file
+
+def get_args():


I would also suggest we hardcode the path here.

kylebgorman · 2025-02-11T22:11:44Z

flask_app/app.py

+    cursor = conn.cursor()
+
+    if request.method == "GET":
+        return render_template('index.html')


"index.html"; don't mix ' and " without good reason

kylebgorman · 2025-02-11T22:21:36Z

flask_app/app.py

+                us_columns.append("raw_frequency")
+            if "subtlexus_freq_per_million" in selected_fields:
+                us_columns.append("freq_per_million")
+            us_query = f"SELECT {', '.join(us_columns)} FROM frequency WHERE source = 'SUBTLEX-US'"


What if you did this once (and not again on lines 85) by getting rid of the WHERE clause, then you sort on the basis of the result iterator, determining where it goes based on the source column?

kylebgorman · 2025-02-11T22:22:20Z

flask_app/app.py

+
+        writer.writerow(columns)  # Writes header
+
+        # Fetches and writes SUBTLEX-US data


I think from here on this is a bit over-commented, with most of them noting obvious details.

kylebgorman · 2025-02-11T22:23:26Z

flask_app/app.py

+                us_columns.append("freq_per_million")
+            us_query = f"SELECT {', '.join(us_columns)} FROM frequency WHERE source = 'SUBTLEX-US'"
+            cursor.execute(us_query)
+            us_results = cursor.fetchall()


You don't need to fetchall, that's going to pull in megabytes of data all at once when you only need the data incrementally. You can just say for row in cursor: I think...

kylebgorman · 2025-02-11T22:24:00Z

flask_app/app.py

+        output.seek(0)  # Moves the cursor to the beginning of the file
+
+        # Sends the file as a response
+        return send_file(


Nice; didn't know you could do this quite like that.

kylebgorman · 2025-02-11T22:24:17Z

flask_app/app.py

+                row_dict = dict(zip(elp_columns, row))
+                writer.writerow([row_dict.get(col, '') for col in columns])  # Write ELP data
+
+        output.seek(0)  # Moves the cursor to the beginning of the file


Understood re: comment, but why is this necessary?

kylebgorman

Just testing locally now. A few things:

Flask and dependencies need to be added to, minimally, requirements.txt. I'd probably pin it to use the current version of Flask (whatever you're testing on) and higher, since I bet it'll be forward-compatible for a while at least.
Since you moved the DB to the data subdirectory, a user who just calls, say, citylex --all-free will get an inscrutable error. But it's basically saying you don't have a subdirectory called data yet. I am wondering about whether we shouldn't move it after all...better to just create it in the user's working directory by default. Or you could make it mandatory to specify its location, if you'd rather take this the other way.
The website looks great (in minimalist terms) actually once I got it running. I don't think a ton of design work is necessary there.
There is a potential issue with the license buttons. If, e.g., "GNU GPL v3" is selected and I click on it, deselecting it, it will deselect all the GPL sources. That's WAI. But I sort of expected that if the reselected that button, it would select all the GPL sources, but it does nothing. Is that WAI or a bug?
The SQLite dump button doesn't seem to work but maybe you already knew that.

fhallee added 12 commits October 19, 2024 19:25

Added functionality to access SUBTLEX US data through a web app

af27c3b

Remove citylex.py (command line script)

09b1164

Added functionality for WikiPron to Flask app

329d1f1

Merge branch 'flask_app' of https://github.com/fhallee/citylex into f…

6b4b78a

…lask_app

Merge remote-tracking branch 'origin/master' into flask_app

6d64a3f

Merge branch 'master' into flask_app Needed to merge the latest changes from the master branch to integrate the "features.py" script functionality into app.py# the commit.

Organized files into flask_app folder

7eb0028

Added functionality for UDLexicons and UniMorph and integrated featur…

7f00c6a

…e format conversion in Flask app

Add ELP source and move tag_to_tag conversion to populate.py from app.py

e6f3a6f

Add .DS_Store to .gitignore and remove existing .DS_Store file

9668bef

Add script.js to handle checkbox interactions and dependencies

755c31d

Add comments to script.js

901fccb

Revert features.py to commit 09539b8

e6f714c

kylebgorman reviewed Feb 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Flask Web App interface #29

Add Flask Web App interface #29

fhallee commented Jan 27, 2025

kylebgorman left a comment

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman Feb 11, 2025

kylebgorman left a comment •

edited

Loading


		from flask import Flask, render_template, request, send_file

		def get_args():


		writer.writerow(columns) # Writes header

		# Fetches and writes SUBTLEX-US data

Add Flask Web App interface #29

Are you sure you want to change the base?

Add Flask Web App interface #29

Conversation

fhallee commented Jan 27, 2025

kylebgorman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kylebgorman left a comment • edited Loading

Choose a reason for hiding this comment

kylebgorman left a comment •

edited

Loading