Jon colorado data (boulder) #2

jonbig · 2023-11-01T00:24:19Z

I first created an intermediate table where I did most of the transformations, then created tables for boulder races, offices, politicians, and race candidates. I used the existing tables as a guide and they appear to almost match. One area where I see differences is when it comes to the ID fields. I left those fields in models, but commented out.
I was under the impression those uuid fields are generated when we insert the rows into the tables, so maybe they just need to be inserted in a specific order so that the first uuid field is generated, and the rest of the uuid fields are based on that?
I commented out the insert statements.
I used sql fluff to lint the models.

wileymc · 2023-11-02T22:42:31Z

Some issues neeed to be addressed before merging this in:

lets remove the source .csv files from source control
needs to be linted (the sql fluff CI job is broken so it will pass regardless)
dbt run doesn't currently work for all models

wileymc · 2023-11-02T22:35:59Z

models/intermediate/co_boulder_city_filings.sql

+
+
+--fields for politician table
+----Where is the id field coming from?


Id's are generated on INSERT for pretty much every table (they are UUIDs which aren't sequential or dependent on one another)

Got it, that has been confusing me (and is why I had a few fields. commented out) because if the IDs are generated on insert they don't exist in my table yet, I won't be able to add them to the select, right?

Correct, they aren't needed for these staging SELECTs

wileymc · 2023-11-02T22:37:10Z

populist.code-workspace

add this file to .gitignore

models/staging/stg_co_boulder_city_offices.sql

models/sources/co_boulder_city/src_co_boulder_city_filings.sql

jonbig · 2023-11-02T23:17:33Z

I created the boulder_updated_filings table from the boulder_updated_filings.csv, that should be the only file needed to run the models.

wileymc · 2023-11-02T23:26:25Z

models/intermediate/co_boulder_city_filings.sql

+            ELSE 'district'
+        END AS election_scope,
+        CASE
+            WHEN office ILIKE '%Mayor%' THEN 'Mayor'


I'm not sure we want to set seat as "Mayor" in this case

You'll also need to join this model to our existing public.politician table so that we can deduplicate politicians and insert the race_candidate records properly. Look at what i did in the mn intermediate model

wileymc · 2023-11-02T23:54:38Z

models/intermediate/co_boulder_city_filings.sql

+LEFT JOIN transformed_filings AS tf ON f.email = tf.email
+LEFT JOIN transformed_filings_1 AS tf1 ON tf.email = tf1.email


i dont think these joins are needed as you can select from either of the CTEs in this final select statement to get exactly what you need

wileymc

I've pushed up a bunch of changes and dbt run is looking good so far! Lets get this merged then we can get the data into the public schema. You can click into my commits above to see what changes were needed to get this working.

jonbig added 6 commits October 31, 2023 16:18

working on creating boulder city pols and offices

0b6a468

working on creating boulder city pols and offices

d9ad0f8

progress on politicians table

19ea27f

progress on politicians table

b523d56

finishing pols and office table

9fa061c

finished races and race/candidate records

ce3de6b

wileymc requested changes Nov 2, 2023

View reviewed changes

jonbig and others added 3 commits November 2, 2023 18:59

working on creating boulder city pols and offices

8a6c100

Lint and clean up

03ab048

Rebase to main

88987fd

wileymc force-pushed the jon_colorado_data branch from 31de5e9 to 88987fd Compare November 2, 2023 23:07

wileymc reviewed Nov 2, 2023

View reviewed changes

Fix issues

a37dd3b

wileymc reviewed Nov 2, 2023

View reviewed changes

wileymc added 4 commits November 3, 2023 21:43

Join politicians, races, and offices to intermediate model

9b82be5

Add dir structure

ad588b9

Clean up seat and move seeds to seeds

a7a73bb

Delete vscode file

ac153f6

wileymc approved these changes Nov 7, 2023

View reviewed changes

jonbig merged commit cb07699 into main Nov 14, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jon colorado data (boulder) #2

Jon colorado data (boulder) #2

jonbig commented Nov 1, 2023

wileymc commented Nov 2, 2023

wileymc Nov 2, 2023

jonbig Nov 2, 2023

wileymc Nov 2, 2023

wileymc Nov 2, 2023

jonbig commented Nov 2, 2023 •

edited

Loading

wileymc Nov 2, 2023

wileymc Nov 2, 2023

wileymc Nov 2, 2023

wileymc left a comment •

edited

Loading



		--fields for politician table
		----Where is the id field coming from?

		LEFT JOIN transformed_filings AS tf ON f.email = tf.email
		LEFT JOIN transformed_filings_1 AS tf1 ON tf.email = tf1.email

Jon colorado data (boulder) #2

Jon colorado data (boulder) #2

Conversation

jonbig commented Nov 1, 2023

wileymc commented Nov 2, 2023

wileymc Nov 2, 2023

Choose a reason for hiding this comment

jonbig Nov 2, 2023

Choose a reason for hiding this comment

wileymc Nov 2, 2023

Choose a reason for hiding this comment

wileymc Nov 2, 2023

Choose a reason for hiding this comment

jonbig commented Nov 2, 2023 • edited Loading

wileymc Nov 2, 2023

Choose a reason for hiding this comment

wileymc Nov 2, 2023

Choose a reason for hiding this comment

wileymc Nov 2, 2023

Choose a reason for hiding this comment

wileymc left a comment • edited Loading

Choose a reason for hiding this comment

jonbig commented Nov 2, 2023 •

edited

Loading

wileymc left a comment •

edited

Loading