Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: updater-scripts #733

Draft
wants to merge 40 commits into
base: gql
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from 3 commits
Commits
Show all changes
40 commits
Select commit Hold shift + click to select a range
90145f5
base section script
adit-bala Oct 27, 2024
101301a
modularize data fetching
adit-bala Oct 28, 2024
f523109
new `Course` & `Class` Schema
adit-bala Oct 28, 2024
7099aac
`courseParser` & `classParser`
adit-bala Oct 30, 2024
7f4e9c9
Merge branch 'gql' into feat-updater-scripts
mathhulk Nov 2, 2024
537af49
don't use terms for `course` api
adit-bala Nov 3, 2024
7b9b141
add batching from @mathhulk
adit-bala Nov 3, 2024
e8a0f2b
insert results into the database
adit-bala Nov 3, 2024
3f81826
only delete relevant objects
adit-bala Nov 4, 2024
d43c524
add mongodb loader
adit-bala Nov 4, 2024
edc75d1
update optional fields for `course`
adit-bala Nov 5, 2024
a0a6122
don't include invalid data
adit-bala Nov 5, 2024
1d6f78f
simplify `course` and `class`
adit-bala Nov 7, 2024
ff86b7c
add required fields for section
adit-bala Nov 11, 2024
e7838e9
add testing script
adit-bala Nov 11, 2024
c2e46aa
init infra changes
adit-bala Nov 17, 2024
469e691
denote required fields in typescript
adit-bala Nov 17, 2024
676a797
decouple `datapuller` infra
adit-bala Nov 21, 2024
57e1931
feat: log total errors for datapuller
adit-bala Nov 21, 2024
9867932
rm dependecy
adit-bala Nov 21, 2024
a0e165c
fix: keep `datapuller` infra in `app`
adit-bala Nov 25, 2024
85d99ba
chore: migrate logic to one file
adit-bala Nov 25, 2024
489859d
chore: add encrypted env vars
adit-bala Nov 26, 2024
d8d099d
Merge branch 'gql' into feat-updater-scripts
adit-bala Nov 26, 2024
ad36e05
feat: logging + cleanup of logs
adit-bala Nov 26, 2024
c4b729a
fix: error types
adit-bala Nov 26, 2024
e4d8e4a
chore: more detailed logging
adit-bala Nov 26, 2024
93adeab
fix: proper define
adit-bala Nov 26, 2024
5a2f5ee
fix: remove `dev` check
adit-bala Nov 26, 2024
f4f30d6
fix additional {end}
maxmwang Dec 1, 2024
4eebf84
package-lock
maxmwang Dec 1, 2024
4226250
fix {{}} whitespacing
maxmwang Dec 1, 2024
16ce976
force string tag
maxmwang Dec 1, 2024
38a4716
fix helm chart
maxmwang Dec 1, 2024
0ebffb7
add datapuller to cd, specify cronjob timezone
maxmwang Dec 2, 2024
f74d8e1
specify datapuller image tags
maxmwang Dec 2, 2024
76fad25
feat: modular cronjob entrypoint script
adit-bala Dec 2, 2024
3285ff0
fix: include datapuller-prod?
adit-bala Dec 2, 2024
0eb3d4f
fix: add command to enable `datapuller-prod`
adit-bala Dec 4, 2024
0f026b9
fix: remove unused `env`
adit-bala Dec 4, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -15,10 +15,10 @@ COPY --from=datapuller-builder /datapuller/out/package-lock.json ./package-lock.
RUN ["npm", "install"]

COPY --from=datapuller-builder /datapuller/out/full/ .
ENTRYPOINT ["turbo", "run", "course", "--filter=datapuller"]
ENTRYPOINT ["turbo", "run", "section", "--filter=datapuller"]

FROM datapuller-dev AS datapuller-prod
ENTRYPOINT ["turbo", "run", "course", "--filter=datapuller", "--env-mode=loose"]
ENTRYPOINT ["turbo", "run", "section", "--filter=datapuller", "--env-mode=loose"]

# backend
FROM base AS backend-builder
Expand Down
2 changes: 1 addition & 1 deletion apps/datapuller/package.json
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
"private": true,
"scripts": {
"build": "tsc --noEmit",
"course": "tsx src/course.ts"
"section": "tsx src/section.ts"
},
"devDependencies": {
"@types/node": "^20.14.12",
Expand Down
43 changes: 0 additions & 43 deletions apps/datapuller/src/course.ts

This file was deleted.

40 changes: 40 additions & 0 deletions apps/datapuller/src/section.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
import { ISectionItem } from "@repo/common";
import { ClassSection, ClassesAPI } from "@repo/sis-api/classes";

import setup from "./shared";
import mapSectionToNewSection from "./shared/parser";
import { fetchPaginatedData } from "./shared/utils";

async function updateSections() {
const { config, log } = setup();
const classesAPI = new ClassesAPI();

const sections = await fetchPaginatedData<ISectionItem, ClassSection>(
classesAPI.v1,
"getClassSectionsUsingGet",
{ "term-id": "2248" },
{
app_id: config.sis.CLASS_APP_ID,
app_key: config.sis.CLASS_APP_KEY,
},
(data) => data.apiResponse.response.classSections || [],
mapSectionToNewSection
);

log.info(`Updated ${sections.length} sections for Spring 2024`);
}

const initialize = async () => {
const { log } = setup();
try {
log.info("\n=== UPDATE SECTIONS ===");
await updateSections();
} catch (error) {
log.error(error);
process.exit(1);
}

process.exit(0);
};

initialize();
4 changes: 4 additions & 0 deletions apps/datapuller/src/shared/config.ts
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,8 @@ export interface Config {
CLASS_APP_KEY: string;
COURSE_APP_ID: string;
COURSE_APP_KEY: string;
TERM_APP_ID: string;
TERM_APP_KEY: string;
};
}

Expand All @@ -37,6 +39,8 @@ export function loadConfig(): Config {
CLASS_APP_KEY: env("SIS_CLASS_APP_KEY"),
COURSE_APP_ID: env("SIS_COURSE_APP_ID"),
COURSE_APP_KEY: env("SIS_COURSE_APP_KEY"),
TERM_APP_ID: env("SIS_TERM_APP_ID"),
TERM_APP_KEY: env("SIS_TERM_APP_KEY"),
},
};
}
4 changes: 2 additions & 2 deletions apps/datapuller/src/shared/index.ts
Original file line number Diff line number Diff line change
Expand Up @@ -3,12 +3,12 @@ import { Logger } from "tslog";
import { loadConfig } from "./config";

export default function setup() {
loadConfig();
const config = loadConfig();

const log = new Logger({
type: "pretty",
prettyLogTimeZone: "local",
});

return { log };
return { log, config };
}
191 changes: 191 additions & 0 deletions apps/datapuller/src/shared/parser.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,191 @@
import { ISectionItem } from "@repo/common";
import { ClassSection } from "@repo/sis-api/classes";

function getRequiredField<T>(
value: T | undefined,
fieldName: string,
defaultValue: T
): T {
if (value === undefined || value === null) {
console.warn(`Missing required field: ${fieldName}`);
adit-bala marked this conversation as resolved.
Show resolved Hide resolved
return defaultValue;
}
return value;
}

function mapSeatReservation(
reservation: any
): ISectionItem["enrollment"]["reservations"][0] {
return {
number: getRequiredField(reservation.number, "reservation.number", 0),
requirementGroup: getRequiredField(
reservation.requirementGroup?.description,
"reservation.requirementGroup.description",
""
),
maxEnroll: getRequiredField(
reservation.maxEnroll,
"reservation.maxEnroll",
0
),
enrolledCount: getRequiredField(
reservation.enrolledCount,
"reservation.enrolledCount",
0
),
};
}

function mapInstructor(
instructor: any
): ISectionItem["meetings"][0]["instructors"][0] {
const name =
instructor.instructor?.names?.find((n: any) => n.type?.code === "PRF") ||
instructor.instructor?.names?.[0];

return {
printInScheduleOfClasses: getRequiredField(
instructor.printInScheduleOfClasses,
"instructor.printInScheduleOfClasses",
false
),
familyName: getRequiredField(name?.familyName, "instructor.familyName", ""),
givenName: getRequiredField(name?.givenName, "instructor.givenName", ""),
role: getRequiredField(instructor.role?.code, "instructor.role.code", ""),
};
}

function mapMeeting(meeting: any): ISectionItem["meetings"][0] {
return {
number: getRequiredField(meeting.number, "meeting.number", 0),
days: [
meeting.meetsMonday,
meeting.meetsTuesday,
meeting.meetsWednesday,
meeting.meetsThursday,
meeting.meetsFriday,
meeting.meetsSaturday,
meeting.meetsSunday,
].map((day, index) =>
getRequiredField(day, `meeting.day[${index}]`, false)
),
startTime: getRequiredField(meeting.startTime, "meeting.startTime", ""),
endTime: getRequiredField(meeting.endTime, "meeting.endTime", ""),
startDate: getRequiredField(meeting.startDate, "meeting.startDate", ""),
endDate: getRequiredField(meeting.endDate, "meeting.endDate", ""),
location: getRequiredField(
meeting.location?.description,
"meeting.location.description",
""
),
instructors: meeting.assignedInstructors?.map(mapInstructor) || [],
};
}

export default function mapSectionToNewSection(
original: ClassSection
): ISectionItem {
const courseId = getRequiredField(
original.class?.course?.identifiers?.find((i) => i.type == "cs-course-id")
?.id,
"courseId",
""
);

const newSection: ISectionItem = {
courseId,
classNumber: getRequiredField(original.class?.number, "classNumber", ""),
sessionId: getRequiredField(original.class?.session?.id, "sessionId", ""),
termId: getRequiredField(original.class?.session?.term?.id, "termId", ""),
sectionId: getRequiredField(original.id, "sectionId", 0),
number: getRequiredField(original.number, "number", ""),
component: getRequiredField(original.component?.code, "component.code", ""),
status: getRequiredField(original.status?.code, "status.code", ""),
instructionMode: getRequiredField(
original.instructionMode?.code,
"instructionMode.code",
""
),
printInScheduleOfClasses: getRequiredField(
original.printInScheduleOfClasses,
"printInScheduleOfClasses",
false
),
graded: getRequiredField(original.graded, "graded", false),
feesExist: getRequiredField(original.feesExist, "feesExist", false),
startDate: new Date(getRequiredField(original.startDate, "startDate", "")),
endDate: new Date(getRequiredField(original.endDate, "endDate", "")),
addConsentRequired: getRequiredField(
original.addConsentRequired?.code,
"addConsentRequired.code",
""
),
dropConsentRequired: getRequiredField(
original.dropConsentRequired?.code,
"dropConsentRequired.code",
""
),
primary: getRequiredField(
original.association?.primary,
"association.primary",
false
),
type: getRequiredField(original.type?.code, "type.code", ""),
combinedSections: getRequiredField(
original.combination?.combinedSections,
"combination.combinedSections",
[]
),
enrollment: {
status: getRequiredField(
original.enrollmentStatus?.status?.code,
"enrollmentStatus.status.code",
""
),
enrolledCount: getRequiredField(
original.enrollmentStatus?.enrolledCount,
"enrollmentStatus.enrolledCount",
0
),
minEnroll: getRequiredField(
original.enrollmentStatus?.minEnroll,
"enrollmentStatus.minEnroll",
0
),
maxEnroll: getRequiredField(
original.enrollmentStatus?.maxEnroll,
"enrollmentStatus.maxEnroll",
0
),
waitlistedCount: getRequiredField(
original.enrollmentStatus?.waitlistedCount,
"enrollmentStatus.waitlistedCount",
0
),
maxWaitlist: getRequiredField(
original.enrollmentStatus?.maxWaitlist,
"enrollmentStatus.maxWaitlist",
0
),
reservations:
original.enrollmentStatus?.seatReservations?.map(mapSeatReservation) ||
[],
},
exams:
original.exams?.map((exam) => ({
date: new Date(getRequiredField(exam.date, "exam.date", "")),
startTime: getRequiredField(exam.startTime, "exam.startTime", ""),
endTime: getRequiredField(exam.endTime, "exam.endTime", ""),
location: getRequiredField(
exam.location?.description,
"exam.location.description",
""
),
number: getRequiredField(exam.number, "exam.number", 0),
type: getRequiredField(exam.type?.code, "exam.type.code", ""),
})) || [],
meetings: original.meetings?.map(mapMeeting) || [],
};

return newSection;
}
60 changes: 60 additions & 0 deletions apps/datapuller/src/shared/utils.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
export async function fetchPaginatedData<T, R>(
api: any,
method: string,
baseParams: Record<string, any>,
headers: Record<string, string>,
responseProcessor: (data: any) => R[],
itemProcessor: (item: R) => T
): Promise<T[]> {
const results: T[] = [];
let page = 1;
let retries = 1;

while (retries > 0) {
try {
const response = await api[method](
{
...baseParams,
"page-number": page,
"page-size": 100,
},
{ headers }
);

const data = await response.json();
const processedData = responseProcessor(data);

if (processedData.length === 0) {
break; // No more data to fetch
}
console.log("Mapping processedData with itemProcessor...");
const transformedData = processedData.map((item, index) => {
try {
return itemProcessor(item);
} catch (error) {
console.error(`Error processing item at index ${index}:`, error);
console.error("Problematic item:", JSON.stringify(item, null, 2));
throw error;
}
});

results.push(...transformedData);
console.log(`Processed ${processedData.length} items from page ${page}.`);
page++;
retries = 1; // Reset retries on successful fetch
} catch (error) {
console.log(`Error fetching page ${page} of data`);
console.log(`Unexpected error querying API. Error: "${error}"`);

if (retries === 0) {
console.log(`Too many errors querying API. Terminating update...`);
break;
}

retries--;
console.log(`Retrying...`);
adit-bala marked this conversation as resolved.
Show resolved Hide resolved
}
}

return results;
}
Loading
Loading