Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job data validation with Indeed website #229

Open
samshipengs opened this issue Dec 23, 2024 · 0 comments
Open

Job data validation with Indeed website #229

samshipengs opened this issue Dec 23, 2024 · 0 comments

Comments

@samshipengs
Copy link

samshipengs commented Dec 23, 2024

I'm trying to validate the (indeed) data I got from JobSpy with the job listings I see directly from Indeed website, given same search params,

such as:

  • radius 50km
  • same location
  • same job title
  • within last 24hrs
  1. I'm seeing some records that show date_posted for example 10 days old (see attachment), but when I go to indeed and search within last 24hrs, it shows up as well, which means shouldn't that listing have date_posted like either Dec 23 (today) or Dec 22? instead of Dec 18?
Screenshot 2024-12-23 at 12 25 07 PM
  1. In general is this a robust way to validate the data we get from JobSpy simply by looking up against the Indeed website, or there might be some discrepancies due to non-obvious things?

  2. I took a quick look of the code, and found this line:

f'location: {{where: "{self.scraper_input.location}", radius: {self.scraper_input.distance}, radiusUnit: MILES}}'

I'm searching the ca website, which would default me to km for radius, so for validation purposes, I suppose I should set the unit to be something KMS instead of MILES?

thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant