1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
Job Data Collection System Python (scraping)
Posted by:
External project from PeoplePerHour
Started:
18-Nov-2024 04:24 GMT
Description:
Expected duration: less than 1 week
Project Overview
We are seeking an experienced Python developer to optimize and enhance our job data collection system. The current Selenium-based approach needs to be replaced with a more efficient API-driven solution, incorporating sophisticated data management and robust error handling.

Key Requirements
- Strong Python programming skills with API integration experience
- Database design and implementation (PostgreSQL preferred)
- Experience with data versioning and delta tracking
- Familiarity with VPN handling for IP rotation
- Linux server deployment experience (Ubuntu)

Technical Specifications

Core Functionalities
1. API Integration
- Implement API-based job ID collection to replace current Selenium approach
- Design intelligent filtering system to manage data retrieval within API limitations
- Develop dynamic filter adjustment for optimal data collection

2. Database Design & Implementation
- Design and implement a PostgreSQL database structure
- Key data points to track:
- Job IDs and metadata
- First addition and update dates
- Full job details (JSON format)
- Update tracking and versioning
- Job availability status

3. Data Management
- Implement delta versioning for historical tracking
- Design system to handle regular job listing updates
- Ensure no data loss during updates

4. System Features
- Flexible time period selection for data retrieval
- Automatic filter optimization to work within API limitations
- IP rotation mechanism using NordVPN

Additional Requirements
- Comprehensive logging system
- Email notification system for errors and results
- Daily statistics tracking and reporting
- Server deployment on Ubuntu VPS

Technical Considerations
- System must handle large volumes of data efficiently
- Solution should be scalable and maintainable
- Must work within API rate limits and restrictions

Deliverables
1. Complete Python codebase
2. Database schema and implementation
3. Import of existing data
4. Deployment documentation
5. System documentation including error handling procedures

Skills Required
- Advanced Python programming
- API integration expertise
- Database design and optimization
- Linux server administration
- Network handling (VPN integration)

This is a complex project requiring a developer with strong system design skills and attention to detail. The ideal candidate will have experience with large-scale data collection and management systems.
Project ID:
3409082
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Storyboard Artist for Animated Web Series
Category: 2D Animation, 2D Animation Explainer Video, 2D Game Art, 2D Layout, 3D Animation, Adobe Animate, Adobe Creative Cloud, Animation, Caricature & Cartoons, Illustration
Budget: $10 - $15 USD
04 Mar 2026 17:04 GMT
GoDaddy E-Commerce Website Build
Category: AI Content Creation, ECommerce, GoDaddy, HTML, Payment Gateway Integration, Shopping Cart Integration, Web Development, Web Design
Budget: £20 - £250 GBP
04 Mar 2026 17:04 GMT
Excel Margin Formula Setup
Category: Data Analysis, Data Entry, Data Management, Data Processing, Excel, Excel Macros, Excel VBA, Visual Basic
Budget: ₹600 - ₹1200 INR
04 Mar 2026 17:03 GMT
UK Education Off-Page SEO (Authority) - Quality Links + Citation 04 Mar 2026 17:03 GMT
UI/UX Designer Needed for Dark Premium Discipline App 04 Mar 2026 17:03 GMT
Minute-Level Memecoin Price Research on historical data
Category: Cryptocurrency, Data Analysis, Data Collection, Data Visualization, Google Sheets, Market Research, Research, Trading
Budget: €6 - €12 EUR
04 Mar 2026 17:00 GMT
Technical Writer for Clinical Diagnostics
Category: Compliance, Content Writing, Medical Writing, Technical Documentation, Technical Writing
Budget: min $50 USD
04 Mar 2026 16:57 GMT
Basic Web Presence for Roofing
Category: Google Analytics, Google Search, Graphic Design, SEO, Web Design, Web Development, WordPress
Budget: £20 - £250 GBP
04 Mar 2026 16:57 GMT
Travel Agency Simulator (Android Mobile Game)
Category: Android, Game Design, Game Development, Mobile App Development, Simulation, Unity
Budget: $750 - $1500 USD
04 Mar 2026 16:55 GMT
School Administrator & Operations Coordinator
Category: Canva, CRM, Event Management, Project Management, Virtual Assistant
Budget: $30 - $250 NZD
04 Mar 2026 16:55 GMT
Classic Blog Template with Elementor
Category: Blog Design, Content Management System (CMS), Elementor, Email Marketing, Graphic Design, HTML, Web Design, Website Optimization, WordPress
Budget: $30 - $250 USD
04 Mar 2026 16:55 GMT
Full-Stack MERN E-Commerce Store Development -- 2
Category: AngularJS, JavaScript, MERN, MongoDB, Node.js, RESTful, RESTful API, Tailwind CSS
Budget: €3000 - €5000 EUR
04 Mar 2026 16:54 GMT
Adapt Company Signage into Logos
Category: Adobe Illustrator, Branding, Graphic Design, Illustration, Logo Design, Photoshop, Vector Design, Visual Design
Budget: £20 - £250 GBP
04 Mar 2026 16:54 GMT
Rwanda Medical & Dental License Verification
Category: Medical, Medical Writing, Research, Research Writing
Budget: $10 - $50 USD
04 Mar 2026 16:53 GMT
5-Page Promotional Brochure Design
Category: Adobe Creative Cloud, Adobe Illustrator, Adobe InDesign, Photoshop, Brochure Design, Graphic Design, Print Design
Budget: $30 - $250 USD
04 Mar 2026 16:51 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2025
1001 Freelance Projects