1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 28-Apr-2025 09:48 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: AI specialist for advanced scraping tool for housing websites
Posted by: External project from PeoplePerHour
Started: 17-Dec-2024 12:34 GMT
Description: I am looking for an AI specialist with extensive experience in AI to develop a Windows Service in C# that can do the following:
Every day, visit a list of approximately 800 URLs of real estate agency websites and navigate through the pages to search for newly listed properties added by the agencies.

Next, these property pages must be read, and the relevant data extracted to be stored in a fixed format in tables on an SQL server.

A number of data fields are mandatory, such as:

The direct URL of the property page within the real estate agency's website (to enforce uniqueness)
The city where the property is located
The street where the property is located
The property type, where the choice comes from our fixed list: entire home, apartment, studio, etc. The engine must select the closest match from our list
The number of rooms
The monthly rental price
Whether this price includes or excludes service charges
The date the property is available
The surface area in square meters
A list of URLs of the photos associated with the property
Additionally, there is a list of optional fields we would like to retrieve if the information is available:

Municipality
District
Postal code
House number
Number of bedrooms
Number of bathrooms
Year of construction
Is there a: garden, garage, rooftop terrace, balcony?
Condition of the property
Is the property furnished?
...and so on
A complete list will be provided.

The challenge lies in the fact that each real estate agency uses a different paging method and different page layouts. Furthermore, some agencies include all the information in one block of text, while others display much of the data in columns. This can also change unexpectedly. Therefore, the software must be resilient and capable of understanding how to navigate through the pages to look for new properties.

A second challenge is that some agencies include photos of other nearby properties under the details of a specific property. The tool must recognize that these photos do not belong to the property in question and should ignore them.

Preferably, we would use—due to cost considerations—an AI model that does not rely on a commercial API, unless doing so offers such significant benefits that it is worthwhile.

I would love to hear about your experience and how you would approach this. Specifically: which AI method/engine you would use and the flow of the software.
Project ID: 3413051
Project category:
Project budget:
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Website Development Needed 28-Apr-2025
01:55 GMT
Life-story 28-Apr-2025
01:50 GMT
Creative Scriptwriter Needed for Kid-Friendly GTA RP YouTube 28-Apr-2025
01:44 GMT
I need a landing page MVP. 28-Apr-2025
01:42 GMT
Personal Assistant For Removal Business 28-Apr-2025
01:40 GMT
Website Administrator 28-Apr-2025
01:39 GMT
Seeking Cognitive Aptitude Test Expert(CCAT/PICA) Expert! 28-Apr-2025
01:34 GMT
i need packaging designed for my protein balls 28-Apr-2025
01:33 GMT
Design Booklet style brochure for my business 15 PAGES 28-Apr-2025
01:33 GMT
Proofreader & Editor Needed for Non-Fiction Book (Trading) 28-Apr-2025
01:33 GMT
Book Layout Designer (Adobe InDesign) — Fast Project 28-Apr-2025
01:33 GMT
YouTube Collaborator for Antique Website 28-Apr-2025
01:33 GMT
Android Build for A mobile Application 28-Apr-2025
01:33 GMT
EUWEB 250427 - Wordpress/Elementor Developer/Admin/AI eCommerce 28-Apr-2025
01:29 GMT
Showroom Design 28-Apr-2025
01:29 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Short URL:
1001fp.com
Mobile version:
m.1001freelanceprojects.com
Copyright © 2005-2024 1001 Freelance Projects