1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 29-Mar-2025 08:22 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: AI specialist for advanced scraping tool for housing websites
Posted by: External project from PeoplePerHour
Started: 17-Dec-2024 12:34 GMT
Description: I am looking for an AI specialist with extensive experience in AI to develop a Windows Service in C# that can do the following:
Every day, visit a list of approximately 800 URLs of real estate agency websites and navigate through the pages to search for newly listed properties added by the agencies.

Next, these property pages must be read, and the relevant data extracted to be stored in a fixed format in tables on an SQL server.

A number of data fields are mandatory, such as:

The direct URL of the property page within the real estate agency's website (to enforce uniqueness)
The city where the property is located
The street where the property is located
The property type, where the choice comes from our fixed list: entire home, apartment, studio, etc. The engine must select the closest match from our list
The number of rooms
The monthly rental price
Whether this price includes or excludes service charges
The date the property is available
The surface area in square meters
A list of URLs of the photos associated with the property
Additionally, there is a list of optional fields we would like to retrieve if the information is available:

Municipality
District
Postal code
House number
Number of bedrooms
Number of bathrooms
Year of construction
Is there a: garden, garage, rooftop terrace, balcony?
Condition of the property
Is the property furnished?
...and so on
A complete list will be provided.

The challenge lies in the fact that each real estate agency uses a different paging method and different page layouts. Furthermore, some agencies include all the information in one block of text, while others display much of the data in columns. This can also change unexpectedly. Therefore, the software must be resilient and capable of understanding how to navigate through the pages to look for new properties.

A second challenge is that some agencies include photos of other nearby properties under the details of a specific property. The tool must recognize that these photos do not belong to the property in question and should ignore them.

Preferably, we would use—due to cost considerations—an AI model that does not rely on a commercial API, unless doing so offers such significant benefits that it is worthwhile.

I would love to hear about your experience and how you would approach this. Specifically: which AI method/engine you would use and the flow of the software.
Project ID: 3413051
Project category:
Project budget:
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Editing of horse training videos for online courses 29-Mar-2025
02:57 GMT
I need a press release manager who can get my articles posted 29-Mar-2025
02:56 GMT
Test CountBricks android app 29-Mar-2025
02:55 GMT
ROM Decompilation – Assembly-Level Projects 29-Mar-2025
02:52 GMT
UK Solicitor to read Will Trust Deed & answer a few questions 29-Mar-2025
02:43 GMT
I need help with fashion trend image research / image curation 29-Mar-2025
02:43 GMT
My Facebook and Instragram can not login please help me 29-Mar-2025
02:43 GMT
design a proffesional shopify shop 29-Mar-2025
02:43 GMT
I need data scraped of medical professionals. Fast turn around. 29-Mar-2025
02:43 GMT
My wordpress website needs redesign and mobile friendly version 29-Mar-2025
02:40 GMT
Allen 123[sharp]522 29-Mar-2025
02:39 GMT
Simple, yet functional website for events 28-Mar-2025
20:42 GMT
​​Agentic AI Assistant for Lead Engagement & Enquiry Management 28-Mar-2025
20:42 GMT
Creating an Amazon brand and packaging for a new product 28-Mar-2025
20:30 GMT
PLATFORM WITH PUBLIC LABEL RIGHTS COURSES 28-Mar-2025
20:08 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Short URL:
1001fp.com
Mobile version:
m.1001freelanceprojects.com
Copyright © 2005-2024 1001 Freelance Projects