1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 20-Sep-2025 03:12 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: Data acquisition
Posted by: External project from PeoplePerHour
Started: 10-Jan-2025 12:49 GMT
Description: We are a software company. For one of our projects we need to download
information from a website containing articles about medical topics.
The website contains cca. 10000 HTML pages of paged listing of articles
in Czech language. The list contains titles of articles, each title having
a link to the detail HTML page with the article text.
We need someone to produce wget and other scripts and download the titles of
all articles, parse the links from those titles, download the detailed pages
of the articles and distill the text that is shown in the page.
The titles as well as the detail pages mostly have the same structure so
this allows for an automated work. But it is not so in 100% cases, there may
be several types of structure so it may require some attention as to how
to distill the correct information.
The result of this work will be a set of static HTML files. You can view this
structure under
https://fomenot.com/z/dwld24/main.html
I.e. the result will contain the contents of the article separated into
paragraphs of normal text and captions (nothing else, no images or other
texts). We only want the main text of the article that is visible on the screen
for the user. No other text or html content.
Another result will be the raw HTML output for each of the detail pages
For accepting the output, we will do our check of the result. If we find errors,
we will give examples of these errors and we will expect the vendor to fix
all such errors in the result, not just those examples. If there are only a few
errors we may not be able to find them and it is ok. But if we find any we will
require correcting them.
We expect that the raw HTML files will be 100% error free (for these we will not
give examples, we just would demand fixing them). For the text-based results
we will give examples before demanding to fix them.

An example of such a source page you can find here:
https://www.idnes.cz/onadnes/zdravi/2
You can see a list of articles, each having a link leading to the detail
and then a paging control that can load more articles from the next page.
This is NOT the page we need to download but similar. Putting here the example
only that you understand what is the task.

Let us know if you could do it and for what price. We will provide the real links
to the selected candidate.
Project ID: 3416433
Project category:
Project budget:
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Smart Financial Assistant App
Category: Android, Flutter, IOS Development, IPad, IPhone, Mobile App Development, React Native, Web Development
Budget: $750 - $1500 USD
19-Sep-2025
21:53 GMT
Asia HR, Talent Acquisition Specialist, Recruitment Manager, Headhunter, Staffing Agent, and Recruitment Consultant Email Scraper Needed
Category: Automation, Data Collection, Data Entry, Data Management, Data Mining, Data Scraping, Email Marketing, Python, Web Scraping, Web Search
Budget: ₹600 - ₹1500 INR
19-Sep-2025
21:52 GMT
Cinematic Logo Animation & Sound
Category: 3D Animation, After Effects, Animation, Blender, Cinema 4D, Logo Animation, Motion Graphics, Sound Design
Budget: ₹600 - ₹1500 INR
19-Sep-2025
21:51 GMT
Fix Elementor Text Editor Loading Issue
Category: CSS, Debugging, Elementor, JavaScript, PHP, Web Development, Website Optimization, WordPress
Budget: $10 - $30 AUD
19-Sep-2025
21:50 GMT
Residential Quit Claim Deed
Category: Contracts, Legal, Legal Consultation, Legal Research, Legal Writing, Litigation, Patents, Property Law, Property Management
Budget: $10 - $30 USD
19-Sep-2025
21:50 GMT
Design 3D Printable Costume Armor
Category: 3D Animation, 3D CAD, 3D Design, 3D Modelling, 3D Printing, 3D Rendering, 3D Visualization, 3ds Max, Blender, Costume Design
Budget: $10 - $30 USD
19-Sep-2025
21:48 GMT
Mexico Virtual Assistant Bilingual Non Profit
Category: Customer Service, Data Entry, English Translation, Legal Translation, Spanish Translator, Translation, Virtual Assistant
Budget: $2 - $8 USD
19-Sep-2025
21:46 GMT
Saudi Google Ads Domination
Category: Google Ads, PPC Marketing, Search Engine Marketing (SEM)
Budget: $10 - $30 USD
19-Sep-2025
21:45 GMT
AI Movie Production
Category: 3D Animation, AI (Artificial Intelligence) HW / SW, AI Animation, AI Content Creation, Animation, Audio Services, Voice Acting, Voice Talent
Budget: $250 - $750 USD
19-Sep-2025
21:45 GMT
English-Urdu Product Page Translation
Category: Arabic Translator, Content Management System (CMS), Content Writing, English (US) Translator, English Translation, Hindi Translator, Translation, Urdu Translator
Budget: $10 - $30 USD
19-Sep-2025
21:45 GMT
Urgent YouTube Product Review Videos
Category: Adobe Premiere Pro, After Effects, Content Creation, Final Cut Pro, Video Editing, Video Production, Videography, YouTube
Budget: ₹1500 - ₹12500 INR
19-Sep-2025
21:44 GMT
WordPress Site Host Migration
Category: CPanel, Database Management, Linux, MySQL, PHP, SSL, Web Development, WordPress
Budget: $30 - $250 USD
19-Sep-2025
21:40 GMT
Startup Investor Pitch Deck
Category: Brochure Design, Business Analysis, Business Consulting, Business Development, Business Plan Writing, Corporate Identity, Graphic Design, Logo Design, Startup Consulting
Budget: $250 - $750 USD
19-Sep-2025
21:38 GMT
IT Resume Compilation for USA Based Employees
Category: Cloud Computing, Data Entry, Data Management, Data Science, Excel, IT Project Management, MySQL, Recruitment, Web Scraping, Web Search
Budget: $1500 - $3000 USD
19-Sep-2025
21:35 GMT
Modern Mexican Menu Redesign
Category: Adobe Creative Suite, Adobe Illustrator, Graphic Design, Logo Design, Menu Design, Photoshop, Typography, User Interface / IA
Budget: ₹75000 - ₹150000 INR
19-Sep-2025
21:33 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Short URL:
1001fp.com
Mobile version:
m.1001freelanceprojects.com
Copyright © 2005-2024 1001 Freelance Projects