1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 01-May-2024 22:49 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: PCRE based URL rewriting feature added to GNU wget
Posted by: External project from PeoplePerHour
Started: 24-Jul-2020 14:30 GMT
Description: Expected duration: less than 1 week
I'm looking for a new feature to be added to GNU Wget which will alter URLs before they are added to the download queue, based on user-provided PCRE (Perl Compatible Regular Expression) patterns.

The feature should support multiple PCREs, passed via command line arguments (and optionally via another mechanisms such as input file).

Example Syntax:
wget --no-clobber --recursive --page-requisites \
--url-replace='s/&foo=\d//g' \
--url-replace='s/AAA/BBB/gi' \
--url-replace='s/(a[0-9])\.bkp\.html$/\1.BACKUP.html/' \
"http://site.tld"

[ Note that "--url-replace" does not currently exist, and is there to demonstrate the feature desired ]

Given the above command, Wget should apply the patterns given with the "--url-replace" arguments early enough such that checks for the corresponding local file already existing (i.e. --no-clobber) are based on the post-replacement URL's value.

A Worked Example:

When recursively downloading / mirroring from the start page "https://site.tld/", using the "Example Syntax" command above:

- Wget finds the link "https://site.tld/aaa.html?a=1&foo=2&bar=3".
- This link is converted into "https://site.tld/BBB.html?a=1&bar=3" after applying the '--url-replace' patterns above (NB: there is a /gi modifier on the second pattern).
- If the local file "./site.tld/BBB.html?a=1&bar=3" already exists, the content will not be downloaded. Otherwise, wget will fetch it as normal.
- If Wget later finds a link to "https://site.tld/a3.bkp.html", this URL is converted to "https://site.tld/a3.BACKUP.html" before being downloaded and saved at "./site.tld/a3.BACKUP.html" (again, assuming that ./site.tld/a3.BACKUP.html does not already exist)


NOTE: If you can suggest another approach to the one above which achieves the same objective and fits within the project budget, I'm willing to consider it. As an example, passing URLs for changes via external script before adding to the queue might be a reasonable approach.


Delivery requirements:

- You will add the feature to the existing code available from https://ftp.gnu.org/gnu/wget/wget-latest.tar.gz (currently: 1.20.3)

- You will provide a zip file of the complete source code with the new feature added, and a patch file which can be applied the original (clean) wget-1.20.3 source code.

- The updated code will compile cleanly on a Fedora 32/CentOS 8 system with normal wget dependencies already installed via DNF/yum (pcre/pcre2/gnutls/nettle/zlib/libidn2)

- The usual commands "make dist clean; ./configure --enable-pcre; make" will work as expected, leaving a ready-to-execute "wget" binary under ./src/wget.
Project ID: 3034169
Project category:
Project budget:
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Bahis Sitesinde Canlı Destek- Call Center
Category: Article Writing, Technical Translation, Translation, Web Design
Budget: $15 - $25 USD
01-May-2024
22:03 GMT
Profitable Shopify Store Optimisation
Category: ECommerce, Shopify, Shopify Templates, Shopping Carts, Web Design
Budget: $15 - $25 CAD
01-May-2024
22:03 GMT
Laravel Developer for Law Firm Management
Category: HTML, Laravel, MySQL, PHP, Web Design
Budget: €30 - €250 EUR
01-May-2024
22:01 GMT
Bilingual Sales Specialist Needed
Category: Internet Marketing, Marketing, Sales, Spanish Translator, Telecom Sales
Budget: $8 - $15 USD
01-May-2024
22:01 GMT
E-commerce Website Builder Needed
Category: ECommerce, HTML, PHP, Web Design, WordPress
Budget: $30 - $250 USD
01-May-2024
22:00 GMT
High-Risk Payment Solutions Expert Required
Category: Web Development
Budget: $30 - $250 USD
01-May-2024
22:00 GMT
Study Level Airliner Creation for Flight Simulator
Category: 3D Animation, 3D Modelling, 3D Rendering, Aircraft Performance, Game Development
Budget: £3000 - £5000 GBP
01-May-2024
22:00 GMT
Retro Band Video - 70s Influence
Category: Animation, Video Editing, Video Production, Video Services, Videography
Budget: £250 - £750 GBP
01-May-2024
21:59 GMT
Modern Book Design for Solomon's Temple
Category: Brochure Design, Graphic Design, Illustration, Logo Design, Photoshop
Budget: $250 - $750 USD
01-May-2024
21:59 GMT
2D Animator & Voiceover for Entertainment Video
Category: After Effects, Animation, Audio Services, Commercials, Voice Talent
Budget: $30 - $250 USD
01-May-2024
21:57 GMT
Modern Minimalist Menu Redesign
Category: Graphic Design, Icon Design, Illustration, Logo Design, Photoshop
Budget: $30 - $250 USD
01-May-2024
21:55 GMT
Asistente Virtual para Transcripción y Diseño de Presentaciones en Canva
Category: Branding, Canva, Powerpoint, Transcription, Virtual Assistant
Budget: $250 - $750 USD
01-May-2024
21:55 GMT
Basic Math Tutoring for 120 Minutes
Category: Education & Tutoring, Mathematics, Matlab And Mathematica, Physics, Statistics
Budget: $15 - $25 USD
01-May-2024
21:54 GMT
Blockchain Wallet App Development
Category: Android, C, Programming, IPhone, Java, Mobile App Development
Budget: £10 - £15 GBP
01-May-2024
21:52 GMT
Perl Script Adjustment: Zero Results Handling
Category: HTML, Perl
Budget: $30 - $250 USD
01-May-2024
21:52 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Short URL:
1001fp.com
Mobile version:
m.1001freelanceprojects.com
Copyright © 2005-2022 1001 Freelance Projects