Background: Companies House data only includes registered office addresses. We require the actual trading addresses (principal place(s) of business) for analysis, marketing outreach, or compliance.
Objective: Build a pipeline that takes a list of UK company numbers (and optional SIC codes), and outputs a CSV with:
Company number
Company name
Number of employees
Turnover (where available)
SIC code(s)
Trading address (street, city, postcode)
2. Scope of Work
Core Data Ingestion
Download/ingest the monthly Companies House bulk CSV (or use the Companies House API) to get company number, name, postcode, SIC code(s).
Trading-Address Enrichment
Primary method: Parse iXBRL filings for .
Fallback method: Query a Places‐API (e.g. Google Places or Foursquare) by “company name + postcode” to retrieve formatted address.
Data Merging & Cleanup
Consolidate registered vs. trading address fields.
Standardize address formatting.
Deduplicate and log failures for manual review.
Export & Delivery
Export a final CSV with the key fields.
Provide a short one-page README describing usage and dependencies
4. Required Skills & Experience
Strong Python (or Node.js) coding for data pipelines.
Experience parsing XBRL/iXBRL (e.g. python-iXBRL or equivalent).
Familiar with REST-API consumption (Companies House, Google/Foursquare, OpenCorporates).
Familiarity with web-scraping frameworks (Scrapy, BeautifulSoup, Puppeteer) is a plus.
Data cleansing and address standardization best practices.
Docker and CLI scripting for packaging (optional but preferred).
Milestones:
Core data ingestion + sample of 50 records
iXBRL enrichment + fallback API integration
Data cleanup, export & documentation
Please include in your proposal:
Relevant past projects / GitHub samples (especially XBRL or address-enrichment work).
Confirmation you can deliver the three key deliverables.
Resolve Google Play App Rejection Category: Android, App Development, App Store Optimization, Digital Marketing, Internet Marketing, Mobile App Development, SEO, Technical Writing Budget: $30 - $250 USD
31-Mar-2026 22:01 GMT
Logo Design From Detailed Brief Category: Adobe Illustrator, Branding, Creative Design, Graphic Design, Illustration, Logo Design, Photoshop, Vector Design Budget: $30 - $250 CAD
31-Mar-2026 21:59 GMT
Expandable Incident Logging Spreadsheet Category: Data Analysis, Data Management, Data Processing, Data Visualization, Excel, Microsoft Access, Microsoft Office, Visual Basic Budget: $30 - $250 USD
31-Mar-2026 21:55 GMT
Optimize Pine Script Scalping Category: Backtesting, Data Analysis, Financial Analysis, Financial Modeling, Market Research, Pine Script, Risk Management, Statistical Analysis Budget: $10 - $30 USD
31-Mar-2026 21:52 GMT
Synology Video-Streaming Einrichtung Category: API, Cloud, Database Management, HTML, Video Streaming, Web Design, Web Development, Web Hosting Budget: €8 - €30 EUR
31-Mar-2026 21:52 GMT
Proactive Networking Partner Wanted Category: Account Management, Business Development, Full Stack Development, Lead Generation, Market Research, Project Management, Sales, Social Media Marketing Budget: €8 - €30 EUR
31-Mar-2026 21:50 GMT
AI Bluetooth PCB Firmware Development Category: AI (Artificial Intelligence) HW / SW, AI Development, Bluetooth, Bluetooth Low Energy (BLE), Electronics, Embedded Systems, Microcontroller Budget: $750 - $1500 USD