Part1 : Provided a dataset of volume sales of products from 2019 to 2022 run an extensive exploratory data analysis including the following: . 1. Data Quality & Structure Checks Missing values, duplicates, negative sales, outliers Date consistency (no gaps, proper frequency, handling holidays/weekends) 2. Descriptive Statistics Overall distribution of daily sales (mean, median, std, skewness, kurtosis) By dimension: product, customer Identify top products per customer by volume 3. Time Series Exploration Trend: long-term upward or downward movement Seasonality: daily/weekly patterns (weekdays vs weekends), monthly, quarterly, yearly cycles Rolling averages (7-day, 30-day) to smooth patterns 4. Visualization Layer Time series plots: raw daily sales, moving averages Boxplots: distribution of sales by weekday or month Histograms/density plots: sales distribution 5. Anomaly & Outlier Detection Unusual spikes/drops Use Z-scores or interquartile ranges to flag anomalies 6. Correlation & Drivers of Sales Correlation if needed 7. Performance Metrics (Baseline) Set benchmarks to prepare for forecasting models: Average daily sales per SKU/store Volatility (Coefficient of Variation) Baseline forecast error (e.g., naïve forecast MAPE)
EDA Deliverables : By the end of an extensive EDA, I should have: Clear understanding of demand patterns, seasonality, and anomalies Insights into drivers of sales (internal like price/promo, external like weather/events) Segmentation of products into high/medium/low performers A baseline performance snapshot to compare forecasting models against.
Part 2 : After cleaning the data based on the above analysis, run a linear regression-based model to prepare a sales volume forecast at product & customer level for 2022 in python or/and pyspark. Measure the accuracy by introducing quality measures and explain why have you introduced these measures.
SEO-Driven Tech Blog Series Category: Article Writing, Blog Writing, Content Marketing, Content Writing, Copywriting, Internet Marketing, Keyword Research, Link Building, Technical Writing Budget: $250 - $750 USD
26-Apr-2026 22:02 GMT
Modern Diagrams for Thought Leadership Models Category: Adobe Creative Cloud, Adobe Illustrator, Branding, Design, Graphic Design, Illustration, Infographics, Logo Design, Photoshop, Visual Design Budget: $250 - $750 AUD
26-Apr-2026 22:02 GMT
Real Estate Social Media Manager Category: Analytics, Content Creation, Digital Marketing, Facebook Marketing, Internet Marketing, Lead Generation, Market Research, Social Media Management, Social Media Marketing, YouTube Budget: $15 - $25 USD
26-Apr-2026 22:01 GMT
Vibrant SMD Award Show Slides Category: 2D Animation, 3D Animation, Adobe Illustrator, Photoshop, Adobe Premiere Pro, Graphic Design, Motion Graphics, Video Editing, Video Services, Videography Budget: $30 - $250 USD
Land Development for Multi-Family Housing Category: Business Analysis, Finance, Financial Analysis, Financial Research, Market Analysis, Real Estate, Risk Assessment, Urban Planning Budget: ₹12500 - ₹37500 INR
26-Apr-2026 21:40 GMT
Spotify Song Promotion Specialist Category: Advertising, Brand Management, Brand Marketing, Content Marketing, Music, Social Media Marketing, Spotify Ads, Twitter, YouTube Budget: ₹1500 - ₹12500 INR
26-Apr-2026 21:39 GMT
High-Detail PLA Parts Printing Category: 3D CAD, 3D Design, 3D Model Maker, 3D Modelling, 3D Printing, Brochure Design, Graphic Design, Photoshop Budget: €30 - €250 EUR
26-Apr-2026 21:39 GMT
HTML-to-Webflow Portfolio Conversion Category: CSS, HTML, HTML5, JavaScript, UI / User Interface, Web Design, Web Development, Webflow Budget: $15 - $25 USD
26-Apr-2026 21:36 GMT
GData Entry Transcription Assistant Category: Copyright, Data Analysis, Data Entry, Data Management, Excel, Google Sheets, JavaScript, PHP, Visual Basic Budget: ₹600 - ₹1500 INR