I need a freelancer to prepare benchmark questions and answers for testing a custom LLM’s reasoning ability.
Scope: Question Set: Collect 500–600 LLM benchmark questions with correct answers. Focus areas: logical, mathematical, commonsense, analytical, and multi-step reasoning. Deliver as JSON or CSV. Python Script: Load questions and send them to an LLM (I'll handle API integration). Compare model answers to correct ones. Output a simple accuracy report. Requirements: Knowledge of LLMs, reasoning datasets, or NLP is preferred. Clean, documented code. Use only open or original questions.
Artistic Video Editor for Urban Brand Category: Adobe Premiere Pro, After Effects, Animation, Cinematography, Motion Graphics, Video Editing, Video Production Budget: €60 - €100 EUR
Albanian-English Translation for Event Category: English (UK) Translator, English (US) Translator, English Grammar, Malay Translator, Slovakian Translator, Translation Budget: €18 - €36 EUR
27-Jun-2025 21:50 GMT
3D Car Model Adjustment Category: 3D Animation, 3D Modelling, 3D Rendering, 3ds Max, Solidworks Budget: £250 - £750 GBP