Maths Professor | $15/hr Remote

Crossing Hurdles · Posted 2026-04-26

Position: SwarmBench Task Engineer Reasoning / MathType: Short-Term Contract (4 weeks)Compensation: $15 per hourLocation: RemoteCommitment: 8 hours per day with 4 hours overlap with PSTRole ResponsibilitiesBuild multi-agent benchmark tasks requiring multi-step mathematical reasoning, proofs, and algorithmic problem-solvingDesign complex problems across domains like numerical analysis, combinatorics, optimization, and statistical inferenceCreate precise problem statements with clear notation, definitions, and expected outputsDevelop verification scripts to validate correctness (numerical tolerance, proof validity, algorithm outputs)Design decomposition strategies to break problems into parallel or independent sub-tasksEvaluate and improve AI model reasoning through structured benchmark creationWork with evaluation frameworks and agentic workflows for reasoning-heavy tasksRequirementsStrong experience in mathematics, quantitative research, or computational science (competition math or university-level exposure preferred)Proficiency in Python (NumPy, SciPy, SymPy or similar libraries)Experience writing mathematical proofs, derivations, or formal reasoning workflowsAbility to create objective, verifiable problems (not subjective or open-ended)Familiarity with AI coding benchmarks (SWE-bench, Terminal-bench, etc.)Comfortable with Linux/terminal workflows, Git, and development environmentsExperience with Docker (writing Dockerfiles, building images, debugging containers)Strong understanding of numerical methods (tolerance, convergence, error bounds)High attention to detail and ability to work independently in a structured environmentApplication ProcessApply / Easy Apply via LinkedInFill out the application form shared via emailComplete the assessment (post-shortlisting; to be completed within 24 hours)

Apply for this role