Back to Jobs

Remote | SWE (Terminal and CLI Dev Tools Focused) — $75–$80/hour

Remote, USA Full-time Posted 2026-06-17

We are sharing a specialised part-time consulting opportunity for experienced software engineers with strong systems debugging ability, deep terminal and shell fluency, and the ability to evaluate AI-powered CLI coding agents across real-world infrastructure tasks. This role supports an exciting collaboration with leading AI labs focused on improving AI-powered coding systems through high-quality comparative evaluation of CLI agents working on real-world debugging scenarios inside Docker-based environments. Selected professionals will solve infrastructure debugging tasks using AI CLI agents, diagnose broken systems inside containers, write bash scripts that resolve root-cause issues, compare agent approaches and performance, and help improve overall model quality. This opportunity is especially well-suited to detail-oriented engineers who are comfortable working across systems, infrastructure, and debugging workflows, and who can apply strong technical judgment to both problem solving and model evaluation.

Key Responsibilities

Professionals in this role may contribute to: Infrastructure Debugging & Resolution Solve real-world broken infrastructure scenarios running inside Docker containers Diagnose issues involving databases, networking, security, pipelines, replication, or access control Help ensure that fixes address the root cause and remain stable across service restarts CLI Agent Evaluation & Comparison Use AI-powered CLI coding agents to help solve TerminalBench tasks Compare agents' approaches, reasoning quality, and effectiveness after each task Help establish rigorous comparative evaluations that directly inform product decisions Bash Scripting & Systems Execution Write bash scripts from scratch to resolve infrastructure problems Work within terminal-based environments to inspect, debug, and repair failing systems Help improve model quality through precise technical execution and structured performance ranking Ideal Profile Strong candidates may have: 3+ years of experience in software engineering with hands-on systems and infrastructure debugging experience Strong bash or shell scripting proficiency Docker and containerization experience Infrastructure and systems debugging skills involving PostgreSQL, MySQL, Redis, nginx, TLS, systemd, log analysis, or similar technologies Familiarity with version control workflows such as Git, pull requests, and issue tracking

Preferred Qualifications

Experience with AI coding tools such as Copilot, Cursor, Claude, or similar tools Strong ability to prompt and evaluate AI-generated technical output Comfort working independently across fast-paced debugging tasks Strong consistency, technical precision, and comparative judgment across repeated evaluations Why This Opportunity Contribute specialised systems engineering expertise to a cutting-edge AI collaboration Help evaluate the next generation of AI-powered CLI coding agents Work on high-impact infrastructure debugging tasks with strong real-world technical relevance Flexible remote work with competitive hourly compensation Contract Details Independent contractor role Fully remote with flexible scheduling Hourly compensation of $75–$80 per hour Immediate start Duration of 1–2 weeks Part-time commitment of 15–25 hours per week, with flexibility up to 40 hours per week Weekly payments via Stripe or Wise Work will not involve access to confidential or proprietary information from any employer, client, or institution Please note: We are unable to support H1-B or STEM OPT candidates at this time Application process includes resume submission, a short AI interview, and follow-up onboarding communication This is a pay-per-task opportunity for writers, with eligible promotion to reviewers based on project needs About The Platform This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects. Experts contribute to improving advanced AI systems by providing specialised expertise across real-world workflows, structured evaluation, model training support, and domain-specific content validation. By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy Apply tot his job Apply To this Job

Similar Jobs

Staff, Advanced Analytics, CS Safety

Remote, USA Full-time

Specialist, Safety

Remote, USA Full-time

Contracts Specialist III

Remote, USA Full-time

Data Entry Specialist – Remote Amazon E‑Commerce & Cloud Operations Accuracy Expert (Work‑From‑Home)

Remote, USA Full-time

Work from Home: Get Free Amazon Products to Review

Remote, USA Full-time

Manager - International Account Development (Virtual - US)

Remote, USA Full-time

Amazon Account Manager - REMOTE

Remote, USA Full-time

Experienced Remote Data Entry Specialist – Amazon Work from Home Opportunities in Data Management and Entry

Remote, USA Full-time

Early Career Trial Attorney, $10k Sign-on Bonus (Remote - California)

Remote, USA Full-time

API Tester, Work from Home

Remote, USA Full-time

Mentor – Cyber Security Career Track (Part-time/Remote)

Remote, USA Full-time

[Work From Home] American Express Part-Time Customer Support Jobs

Remote, USA Full-time

Experienced Customer Service Representative – Admin/Clerical Support for arenaflex

Remote, USA Full-time

Senior Manager, People Systems

Remote, USA Full-time

Mental Health Providers - Venezuela

Remote, USA Full-time

[Remote-Position] Remote App Reviewer and Product Tester

Remote, USA Full-time

Engineer III, Platform Engineering

Remote, USA Full-time

Experienced Customer Service Representative – Remote Part-Time or Full-Time Opportunity at arenaflex

Remote, USA Full-time

Data Collector (Fully Remote)

Remote, USA Full-time

Professional English Teacher (Remote)

Remote, USA Full-time