CoinMarketCap Logo

CoinMarketCap

LLM Algorithm Engineer

Reposted 16 Days Ago
Be an Early Applicant
In-Office or Remote
13 Locations
Mid level
In-Office or Remote
13 Locations
Mid level
The role involves post-training of LLMs, model alignment, server operation for checkpoint routing, and building evaluation pipelines.
The summary above was generated by AI
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.

Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.

Top Skills

Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm

Similar Jobs

7 Hours Ago
Remote or Hybrid
6 Locations
Mid level
Mid level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Virtual Sales Representative promotes health products via digital platforms, builds customer relationships, and collaborates with teams to achieve sales goals while ensuring compliance and customer support.
Top Skills: Microsoft TeamsSalesforceVeeva EngageZoom
10 Hours Ago
Remote or Hybrid
2 Locations
Senior level
Senior level
Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
The Software Solution Architect supports sales by providing technical expertise, building customer relationships, delivering presentations, and ensuring effective implementation of software solutions.
Top Skills: Crm SoftwareMS OfficeSoftware ArchitectureSoftware Engineering
12 Hours Ago
Remote or Hybrid
8 Locations
Expert/Leader
Expert/Leader
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Staff Frontend Engineer leads technical initiatives, designs user-facing features, mentors engineers, and drives architectural direction for the Activation and Engagement Web team to support Square's Growth strategies.
Top Skills: GraphQLGrpcJavaJavaScriptJestNext.JsNode.jsPlaywrightProtocol BuffersReactRestTypescript

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account