Design, build, and maintain scalable data pipelines and architectures. Oversee AI-enhanced business intelligence and LLM initiatives while collaborating with stakeholders.
This is a full-time work from home opportunity for a star Data/ML Engineer from LATAM.
IDT(www.idt.net) is an American telecommunications company founded in 1990 and headquartered in New Jersey. Today it is an industry leader in prepaid communication and payment services and one of the world’s largest international voice carriers. We are listed on the NYSE, employ over 1300 people across 20+ countries, and have revenues in excess of $1.5 billion.
We are looking for a skilled Data/ML Engineer to join our BI team and take an active role in designing, building, and maintaining the end-to-end data pipeline, architecture and design that powers our warehouse, LLM-driven applications, and AI-based BI. If you're looking for a company that will give you the maximum flexibility in choosing a location to work, this opportunity is for you!
Responsibilities:
- Design, develop, and maintain scalable data pipelines to support ingestion, transformation, and delivery into centralized feature stores, model-training workflows, and real-time inference services.
- Build and optimize workflows for extracting, storing, and retrieving semantic representations of unstructured data to enable advanced search and retrieval patterns.
- Architect and implement lightweight analytics and dashboarding solutions that deliver natural language query experience and AI-backed insights.
- Define and execute processes for managing prompt engineering techniques, orchestration flows, and model fine-tuning routines to power conversational interfaces.
- Oversee vector data stores and develop efficient indexing methodologies to support retrieval-augmented generation (RAG) workflows.
- Partner with data stakeholders to gather requirements for language-model initiatives and translate into scalable solutions.
- Create and maintain comprehensive documentation for all data processes, workflows and model deployment routines.
- Should be willing to stay informed and learn emerging methodologies in data engineering, MLOps and LLM operations.
Requirements:
- 8+ years of experience as a Data Engineer with 2+ years focused on MLOps.
- Excellent English communication skills.
- Effective oral and written communication skills with BI team and user community.
- Demonstrated experience in utilizing python for data engineering tasks, including transformation, advanced data manipulation, and large-scale data processing.
- Deep understanding of vector databases and RAG architectures, and how they drive semantic retrieval workflows.
- Skilled at integrating open-source LLM frameworks into data engineering workflows for end-to-end model training, customization, and scalable inference.
- Experience with cloud platforms like AWS or Azure Machine Learning for managed LLM deployments.
- Hands-on experience with big data technologies including Apache Spark, Hadoop, and Kafka for distributed processing and real-time data ingestion.
- Experience designing complex data pipelines extracting data from RDBMS, JSON, API and Flat file sources.
- Demonstrated skills in SQL and PLSQL programming, with advanced mastery in Business Intelligence and data warehouse methodologies, along with hands-on experience in one or more relational database systems and cloud-based database services such as Snowflake/Redshift.
- Understanding of software engineering principles and skills working on Unix/Linux/Windows Operating systems, and experience with Agile methodologies.
- Proficiency in version control systems, with experience in managing code repositories, branching, merging, and collaborating within a distributed development environment.
- Interest in business operations and comprehensive understanding of how robust BI systems drive corporate profitability by enabling data-driven decision-making and strategic insights.
Pluses
- Experience with vector databases such as DataStax AstraDB, and developing LLM-powered applications using popular open source frameworks like LangChain and LlamaIndex–including prompt engineering, retrieval-augmented generation (RAG), and orchestration of intelligent workflows.
- Familiarity with evaluating and integrating open-source LLM frameworks–such as Hugging Face Transformers/LLaMA-4 across end-to-end workflows, including fine-tuning and inference optimization.
- Knowledge of MLOps tooling and CI/CD pipelines to manage model versioning and automated deployments.
Please attach CV in English.
The interview process will be conducted in English.
Only accepting applicants from LATAM.
Top Skills
Spark
AWS
Azure Machine Learning
Datastax Astradb
Hadoop
Hugging Face Transformers
Kafka
Langchain
Llamaindex
Plsql
Python
Redshift
Snowflake
SQL
Similar Jobs
Blockchain • Internet of Things • Payments • Cryptocurrency • Web3
As a Staff Software Engineer, you'll build scalable software for Data Products, improve architecture, and lead teams in decentralized infrastructure development.
Top Skills:
AWSC++GCPGoJavaKafkaPostgresPythonTerraformTypescript
Blockchain • Internet of Things • Payments • Cryptocurrency • Web3
As a Senior Software Engineer, you will develop scalable software for tokenization, design APIs, and enhance blockchain interoperability, while collaborating with an expert team.
Top Skills:
APIsBlockchainGoSmart ContractsSolidityWeb 3.0
Blockchain • Internet of Things • Payments • Cryptocurrency • Web3
As a Software Engineer, you will automate node deployments, enhance platform scalability and security while collaborating with different teams to integrate products seamlessly.
Top Skills:
BlockchainGoSolidityTypescriptWeb3
What you need to know about the Sydney Tech Scene
From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.