About Us
What if your work could drive change in a globally established industry, shaping processes that touch every corner of the world? At Forto, we are at the forefront of change, harnessing the power of AI to revolutionise logistics. We want to reinvent digital supply chains to be transparent, frictionless and sustainable. From day one, our mission has been to simplify global trade – creating a seamless and efficient logistics process.
Your Role & Mission
As a Data Scientist in the Data Science team at Forto, you will take ownership of production ML systems that extract structured intelligence from unstructured logistics data. You will be the first dedicated DS engineer on a rebuilding team, working closely with the Engineering Manager across three core workstreams: document data extraction (FlashDoc), vocabulary mapping, and rate sheet parsing, while using a combination of LLMs, custom models, and rule-based postprocessing. Your immediate priority is ensuring continuity of existing production systems, but equally important is driving step-change improvements in accuracy through disruptive methods and new technologies when the opportunity arises. Beyond document automation, the team's roadmap extends into traditional data science territory, demand forecasting, churn prediction, route optimization, and predictive analytics for logistics operations.
What Will You Do
Design, build, and maintain end-to-end ML pipelines for document extraction, classification, and data enrichment in production.
Develop and improve LLM-based extraction systems for complex logistics documents (packing lists, booking confirmations, invoices, rate sheets).
Build prompt evaluation frameworks and feedback-based optimization loops to systematically improve extraction accuracy.
Train custom in-house models using human-in-the-loop (HITL) data to move from assisted to fully automated extraction.
Build and maintain semantic similarity models for free-text to standardized TMS vocabulary across ports, terminals, container types, legal entities, and line items.
Contribute to rate sheet extraction: building carrier-specific parsing logic, postprocessing, and multi-file combination logic.
Improve pipeline reliability through redesign, testing, monitoring, and alerting for non-deterministic ML systems.
Evaluate and introduce disruptive approaches (new model architectures, fine-tuning strategies, novel evaluation methods) to achieve step-change accuracy improvements when incremental optimization plateaus.
Scope and build out the team's next generation of DS workstreams beyond document automation: demand forecasting, churn prediction, route optimization, and other predictive analytics use cases for Commercial and Logistics teams.
Partner with Product Managers to identify where DS can solve real user pain points, proactively surface opportunities from the data, and shape product roadmaps with a data-informed perspective.
Collaborate closely with Engineering teams on integration, infrastructure, and API design to ensure DS outputs are consumed reliably by downstream systems.
Manage stakeholder expectations: communicate what is feasible given capacity, set realistic timelines, flag risks early, and negotiate prioritization trade-offs across teams
Required Skills and Experience
3+ years of professional experience in data science or machine learning engineering;
Ability to design, deploy, and maintain ML systems in production. Go beyond model development. It includes pipeline architecture, monitoring, reliability, and handling non-deterministic outputs at scale;
Ability to quickly get onboarded with new tools/ technologies/ problem space;
Strong use of agentic tools for coding;
Strong proficiency in Python;
Hands-on experience with LLMs (prompting, fine-tuning, evaluation) and understanding of their limitations in production environments;
Strong foundation in classical data science and statistics: regression, classification, time series analysis, data leakage, experimental design, and hypothesis testing;
Strong analytical and problem-solving skills;
Strong stakeholder management skills;
Preferred Skills and Experience
Experience in logistics, supply chain, or freight forwarding domains;
Experience working directly with Product Managers and Engineering teams;
Familiarity with semantic similarity and entity resolution techniques;
Experience with human-in-the-loop (HITL) workflows and designing feedback loops for model improvement;
Experience with demand forecasting, time series modeling, or churn prediction in a business context;
Experience with low volume data setting;
Experience with route or network optimization (cost, risk, or profitability modeling);
Don’t fit all of our criteria? That’s okay! We know that you might be hesitant to apply if you don’t meet all our requirements, but here at Forto, we pride ourselves on embracing diverse perspectives and celebrating potential. If you are passionate about this position and the Forto values, please apply anyway. There could be a place for you in this role - or another one that’s a perfect fit!
Why work with us?
Our team is hard-working, constantly seeking to maximise the impact of their work, but we put our people first, always winning with care. We value efficient systems and swift, direct communication. We want everyone to have their time to speak, so that we can embrace diverse perspectives to help drive towards solutions always.