Insights

AI Tech Blog

We build, test, and fine-tune AI solutions every day – and this is where we share our insights, experiences, and technical deep dives. No corporate fluff, no overpromises. Just real reflections on what works (and what doesn’t) when developing and deploying large language models (LLMs), optimizing AI workflows, and scaling machine learning systems.

Welcome to a blog written by data scientists, for data scientists – and for anyone curious about the real-world challenges of AI engineering, model evaluation, and practical MLOps.

Featured

Evaluating and Testing Large Language Models

Hampus Gustavsson, Senior Data Scientist at Todai

As with all software, before releasing it to production, it must pass a thorough test suite. The same applies to large language models (LLMs). However, testing LLMs is not as straightforward as testing traditional software or even classical machine learning models.

Read more