Do you want to help raise the overall AI maturity across DNB?

DNB

Dronning Eufemias gate 30, 0191 Oslo, 0191 Oslo

Om jobben

Stillingstittel: AI Evaluation Engineer
Type ansettelse: Fast, heltid 100%
Antall stillinger: 1
Arbeidsspråk: Engelsk

Søk på jobben

Søk senest søndag 26. april

People are the very DNA of DNB. Since 1822, bright minds have worked together to find the best solutions for our customers. Today, DNB is much more than Norway's largest bank - we are a technology-driven financial institution that continuously connects people and ideas to knowledge and capital in new ways. Diversity is part of who we are, and inclusion is something we actively choose every single day. We promise to do our best to make you feel at home. A job at Norway's largest financial group offers professional challenges in an exciting work environment with many opportunities for development. AI Evaluation Engineer

About us
AI Tech is DNB's new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact. We bring together deep technical expertise, modern AI platforms, and hands‑on delivery to scale agentic AI across the group. We move fast, learn fast, and deliver real outcomes.

We expect every member of AI Tech to be a role model in the everyday use of AI ---using AI proactively in coding, automating tasks, improving documentation, and accelerating problem‑solving. You help raise the overall AI maturity across DNB by demonstrating what AI‑first engineering looks like.

What you will do

Designing evaluation frameworks for agentic systems. Ensuring quality, coverage, safety, efficiency, and regulatory compliance
Building LLM-as-judge pipelines: prompt design, calibration, consistency validation
Integrating evaluation into CI/CD: automated regression detection when context (prompts, data sources etc.), models or services change
Running evaluation experiments when there are relevant changes or additions
Constructing and maintaining evaluation datasets (input, ground truth etc.).
Working with domain experts to translate requirements into measurable criteria
Developing shared evaluation tooling and patterns that teams across DNB can adopt

Background

Relevant background from evaluating AI / ML systems
Proficiency in Python for experiments and evaluation
Knowledge of observability, logging, and dataset curation
Experience running structured experiments spanning many roles
Familiarity with AI tooling for developer productivity (Claude Code, Copilot etc.)
Experience with agentic evaluation techniques (using code, LLMs and humans)
Experience with agentic evaluation solutions (AgentCore, Foundry, MLflow etc.)

Tech stack example:
Python, Strands, AWS AgentCore, AWS Bedrock, MCP, Mlflow, OTel, Docker, GitHub Actions

What you bring
You bring an evidence driven mindset and back your claims and decisions with data and numbers. You act as a role model for rigorous and responsible AI testing, setting a high bar for quality, safety, and trustworthiness in everything you build. You communicate and collaborate effectively across roles and teams, and you take true ownership to the team's goals.

What's in it for you?
You'll work on challenging and meaningful tasks in a strong engineering culture with solid opportunities for professional growth and career development. We offer attractive pension and insurance schemes, as well as employee benefits on DNB's products. You'll also have access to company cabins across Norway, sports, cultural and social activities, and a wide range of employee discounts. We support flexibility in everyday work through flexible working hours, a hybrid way of working, extra days off, and reduced working hours from May to August (summertime).

Contact Persons: Mari Sand Frogner, Service Owner | Mari.Sand.Frogner@dnb.no | +47 41475454 / Olav Lognvik, Engineering Manager | Olav.Lognvik@dnb.no | +47 40019700
Application Deadline: 26.04.26

In this job application process, you only need to upload your resume (CV) and briefly answer some job-related questions. Cover letter is not required but you can upload this as an attachment if desired. In DNB, we carry out background checks to verify that the information provided in your CV and other documentation is correct. Background checks are generally performed by an external independent third party. Former employers are typically contacted to check previous positions and periods of employment, while educational institutions are asked to confirm marks. No background check will be conducted without your prior consent, and you will receive more detailed information about this, if applicable. For positions that require an authorisation and/or approval of suitability, a police certificate of good conduct will be required.

Om bedriften

En jobb i Norges største finanskonsern byr på faglige utfordringer i et spennende arbeidsmiljø med mange muligheter. Vi trenger medarbeidere med ulik bakgrunn og kompetanse. Vil du være med på laget?

Sektor: Privat
Nettsted: http://www.dnb.no/

Del annonsen

Annonsedata

Rapporter annonse

Stillingsnummer: 314c1a1a-d0cf-426e-9579-98cb9660d44e
Sist endret: 20. april 2026
Hentet fra: FINN
Referanse: 460023351

Lignende annonser

Design and build multi-agent applications; orchestration, tools, memory and more

AI Engineer Agentic Systems

DNB

Oslo

Do you want to design and operate agentic AI solutions end‑to‑end in production?

AgentOps Engineer

DNB

Oslo

Data / ML / AI Engineer

Statens pensjonskasse

Oslo