re:cinq Logore:cinq
Evals, reducing hallucinations, & AI-native development
0:000:00
PodcastJanuary 29, 2026

Evals, reducing hallucinations, & AI-native development

AGENTIC WORKFLOWSAI EVALSDOCUMENTATION REGISTRYMODEL HALLUCINATIONSAI NATIVESYSTEM STEERING

In this episode, Deejay sits down with Amy Heineike, founding AI engineer at TESSL, to explore the structural shift toward AI-native development. They discuss the necessity of machine-optimized documentation registries to eliminate agent hallucinations and the cultural transition from deterministic logic to a biological science mindset. Amy details the mechanics of building evaluation harnesses, the pitfalls of contradictory steering, and how the role of the software engineer is evolving into a high-level architect of intentional outcomes and anti-fragile systems.

Hosted by:
Deejay
Featuring:
Amy Heineike, Tessl

Episode Transcript

Daniel Jones (00:00) Amy Heineike, founding AI engineer at TESSL. ⁓ It's great to have you with us. What are you and the folks at TESSL doing at the moment? Amy (00:07) ⁓ hi, Daniel. great to be here. Yeah. So we are building tools to help people who are using coding agents every day. So we've rele...

Episode Highlights

  • TESL builds documentation registries to ground coding agents and stop API hallucinations in enterprise environments.
  • Moving to AI-native development requires shifting from deterministic logic to biological science and probabilistic experimentation.
  • Evaluations measure agent success across baskets of scenarios rather than traditional binary pass-fail unit tests.
  • Hyper-detailed task prompts paradoxically trigger models to ignore broader system instructions and core steering rules.
  • The software engineering role is evolving into a Product Engineer focused on high-level intentional outcomes.
  • System non-determinism acts as a feature enabling anti-fragility and escapes from logical local maxima.
  • Multi-pass agentic loops manage distinct concerns like security and performance more effectively than single prompts.