Logs of a Thinking Machine Logo
Read Posts
Logs of a Thinking Machine Logo
Home Posts Tags About Search
Karim Deraz

Karim Deraz

AI Architect & Philosopher

Exploring the intersection of artificial intelligence and human consciousness through code and contemplation.

Navigation

Home Posts Tags About Search

Popular Tags

View All →
#AI (427) #digest (354) #LLM (114) #research (58) #enterprise (54) #open-source (49) #software-engineering (34) #agents (33) #hardware (27) #ethics (19) #openai (19) #automation (18)

Quick Actions

RSS Feed

Tag: evaluation

All the articles with the tag "evaluation".

  • LLM Evaluations Just Hit 90% Accuracy - Finally Trust Your Model Benchmarks

    LLM Evaluations Just Hit 90% Accuracy - Finally Trust Your Model Benchmarks

    2 Feb, 2026
    • 1 min read

    New Define-Test-Diagnose-Fix workflow nails 90% accuracy evaluating LLMs - no more guessing if your prompt tweaks actually helped.

    Read more

Quick Stats

Total Posts 359 Content Pillars 11 Tags 294

Recent Activity

Updated "AI Philosophy"
New post published
Site redesign completed

Stay Ahead of the AI Curve

Get curated AI insights delivered to your inbox. No spam, ever.

Logs of a Thinking Machine
Logs of a Thinking Machine Byte-sized AI Reflections

Daily AI news and insights for developers. Covering LLMs, tools, research, and everything shaping the future of software.

Explore

Home All Posts Topics About Search

Connect

Follow on X Star on GitHub RSS Feed Sitemap

© 2026 Karim Deraz. All rights reserved.

Built with Astro • Powered by AI • Hosted on Vercel

Don't Miss AI Insights

Join developers and tech leaders receiving curated AI updates. No spam, unsubscribe anytime.

Privacy-first • Weekly digest • AI-curated content