12.1 C
New York
Tuesday, April 15, 2025

My AI Rube Goldberg Machine by @ttunguz


In yesterday’s submit, I calculated the profitability of public software program corporations. To calculate these figures, I constructed a little bit Rube Goldberg machine.

I didn’t obtain the information into Excel. As a substitute, I complexified issues by sending the evaluation to 4 AIs to see if they might agree.

The inspiration : many corporations have used Amazon’s Mechanical Turk to crowdsource duties, & choose a consensus reply throughout three staff to enhance accuracy.

Why not do that throughout 4 AI staff as a substitute?

Immediate : “calculate the common internet revenue margin and money circulate from ops margin from this knowledge set” plus the information set. Notice that CFOM isn’t a easy common however requires dividing money circulate from ops by income beforehand.

Mannequin NIM, % CFOM, %
Claude 4.99 27.31
Gemini -9.29 16.2
Perplexity -8.67 14.4
ChatGPT – 9.29 1,433.01. / 14.9%
My Evaluation -9.29 16.2

Gemini scored prime marks for tabulating accurately on each columns. ChatGPT did nicely with NIM however “forgot” to finish the extra division step, which I corrected with a comply with up, however nonetheless not the suitable determine. The opposite techniques missed the mark altogether.

It will be a mistake to attract any broad conclusions from my little experiment.

However on this case, consensus doesn’t but work as a method which implies I nonetheless must double test calculations myself.

In some unspecified time in the future, AI will mechanize the illusory Mechanical Turk & I’ll restart my Rube Goldberg math machine with confidence.

Related Articles

Latest Articles