In yesterday’s submit, I calculated the profitability of public software program corporations. To calculate these figures, I constructed a little bit Rube Goldberg machine.
I didn’t obtain the information into Excel. As a substitute, I complexified issues by sending the evaluation to 4 AIs to see if they might agree.
The inspiration : many corporations have used Amazon’s Mechanical Turk to crowdsource duties, & choose a consensus reply throughout three staff to enhance accuracy.
Why not do that throughout 4 AI staff as a substitute?
Immediate : “calculate the common internet revenue margin and money circulate from ops margin from this knowledge set” plus the information set. Notice that CFOM isn’t a easy common however requires dividing money circulate from ops by income beforehand.
Mannequin | NIM, % | CFOM, % |
---|---|---|
Claude | 4.99 | 27.31 |
Gemini | -9.29 | 16.2 |
Perplexity | -8.67 | 14.4 |
ChatGPT | – 9.29 | 1,433.01. / 14.9% |
My Evaluation | -9.29 | 16.2 |
Gemini scored prime marks for tabulating accurately on each columns. ChatGPT did nicely with NIM however “forgot” to finish the extra division step, which I corrected with a comply with up, however nonetheless not the suitable determine. The opposite techniques missed the mark altogether.
It will be a mistake to attract any broad conclusions from my little experiment.
However on this case, consensus doesn’t but work as a method which implies I nonetheless must double test calculations myself.
In some unspecified time in the future, AI will mechanize the illusory Mechanical Turk & I’ll restart my Rube Goldberg math machine with confidence.