Is AI Getting Less Reliable? Check Its Math!

The latest update of Chat GPT-4o is having issues. I was working with it yesterday, researching and streamlining a couple of cosmetic formulas for some test batches. Repeatedly, it couldn't add the percentages up to 100%: 96, 97.5, 102. Each time I brought the issue to its attention, the bot apologized and repeated it. This went on for well over an hour. I use a paid, not free, version daily for editing, brainstorming, and content creation.

Here's one example:

I pointed out an issue: 72.5% + 18% + 5.5% = 96% not 100%

Response: Errors Identified:

  1. The total formula I provided previously adds up to 96%, not 100%. The missing 4% needs to be allocated correctly.
  2. Later, I incorrectly stated the oil phase as 23% instead of 18% and dropped 0.5% in the totals, creating confusion.
  3. The formula must be re-balanced to total 100% without these errors.
  • Water Phase: 75%
  • Oil Phase: 18%
  • Cool-Down Phase: 6%

Grand Total: 100% The problem? These add up to 99%.

When we finally got the formulas to add up correctly, I asked the bot how to submit the issue to Open.ai. GPT-4o happily assisted me and thanked me. I followed GPT's directions and submitted the transcript, highlighting the issues and a message to Open.ai.

I'm so glad I wasn't making medicine or doing something critical. We aren't going to stop using AI, but we need to be aware of the glitches, double-check any mathematical calculations, and fact-check the content.

I received an email back from Open.ai.

Note: Copyleaks AI Detector says the response is 100% written by AI, although there is a human name at the close. I will let the quality of the writing, or AI quality, speak for itself.

Here is what Open.ai replied:

Thank you for reaching out to OpenAI support regarding the outputs you received from our model. We understand that it can be frustrating when the results don't meet your expectations.

Kindly note that our models aim to provide helpful and accurate information, but they have certain limitations. The output can vary and might not always align with previous results or personal expectations. This variability is due to the complex nature of AI and its dependency on diverse and extensive training data. For more detailed information, you may find this article helpful: Does ChatGPT always tell the truth?

​When it comes to image inputs, our models strive to interpret and respond to the content accurately. However, there are specific challenges, such as handling text in non-Latin alphabets or images with complex visual elements. Please refer to this FAQ for more insights: Image Inputs for ChatGPT FAQ

We are continually working to improve our models and appreciate your feedback as it helps us identify areas for enhancement. If you have specific examples or further questions, please do not hesitate to provide them, as they are invaluable for our ongoing development efforts.

Thoughts and considerations

The email raises more questions than offers answers. I know AI is fallible and needs to be fact-checked. That's why I was checking each addition provided. I've researched AI and written two ebooks on its use. I've taught courses on best practices. But discovering that it can no longer be trusted to add correctly was a bit of a shock.

How can it reliably be used in science, medicine, or business automation? I know many people who use it to generate content. If it can't do addition, how accurate is the content?

I also received feedback from others on Facebook. One IT friend asked Google's AI what time a stock car race was. The response was 8 PM EST and 2 PM PDT. [There is only a 3-hour time difference, not 5. That could cause you to miss a race, game, or flight.]

What about students or copy/content writers using AI for research?

What about solopreneurs, stressed about running their businesses and depending on AI to make their lives easier?

How flawed is what they are putting out there? Are they fact-checking and editing that content for accuracy? How much extra time is needed to validate everything provided?

Recommendations

There are both generalist and specific AI tools. Generalist AIs are like your primary care physician. Specific AI are like neurosurgeons, cardiologists, or orthopedists. They specialize. So, users are best served when the right AI is employed for the task. Do you need a generalist like ChatGPT, a Virtual Assistant, a Sales Tool, a Social Media Manager, or a writing tool like Jasper? Or do you need an app that specializes in images?

If you're considering new AI tools, look for free trials. Then, test the ability to provide accurate responses. Note that the quality of your responses depends on the quality of your prompt.

The new norm is subscription-based software, so it's essential to evaluate costs. While a by-the-year package may be cheaper, you may get trapped in a non-refundable year with a tool you don't like. If the free trial isn't long enough, try by the month to ensure you want to keep that tool.

Remember to check out the Chrome extensions. The Chrome store offers multiple AI tools, many of which are free. These can assist you with edits, grammar, note-taking, screen recordings, and other productivity-related tasks.

Consider bundled tools. Some providers offer a collection of AI tools. Poe.ai is one of these. It offers access to dozens of AI tools. It has package options in the Apple store, Google Play store, and the Poe website. The latter has the most diverse options. There are some caveats to check out the help articles before you buy. At the free and less expensive options, how much you can do with a tool is limited, and you have to upgrade. [No, I am not an affiliate. As a consultant, I retain my independence.]

Another option is to get support to find what you need. AI guides and consultants are popping up everywhere. If you need assistance, feel free to message me.

Before the consultant makes offers, look for one who asks questions and learns about you and your needs.

Breaking News

Judith Culp Pearson receives three top honors
at the annual Society of Permanent Cosmetic Professionals in
Ft. Worth, Texas - October 7-9, 2023

Ambassador Award

For Your Continued
Service & Dedication

Certificate Graphic

In Recognition and Appreciation for Contributions to the Society of Permanent Cosmetic Professionals Member Education October 2023

Speaker Award of Excellence
Got a problem? Let's chat and solve it.

Use my booking button or message me. Let's schedule a phone or virtual consultation and explore the possibilities.
541-485-1004
© 2025 Judith Culp Pearson Wellness Copy | All Rights Reserved | A Division of Culp Pearson Enterprises, LLC.
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram