Tech News : OpenAI : Powerful Voice Building Updates

ChatGPT’s creator and Microsoft-backed start-up OpenAI has announced the introduction of its new (beta) Realtime API tool which enables developers to create AI voice applications using a single set of instructions.

Multiple Steps Now Reduced To One Step 

The ‘Realtime API’ tool simplifies the process of creating AI-driven voice applications by integrating what used to be multiple steps – speech recognition, text generation, and speech synthesis – into a single API call.

Previously 

Previously, developers creating voice assistants had to navigate a multi-step process, starting with transcribing audio using automatic speech recognition tools like Whisper, then passing the text to a language model for processing and generating responses, and finally converting the output back to speech using a separate text-to-speech model. This approach often led to issues such as the loss of emotional nuance, accents, and emphasis, while also introducing noticeable latency that made the interaction slower and less natural than human conversation.

The Benefits 

The ability of the new Realtime API tool to reduce the process to a single API call significantly improves efficiency by lowering latency, preserving the natural flow of conversation, and simplifying development, enabling faster and more seamless voice interactions.

How Does Realtime API Work? 

The Realtime API tool works by establishing a persistent WebSocket connection that allows seamless message exchange with OpenAI’s GPT-4o model. This enables real-time, continuous communication, making it particularly useful for voice assistant applications. The API supports function calling, allowing the voice assistant to perform actions like placing orders or retrieving user-specific information for personalised responses. For example, a voice assistant could pull up a customer’s profile to tailor its conversation or execute tasks based on user input without switching between multiple models or systems, thereby streamlining the interaction for faster, more natural experiences.

Business Benefit for OpenAI 

Also, OpenAI’s rollout of advanced tools like the Realtime API is crucial for businesses that rely on its services to develop AI applications, which contribute significantly to OpenAI’s revenue. Creating a tool that makes it easier for companies to create efficient, cutting-edge solutions, reducing costs and development time therefore also helps OpenAI to retain clients and attract new business in a competitive market.

When And How Much? 

OpenAI says Realtime API began rolling out October 1 in public beta to all paid developers.

Pricewise, OpenAI says the Realtime API uses both text tokens and audio tokens, with text input tokens priced at $5 per 1M and $20 per 1M output tokens. Audio input is priced at $100 per 1M tokens and output is $200 per 1M tokens. OpenAI says this equates to approximately $0.06 per minute of audio input and $0.24 per minute of audio output.

How Is It For Privacy and Security? 

The Realtime API ensures safety and privacy through multiple layers of protection, including automated monitoring and human review of flagged inputs. It uses the same audio safety infrastructure as ChatGPT’s Advanced Voice Mode and OpenAI says it’s been rigorously tested to prevent high-risk gaps. OpenAI says it enforces strict usage policies, prohibiting harmful use like spam, and requires transparency in AI interactions and that user data is not used for model training without explicit permission.

How Can Developers Try It? 

OpenAI says, to get started with the Realtime API, developers can begin building by accessing the Playground (OpenAI’s web-based testing environment), using OpenAI’s documentation, and the reference client. Also, OpenAI says client libraries for essential audio components like echo cancellation and sound isolation have been developed in collaboration with LiveKit and Agora, and Twilio has also integrated the Realtime API with its Voice APIs, allowing seamless deployment of AI virtual agents for voice interactions.

Future Plans For Realtime AI 

Looking ahead, OpenAI plans to expand the Realtime API by adding new capabilities. Initially focused on voice, future updates will introduce additional modalities such as vision and video. They also plan to increase rate limits to accommodate larger deployments and integrate official SDK support for Python and Node.js. Other upcoming features include prompt caching to reduce costs and support for GPT-4o mini, enabling developers to create even more efficient application.

Other Very Good News For OpenAI 

It seems that introducing Realtime AI isn’t the only thing that OpenAI’s got to be pleased about at the moment following the news that OpenAI has nearly doubled its valuation to an eye-watering $157 billion after a (complex, multiple negotiations) funding round where it raised $6.6 billion from backers including Microsoft, SoftBank and Thrive Capital. However, as part of the deal, OpenAI’s investors can withdraw their funds if OpenAI doesn’t convert into a for-profit firm within two years.

What Does This Mean For Your Business? 

As OpenAI rolls out its Realtime API, the company is taking a significant step toward streamlining AI voice application development. By consolidating multiple tasks (speech recognition, language generation, and speech synthesis) into a single API call, OpenAI not only reduces complexity for developers but also greatly improves the naturalness and fluidity of real-time conversations. This efficiency will likely appeal to developers and businesses alike, who can now create more responsive and context-aware voice applications while saving time and resources.

Also, OpenAI’s apparent focus on privacy and security, combined with what appears to be a transparent pricing model, reflects a commitment to building trust with its users. For example, things like layered security protections, strict usage policies, and clear guidelines for AI interaction transparency are likely to reassure developers and end-users alike, particularly in a climate where data privacy is of paramount concern. OpenAI’s collaboration with key partners like Twilio, along with plans for future expansion into modalities such as vision and video, show the company’s forward-thinking approach and ambition to stay ahead of the competition at the forefront of AI technology.

For businesses, this means not only quicker deployment of voice-driven applications but also the potential for more personalised and effective customer interactions, paving the way for innovation across industries. With these advancements, the Realtime API could become a key tool for those looking to integrate sophisticated AI-driven voice solutions into their workflows, setting a new standard for efficiency in voice AI applications.

An Apple Byte : Microsoft Office 2024 For Macs (Subs-Free!)

Microsoft is now offering a no-subscription, standalone version of Microsoft Office 2024, available as a “one-time purchase for 1 PC or Mac.” 

As a one-time purchase version of Microsoft Office, this provides permanent access to Microsoft’s classic productivity suite, representing a longer-term cost-saving alternative to the subscription-based Microsoft 365.

Two versions of the one-time purchase Microsoft Office 2024 for Mac are now available:

1. Office Home 2024 is priced at £129.99, including Word, Excel, PowerPoint, and OneNote (for non-commercial use).

2. Office Home & Business 2024 is priced at £249.99, which includes everything in the Home version plus Outlook and rights for commercial use.

Both require a Microsoft account and an internet connection.

Unlike Microsoft 365, which includes cloud-based services, this standalone version does not provide access to services such as 1 TB of OneDrive storage, 60 Skype minutes per month, Microsoft chat and phone support, or regular feature updates.

However, the Office Home & Business 2024 version offers added functionality for those needing Outlook or the ability to use Office apps commercially.

As Microsoft highlights, “one-time purchases don’t have an upgrade option, which means if you plan to upgrade to the next major release, you’ll have to buy it at full price”. 

This makes it a more cost-effective choice upfront for users, though Mac users will need to consider whether they prefer the lower initial cost over the ongoing updates and services available with a Microsoft 365 subscription.

Security Stop Press : Beware of Iranian-Backed Spear Phishing Attacks

The UK’s National Cyber Security Centre (NCSC) and its US counterparts have issued a high-alert warning over a surge in spear-phishing attacks backed by Iran.

These attacks, linked to Iran’s Islamic Revolutionary Guard Corps (IRGC), are reportedly targeting politicians, journalists, activists, and others involved in Middle Eastern affairs.

The attackers are using social engineering techniques, impersonating trusted figures (e.g. colleagues or journalists) via email and messaging platforms to gain victims’ trust. The goal is to trick individuals into providing email credentials through fake log-in pages, thereby granting the attackers full access to sensitive communications and data. This poses significant risks not only to personal and business accounts but also to national security, as these compromised emails may be used to further cyber-espionage efforts.

The NCSC warns that these spear-phishing campaigns are escalating amidst geopolitical tensions, particularly as investigations into Iranian cyber operations grow. Recent developments include the indictment of three IRGC members in the US for hacking attempts related to political campaigns.

The advice is to implement multi-factor authentication (MFA), regularly update cybersecurity protocols, and to remain vigilant against unsolicited communications. Proactive steps, such as using the NCSC’s cyber defence tools, may also help mitigate the threat.

Sustainability-in-Tech : Taking Carbon Out Of … Seawater!

Amsterdam-based startup Brineworks is using innovative seawater electrolysis technology to tackle the issue of carbon emissions and reshape the landscape of carbon capture and sustainable fuel production.

In A Nutshell – Leveraging The Potential of Seawater 

Brineworks has developed a seawater electrolysis technology that extracts carbon dioxide (CO₂) and produces green hydrogen (H₂) directly from seawater. The company’s new approach is different because it leverages the ocean’s higher CO₂ concentration (about 150 times that of the atmosphere), thereby making the process more energy-efficient and cost-effective than traditional air capture methods, all while simultaneously generating green hydrogen as a valuable fuel byproduct.

What Problem Does The Technology Solve? 

Existing traditional direct air capture (DAC) methods (capturing carbon from the atmosphere) are costly and energy-intensive. For example, the International Energy Agency estimates costs between $230 to $630 (€210 to €570) per metric ton of CO₂ for traditional DAC methods. These high costs have, until now, been a significant barrier to large-scale adoption and meaningful impact on global emissions.

Brineworks’ Affordable Seawater Electrolysis 

Brineworks aims to disrupt this landscape by capturing CO₂ directly from seawater at under $100 per metric ton when scaled. Given that the ocean holds CO₂ concentrations about 150 times higher than the atmosphere, the energy required for extraction is substantially less. This makes the process not only more efficient but also more economically viable.

How It Works (Acidifying and Neutralising Seawater for CO₂ Extraction) 

The core of Brineworks’ technology lies in its innovative seawater electrolyser. Electrolysing seawater essentially involves passing an electric current through it, which splits it into different components such as hydrogen gas, oxygen, and ions, but essentially two main parts – an acid stream and a base stream. The ‘acid stream’ (i.e. an acidic solution) is used to lower the pH of seawater in a controlled environment to release dissolved carbon dioxide (CO₂). The ‘base stream’ refers to the alkaline solution that results from the same electrolysis process.

The system that Brineworks users, therefore, employs the acid stream from electrolysis to acidify seawater in a ‘closed system’, which liberates pure CO₂. This CO₂ can then be directly extracted and either sequestered for permanent carbon removal or used as a feedstock for producing e-fuels like e-methanol, e-methane, sustainable aviation fuel, and e-diesel.

After CO₂ extraction, the base stream (alkaline) is then used to neutralise the seawater before it’s released back into the ocean, ensuring minimal environmental impact, and maintaining the ocean’s pH balance. This management of seawater chemistry in the process helps prevent any ecological disruption while leveraging the ocean’s natural ability to absorb more CO₂ from the atmosphere, thereby aiding in global carbon reduction efforts.

Co-Production of Green Hydrogen 

An added advantage of Brineworks’ process is the substantial production of green hydrogen during electrolysis. For every ton of CO₂ extracted, significant amounts of H₂ are generated. This hydrogen can be stored and used as a clean energy / fuel source, providing an additional revenue stream, and enhancing the overall efficiency of the system.

Applications

Brineworks’ technology appears to hold real potential for industries that are challenging to electrify, particularly maritime shipping and aviation. For example:

– Maritime shipping is responsible for about 3 per cent of global emissions and relies on high-energy-density fuels for long-haul voyages. Electrification isn’t a feasible solution in the near term. The Brineworks technology could, therefore, supply the essential CO₂ and H₂ feedstocks for producing carbon-neutral e-fuels directly at ports, thereby reducing reliance on fossil fuels and cutting emissions.

– In the aviation industry, large commercial and cargo planes require massive amounts of energy relative to their weight, meaning that full electrification is currently impractical. Sustainable aviation fuels (SAFs) could offer a viable path to decarbonisation but are hindered by high production costs due to expensive feedstocks. By providing affordable CO₂ and H₂, the technology from Brineworks could lower the costs of SAFs, making them a more practical option for the industry.

Ocean-Based CO₂ Removal (Enhancing Natural Sinks) 

Beyond fuel production, Brineworks also supports ocean-based carbon dioxide removal techniques like Direct Ocean Capture (DOC) and Ocean Alkalinity Enhancement (OAE). These methods enhance the ocean’s capacity to absorb atmospheric CO₂ and combat ocean acidification. Brineworks’ technology provides a versatile and cost-effective means to implement these strategies at scale. For example:

– Direct Ocean Capture (DOC). By adjusting seawater chemistry through acidification and neutralisation, Brineworks can remove CO₂ from seawater, which is then replenished by atmospheric CO₂, effectively reducing greenhouse gases.

– Ocean Alkalinity Enhancement (OAE). The base stream from electrolysis is added to the ocean, increasing its alkalinity, and enhancing its ability to store CO₂ in a stable, dissolved form.

Scaling and Sustainability 

Brineworks is also keen to emphasise the scalability and sustainability in its design. The modular system uses earth-abundant, low-cost materials, thereby reducing capital expenditure (CAPEX) and maintenance costs. High-efficiency membranes and a reduction in precious metal usage also make the technology more accessible and easier to deploy in various settings.

By making the process decentralised, sustainable fuel production can occur anywhere there’s seawater, thereby freeing industries from reliance on geopolitical supply chains of fossil fuels. This could open up possibilities for nations worldwide to produce their own sustainable fuels, contributing to global decarbonisation efforts.

Not The Only Company Doing It 

It should be noted here that Brineworks isn’t the only company / organisation developing seawater electrolysis technology that uses ocean as a source for both carbon capture and green hydrogen production. Others with similar systems include:

– Los Angeles-based ‘Equatic’, which is a leading player in this space, utilising a seawater electrolysis process to simultaneously capture CO₂ from the atmosphere and produce hydrogen. Their technology involves oxygen-selective anodes (OSAs), which allow for efficient hydrogen production without producing harmful chlorine gas. Like Brineworks, Equatic’s process is designed to operate at a cost of under $100 per tonne of CO₂ removed, and they have major projects planned for commercial-scale carbon removal and hydrogen production.

– University of Adelaide researchers have also developed a method to split seawater into hydrogen and oxygen with nearly 100 per cent efficiency. Their approach does not require any pre-treatment, such as desalination or purification, making it a highly efficient and scalable option for green hydrogen production.

What Does This Mean For Your Organisation? 

Brineworks’ innovative approach to carbon capture through seawater electrolysis represents a significant leap forward in the race toward sustainable energy and emissions reduction. By harnessing the ocean’s higher carbon dioxide concentration, the company has developed a method that is not only more energy-efficient but also economically viable. The dual benefit of extracting CO₂ and producing green hydrogen simultaneously positions Brineworks as a key player in decarbonisation efforts, particularly in sectors like shipping and aviation, where electrification remains impractical due to the high energy demands of long-haul operations. The production of synthetic fuels such as e-methanol and e-diesel, using Brineworks’ CO₂ and hydrogen feedstocks, provides a viable solution for these industries to reduce emissions without compromising operational efficiency.

In maritime shipping, which accounts for 3 per cent of global emissions, Brineworks’ technology offers the potential for significant emission reductions by providing ports with the ability to produce carbon-neutral fuels on-site, minimising reliance on fossil fuels. Similarly, aviation (which faces immense challenges in full electrification) could benefit from cheaper and more accessible Sustainable Aviation Fuels (SAFs) made possible by the lower-cost feedstocks Brineworks provides. These fuels may be critical to reducing the carbon footprint of an industry that is responsible for a significant share of global emissions.

What sets Brineworks apart is its ability to achieve large-scale carbon removal while maintaining environmental integrity, ensuring that the delicate balance of ocean chemistry remains intact. This forward-thinking technology, which supports both direct CO₂ extraction and ocean-based carbon removal techniques, offers a sustainable, modular solution that can be deployed globally. In addition to shipping and aviation, industries such as energy production and heavy manufacturing could benefit from the technology’s ability to co-generate green hydrogen, enabling a transition away from fossil fuels while supporting energy storage and production needs.

Tech Tip – Use “Windows Key + X” to Open the Power User Menu

Access commonly used system tools and features like Device Manager, Disk Management, and Task Manager directly with a hidden Power User Menu, offering faster access to tools without navigating through multiple menus. Here’s how to quickly access it:

How To Open Power User Menu

– Press Win + X to bring up the Power User Menu.

How To Access Important Tools

– From the pop-up Power Menu, you can quickly select which important tools you’d like to access, such as Device Manager, Task Manager, Control Panel, and more.

Each week we bring you the latest tech news and tips that may relate to your business, re-written in an techy free style. 

Archives