Nanoveu Runs Dual AI Voice Models on One Chip at Under Half a Milliwatt

By Josua Ferreira -

EMASS fuses voice recognition and keyword detection on a single always-on chip

Nanoveu Limited (ASX: NVU, OTCQB: NNVUF) has reported that its EMASS unit demonstrated two AI voice functions, keyword detection and voice recognition, running simultaneously on its ECS-DoT edge AI co-processor at sub-milliwatt power, entirely on-device.

The demonstration marks the first time ECS-DoT has been shown running two AI functions at once, achieving 92% keyword-detection accuracy at roughly 4 ms latency while drawing 400–500 µW average power.

The significance for investors lies in what it proves: the platform can run multiple AI models concurrently, rather than a single, isolated voice block.

Headline figures from the demonstration include:

  • Two AI functions fused on one always-on co-processor
  • 92% top-1 / 97% top-3 keyword-detection accuracy
  • ~4 ms inference latency, fast enough to feel instantaneous to the user
  • 400–500 µW average power, on silicon independently benchmarked at up to 20x the energy efficiency of leading edge-AI chips (per MLPerf Tiny, 22 October 2024 announcement)
  • Audio never leaves the chip

ECS-DoT Voice Demonstration Performance Dashboard

What the demonstration actually achieved

EMASS demonstrated two complementary voice capabilities on the ECS-DoT co-processor. The first is keyword detection, which recognises spoken commands and wake words in real time. The second is voice recognition, which confirms whether the person speaking is the enrolled, authorised user.

Together, the functions allow a device to respond to voice instantly while gating or personalising that response to the right person, all on-chip. Both capabilities run always-on while the host processor stays asleep. ECS-DoT performs continuous listening and wakes the device’s main processor only on a recognised command or a verified user, complementing rather than replacing the host CPU.

The keyword-detection model is a compact 8-bit network that was tested over 1,500 times to measure how reliably it identifies the correct spoken keyword. The voice-recognition model produces a per-user voice signature matched against an enrolled template.

Both functions were implemented on a standard ECS-DoT evaluation board using a standard PCM digital MEMS microphone, with no additional specialised hardware required.

CEO commentary on the platform implications

Mark Goranson, Nanoveu’s CEO of Semiconductor Technologies

“Today we are demonstrating two AI voice models, keyword detection and voice recognition, running together on ECS-DoT at sub-milliwatt average power, which is significant in its own right. But the bigger point is what it proves about the platform: ECS-DoT is multi-sensor and multimodal, so the same always-on engine is built to run image and other sensor models on these devices too. That provides optionality for device makers to add layered, low-power intelligence across voice, vision and beyond, while the main processor stays asleep.”

The power-latency-privacy trade-off explained

Delivering always-on voice on a compact, power-constrained device has traditionally been limited by a three-way trade-off between power, latency and privacy.

Keeping a microphone pipeline and a capable processor continuously active to listen for commands increases power consumption and thermal load. Duty-cycling those processors to save power, on the other hand, introduces latency and increases the risk of missed activations. An always-listening microphone that processes audio off the sensor also creates a persistent privacy surface.

Why does this matter to investors? Voice is becoming a primary control surface for computing devices, with the Voice AI Agents market projected to reach US$47.5 billion by 2034, according to Markets.us (April 2025). The ability to resolve the trade-off carries commercial weight.

ECS-DoT addresses this by performing the always-on sensing itself, at sub-milliwatt average power, and waking the host processor only when a recognised command or verified user calls for it. Because keyword detection and voice recognition run on-device, audio never needs to leave the chip.

Property Always-on mic + host CPU Duty-cycled / wake-on-demand ECS-DoT co-processor
Power High & constant Partial <0.5 mW demonstrated
Latency / wake delay None Yes, but costly ~4 ms at co-processor
Privacy Audio off-sensor Partial Never leaves chip
Speaker verification Host only No Yes, per-user signature on-device

Beyond voice: a multimodal, multi-sensor platform

The strategic angle extends beyond this single voice demonstration. According to Nanoveu, the same engine running these voice models is built to run image, motion and other sensor models within a single ultra-low-power, always-on budget.

The company describes ECS-DoT as multi-sensor and multimodal by design. The capability demonstrated here forms the foundation for broader on-device, multimodal intelligence, rather than a single-function voice block. For device makers, this offers optionality to layer intelligence across voice, vision and beyond on one low-power co-processor.

ECS-DoT’s sub-milliwatt power envelope has already been validated in a different deployment context: live drone flight trials recorded up to 27.8% cruise efficiency gains, with the chip consuming less than 0.0002% of the drone’s total cruise energy across a 120-second flight.

The modalities the platform is designed to address include:

  • Voice — instant, personalised, hands-free control through keyword detection and voice recognition (demonstrated)
  • Vision — on-device image classification and presence or user recognition, with images never leaving the device
  • Other sensors — motion and additional sensor streams fused with audio and vision

Next steps toward commercial deployment

EMASS has outlined a roadmap to advance the always-on voice solution toward commercial deployment. The company notes this remains at the discussion and design-in stage.

  1. Technical development — expanding the keyword vocabulary, hardening keyword detection and voice recognition performance across users and operating conditions, and progressing toward production-ready reference designs for computing devices.

  2. Commercial engagement — active discussions with computing-device manufacturers, processors and platform integrators on deploying on-device voice modules.

  3. Design-ins — pursuing design-in engagements to integrate ECS-DoT-based voice processors into next-generation tablets, PCs and always-on consumer and industrial endpoints.

The company has positioned ECS-DoT as an always-on intelligence layer for next-generation devices across high-value markets, including tablets, PCs, and consumer, professional and industrial endpoints.

For investors tracking how EMASS is funding the path from demonstration to production-ready silicon, our detailed coverage of the $7.5M capital raise outlines the dual-track strategy combining 16nm chip manufacturing through TSMC with the live field validation program underpinning commercial readiness.

Don’t Miss the Next ASX Tech Breakout

Big News Blast delivers FREE breaking ASX tech news and in-depth analysis directly to your inbox within minutes of release. Over 20,000 subscribers are already getting ahead of the market the moment announcements drop. Click the “Free Alerts” button at StockWire X to start receiving real-time alerts today.


Frequently Asked Questions

What is the ECS-DoT chip and what does it do?

ECS-DoT is an edge AI co-processor developed by Nanoveu's EMASS unit that performs always-on AI inference — such as keyword detection and voice recognition — at sub-milliwatt power, allowing devices to listen and respond continuously without draining battery or sending audio off the chip.

What did the Nanoveu EMASS AI voice fusion demonstration actually prove?

The demonstration showed ECS-DoT running two AI models simultaneously for the first time — keyword detection at 92% top-1 accuracy and 4 ms latency, alongside voice recognition — all on a single always-on co-processor drawing just 400–500 µW, with no audio leaving the device.

How does ECS-DoT compare to existing always-on voice solutions in terms of power and privacy?

Unlike always-on microphone setups that keep a host CPU active at high constant power, or duty-cycled approaches that introduce latency, ECS-DoT draws under 0.5 mW while delivering 4 ms response times and keeping all audio processing on-chip — meaning audio never leaves the device.

What markets is Nanoveu targeting with its always-on voice technology?

EMASS is targeting computing-device manufacturers across tablets, PCs, and consumer and industrial endpoints, pursuing design-in engagements as the Voice AI Agents market is projected to reach US$47.5 billion by 2034.

How far is Nanoveu's ECS-DoT from commercial deployment in voice applications?

The company states that commercial engagement is currently at the discussion and design-in stage, with the roadmap still requiring expanded keyword vocabulary, performance hardening across real-world conditions, and progression to production-ready reference designs before commercial deployment.

Josua Ferreira
By Josua Ferreira
Partnership Director
Josua Ferreira holds a Bachelor of Commerce in Marketing and Advertising and brings a background in publication, business development, and ASX market storytelling. He has worked with listed companies across the resource sector and broader market, combining sharp commercial instincts with a genuine commitment to keeping investors informed.
Learn More
Companies Mentioned in Article

Breaking ASX Alerts Direct to Your Inbox

Join +20,000 subscribers receiving alerts.

Join thousands of investors who rely on StockWire X for timely, accurate market intelligence.

About the Publisher