Breaking News

DJI Agras T100, T70P and T25P Launches Globally Sony Introduces the RX1R III Razer Introduces Next-Generation Connectivity and Performance with New Thunderbolt 5 Dock and Core X V2 Transcend's New ESD420 Portable SSD Offers MagSafe Compatibility and Pro-Level Performance G.SKILL Trident Z5 DDR5 Memory and WigiDash Receives European Hardware Awards 2025

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Toshiba's Speech Recognition AI Technology Delivers User-specific Operation of Home Appliances

Toshiba's Speech Recognition AI Technology Delivers User-specific Operation of Home Appliances

Consumer Electronics Mar 9,2020 0

Toshiba Corp. has developed an AI technology that can bring fast recognition of speakers and keywords to all kinds of electronic products, without any need for internet connectivity and no need to rely on cloud resources for processing.

Home appliances integrating the technology will be able to register individual speakers with only three utterances, and to adjust operation in response to voice commands.

For example, an air-conditioner set in operation by a voice command will also adjust its temperature setting to suit the user who made the command.

Keyword detection and speaker recognition both require large numbers of calculations, which are typically executed remotely on a cloud platform or high-end devices like a smartphone. Making such capabilities a native feature of home appliances and other devices requires high-speed AI technology that can be embedded in the devices themselves. Toshiba says that its new AI can simultaneously and quickly execute keyword detection and speaker recognition, all without any need for network connectivity or remote processing power.

The technology has two core features.

The first feature is the use of the intermediate outputs during the keyword detection for effective speaker registration and recognition. The AI must first detect keywords by separating ambient noise from audio information. Its neural network does this by processing spoken input while absorbing the effects of ambient noise. Speaker registration and recognition are performed using the intermediate outputs of the neural network, an approach that suppresses the effects of ambient noise on speaker recognition, and also reduces the time required recognize the speaker. It secures high-speed operations with constrained resources.

The second feature is the use of data expansion methodology in the neural network. Data expansion is a method for learning from small amounts of data, in this case spoken utterances. By randomly assigning zero weight to connections between neural network nodes, simulated voice information can be generated, as if a speaker had spoken in various ways. Successful identification of individuals is based on the AI learning from their speech samples, and this method recognizes particular speakers even when only a small number of utterances are available. Toshiba has reduced the number of required speech samples to a point where the new AI technology can complete user registration with only three utterances.

Comparative evaluation based on three utterances per registered speaker found that Toshiba's method achieved an identification accuracy of 89% for 100 people, while accuracy of i-vector, a commonly used method for speaker recognition, remains at 71%. As devices such as home appliances are expected to have five to 10 registered speakers at most, this level of performance is considered sufficient for practical application. Furthermore, the amounts of computation and processing speed were measured on a server and confirmed that neither would be problematic, even in an embedded system.

As its next step, Toshiba will work toward incorporating the technology in embedded systems and investigating its utility in home appliances and other use cases. The company is also reviewing the opportunity to develop new services, such as application in the communication AI "RECAIUS™" developed by Toshiba Digital Solutions Corporation.

Tags: ToshibaArtificial IntelligenceAutomatic speech recognition (ASR)
Previous Post
Smartphone Sales in China Dropped Significantly in February
Next Post
Australia Launches Federal Court Action Against Facebook

Related Posts

  • Toshiba Canvio Flex 2TB

  • Toshiba expands storage evaluation services in EMEA with new HDD Innovation Lab

  • Toshiba Unveils New Canvio Flex and Canvio Gaming 2.5” Portable Hard Drives

  • Toshiba Collaborates with PROMISE Technology on Providing the Optimal Data Storage Technology for CERN’s Large Hadron Collider

  • Toshiba Announces 24TB CMR and 28TB SMR Enterprise Hard Disk Drives

  • Toshiba Canvio Flex 4TB

  • Toshiba Canvio Basics 1TB

  • Toshiba’s next-generation S300 Pro Surveillance HDDs for large-scale video surveillance systems

Latest News

DJI Agras T100, T70P and T25P Launches Globally
Drones

DJI Agras T100, T70P and T25P Launches Globally

Sony Introduces the RX1R III
Cameras

Sony Introduces the RX1R III

Razer Introduces Next-Generation Connectivity and Performance with New Thunderbolt 5 Dock and Core X V2
Gaming

Razer Introduces Next-Generation Connectivity and Performance with New Thunderbolt 5 Dock and Core X V2

Transcend's New ESD420 Portable SSD Offers MagSafe Compatibility and Pro-Level Performance
PC components

Transcend's New ESD420 Portable SSD Offers MagSafe Compatibility and Pro-Level Performance

G.SKILL Trident Z5 DDR5 Memory and WigiDash Receives European Hardware Awards 2025
Enterprise & IT

G.SKILL Trident Z5 DDR5 Memory and WigiDash Receives European Hardware Awards 2025

Popular Reviews

be quiet! Light Loop 360mm

be quiet! Light Loop 360mm

be quiet! Dark Mount Keyboard

be quiet! Dark Mount Keyboard

be quiet! Light Mount Keyboard

be quiet! Light Mount Keyboard

Noctua NH-D15 G2

Noctua NH-D15 G2

Soundpeats Pop Clip

Soundpeats Pop Clip

be quiet! Light Base 600 LX

be quiet! Light Base 600 LX

Crucial T705 2TB NVME White

Crucial T705 2TB NVME White

be quiet! Pure Base 501

be quiet! Pure Base 501

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed