Breaking News

Kioxia and Dell Technologies First to Deliver High-Density Server with 9.8 PB of Flash Storage ASUS Republic of Gamers Announces ROG NUC 16 Silicon Power Launches CreatePro Series Newtro Cooling Series and Next-Gen LCD Coolers at Computex 2026 Sony Announces the Launch of Xperia 1 VIII

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Toshiba's Voice Recognition Technology Can Distinguish Multiple Individual Speakers Without Training

Toshiba's Voice Recognition Technology Can Distinguish Multiple Individual Speakers Without Training

Enterprise & IT Oct 25,2016 0

Toshiba has developed a technology capable of precisely distinguishing the voices of individual speakers in real time, even when multiple persons are speaking at the same time.

Typically, voice recognition reliability falls off when multiple people speak simultaneously. While technologies have been developed for separating simultaneous speech, the acoustic characteristics of locations where conversations take place and recording environment factors, such as the positioning of the speakers, have required provision of dozens of minutes of recordings to realize training for optimal separation.

Toshiba claims that its new technology achieves precise, real-time identification of speakers and separate voice capture, even when many voices are trying to be heard, and delivers high-precision recognition and transcription of each speaker, using a microphone array embedded in a single sound input device.

High-precision transcription alleviates the need for manually keeping minutes in business meetings, and allows an increased focus on analysis of customer opinions and the improvement of staff manuals. Transcriptions of meetings with customers from overseas can also be used for automatic translation systems.

A problem with previous sound-source separation systems is that they require many minutes of pre-recorded speech for system training in order create a sufficiently precise separation filter for each speech source (person). Toshiba's method replaces this time-consuming direct learning for filter creation with learning of the spatial characteristics representing speaker position information from the positioning of the microphones. This achieves high-performance separation supported by continuous filter updates according to the environment, and approximately double the separation precision of previous techniques. Toshiba says that when separating simultaneous speech of two speakers, the amount of suppression of the of the second person's speech was improved from 3 to 9 dB - an approximate doubling.

In operation, the new system rapidly determines the relative positioning of speakers through matching an association table for sound direction to the time difference at which sound arrives from speakers attached to each microphone. This technique allows the capture and separation of each individual's voice, even when there are simultaneous utterances and without any previous recordings at the location.

Toshiba keeps working on the new technology, and plans to include it to its 2017 RECAIUS cloud-based service that supports various human activities for understanding the intentions of humans in audio and visual recordings.

Tags: Toshiba
Previous Post
Emporio Armani Launches A Hybrid Smartwatch
Next Post
Alexa Skill To Lets You Control Your Harmony Hub-based Universal Remotes

Related Posts

  • Toshiba Canvio Flex Portable Hard Drive, Now in Metallic Blue

  • Toshiba Begins Sampling of 30-34 TB SMR Nearline Hard Disk Drives

  • World Backup Day 2026: A Backup Doesn’t Always Need to be in the Cloud

  • Toshiba to Showcase High-Performance AI and Petabyte-Scale Storage Solutions at Cloudfest 2026

  • Asustor AS5404T 4-Bay NAS System

  • Toshiba Storage Trends 2026

  • Toshiba launches S300 AI surveillance HDD for AI-driven video applications

  • Toshiba First in Industry to Verify 12-Disk Stacking Technology for Hard Drives

Latest News

Kioxia and Dell Technologies First to Deliver High-Density Server with 9.8 PB of Flash Storage
Enterprise & IT

Kioxia and Dell Technologies First to Deliver High-Density Server with 9.8 PB of Flash Storage

ASUS Republic of Gamers Announces ROG NUC 16
Enterprise & IT

ASUS Republic of Gamers Announces ROG NUC 16

Silicon Power Launches CreatePro Series
Enterprise & IT

Silicon Power Launches CreatePro Series

Newtro Cooling Series and  Next-Gen LCD Coolers at Computex 2026
Cooling Systems

Newtro Cooling Series and Next-Gen LCD Coolers at Computex 2026

Sony Announces the Launch of Xperia 1 VIII
Smartphones

Sony Announces the Launch of Xperia 1 VIII

Popular Reviews

Akaso 360 Action camera

Akaso 360 Action camera

Dragon Touch Digital Calendar

Dragon Touch Digital Calendar

be quiet! Pure Loop 3 280mm

be quiet! Pure Loop 3 280mm

Noctua NF-A12x25 G2 fans

Noctua NF-A12x25 G2 fans

Soft2bet and the unseen hardware that makes instant play possible

Soft2bet and the unseen hardware that makes instant play possible

Endorfy Thock V2 Wireless Keyboard

Endorfy Thock V2 Wireless Keyboard

Crucial T710 2TB NVME SSD

Crucial T710 2TB NVME SSD

JSAUX 65Wh Rog Ally Battery

JSAUX 65Wh Rog Ally Battery

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed