Breaking News

ASUS Unveils Complete Portfolio Support for Intel Core 200S Series Samsung Brings AirDrop Support to Quick Share with Galaxy S26 Series TerraMaster Spring Sale 2026 Upgraded Up To 30% LG Display becomes world’s first to mass-produce 1-120Hz laptop panel ASRock Launches new 240Hz Gaming Monitors

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Toshiba's Voice Recognition Technology Can Distinguish Multiple Individual Speakers Without Training

Toshiba's Voice Recognition Technology Can Distinguish Multiple Individual Speakers Without Training

Enterprise & IT Oct 25,2016 0

Toshiba has developed a technology capable of precisely distinguishing the voices of individual speakers in real time, even when multiple persons are speaking at the same time.

Typically, voice recognition reliability falls off when multiple people speak simultaneously. While technologies have been developed for separating simultaneous speech, the acoustic characteristics of locations where conversations take place and recording environment factors, such as the positioning of the speakers, have required provision of dozens of minutes of recordings to realize training for optimal separation.

Toshiba claims that its new technology achieves precise, real-time identification of speakers and separate voice capture, even when many voices are trying to be heard, and delivers high-precision recognition and transcription of each speaker, using a microphone array embedded in a single sound input device.

High-precision transcription alleviates the need for manually keeping minutes in business meetings, and allows an increased focus on analysis of customer opinions and the improvement of staff manuals. Transcriptions of meetings with customers from overseas can also be used for automatic translation systems.

A problem with previous sound-source separation systems is that they require many minutes of pre-recorded speech for system training in order create a sufficiently precise separation filter for each speech source (person). Toshiba's method replaces this time-consuming direct learning for filter creation with learning of the spatial characteristics representing speaker position information from the positioning of the microphones. This achieves high-performance separation supported by continuous filter updates according to the environment, and approximately double the separation precision of previous techniques. Toshiba says that when separating simultaneous speech of two speakers, the amount of suppression of the of the second person's speech was improved from 3 to 9 dB - an approximate doubling.

In operation, the new system rapidly determines the relative positioning of speakers through matching an association table for sound direction to the time difference at which sound arrives from speakers attached to each microphone. This technique allows the capture and separation of each individual's voice, even when there are simultaneous utterances and without any previous recordings at the location.

Toshiba keeps working on the new technology, and plans to include it to its 2017 RECAIUS cloud-based service that supports various human activities for understanding the intentions of humans in audio and visual recordings.

Tags: Toshiba
Previous Post
Emporio Armani Launches A Hybrid Smartwatch
Next Post
Alexa Skill To Lets You Control Your Harmony Hub-based Universal Remotes

Related Posts

  • World Backup Day 2026: A Backup Doesn’t Always Need to be in the Cloud

  • Toshiba to Showcase High-Performance AI and Petabyte-Scale Storage Solutions at Cloudfest 2026

  • Asustor AS5404T 4-Bay NAS System

  • Toshiba Storage Trends 2026

  • Toshiba launches S300 AI surveillance HDD for AI-driven video applications

  • Toshiba First in Industry to Verify 12-Disk Stacking Technology for Hard Drives

  • Toshiba Canvio Flex 2TB

  • Toshiba expands storage evaluation services in EMEA with new HDD Innovation Lab

Latest News

ASUS Unveils Complete Portfolio Support for Intel Core 200S Series
Enterprise & IT

ASUS Unveils Complete Portfolio Support for Intel Core 200S Series

Samsung Brings AirDrop Support to Quick Share with Galaxy S26 Series
Smartphones

Samsung Brings AirDrop Support to Quick Share with Galaxy S26 Series

TerraMaster Spring Sale 2026 Upgraded Up To 30%
Enterprise & IT

TerraMaster Spring Sale 2026 Upgraded Up To 30%

LG Display becomes world’s first to mass-produce 1-120Hz laptop panel
Enterprise & IT

LG Display becomes world’s first to mass-produce 1-120Hz laptop panel

ASRock Launches new 240Hz Gaming Monitors
Gaming

ASRock Launches new 240Hz Gaming Monitors

Popular Reviews

be quiet! Dark Mount Keyboard

be quiet! Dark Mount Keyboard

be quiet! Light Mount Keyboard

be quiet! Light Mount Keyboard

Akaso 360 Action camera

Akaso 360 Action camera

Dragon Touch Digital Calendar

Dragon Touch Digital Calendar

be quiet! Pure Loop 3 280mm

be quiet! Pure Loop 3 280mm

Noctua NF-A12x25 G2 fans

Noctua NF-A12x25 G2 fans

Arctic Liquid Freezer III 360 Pro Argb

Arctic Liquid Freezer III 360 Pro Argb

Soft2bet and the unseen hardware that makes instant play possible

Soft2bet and the unseen hardware that makes instant play possible

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed