Breaking News

Samsung Launches New SSD T7 Resurrected NIKON RELEASES FIRMWARE VERSION 3.00 FOR THE NIKON Z F WITH NEW IN-CAMERA FILM GRAIN FEATURE AND MORE COLORFUL Expands B850 Motherboard Lineup with New CVN, Battle-Ax, and MEOW Models HighPoint Unveils the MCIO-PCIEX16-G5 CASIO introduces a new limited edition Hammered heritage model, the New MRG-B5000HT

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Twitter's New Search Architecture

Twitter's New Search Architecture

Enterprise & IT Oct 8,2010 0

Twitter launched a new backend for search on twitter.com during the last few weeks. Twitter’s real-time search engine was, until very recently, based on the technology that Summize originally developed. However, scaling the old MySQL-based system had become increasingly challenging. About 6 months ago, Twitter decided to develop a new search architecture that is based on a highly efficient inverted index instead of a relational database. Twitter chose Lucene, a search engine library written in Java, as a starting point.

Twitter's demands on the new system are immense: With over 1,000 TPS (Tweets/sec) and 12,000 QPS (queries/sec) = over 1 billion queries per day (!) Twitter already put a very high load on our machines. In addition to these scalability requirements, Twitter also need to support extremely low indexing latencies (the time it takes between when a Tweet is tweeted and when it becomes searchable) of less than 10 seconds. Since the indexer is only one part of the pipeline a Tweet has to make it through, Twitter needed the indexer itself to have a sub-second latency.

However, Lucene has several shortcomings for real-time search. That’s why Twitter rewrote big parts of the core in-memory data structures, especially the posting lists, while still supporting Lucene’s standard APIs. This allows Twitter to use Lucene’s search layer almost unmodified. Some of the highlights of the changes include:

* significantly improved garbage collection performance
* lock-free data structures and algorithms
* posting lists, that are traversable in reverse order
* efficient early query termination

Twitter estimates that it is only using about 5% of the available backend resources. Twitter's new indexer could also index roughly 50 times more Tweets per second than Twitter currently gets.

The first difference users might notice is the bigger index, which is now twice as long -- without making searches any slower. And, maybe most importantly, the new system is versatile and extensible, which will allow Twitter to build new features faster and better.

Tags: Twitter
Previous Post
Firefox 4 Beta for Android and Maemo is Now Available
Next Post
Opera To Participate In in Open Screen Project

Related Posts

  • Elon Musk to Acquire Twitter

  • Twitter Marks President Trump's Tweet With Tag Warning About "Violence"

  • Twitter's New Settings Let You Choose Who Can Reply to Your Tweet

  • Twitter to Let Employees Work From Home For Ever

  • Twitter Launches Labels to Warn On Misleading COVID-19 Information

  • Twitter to Start Warning Users That Post Offensive Replies

  • Twitter Reports Small Revenue As Advertising Business Hit By Covid-19

  • Twitter to Make Data Available COVID-19 Tweets to Researchers

Latest News

Samsung Launches New SSD T7 Resurrected
Consumer Electronics

Samsung Launches New SSD T7 Resurrected

NIKON RELEASES FIRMWARE VERSION 3.00 FOR THE NIKON Z F WITH NEW IN-CAMERA FILM GRAIN FEATURE AND MORE
Cameras

NIKON RELEASES FIRMWARE VERSION 3.00 FOR THE NIKON Z F WITH NEW IN-CAMERA FILM GRAIN FEATURE AND MORE

COLORFUL Expands B850 Motherboard Lineup with New CVN, Battle-Ax, and MEOW Models
PC components

COLORFUL Expands B850 Motherboard Lineup with New CVN, Battle-Ax, and MEOW Models

HighPoint Unveils the MCIO-PCIEX16-G5
Enterprise & IT

HighPoint Unveils the MCIO-PCIEX16-G5

CASIO introduces a new limited edition Hammered heritage model, the New MRG-B5000HT
Consumer Electronics

CASIO introduces a new limited edition Hammered heritage model, the New MRG-B5000HT

Popular Reviews

be quiet! Dark Mount Keyboard

be quiet! Dark Mount Keyboard

Terramaster F8-SSD

Terramaster F8-SSD

be quiet! Light Mount Keyboard

be quiet! Light Mount Keyboard

Soundpeats Pop Clip

Soundpeats Pop Clip

Akaso 360 Action camera

Akaso 360 Action camera

Dragon Touch Digital Calendar

Dragon Touch Digital Calendar

Noctua NF-A12x25 G2 fans

Noctua NF-A12x25 G2 fans

be quiet! Pure Loop 3 280mm

be quiet! Pure Loop 3 280mm

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed