Infobright
Encyclopedia
Infobright is a commercial provider of column-oriented
Column-oriented DBMS
A column-oriented DBMS is a database management system that stores its content by column rather than by row. This has advantages for data warehouses and library catalogues where aggregates are computed over large numbers of similar data items....

 relational database
Relational database
A relational database is a database that conforms to relational model theory. The software used in a relational database is called a relational database management system . Colloquial use of the term "relational database" may refer to the RDBMS software, or the relational database itself...

 software with a focus in machine-generated data
Machine-generated data
Machine-generated data is the generic term for information which was automatically created from a computer process, application, or other machine without the intervention of a human. However, there is some indecision as to the breadth of the term...

. The company's head office is located in Toronto
Toronto
Toronto is the provincial capital of Ontario and the largest city in Canada. It is located in Southern Ontario on the northwestern shore of Lake Ontario. A relatively modern city, Toronto's history dates back to the late-18th century, when its land was first purchased by the British monarchy from...

, Canada
Canada
Canada is a North American country consisting of ten provinces and three territories. Located in the northern part of the continent, it extends from the Atlantic Ocean in the east to the Pacific Ocean in the west, and northward into the Arctic Ocean...

. Most of its research and development is based in Warsaw
Warsaw
Warsaw is the capital and largest city of Poland. It is located on the Vistula River, roughly from the Baltic Sea and from the Carpathian Mountains. Its population in 2010 was estimated at 1,716,855 residents with a greater metropolitan area of 2,631,902 residents, making Warsaw the 10th most...

, Poland
Poland
Poland , officially the Republic of Poland , is a country in Central Europe bordered by Germany to the west; the Czech Republic and Slovakia to the south; Ukraine, Belarus and Lithuania to the east; and the Baltic Sea and Kaliningrad Oblast, a Russian exclave, to the north...

.

History

Infobright was founded in 2005. It became an open source
Open source
The term open source describes practices in production and development that promote access to the end product's source materials. Some consider open source a philosophy, others consider it a pragmatic methodology...

 company in September 2008, when it issued the first free release of its software. At the same time its community site was launched.
The company is funded by venture capital
Venture capital
Venture capital is financial capital provided to early-stage, high-potential, high risk, growth startup companies. The venture capital fund makes money by owning equity in the companies it invests in, which usually have a novel technology or business model in high technology industries, such as...

 investors Flybridge Capital Partners
Flybridge Capital Partners
Flybridge Capital Partners is a Boston-based private equity firm focused on early-stage venture capital investments. With $560 million under management across three funds, the firm invests in consumer, healthcare, energy and information technology markets and is led by a team with domain expertise...

, RBC Venture Partners, and Sun Microsystems
Sun Microsystems
Sun Microsystems, Inc. was a company that sold :computers, computer components, :computer software, and :information technology services. Sun was founded on February 24, 1982...

.

In 2009, Infobright was recognized as MySQL's Partner of the Year, and a Gartner
Gartner
Gartner, Inc. is an information technology research and advisory firm headquartered in Stamford, Connecticut, United States. It was known as GartnerGroup until 2001....

 Cool Vendor in Data Management and Integration. It is also certified for use with Sun's Unified Storage product line. It is the assignee of published patent applications on data compression, query optimization, and data organization.

Technology

Infobright's database software is integrated with MySQL
MySQL
MySQL officially, but also commonly "My Sequel") is a relational database management system that runs as a server providing multi-user access to a number of databases. It is named after developer Michael Widenius' daughter, My...

, but with its own proprietary data storage and query optimization layers.

Infobright uses a columnar approach to database design. When data is loaded into a table, it is broken into the groups of 216 rows, further decomposed into separate data packs for each of the columns. By breaking each column by the same number of rows, it maintains its integrity with other columns for the same entry. For example, row 1, column 1 is the first entry in the first datapack for column 1. Row 1 in column 2 is the first entry in the first datapack for column 2.

Each data pack is separately compressed
Data compression
In computer science and information theory, data compression, source coding or bit-rate reduction is the process of encoding information using fewer bits than the original representation would use....

 to approximately 10:1 on average.

Knowledge Grid

A metadata layer (called the Database Knowledge Grid) stores compact information about the contents and relationships between the data packs, replacing the concept of a traditional database index
Index (database)
A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of slower writes and increased storage space...

.

Query execution

The optimizer uses theories of rough set
Rough set
In computer science, a rough set, first described by a Polish computer scientist Zdzisław I. Pawlak, is a formal approximation of a crisp set in terms of a pair of sets which give the lower and the upper approximation of the original set...

s and Granular Computing
Granular computing
Granular computing is an emerging computing paradigm of information processing. It concerns the processing of complex information entities called information granules, which arise in the process of data abstraction and derivation of knowledge from information...

by categorizing which data packs need to be decompressed and by refining such categorization using partial results obtained from the Knowledge Grid and already decompressed data packs.

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK