All Topics  
UltraSPARC T1

 

   Email Print
   Bookmark   Link






 

UltraSPARC T1



 
 
Sun Microsystems
Sun Microsystems

Sun Microsystems, Inc. is a multinational corporation vendor of computers, computer components, computer software, and information technology services, founded on February 24, 1982....
' UltraSPARC T1 microprocessor
Microprocessor

A microprocessor incorporates most or all of the functions of a central processing unit on a single integrated circuit . The first microprocessors emerged in the early 1970s and were used for electronic calculators, using Binary-coded decimal arithmetic on 4-bit Word ....
, known until its 14 November 2005 announcement by its development codename "Niagara", is a multithreading, multicore
Multicore

Multicore may refer to:* Multi-core * multicore cable* audio multicore cable...
 CPU
Central processing unit

A central processing unit is an electronic circuit that can execute computer programs. This broad definition can easily be applied to many early computers that existed long before the term "CPU" ever came into widespread usage....
. Designed to lower the energy consumption of server computers
Server (computing)

A server is a computer program that provides services to other computer programs , in the same or other computer. The physical computer that runs a server program is also often referred to as server....
, the CPU typically uses 72 W of power at 1.4 GHz.

The T1 is a new-from-the-ground-up SPARC
SPARC

SPARC is a Reduced Instruction Set Computer microprocessor instruction set Computer architecture originally designed in 1985 by Sun Microsystems....
 microprocessor implementation that conforms to the and executes the full SPARC V9 instruction set
Instruction set

An instruction set is a list of all the instruction , and all their variations, that a processor can execute.Instructions include:* Arithmetic such as add and subtract...
.






Discussion
Ask a question about 'UltraSPARC T1'
Start a new discussion about 'UltraSPARC T1'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Sun Microsystems
Sun Microsystems

Sun Microsystems, Inc. is a multinational corporation vendor of computers, computer components, computer software, and information technology services, founded on February 24, 1982....
' UltraSPARC T1 microprocessor
Microprocessor

A microprocessor incorporates most or all of the functions of a central processing unit on a single integrated circuit . The first microprocessors emerged in the early 1970s and were used for electronic calculators, using Binary-coded decimal arithmetic on 4-bit Word ....
, known until its 14 November 2005 announcement by its development codename "Niagara", is a multithreading, multicore
Multicore

Multicore may refer to:* Multi-core * multicore cable* audio multicore cable...
 CPU
Central processing unit

A central processing unit is an electronic circuit that can execute computer programs. This broad definition can easily be applied to many early computers that existed long before the term "CPU" ever came into widespread usage....
. Designed to lower the energy consumption of server computers
Server (computing)

A server is a computer program that provides services to other computer programs , in the same or other computer. The physical computer that runs a server program is also often referred to as server....
, the CPU typically uses 72 W of power at 1.4 GHz.

The T1 is a new-from-the-ground-up SPARC
SPARC

SPARC is a Reduced Instruction Set Computer microprocessor instruction set Computer architecture originally designed in 1985 by Sun Microsystems....
 microprocessor implementation that conforms to the and executes the full SPARC V9 instruction set
Instruction set

An instruction set is a list of all the instruction , and all their variations, that a processor can execute.Instructions include:* Arithmetic such as add and subtract...
. Sun has produced two previous multicore processors (UltraSPARC IV and IV+), but UltraSPARC T1 is its first microprocessor that is both multicore and multithreaded. The processor is available with four, six or eight CPU cores, each core able to handle four threads
Thread (computer science)

In computer science, a thread of execution is a Fork of a computer program into two or more Concurrency running task s. The implementation of threads and process es differs from one operating system to another, but in most cases, a thread is contained inside a process....
 concurrently. Thus the processor is capable of processing up to 32 threads concurrently.

Similar to how high-end Sun SMP
Symmetric multiprocessing

In computing, symmetric multiprocessing or SMP involves a multiprocessor computer-architecture where two or more identical processors can connect to a single shared main memory....
 systems work, the UltraSPARC T1 can be partitioned. Thus, several cores can be partitioned for running a single or group of processes and/or threads, whilst the other cores deal with the rest of the processes on the system.

Cores

The UltraSPARC T1 was designed from scratch as a multi-threaded, special-purpose processor, and thus introduces a whole new architecture for obtaining high performance. Rather than try to make each core as intelligent and optimized as they can, Sun's goal was to run as many concurrent threads as possible, and maximize utilization of each core's pipeline.

The T1's cores are less complex than those of current high end processors in order to allow 8 cores to fit on the same die. The cores do not feature out-of-order execution
Out-of-order execution

In computer engineering, out-of-order execution, OoOE, is a paradigm used in most high-performance microprocessors to make use of Instruction cycle that would otherwise be wasted by a certain type of costly delay....
, or a sizable amount of cache
CPU cache

A CPU cache is a cache used by the central processing unit of a computer to reduce the average time to access computer storage. The cache is a smaller, faster memory which stores copies of the data from the most frequently used main memory locations....
. Single-thread processors depend heavily on large caches for their performance because cache misses result in a wait while the data is fetched from main memory. By making the cache larger the probability of a cache miss is reduced, but the impact of a miss is still the same.

The T1 cores largely side-step the issue of cache misses by multithreading. Each core is a barrel processor
Barrel processor

A barrel processor is a Central processing unit that switches between Thread of execution on every Instruction cycle. This CPU design technique is also known as "interleaved" or "fine-grained" temporal multithreading....
, meaning it switches between available threads each cycle. When a long-latency event, such as cache miss occurs, the thread is taken out of rotation while the data is fetched into cache in the background. Once the long-latency event completes, the thread is made available for execution again. Sharing of the pipeline by multiple threads may make each thread slower, but the overall throughput (and utilization) of each core is much higher. It also means that the impact of cache misses is greatly reduced, and the T1 can maintain high throughput with a smaller amount of cache. The cache no longer needs to be large enough to hold all or most of the "working set", just the recent cache misses of each thread.

Benchmarks demonstrate this approach has worked very well on commercial (integer), multithreaded workloads such as Java
Java (programming language)

Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java ....
 application servers, Enterprise Resource Planning
Enterprise resource planning

Enterprise resource planning is an enterprise-wide information system designed to coordinate all the resources, information, and activities needed to complete business processes such as order fulfillment or billing....
 (ERP) application servers, email (such as Lotus Domino) servers, and web servers. These benchmarks suggest each core in the UltraSPARC T1 is more powerful than the circa 2001, single-core, single-threaded UltraSPARC III, and at a chip to chip comparison, significantly outperforms other processors on multithreaded integer workloads.

At the time of its release in December 2005, a single-chip, eight-core, 32-thread, 1.2 GHz UltraSPARC T1 server performed similarly to a two-socket, four-core, eight-thread, 1.9 GHz IBM POWER5 server, performed similarly to a four-socket, eight-core, sixteen-thread 3.0 GHz Intel Xeon "Paxville MP" server, and exceeded the performance of a four-socket, four-core, four-thread 1.6 GHz Intel Itanium server. Arguably, this made the UltraSPARC T1 the world's most powerful general-purpose commercial server processors, when considering multithreaded commercial workloads.

Studies by Intel show that even under full load, a typical x86 server CPU is idle 50 to 60% of the time. This is due to cache misses which all CPU architectures suffer from; they must wait for data to arrive from RAM. That is also why modern CPUs have larger cache, complex prefetch logic, etc. However, CPUs belonging to the T1 family do not suffer from this problem. Instead, as soon a T1 thread stalls due to a cache miss, the T1 switches thread in 1 clock cycle and continues to do work while waiting for the data. Typically on a modern CPU, a thread switch takes a much longer time than 1 clock cycle. This is the reason a T1 can work 95% of the time and only waits for data 5% of the time. Compare this to an x86 CPU at 3 GHz. Because the x86 CPU can only work at half speed due to cache misses, it can be compared to a 1.5 GHz CPU working at full speed. However, one of the T1 threads can compare to an Intel Pentium 3 CPU at 1 GHz in terms of computing power.

The T1 is slow on single threaded work but shines on multi-threaded work. A common mistake is that the T1 is not fully loaded when testing. When testing, typically it is loaded with small data, 1 GB or so. In that case an x86 CPU easily outperforms the T1. However, when the machine is heavily loaded with lots of data, the T1 will easily outperform the x86 CPU. The x86 CPU will stall but the T1 continues to work. The T1 degrades a magnitude slower than the x86 CPU. To fully take advantage of the T1, it must be loaded heavily. Otherwise it will not show its true potential.

Systems

The T1 processor can be found in the following products from Sun and Fujitsu Computer Systems
Fujitsu

is a Japanese company specializing in semiconductors, air conditioners, computers , telecommunications, and Service , and is headquartered in Minato, Tokyo, Tokyo....
:
  • Sun/Fujitsu/Fujitsu Siemens SPARC Enterprise
    SPARC Enterprise

    The SPARC Enterprise series is a range of UNIX server computers co-developed by Sun Microsystems and Fujitsu introduced in 2007. They are marketed and sold by Sun Microsystems, Fujitsu, and Fujitsu Siemens Computers under the common brand of SPARC Enterprise, superseding Sun's Sun Fire and Fujitsu's PRIMEPOWER server product lines....
     T1000 and T2000 servers
  • Sun Fire
    Sun Fire

    The Sun Fire server brand is a series of server computers introduced by Sun Microsystems in 2001. The Sun Fire branding coincided with the introduction of the UltraSPARC III processor, superseding the UltraSPARC II-based Sun Enterprise series....
     T1000 and T2000 servers
  • Sun Netra T2000 Server
  • Sun Netra CP3060 Blade
  • Sun Blade T6300 Server Module


Target market

The UltraSPARC T1 microprocessor is unique in its strength and weaknesses, and as such is targeted at specific markets. Rather than being used for high-end number-crunching
Supercomputer

A supercomputer is a computer that is at the frontline of current processing capacity, particularly speed of calculation. Supercomputers introduced in the 1960s were designed primarily by Seymour Cray at Control Data Corporation , and led the market into the 1970s until Cray left to form his own company, Cray Research....
 and ultra-high performance applications, the chip is targeted at network-facing high-demand servers, such as high-traffic web server
Web server

The term web server can mean one of two things:# A computer program that is responsible for accepting Hypertext Transfer Protocol requests from clients , and Server them HTTP responses along with optional data contents, which usually are web pages such as Hypertext Markup Language documents and linked objects ....
s, and mid-tier Java, ERP, and CRM application servers, which often utilize a large number of separate threads. One of the limitations of the T1 design is that a single floating point unit
Floating point unit

A floating-point unit is a part of a computer system specially designed to carry out operations on floating point numbers. Typical operations are addition, subtraction, multiplication, division , and square root....
 (FPU) is shared between all 8 cores, making the T1 unsuitable for applications performing a lot of floating point mathematics. However, since the processor's intended markets do not typically make much use of floating-point operations, Sun does not expect this to be a problem. Sun provides a tool for analysing an application's level of parallelism
Parallel computing

Parallel computing is a form of computing in which many calculations are carried out simultaneously, operating on the principle that large problems can often be divided into smaller ones, which are then solved Concurrency ....
 and use of floating point instructions to determine if it is suitable for use on a T1 or T2 platform.

In addition to web and application tier processing, the UltraSPARC T1 may be well suited for smaller database applications which have a large user count. One customer has published results showing that a MySQL
MySQL

MySQL is a relational database management system which has more than 11 million installations. The program runs as a server providing multi-user access to a number of databases....
 application running on an UltraSPARC T1 server ran 13.5 times faster than on an AMD Opteron server.

Virtualization

T1 is the first SPARC processor that supports the Hyper-Privileged execution mode. The SPARC Hypervisor runs in this mode, and it can partition a T1 system into 32 Logical Domains
Logical Domains

Logical Domains is a technology from Sun Microsystems released in May 2007 which offers a virtualization abstracted from all physical devices....
, each of which can run an operating system instance.

Currently, Solaris
Solaris Operating System

Solaris is a Unix-based operating system introduced by Sun Microsystems in 1992 as the successor to SunOS.Solaris is known for its scalability, especially on SPARC systems, and for originating many innovative features such as DTrace and ZFS....
 and Linux
Linux

Linux is a generic term referring to Unix-like computer operating systems based on the Linux kernel. Their development is one of the most prominent examples of free and open source software collaboration; typically all the underlying source code can be used, freely modified, and redistributed by anyone under the terms of the GNU GPL license...
 are supported, and FreeBSD
FreeBSD

FreeBSD is a Unix-like free software operating system descended from AT&T Unix via the Berkeley Software Distribution branch through the 386BSD and Berkeley Software Distribution#4.4BSD and descendants operating systems....
 support is under development.

Software licensing issues

Traditionally, commercial software suites like Oracle database
Oracle database

The Oracle Database consists of a relational database management system produced and marketed by Oracle Corporation. , Oracle had become a major presence in database computing....
 charge their customers based on the number of processors the software runs on. In early 2006, Oracle
Oracle Corporation

Oracle Corporation specializes in developing and marketing enterprise software products ? particularly database management systems. Through organic growth and a number of high-profile acquisitions, Oracle enlarged its share of the software market....
 changed the licensing model by introducing the processor factor. With a processor factor of .25 for the T1, an 8-core T2000 requires only a 2-CPU license.

In Q3 2006, IBM
IBM

International Business Machines Corporation, abbreviated IBM and nicknamed "Big Blue" , is a multinational corporation computer technology and consulting corporation headquartered in Armonk, New York, New York, United States....
 introduced the concept of Value Unit (VU) pricing. Each core of the T1 is 30 PVUs instead of the default value of 100 PVUs per core.

Weaknesses

The T1 is only available in uniprocessor systems, limiting vertical scalability in large enterprise environments; Sun has announced that the follow-on "Victoria Falls" processor will address this.

"Rock"


The UltraSPARC T1 is designed for single CPU systems only and is not capable of SMP. Future Sun CMT
Simultaneous multithreading

Simultaneous multithreading, often abbreviated as SMT, is a technique for improving the overall efficiency of superscalar Central processing unit with Multithreading ....
 UltraSPARC processors such as Rock will support multiple chip server architectures. The Rock processor targets traditional data facing workloads such as databases. As such, it is seen as the logical follow-on to Sun's SMP processors such as UltraSPARC IV, rather than a replacement for the UltraSPARC T1 or T2.

Rock also targets floating point workloads, unlike UltraSPARC T1. Sun has publicly disclosed a feature in the Rock processor called hardware scout
Hardware scout

Hardware scout is a technique that uses otherwise idle central processing unit execution resources to perform instruction prefetch during CPU cache misses....
, which uses multithreaded hardware to perform prefetching
Instruction prefetch

In computer architecture, instruction prefetch is a technique used in microprocessors to speed up the execution of a program by reducing wait states....
.

Rock is the world's first general purpose processor with hardware transactional memory
Transactional memory

Transactional memory attempts to simplify parallel programming by allowing a group of load and store instructions to execute in an atomicity way....
.

UltraSPARC T2


Formerly known by the codename Niagara 2, the follow-on to the UltraSPARC T1 supports eight threads per core, and each core has its own FPU.

UltraSPARC T2 Plus

In February 2007, Sun announced at its annual analyst summit that its third-generation simultaneous multithreading
Simultaneous multithreading

Simultaneous multithreading, often abbreviated as SMT, is a technique for improving the overall efficiency of superscalar Central processing unit with Multithreading ....
 design, code-named Victoria Falls, was taped out
Tape-out

In electronics design, tape-out or tapeout is the final stage of the design cycle of integrated circuits or printed circuit boards, the point at which the description of a circuit is sent for manufacture....
 in October 2006. A two-socket server (2 RU
Rack unit

A rack unit or U is a unit of measure used to describe the height of equipment intended for mounting in a 19-inch rack or a 23-inch rack ....
) will have 128 threads, 16 cores, and a 65× performance improvement over UltraSPARC III.

At the Hot Chips
Hot Chips

Hot Chips is an IEEE sponsored technological symposium which is held every year in August on Stanford University campus since 1989. The general emphasis of the conference are microprocessors and integrated circuits....
 19 conference, Sun announced that Victoria Falls will be in 2-way and 4-way servers. Thus, a single 4-way SMP server will support 256 concurrent hardware threads.

In April 2008, Sun released 2-way UltraSPARC T2 Plus servers, the SPARC Enterprise T5140 and T5240.

In October 2008, Sun released 4-way UltraSPARC T2 Plus SPARC Enterprise T5440 server.

Niagara 3

In October 2006, Sun disclosed that Niagara 3 will be built with a 45 nm process. According to an article in The Register from June 2008 the processor will have 16 cores with 16 threads each.

Open design

On March 21, 2006, Sun made the UltraSPARC T1 processor design available under the GNU General Public License
GNU General Public License

The GNU General Public License is a widely used free software license, originally written by Richard Stallman for the GNU project. The GPL is the most popular and well-known example of the type of strong copyleft license that requires derived works to be available under the same copyleft....
 via the OpenSPARC
OpenSPARC

OpenSPARC is an open source hardware project started in December 2005. The initial contribution to the project was Sun Microsystems' Register transfer level Verilog code for a full 64-bit, 32-thread microprocessor, the UltraSPARC T1 processor....
 project. The published information includes:
  • Verilog
    Verilog

    In the semiconductor and electronic design industry, Verilog is a hardware description language used to model Electronics#Electronic systems. Verilog HDL, not to be confused with VHDL, is most commonly used in the design, verification, and implementation of Digital circuit logic chips at the Register transfer level level of Abstraction...
     source code of the UltraSPARC T1 design;
  • Verification suite and simulation models;
  • ISA specification (UltraSPARC Architecture 2005);
  • The Solaris
    Solaris Operating System

    Solaris is a Unix-based operating system introduced by Sun Microsystems in 1992 as the successor to SunOS.Solaris is known for its scalability, especially on SPARC systems, and for originating many innovative features such as DTrace and ZFS....
     10 OS simulation images.


External links

  • – By Jessica Davis, Electronic News, 14 Nov 2005
  • by Linda Geppert, in IEEE Spectrum, January 2005
  • by Poonacha Kongetira, Kathirgamar Aingaran, Kunle Olukotun, in IEEE Micro, March-April 2005