当前位置: 首页 > news >正文

Computer Architecture

System Evaluation Metrics

Cost Metrics

The cost of a chip includes:

  • Design cost: non-recurring engineering (NRE), can be amortized well if there is high volume;
  • Manufacturing cost: depends on area;
    • Manufacturing Semiconductor Chips: Ingot → Wafer → Die (unpackaged chip) → Chip
    • To measure the production efficiency of semiconductor manufacturing, we use the metric yield: the portion of good chips per wafer.
  • Testing cost: depends on yield and test time;
  • Packaging cost: depends on die size, number of pins, power delivery, ...

The cost of a system includes:

  • Power cost;
  • Cooling cost;
  • Total Cost of Ownership (TCO) of datacenters:
    • Capital expenses (CAPEX): facilities, assembly & installation, compute, storage,
      networking, software, …
    • Operational expenses (OPEX): energy, rent, maintenance, employee salaries, …
  • System availability: Downtime is expensive​​ and results in a direct loss of revenue. Redundancy​​ (adding backup components) improves availability but also increases the initial capital cost.

Performance Metrics

Performance metrics:

  • Latency: time to complete a task;
  • Throughput: tasks completed per unit time;

Improving latency often reduces throughput, but not vice versa. For example, inter-task parallelization improves throughput but not latency of a task, while intra-task parallelization improves both.

Buffering/queuing/batching improves throughput but may hurt latency, leads to the tradeoff between latency and throughput.

Digital systems (e.g., processors) operate using a constant-rate clock:

  • Clock cycle time (CCT): duration of a clock cycle;
  • Clock frequency (rate): cycles per second.

To compute the execution time of a program, we first compute the number of instructions (IC), which is fixed for a given program. Then we compute the average number of cycles per instruction (CPI), which depends on the system architecture and implementation. All together, we have

\[\text{Execution Time} = \frac{\text{Instructions}}{\text{Program}} \times \frac{\text{Cycles}}{\text{Instruction}} \times \frac{\text{Time}}{\text{Cycle}}= \text{IC} \times \text{CPI} \times \text{CCT}. \]

Roughly speaking, software determines IC, ISA determines CPI, and microarchitecture/circuit determines CCT.

So far we only discuss the performance on processors. What about memory? It could be reflected on CPI. We know that

\[\text{Runtime}=\max(\text{#ops}/\text{processor throughput},\text{#bytes}/\text{memory bandwidth}). \]

Denote operational Intensity (OI) as \(\frac{\text{#ops}}{\text{#bytes}}\), we have

\[\begin{align*} \text{Perf}=&\ \text{#ops}/\text{Runtime}\\ =&\ \min(\text{processor throughput},\text{memory bandwidth}\times\text{operational intensity}). \end{align*} \]

Drawing the graph of performance vs. operational intensity, we have the roofline model (for certain system):

Power and Energy Metrics

Dynamic/active power: \(C\times V_{dd}^2\times f_{0\to 1}=\alpha C V_{dd}^2 f\), where \(C\) is the capacitance being switched, \(V_{dd}\) is the supply voltage, \(f_{0\to 1}\) is the frequency of 0-to-1 transitions, \(\alpha\) is the activity factor (the fraction of capacitance being switched), and \(f\) is the clock frequency.

Static/leakage power: \(V_{dd}I_{leak}\), where \(I_{leak}\) is the leakage current.

Therefore, total power is

\[\text{Power}=\alpha C V_{dd}^2 f + V_{dd} I_{leak}. \]

And

\[\text{Energy}=\text{Power}\times \text{Time}. \]

Limiting factors of power, energy, and power density:

  • Power is limited by infrastructure, e.g., power supply;
  • Power density is limited by thermal dissipation, e.g., fans, liquid cooling;
  • Energy is limited by battery capacity or electrical bill.

Power scaling:

  • Dennard scaling (1974-2005): If the feature size scales by \(1/S\), the supply voltage and current can scale by \(1/S\);
  • Post-Dennard scaling (2006-now): Power limits performance scaling (power wall), so we need to slow down frequency scaling or reduce chip utilization.

Normalize performance to power:

\[\text{Energy Efficiency}=\frac{\text{Performance}}{\text{Power}}=\frac{\text{Operations}/\text{Time}}{\text{Energy}/\text{Time}}=1/\frac{\text{Energy}}{\text{Operations}}. \]

For certain task, choose the "optimal" design to trade off performance and energy.

Scalability

Scalability measures the speedup achieved by using \(N\) processors compared to using just \(1\) processor.

Two settings to evaluate scalability:

  • Strong scaling: speedup on \(N\) processors with fixed total workload size
  • Weak scaling: speedup on \(N\) processors with fixed per-processor workload size

How to balance the workload?

  • Static load balancing: to partition input as evenly as possible
  • Dynamic load balancing, e.g., work dispatch, work stealing

Suppose that an optimization accelerates a fraction \(f\) of a program by a factor of \(S\), then the overall speedup is given by Amdahl's Law:

\[\text{Speedup}=\frac{1}{(1-f)+\frac{f}{S}}. \]

Benchmark

Benchmark is a carefully selected programs used to measure performance. And benchmark suite is a collection of benchmarks.

To report the average performance on a benchmark suite, we may use three types of means: arithmetic (for absolutes), geometric (for rates) and harmonic (for ratios).

http://www.sczhlp.com/news/129699/

相关文章:

  • 枚举子集
  • cv-css 快捷方式,将指定节点的计算样式获取下拉 获取tailwind网页样式成原生样式
  • 国内包装设计网站浏阳企业网站建设
  • 邯郸做移动网站多少钱现在做网站怎么样
  • 济南做企业网站公司专升本需要考些什么科目
  • 网站开发需要后台吗高新区做网站的公司
  • 溧阳市建设局网站北京海淀网站制作公司
  • 电子商务运营网站郴州网站设计
  • 免费网站怎么注册哪儿有网络推广培训
  • 企业网站建设需注意什么一般网站建设需求有哪些方面
  • 网站设计网站源码广州万户网络科技有限公司
  • 网站建设推广服务合同wordpress 获取ip
  • 网站开发实例模板全国免费分类信息发布平台
  • 网站怎么做免费seo搜索引擎哪家公司的网好
  • 前端电商网站开发周期百丽优购物官方网站
  • 如何开发网站软件app软件景观规划设计公司
  • Avalonia 学习笔记07. Control Themes(控件主题)
  • 廊坊网站关键词优化论文 网站建设可行性
  • WordPress电影公司网站主题专业站
  • 长沙本土网站制作公司用外链技术做视频网站
  • 国产手机做系统下载网站wordpress最新列表页
  • xp系统做局域网内网站明光网站建设
  • 大连网站建设仟亿科技网络建设公司哪家好
  • 住房和城乡建设部网站第九批app推广渠道有哪些
  • 林和西网站建设做中文网站公司
  • 电子商务网站建设开发文档网站做程序
  • 龙泉驿建设局网站一级造价师注册查询系统平台入口
  • matter 协议的架构;
  • matter 协议解析;
  • 9月23日