Skip to content

Commit 464f90f

Browse files
updated docs
1 parent 946c05c commit 464f90f

37 files changed

+691
-389
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -705,7 +705,7 @@ To use the latest [Cpp-Taskflow](https://github.com/cpp-taskflow/cpp-taskflow/ar
705705
+ AppleClang Xode Version at least v8
706706
+ Nvidia CUDA Toolkit and Compiler ([nvcc][nvcc]) at least v10.0 with -std=c++14
707707

708-
See the [C++ compiler support](https://en.cppreference.com/w/cpp/compiler_support) status.
708+
Cpp-Taskflow works on Linux, Windows, and Mac OS X. See the [C++ compiler support](https://en.cppreference.com/w/cpp/compiler_support) status.
709709

710710
<div align="right"><b><a href="#table-of-contents">[↑]</a></b></div>
711711

docs/CPU-vs-TPU-vs-GPU.png

103 KB
Loading

docs/Cookbook.js

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@ var Cookbook =
22
[
33
[ "C0: Project Motivation", "chapter0.html", [
44
[ "The Era of Multicore", "chapter0.html#TheEraOfMulticore", null ],
5+
[ "Heterogeneous Computing", "chapter0.html#C0HeterogeneousComputing", null ],
56
[ "Loop-level Parallelism", "chapter0.html#LoopLevelParallelism", null ],
67
[ "Task-based Parallelism", "chapter0.html#TaskBasedParallelism", null ],
7-
[ "Challenges of Task-based Parallel Programming", "chapter0.html#ChallengesOfTaskBasedParallelProgramming", null ],
88
[ "The Project Mantra", "chapter0.html#TheProjectMantra", null ]
99
] ],
1010
[ "C1: Static Tasking", "chapter1.html", [

docs/Reference.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -97,11 +97,11 @@ <h1><a class="anchor" id="RefConference"></a>
9797
Conference</h1>
9898
<ol>
9999
<li>
100-
T.-W. Huang, C.-X. Lin, G. Guo, and Martin D. F. Wong, "Cpp-Taskflow: Fast Task-based Parallel Programming using Modern C++," <em>IEEE International Parallel and Distributed Processing Symposium (IPDPS)</em>, pp. 974-983, Rio de Janeiro, Brazil, 2019. </li>
100+
T.-W. Huang, C.-X. Lin, G. Guo, and Martin D. F. Wong, &quot;<a href="ipdps19.pdf">Cpp-Taskflow: Fast Task-based Parallel Programming using Modern C++</a>,&quot; <em>IEEE International Parallel and Distributed Processing Symposium (IPDPS)</em>, pp. 974-983, Rio de Janeiro, Brazil, 2019. </li>
101101
<li>
102-
C.-X. Lin, T.-W. Huang, G. Guo, and M. D. F. Wong, "A Modern C++ Parallel Task Programming Library," <em>ACM Multimedia Conference (MM)</em>, pp. 2284-2287, Nice, France, 2019. </li>
102+
C.-X. Lin, T.-W. Huang, G. Guo, and M. D. F. Wong, &quot;<a href="mm19.pdf">A Modern C++ Parallel Task Programming Library</a>,&quot; <em>ACM Multimedia Conference (MM)</em>, pp. 2284-2287, Nice, France, 2019. </li>
103103
<li>
104-
C.-X. Lin, T.-W. Huang, G. Guo, and Martin D. F. Wong, "An Efficient and Composable Parallel Task Programming Library," <em>IEEE High-performance and Extreme Computing Conference (HPEC)</em>, pp. 1-7, Waltham, MA, 2019. </li>
104+
C.-X. Lin, T.-W. Huang, G. Guo, and Martin D. F. Wong, &quot;<a href="hpec19.pdf">An Efficient and Composable Parallel Task Programming Library</a>,&quot; <em>IEEE High-performance and Extreme Computing Conference (HPEC)</em>, pp. 1-7, Waltham, MA, 2019. </li>
105105
</ol>
106106
<h1><a class="anchor" id="RefJournal"></a>
107107
Journal</h1>

docs/chapter0.html

Lines changed: 12 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -92,19 +92,26 @@
9292
<div class="title">C0: Project Motivation </div> </div>
9393
</div><!--header-->
9494
<div class="contents">
95-
<div class="textblock"><p>Cpp-Taskflow addresses a long-standing problem, <em>how can we make it easier for C++ developers to write efficient parallel and heterogeneous programs under complex task dependencies?</em></p>
95+
<div class="textblock"><p>Cpp-Taskflow addresses a long-standing problem, <em>how can we make it easier for C++ developers to quickly write parallel and heterogeneous programs with high performance scalability and simultaneous high productivity?</em></p>
9696
<h1><a class="anchor" id="TheEraOfMulticore"></a>
9797
The Era of Multicore</h1>
9898
<p>In the past, we embrace <em>free</em> performance scaling on our software thanks to advances in manufacturing technologies and micro-architectural innovations. Approximately for every 1.5 year we can speed up our programs by simply switching to new hardware and compiler vendors that brings 2x more transistors, faster clock rates, and higher instruction-level parallelism. However, this paradigm was challenged by the power wall and increasing difficulties in exploiting instruction-level parallelism. The boost to computing performance has stemmed from changes to multicore chip designs.</p>
9999
<div class="image">
100-
<img src="era_multicore.jpg" alt="era_multicore.jpg" width="50%"/>
100+
<img src="era_multicore.jpg" alt="era_multicore.jpg" width="60%"/>
101101
</div>
102102
<p>The above sweeping visualization (thanks to Prof. Mark Horowitz and his group) shows the evolution of computer architectures is moving toward multicore designs. Today, multicore processors and multiprocessor systems are common in many electronic products such as mobiles, laptops, desktops, and servers. In order to keep up with the performance scaling, it is becoming necessary for software developers to write parallel programs that utilize the number of available cores.</p>
103+
<h1><a class="anchor" id="C0HeterogeneousComputing"></a>
104+
Heterogeneous Computing</h1>
105+
<p>With the influence of artificial intelligence (AI) through new and merged workloads, heterogeneous computing becomes demanding and will continue to be heard for years to come. We have not just CPUs but GPUs, TPUs, FPGAs, and ASICs to accelerator a wide variety of scientific computing problems.</p>
106+
<div class="image">
107+
<img src="CPU-vs-TPU-vs-GPU.png" alt="CPU-vs-TPU-vs-GPU.png" width="60%"/>
108+
</div>
109+
<p>The question is: <em>How are we going to program these beasts?</em> Writing a high-performance sequential program is hard. Parallel programming is harder. Parallel programming of heterogeneous devices is extremely challenging if we care about performance and power efficiency. Programming models need to deal with productivity versus performance.</p>
103110
<h1><a class="anchor" id="LoopLevelParallelism"></a>
104111
Loop-level Parallelism</h1>
105112
<p>The most basic and simplest concept of parallel programming is <em>loop-level parallelism</em>, exploiting parallelism that exists among the iterations of a loop. The program typically partitions a loop of iterations into a set of of blocks, either fixed or dynamic, and run each block in parallel. Below the figure illustrates this pattern.</p>
106113
<div class="image">
107-
<img src="loop-level-parallelism.jpeg" alt="loop-level-parallelism.jpeg" width="30%"/>
114+
<img src="loop-level-parallelism.jpeg" alt="loop-level-parallelism.jpeg" width="50%"/>
108115
</div>
109116
<p>The main advantage of the loop-based approach is its simplicity in speeding up a regular workload in line with Amdahl's Law. Programmers only need to discover independence of each iteration within a loop and, once possible, the parallel decomposition strategy can be easily implemented. Many existing libraries have built-in support to write a parallel-for loop.</p>
110117
<h1><a class="anchor" id="TaskBasedParallelism"></a>
@@ -114,18 +121,15 @@ <h1><a class="anchor" id="TaskBasedParallelism"></a>
114121
<object type="image/svg+xml" data="task-level-parallelism.svg" width="60%">task-level-parallelism.svg</object>
115122
</div>
116123
<p>The above figure shows an example <em>task dependency graph</em>. Each node in the graph represents a task unit at function level and each edge indicates the task dependency between a pair of tasks. Task-based model offers a powerful means to express both regular and irregular parallelism in a top-down manner, and provides transparent scaling to large number of cores. In fact, it has been proven, both by the research community and the evolution of parallel programming standards, task-based approach scales the best with future processor generations and architectures.</p>
117-
<h1><a class="anchor" id="ChallengesOfTaskBasedParallelProgramming"></a>
118-
Challenges of Task-based Parallel Programming</h1>
119-
<p>Parallel programs are notoriously hard to write correctly, regardless of loop-based approach or task-based model. A primary reason is <em>data dependency</em>, some data cannot be accessed until some other data becomes available. This dependency constraint introduces a number of challenges such as data race, thread contention, and consistencies when writing a correct parallel program. Through the evolution of parallel programming standards, it has been proven that the most effective way to overcome these obstacles is a suitable task-based programming model. The programming model affects software developments in various aspects, such as programmability, debugging effort, development costs, efficiencies, etc.</p>
120124
<h1><a class="anchor" id="TheProjectMantra"></a>
121125
The Project Mantra</h1>
122-
<p>The goal of Cpp-Taskflow is simple - <em>We aim to help C++ developers quickly write parallel programs with high performance scalability and simultaneous high productivity</em>. We want developers to write simple and effective parallel code, specifically with the following objectives:</p>
126+
<p>The goal of Cpp-Taskflow is simple - <em>We help developers quickly write parallel programs with high performance scalability and simultaneous high productivity</em>. We want developers to write simple and effective parallel code, specifically with the following objectives:</p>
123127
<ul>
124128
<li>Expressiveness </li>
125129
<li>Readability </li>
126130
<li>Transparency</li>
127131
</ul>
128-
<p>In a nutshell, code written with Cpp-Taskflow explains itself. The transparency allows developers to forget about the difficult thread managements at programming time. They can focus on high-level implementation of parallel decomposition algorithms, leaving the concurrency details and scalability handled by Cpp-Taskflow. </p>
132+
<p>In a nutshell, code written with Cpp-Taskflow explains itself. The transparency allows developers to focus on the development of application algorithms and parallel decomposition strategies, rather than low-level, system-specific details. </p>
129133
</div></div><!-- contents -->
130134
</div><!-- doc-content -->
131135
<!-- start footer part -->

docs/hpec19.pdf

1020 KB
Binary file not shown.

docs/index.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -94,7 +94,7 @@
9494
<div class="contents">
9595
<div class="textblock"><h1><a class="anchor" id="ModernCppParallelTaskProgrammingLibrary"></a>
9696
Modern C++ Parallel Task Programming</h1>
97-
<p>Cpp-Taskflow helps you quickly write parallel task programs with high performance scalability and simultaneous high productivity. It is by far faster, more expressive, fewer lines of code, and easier for drop-in integration than existing parallel task programming libraries such as <a href="https://www.openmp.org/spec-html/5.0/openmpsu99.html">OpenMP Tasking</a> and Intel <a href="https://www.threadingbuildingblocks.org/tutorial-intel-tbb-flow-graph">Thread Building Block (TBB) FlowGraph</a>.</p>
97+
<p>Cpp-Taskflow helps you quickly write parallel task programs with high performance scalability and simultaneous high productivity. It is by far faster, more expressive, fewer lines of code, and easier for drop-in integration than existing parallel task programming frameworks such as <a href="https://www.openmp.org/spec-html/5.0/openmpsu99.html">OpenMP Tasking</a> and Intel <a href="https://www.threadingbuildingblocks.org/tutorial-intel-tbb-flow-graph">Thread Building Block (TBB) FlowGraph</a>.</p>
9898
<div class="image">
9999
<img src="performance.jpg" alt="performance.jpg" width="95%"/>
100100
</div>
@@ -142,7 +142,7 @@ <h1><a class="anchor" id="HeterogeneousTasking"></a>
142142
</div>
143143
<h1><a class="anchor" id="HowToInstallCppTaskflow"></a>
144144
How to Install Cpp-Taskflow?</h1>
145-
<p>Cpp-Taskflow is <em>header-only</em> and there is no need for installation. Simply download the source and copy the headers under the <code>taskflow</code> subdirectory to your project.</p>
145+
<p>Cpp-Taskflow is <em>header-only</em> and there is no need for installation. Simply download the source and copy the headers under the directory <code>taskflow/</code> to your project.</p>
146146
<div class="fragment"><div class="line">~$ git clone https://github.com/cpp-taskflow/cpp-taskflow.git</div><div class="line">~$ cd cpp-taskflow/</div><div class="line">~$ cp -r taskflow myproject/include/</div></div><!-- fragment --><h1><a class="anchor" id="ASimpleFirstProgram"></a>
147147
A Simple First Program</h1>
148148
<p>Here is a rather simple program to get you started.</p>
@@ -164,7 +164,7 @@ <h1><a class="anchor" id="SupportedCompilers"></a>
164164
<li>AppleClang Xode Version at least v8 </li>
165165
<li>Nvidia CUDA Toolkit and Compiler (nvcc) at least v10.0 with -std=c++14</li>
166166
</ul>
167-
<p>Cpp-Taskflow works on Linux, Windows, and OSX.</p>
167+
<p>Cpp-Taskflow works on Linux, Windows, and Mac OS X.</p>
168168
<h1><a class="anchor" id="License"></a>
169169
License</h1>
170170
<p>Cpp-Taskflow is open-source under permissive MIT license. </p>

0 commit comments

Comments
 (0)