simoncpp
diff --git a/‎README.md‎
Lines changed: 22 additions & 41 deletions b/‎README.md‎
Lines changed: 22 additions & 41 deletions
diff --git a/‎benchmarks/graph_pipeline/gold.cpp‎
Lines changed: 4 additions & 11 deletions b/‎benchmarks/graph_pipeline/gold.cpp‎
Lines changed: 4 additions & 11 deletions
diff --git a/‎benchmarks/graph_pipeline/omp.cpp‎
Lines changed: 4 additions & 15 deletions b/‎benchmarks/graph_pipeline/omp.cpp‎
Lines changed: 4 additions & 15 deletions
diff --git a/‎benchmarks/graph_pipeline/taskflow.cpp‎
Lines changed: 4 additions & 17 deletions b/‎benchmarks/graph_pipeline/taskflow.cpp‎
Lines changed: 4 additions & 17 deletions
diff --git a/‎benchmarks/graph_pipeline/tbb.cpp‎
Lines changed: 4 additions & 16 deletions b/‎benchmarks/graph_pipeline/tbb.cpp‎
Lines changed: 4 additions & 16 deletions
diff --git a/‎benchmarks/linear_pipeline/taskflow.cpp‎
Lines changed: 4 additions & 0 deletions b/‎benchmarks/linear_pipeline/taskflow.cpp‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎benchmarks/linear_pipeline/tbb.cpp‎
Lines changed: 4 additions & 0 deletions b/‎benchmarks/linear_pipeline/tbb.cpp‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎docs/ParallelPipeline.html‎
Lines changed: 1 addition & 1 deletion b/‎docs/ParallelPipeline.html‎
Lines changed: 1 addition & 1 deletion
@@ -240,26 +240,6 @@ tf::Task cudaflow = taskflow.emplace([&](tf::cudaFlow& cf) {
 
 <p align="center"><img src="doxygen/images/saxpy_1_cudaflow.svg"></p>
 
-Taskflow also supports SYCL, a general-purpose heterogeneous programming model,
-to program GPU tasks in a single-source C++ environment using the task graph-based 
-approach.
-
-```cpp
-tf::Task syclflow = taskflow.emplace_on([&](tf::syclFlow& sf){
-  tf::syclTask h2d_x = cf.copy(dx, hx.data(), N).name("h2d_x");
-  tf::syclTask h2d_y = cf.copy(dy, hy.data(), N).name("h2d_y");
-  tf::syclTask d2h_x = cf.copy(hx.data(), dx, N).name("d2h_x");
-  tf::syclTask d2h_y = cf.copy(hy.data(), dy, N).name("d2h_y");
-  tf::syclTask saxpy = sf.parallel_for(sycl::range<1>(N), 
-    [=] (sycl::id<1> id) {
-      dx[id] = 2.0f * dx[id] + dy[id];
-    }
-  ).name("saxpy");
-  saxpy.succeed(h2d_x, h2d_y)
-       .precede(d2h_x, d2h_y);
-}, sycl_queue).name("syclFlow");
-```
-
 ## Compose Task Graphs
 
 Taskflow is composable. 
@@ -371,6 +351,28 @@ tf::cudaTask cuda3 = cudaflow.sort(     // sort a range of items on GPU
 );
 ```
 
+Additionally, %Taskflow provides composable graph building blocks for you to 
+efficiently implement common parallel algorithms, such as parallel pipeline.
+
+@code{.cpp}
+// create a pipeline to propagate five tokens through three serial stages
+tf::Pipeline pl(num_parallel_lines,
+  tf::Pipe{tf::PipeType::SERIAL, [](tf::Pipeflow& pf) {
+    if(pf.token() == 5) {
+      pf.stop();
+    }
+  }},
+  tf::Pipe{tf::PipeType::SERIAL, [](tf::Pipeflow& pf) {
+    printf("stage 2: input buffer[%zu] = %d\n", pf.line(), buffer[pf.line()]);
+  }},
+  tf::Pipe{tf::PipeType::SERIAL, [](tf::Pipeflow& pf) {
+    printf("stage 3: input buffer[%zu] = %d\n", pf.line(), buffer[pf.line()]);
+  }}
+);
+taskflow.composed_of(pl)
+executor.run(taskflow).wait();
+@endcode
+
 
 # Supported Compilers
 
@@ -424,8 +426,6 @@ You are completely free to re-distribute your work derived from Taskflow.
 * * *
 
 [Tsung-Wei Huang]:       https://tsung-wei-huang.github.io/
-[Chun-Xun Lin]:          https://github.com/clin99
-[Martin Wong]:           https://ece.illinois.edu/directory/profile/mdfwong
 [GitHub releases]:       https://github.com/taskflow/taskflow/releases
 [GitHub issues]:         https://github.com/taskflow/taskflow/issues
 [GitHub insights]:       https://github.com/taskflow/taskflow/pulse
@@ -434,15 +434,7 @@ You are completely free to re-distribute your work derived from Taskflow.
 [Project Website]:       https://taskflow.github.io/
 [cppcon20 talk]:         https://www.youtube.com/watch?v=MX15huP5DsM
 [contributors]:          https://taskflow.github.io/taskflow/contributors.html
-[OpenMP Tasking]:        https://www.openmp.org/spec-html/5.0/openmpsu99.html 
-[TBB FlowGraph]:         https://www.threadingbuildingblocks.org/tutorial-intel-tbb-flow-graph
-[OpenTimer]:             https://github.com/OpenTimer/OpenTimer
-[DtCraft]:               https://github.com/tsung-wei-huang/DtCraft
 [totalgee]:              https://github.com/totalgee
-[damienhocking]:         https://github.com/damienhocking
-[ForgeMistress]:         https://github.com/ForgeMistress
-[Patrik Huber]:          https://github.com/patrikhuber
-[KingDuckZ]:             https://github.com/KingDuckZ
 [NSF]:                   https://www.nsf.gov/
 [UIUC]:                  https://illinois.edu/
 [CSL]:                   https://csl.illinois.edu/
@@ -452,18 +444,7 @@ You are completely free to re-distribute your work derived from Taskflow.
 [cookbook]:              https://taskflow.github.io/taskflow/pages.html
 [references]:            https://taskflow.github.io/taskflow/References.html
 [PayMe]:                 https://www.paypal.me/twhuang/10
-[C++17]:                 https://en.wikipedia.org/wiki/C%2B%2B17
-[C++14]:                 https://en.wikipedia.org/wiki/C%2B%2B14
 [email me]:              mailto:[email protected]
 [Cpp Conference 2018]:   https://github.com/CppCon/CppCon2018
-[IPDPS19]:               https://tsung-wei-huang.github.io/papers/ipdps19.pdf
 [TPDS21]:                https://tsung-wei-huang.github.io/papers/tpds21-taskflow.pdf
-[cuda-zone]:             https://developer.nvidia.com/cuda-zone
-[nvcc]:                  https://developer.nvidia.com/cuda-llvm-compiler
-[cudaGraph]:             https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__GRAPH.html
-[Firestorm]:             https://github.com/ForgeMistress/Firestorm
-[Shiva]:                 https://shiva.gitbook.io/project/shiva
-[PID Framework]:         http://pid.lirmm.net/pid-framework/index.html
-[NovusCore]:             https://github.com/novuscore/NovusCore
-[SA-PCB]:                https://github.com/choltz95/SA-PCB
 
@@ -1,7 +1,5 @@
 #include "levelgraph.hpp"
 #include <fstream>
-//#include "matrix_calculation.hpp"
-
 
 
 int pipe_helper(
@@ -600,17 +598,12 @@ std::chrono::microseconds measure_time_gold(
       graph_pipeline_gold_16_pipes(graph);
       end = std::chrono::high_resolution_clock::now();
     break;
+    
+    default:
+      throw std::runtime_error("can support only up to 16 pipes");
+    break;
   }
 
-  //std::ofstream outputfile;
-  //outputfile.open("./build/benchmarks/tf_time.csv", std::ofstream::app);
-  //outputfile << num_threads << ','
-  //           << num_lines   << ','
-  //           << pipes       << ','
-  //           << size        << ','
-  //           << elapsed.count()/1e3 << '\n';
-
-  //outputfile.close();
   return std::chrono::duration_cast<std::chrono::microseconds>(end - beg);
 }
 
@@ -29,10 +29,6 @@ void last_pipe_helper(LevelGraph& graph, const size_t i) {
   else {
     graph.node_at(lev, len).set_value(retval);
   }
-  //std::ofstream outputfile;
-  //outputfile.open("./omp_result_.txt", std::ofstream::app);
-  //outputfile << graph.node_at(lev, len).get_value() << '\n';
-  //outputfile.close();
 }
 
 
@@ -1413,18 +1409,11 @@ std::chrono::microseconds measure_time_omp(
       graph_pipeline_omp_16_pipes(graph);
       end = std::chrono::high_resolution_clock::now();
     break;
+    
+    default:
+      throw std::runtime_error("can support only up to 16 pipes");
+    break;
   }
-
-  //std::ofstream outputfile;
-  //outputfile.open("./omp_time.csv", std::ofstream::app);
-  //outputfile << num_threads               << ','
-  //           << num_lines                 << ','
-  //           << pipes                     << ','
-  //           << graph.graph_size()        << ','
-  //           << (std::chrono::duration_cast<std::chrono::microseconds>(end - beg).count())/1e3
-  //           << '\n';
-
-  //outputfile.close();
   return std::chrono::duration_cast<std::chrono::microseconds>(end - beg);
 }
 
@@ -3,8 +3,6 @@
 #include <taskflow/algorithm/pipeline.hpp>
 //#include "matrix_calculation.hpp"
 
-
-
 struct Input {
   size_t lev;
   size_t len;
@@ -68,11 +66,6 @@ struct FilterFinal {
     else {
       graph.node_at(lev, len).set_value(val);
     }
-
-    //std::ofstream outputfile;
-    //outputfile.open("./tf_result_.txt", std::ofstream::app);
-    //outputfile << graph.node_at(lev, len).get_value() << '\n';
-    //outputfile.close();
   }
 };
 
@@ -597,17 +590,11 @@ std::chrono::microseconds measure_time_taskflow(
     case 16:
       elapsed = graph_pipeline_taskflow_16_pipes(graph, num_lines, num_threads);
     break;
-  }
 
-  //std::ofstream outputfile;
-  //outputfile.open("./tf_time.csv", std::ofstream::app);
-  //outputfile << num_threads               << ','
-  //           << num_lines                 << ','
-  //           << pipes                     << ','
-  //           << graph.graph_size()        << ','
-  //           << elapsed.count()/1e3 << '\n';
-
-  //outputfile.close();
+    default:
+      throw std::runtime_error("can support only up to 16 pipes");
+    break;
+  }
   return elapsed;
 }
 
@@ -491,23 +491,11 @@ std::chrono::microseconds measure_time_tbb(
       graph_pipeline_tbb_16_pipes(graph, num_lines);
       end = std::chrono::high_resolution_clock::now();
     break;
+    
+    default:
+      throw std::runtime_error("can support only up to 16 pipes");
+    break;
   }
 
-  //std::ofstream outputfile;
-  //outputfile.open("./tbb_result.txt", std::ofstream::app);
-  //for (auto r:result) {
-  //  outputfile << r << '\n';
-  //}
-  
-  //std::ofstream outputfile;
-  //outputfile.open("./tbb_time.csv", std::ofstream::app);
-  //outputfile << num_threads               << ','
-  //           << num_lines                 << ','
-  //           << pipes                     << ','
-  //           << graph.graph_size()        << ','
-  //           << (std::chrono::duration_cast<std::chrono::microseconds>(end - beg).count())/1e3
-  //           << '\n';
-  //outputfile.close();
-
   return std::chrono::duration_cast<std::chrono::microseconds>(end - beg);
 }
@@ -510,6 +510,10 @@ std::chrono::microseconds measure_time_taskflow(
     case 16:
       elapsed = parallel_pipeline_taskflow_16_pipes(pipes, num_lines, num_threads, size);
       break;
+
+    default:
+      throw std::runtime_error("can support only up to 16 pipes");
+    break;
   }
 
   //std::ofstream outputfile;
 
@@ -717,6 +717,10 @@ std::chrono::microseconds measure_time_tbb(
       parallel_pipeline_tbb_16_pipes(pipes, num_lines, size);
       end = std::chrono::high_resolution_clock::now();
       break;
+
+    default:
+      throw std::runtime_error("can support only up to 16 pipes");
+    break;
   }
 
   //std::ofstream outputfile;
 
@@ -67,7 +67,7 @@ <h3>Contents</h3>
             <li><a href="#ParallelPipelineLearnMore">Learn More about Taskflow Pipeline</a></li>
           </ul>
         </div>
-<p>Taskflow provides a <em>task-parallel</em> pipeline programming framework for you to create a <em>pipeline scheduling framework</em> to implement pipeline algorithms. Pipeline parallelism refers to a parallel execution of multiple data tokens through a linear chain of pipes or stages. Each stage processes the data token sent from the previous stage, applies the given callable to that data token, and then sends the result to the next stage. Multiple data tokens can be processed simultaneously across different stages.</p><section id="ParallelPipelineIncludeHeaderFile"><h2><a href="#ParallelPipelineIncludeHeaderFile">Include the Header</a></h2><p>You need to include the header file, <code>taskflow/algorithm/pipeline.hpp</code>, for creating a pipeline scheduling framework.</p><pre class="m-code"><span class="cp">#include</span> <span class="cpf">&lt;taskflow/algorithm/pipeline.hpp&gt;</span><span class="cp"></span></pre></section><section id="UnderstandPipelineScheduling"><h2><a href="#UnderstandPipelineScheduling">Understand the Pipeline Scheduling Framework</a></h2><p>A <a href="classtf_1_1Pipeline.html" class="m-doc">tf::<wbr />Pipeline</a> object is a <em>composable</em> graph to create a <em>pipeline scheduling framework</em> through a module task in a taskflow (see <a href="ComposableTasking.html" class="m-doc">Composable Tasking</a>). Unlike the conventional pipeline programming frameworks (e.g., Intel TBB Parallel <a href="classtf_1_1Pipeline.html" class="m-doc">Pipeline</a>), Taskflow&#x27;s pipeline algorithm does not provide any data abstraction, which often restricts users from optimizing data layouts in their applications, but a flexible framework for users to customize their application data atop an efficient pipeline scheduling framework.</p><div class="m-graph"><svg style="width: 22.250rem; height: 22.688rem;" viewBox="0.00 0.00 356.00 363.08">
+<p>Taskflow provides a <em>task-parallel</em> pipeline programming framework for you to create a <em>pipeline scheduling framework</em>. Pipeline parallelism refers to a parallel execution of multiple data tokens through a linear chain of pipes or stages. Each stage processes the data token sent from the previous stage, applies the given callable to that data token, and then sends the result to the next stage. Multiple data tokens can be processed simultaneously across different stages.</p><section id="ParallelPipelineIncludeHeaderFile"><h2><a href="#ParallelPipelineIncludeHeaderFile">Include the Header</a></h2><p>You need to include the header file, <code>taskflow/algorithm/pipeline.hpp</code>, for creating a pipeline scheduling framework.</p><pre class="m-code"><span class="cp">#include</span> <span class="cpf">&lt;taskflow/algorithm/pipeline.hpp&gt;</span><span class="cp"></span></pre></section><section id="UnderstandPipelineScheduling"><h2><a href="#UnderstandPipelineScheduling">Understand the Pipeline Scheduling Framework</a></h2><p>A <a href="classtf_1_1Pipeline.html" class="m-doc">tf::<wbr />Pipeline</a> object is a <em>composable</em> graph to create a <em>pipeline scheduling framework</em> through a module task in a taskflow (see <a href="ComposableTasking.html" class="m-doc">Composable Tasking</a>). Unlike the conventional pipeline programming frameworks (e.g., Intel TBB Parallel <a href="classtf_1_1Pipeline.html" class="m-doc">Pipeline</a>), Taskflow&#x27;s pipeline algorithm does not provide any data abstraction, which often restricts users from optimizing data layouts in their applications, but a flexible framework for users to customize their application data atop an efficient pipeline scheduling framework.</p><div class="m-graph"><svg style="width: 22.250rem; height: 22.688rem;" viewBox="0.00 0.00 356.00 363.08">
 <g transform="scale(1 1) rotate(0) translate(4 359.0782)">
 <title>Taskflow</title>
 <g class="m-cluster">