Add new post about eddl part 2

2026-06-13 08:08:10 -07:00 · 2021-10-31 18:18:46 -04:00
parent 6f4c32d4fb
commit f86bce1774
20 changed files with 1572 additions and 153 deletions
@@ -441,42 +441,42 @@ More details in design and implementation can be found in late posts.</p>

 <h2 id="why-do-we-need-training-on-edge">Why do we need training on edge?</h2>

-<p>Cloud is not trustworthy anymore. More and more facts supports that breach on cloud happens frequently than before.
-Nowadays, with more generated personal sensitive data has been uploaded to the cloud center, tech company know better to someones than user themselves.</p>
+<p>Cloud is not trustworthy anymore. More and more facts support that breach on the cloud happens frequently than before.
+Nowadays, with more generated personal sensitive data has been uploaded to the cloud center, tech companies know better to someones than the user.</p>

-<p>Researchers, no matter in industry on academia, are working in a way that still learning from users’ data but also keeping raw sensitive data under users’ control.
-Many publications already showed feasibility of only sharing after-trained model instead of raw data.
+<p>Researchers, no matter in the industry on academia, are working in a way that still learning from users’ data but also keeping raw sensitive data under users’ control.
+Many publications have already shown the feasibility of only sharing the after-trained model instead of raw data.
 One recent popular study on this is google’s <a href="https://ai.googleblog.com/2017/04/federated-learning-collaborative.html">federated learning</a>.</p>

-<p>During investigated this problem, we found that let end user train their own data is safe, but sacrifice efficiency.
+<p>During investigating this problem, we found that letting end-user train their data is safe, but sacrifice efficiency.
 Since one end device has limited resources, training time and power consumption can be disappointing.
-We believe there must have a leverage between privacy and efficiency in some target scenarios.</p>
+We believe there must have leverage between privacy and efficiency in some target scenarios.</p>

-<p>Fortunately, we observed that users who belongs to the same campus, plant, firm and community always share similar interests.
+<p>Fortunately, we observed that users who belong to the same campus, plant, firm, and community always share similar interests.
 Therefore, these co-located users have similar demands in using AI-involved routines.
-Also, co-located users are easily targeted by same type of threats, such as ransomware to financial practitioners.</p>
+Also, co-located users are easily targeted by the same type of threats, such as ransomware to financial practitioners.</p>

-<p>Think about this, sending features of a new malware app to cloud services in order to train a neural networks used by antivirus program.
-This process may takes long time and small amount of samples may not be recognized by the global neural networks model.
-With a customized local model trained and deployed on the edge can successfully counter the problem.
-With edge training as a supplement of cloud training can achieve better response time and let the whole system more flexible.</p>
+<p>Think about this, sending features of a new malware app to cloud services to train neural networks used by antivirus programs.
+This process may take a long time and a small number of samples may not be recognized by the global neural networks model.
+A customized local model trained and deployed on the edge can successfully counter the problem.
+With edge training as a supplement to the cloud training can achieve better response time and let the whole system more flexible.</p>

 <h2 id="why-training-on-edge-is-hard">Why training on edge is hard?</h2>

-<p>Since all co-located users’ device can be used for an edge training, issues and challenges occur as deploying this distributed system.</p>
+<p>Since all co-located users’ devices can be used for edge training, issues and challenges occur as deploying this distributed system.</p>

 <p>The first challenge is <strong>struggling workers</strong>.
-Training devices are heterogeneity, from limited IoT camera to high-end media center with powerful GPU.
-They are not designed to do machine learnings.
-So, a good edge-based distributed learning framework must can handle variety speeds in training tasks.</p>
+Training devices are heterogeneous, from limited IoT cameras to high-end media centers with powerful GPUs.
+They are not designed to do machine learning.
+So, a good edge-based distributed learning framework must be able to handle a variety of speeds in training tasks.</p>

 <p>The second challenge is how to <strong>scale up</strong> clusters.
-In a campus, thousands and more devices may contribute computing resources to the same training tasks.
-However, these devices may located in far not matter in physical or in network topology. 
-How can we well use them well, without struggled with endless transmission time remains a challenge.</p>
+On a campus, thousands and more devices may contribute computing resources to the same training tasks.
+However, these devices may be located far no matter in physical or in network topology.
+The question of how can we well use them well, without struggling with endless transmission time remains a challenge.</p>

 <p>The third issue is frequently <strong>joining and exiting</strong> of devices.
-We can’t rely on each devices to faithfully working on training tasks rather than their original workload.
+We can’t rely on each device to faithfully work on training tasks rather than their original workload.
 Smartly schedule work balance and handle join/exit issues also need under consideration.</p>

 <h2 id="our-proposal">Our proposal</h2>
@@ -485,29 +485,29 @@ Smartly schedule work balance and handle join/exit issues also need under consid
  <li>
    <p>Dynamic training data distribution and runtime profiler</p>

-    <p>We design a dynamic training data distribution mechanism that helps to both the first and the third challenges.
-  Preprocessing data can be transmitted without leakage of raw sensitive information. 
-  This can helps with struggling workers who can train small batches in order to upload parameters with a similar training time.
-  Also, for extremely slow devices, join and exit of devices cases, dynamic data distribution and profiler can helps with keep global training parameters from polluted and staleness.</p>
+    <p>We design a dynamic training data distribution mechanism that helps both the first and the third challenges.
+  Preprocessing data can be transmitted without leakage of raw and sensitive information.
+  This can help struggling workers who can train small batches in order to upload parameters with a similar training time.
+  Also, for extremely slow devices, join and exit of devices cases, dynamic data distribution and profiler can help with keeping global training parameters from pollution and staleness.</p>

-    <p>To counter heterogeneity’s, more approaches were applied in our later research.
-  More details were introduced to runtime profiler in the later works.</p>
+    <p>To counter heterogeneity, more approaches were applied in our later research.
+  More details were introduced to the runtime profiler in the later works.</p>
  </li>
  <li>
    <p>Asynchronous and synchronous aggregation enabled</p>

-    <p>In our findings, asynchronous and synchronous parameter update have their pros and cons. 
-  Keeping sync all the time leads struggling worker issue unsolvable.
-  However, async’s harm to accuracy and convergence time also need attentions.
-  To carefully chose between these two update policies at the runtime is what we proposed to make use of their own advantages.</p>
+    <p>In our findings, asynchronous and synchronous parameter update have their pros and cons.
+  Keeping sync all the time leads to struggling worker issues unsolvable.
+  However, async’s harm to accuracy and convergence time also needs attention.
+  To carefully choose between these two update policies at the runtime is what we proposed to make use of their own advantages.</p>
  </li>
  <li>
    <p>Leader role splitting</p>

-    <p>The idea is to let worker devices with higher bandwidth taking leader role during training.
-  Parameter updating does not require much computation but only need bandwidth. 
+    <p>The idea is to let worker devices with higher bandwidth take leader-role during training.
+  Parameter updating does not require much computation but only needs a great of bandwidth.
  Devices with sufficient bandwidth can also work as virtual leader devices.
-  This approach helps with minimize physical devices we used and more leaders can further scale up workers limits.</p>
+  This approach helps minimize physical devices we used and more leaders can further scale up workers’ limits.</p>
  </li>
 </ul>
 </div><section class="article__sharing d-print-none"></section><div class="d-print-none"><footer class="article__footer"><meta itemprop="dateModified" content="2021-10-13T16:53:20-04:00"><!-- start custom article footer snippet -->
@@ -515,7 +515,7 @@ Smartly schedule work balance and handle join/exit issues also need under consid
 <!-- end custom article footer snippet -->
 <div class="article__subscribe"><div class="subscribe"><i class="fas fa-rss"></i> <a type="application/rss+xml" href="/feed.xml">Subscribe</a></div>
 </div><div class="article__license"></div></footer>
-<div class="article__section-navigator clearfix"><div class="previous"><span>PREVIOUS</span><a href="/posts/generate-word-cloud-with-chinese-fenci">Generate Word Cloud Figures with Chinese-Tokenization and WordCloud python libraries</a></div></div></div>
+<div class="article__section-navigator clearfix"><div class="previous"><span>PREVIOUS</span><a href="/posts/generate-word-cloud-with-chinese-fenci">Generate Word Cloud Figures with Chinese-Tokenization and WordCloud python libraries</a></div><div class="next"><span>NEXT</span><a href="/posts/eddl-how-do-we-train-on-limited-edge-devices-part2">EDDL: How do we train neural networks on limited edge devices - PART 2</a></div></div></div>

 </div>