Minor improvement.

frankfliu · frankfliu · commit 0c44284be335 · 2021-10-14T08:34:32.000-07:00
diff --git a/chapter_deep-learning-computation/custom-layer.ipynb b/chapter_deep-learning-computation/custom-layer.ipynb
@@ -299,7 +299,6 @@
    "source": [
     "NDArray input = manager.randomUniform(0, 1, new Shape(2, 5));\n",
     "\n",
-    "linear.setInitializer(new XavierInitializer(), Parameter.Type.WEIGHT);\n",
     "linear.initialize(manager, DataType.FLOAT32, input.getShape());\n",
     "\n",
     "Model model = Model.newInstance(\"my-linear\");\n",
@@ -328,7 +327,6 @@
     "SequentialBlock net = new SequentialBlock();\n",
     "net.add(new MyLinear(8, 64)); // 64 units in -> 8 units out\n",
     "net.add(new MyLinear(1, 8)); // 8 units in -> 1 unit out\n",
-    "net.setInitializer(new XavierInitializer(), Parameter.Type.WEIGHT);\n",
     "net.initialize(manager, DataType.FLOAT32, input.getShape());\n",
     "\n",
     "Model model = Model.newInstance(\"lin-reg-custom\");\n",
diff --git a/chapter_deep-learning-computation/parameters.ipynb b/chapter_deep-learning-computation/parameters.ipynb
@@ -332,18 +332,15 @@
     "\n",
     "This setup has the advantage that we don't have to worry about our `setInitializer()` overriding our previous `initializer`s on internal blocks!\n",
     "\n",
-    "If you want to however, you can explicitly set an initializer for a `Parameter` by calling its `setInitializer()` function directly and passing in `true` to the overwrite input.\n",
-    "Simply loop over all the parameters returned from `getParameters()` and set their initializers directly!"
+    "If you want to however, you can explicitly set an initializer for a `Parameter` by calling its `setInitializer()` function directly."
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Let us begin by calling on built-in initializers. \n",
-    "The code below initializes all parameters \n",
-    "to a given constant value 1, \n",
-    "by using the `ConstantInitializer()` initializer. \n",
+    "Let us begin by calling on built-in initializers. The code below initializes all parameters \n",
+    "to a given constant value 1, by using the `ConstantInitializer()` initializer. \n",
     "\n",
     "Note that this will not do anything currently since we have already set\n",
     "our initializer in the previous code block.\n",
@@ -430,7 +427,7 @@
    },
    "outputs": [],
    "source": [
-    "SequentialBlock net = getNet();\n",
+    "net = getNet();\n",
     "net.setInitializer(new NormalInitializer(), Parameter.Type.WEIGHT);\n",
     "net.initialize(manager, DataType.FLOAT32, x.getShape());\n",
     "Block linearLayer = net.getChildren().valueAt(0);\n",
@@ -444,7 +441,7 @@
    "source": [
     "We can also apply different initializers for certain Blocks.\n",
     "For example, below we initialize the first layer\n",
-    "with the `Xavier` initializer\n",
+    "with the `XavierInitializer` initializer\n",
     "and initialize the second layer \n",
     "to a constant value of 0.\n",
     "\n",
@@ -464,7 +461,7 @@
    },
    "outputs": [],
    "source": [
-    "SequentialBlock net = new SequentialBlock();\n",
+    "net = new SequentialBlock();\n",
     "Linear linear1 = Linear.builder().setUnits(8).build();\n",
     "net.add(linear1);\n",
     "net.add(Activation.reluBlock());\n",
@@ -485,15 +482,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Finally, we can loop over the `ParameterList` and set their initializers individually.\n",
-    "When setting initializers directly on the `Parameter`, you must pass in an `overwrite`\n",
-    "boolean along with the initializer to declare whether you want your current\n",
-    "initializer to overwrite the previous initializer if one has already been set.\n",
-    "Here, we do want to overwrite and so pass in `true`. \n",
-    "\n",
-    "For this example, however, since we haven't set the `weight` initializers before, there is no initializer to overwrite so we could pass in `false` and still have the same outcome.\n",
-    "\n",
-    "However, since `bias` parameters are automatically set to initialize at 0, to properly set our intializer here, we have to set overwrite to `true`."
+    "Finally, we can directly access the `Parameter.setInitializer()` and set their initializers individually."
    ]
   },
   {
@@ -502,30 +491,16 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "SequentialBlock net = getNet();\n",
+    "net = getNet();\n",
     "ParameterList params = net.getParameters();\n",
-    "for (int i = 0; i < params.size(); i++) {\n",
-    "    // Here we interleave initializers.\n",
-    "    // We initialize parameters at even indexes to 0\n",
-    "    // and parameters at odd indexes to 2.\n",
-    "    Parameter param = params.valueAt(i);\n",
-    "    if (i % 2 == 0) {\n",
-    "        // All weight parameters happen to be at even indices.\n",
-    "        // We set them to initialize to 0.\n",
-    "        param.setInitializer(new ConstantInitializer(0));\n",
-    "    }\n",
-    "    else {\n",
-    "        // All bias parameters happen to be at odd indices.\n",
-    "        // We set them to initialize to 2.\n",
-    "        param.setInitializer(new ConstantInitializer(2));\n",
-    "    }\n",
-    "}\n",
-    "net.initialize(manager, DataType.FLOAT32, x.getShape());\n",
     "\n",
-    "for (var param : net.getParameters()) {\n",
-    "    System.out.println(param.getKey());\n",
-    "    System.out.println(param.getValue().getArray());\n",
-    "}"
+    "params.get(\"01Linear_weight\").setInitializer(new NormalInitializer());\n",
+    "params.get(\"03Linear_weight\").setInitializer(Initializer.ONES);\n",
+    "\n",
+    "net.initialize(manager, DataType.FLOAT32, new Shape(2, 4));\n",
+    "\n",
+    "System.out.println(params.valueAt(0).getArray());\n",
+    "System.out.println(params.valueAt(2).getArray());"
    ]
   },
   {
@@ -563,7 +538,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "class MyInit implements Initializer {\n",
+    "static class MyInit implements Initializer {\n",
     "\n",
     "    public MyInit() {}\n",
     "\n",
@@ -593,7 +568,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "SequentialBlock net = getNet();\n",
+    "net = getNet();\n",
     "net.setInitializer(new MyInit(), Parameter.Type.WEIGHT);\n",
     "net.initialize(manager, DataType.FLOAT32, x.getShape());\n",
     "Block linearLayer = net.getChildren().valueAt(0);\n",
diff --git a/chapter_deep-learning-computation/read-write.ipynb b/chapter_deep-learning-computation/read-write.ipynb
@@ -60,7 +60,7 @@
     "try (FileOutputStream fos = new FileOutputStream(\"x-file\")) {\n",
     "    fos.write(x.encode());\n",
     "}\n",
-    "x;"
+    "x"
    ]
   },
   {
@@ -89,7 +89,30 @@
     "    // from a `FileInputStream` and return it as a `byte[]`.\n",
     "    x2 = NDArray.decode(manager, Utils.toByteArray(fis));\n",
     "}\n",
-    "x2;"
+    "x2"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We can also store `NDList` into a file and load it back:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "NDList list = new NDList(x, x2);\n",
+    "try (FileOutputStream fos = new FileOutputStream(\"x-file\")) {\n",
+    "    fos.write(list.encode());\n",
+    "}\n",
+    "try (FileInputStream fis = new FileInputStream(\"x-file\")) {\n",
+    "    list = NDList.decode(manager, Utils.toByteArray(fis));\n",
+    "}\n",
+    "list"
    ]
   },
   {
diff --git a/chapter_linear-networks/linear-regression-djl.ipynb b/chapter_linear-networks/linear-regression-djl.ipynb
@@ -284,7 +284,7 @@
    "source": [
     "DefaultTrainingConfig config = new DefaultTrainingConfig(l2loss)\n",
     "    .optOptimizer(sgd) // Optimizer (loss function)\n",
-    "    .optDevices(Engine.getInstance().getDevices(1)) // single GPU\n",
+    "    .optDevices(manager.getEngine().getDevices(1)) // single GPU\n",
     "    .addTrainingListeners(TrainingListener.Defaults.logging()); // Logging\n",
     "\n",
     "Trainer trainer = model.newTrainer(config);"
diff --git a/chapter_linear-networks/softmax-regression-djl.ipynb b/chapter_linear-networks/softmax-regression-djl.ipynb
@@ -245,7 +245,7 @@
    "source": [
     "DefaultTrainingConfig config = new DefaultTrainingConfig(loss)\n",
     "    .optOptimizer(sgd) // Optimizer\n",
-    "    .optDevices(Engine.getInstance().getDevices(1)) // single GPU\n",
+    "    .optDevices(manager.getEngine().getDevices(1)) // single GPU\n",
     "    .addEvaluator(new Accuracy()) // Model Accuracy\n",
     "    .addTrainingListeners(TrainingListener.Defaults.logging()); // Logging\n",
     "\n",
@@ -307,7 +307,7 @@
     "int numEpochs = 3;\n",
     "\n",
     "EasyTrain.fit(trainer, numEpochs, trainingSet, validationSet);\n",
-    "trainer.getTrainingResult().getValidateEvaluation(\"Accuracy\")"
+    "var result = trainer.getTrainingResult();"
    ]
   },
   {
diff --git a/chapter_linear-networks/softmax-regression-scratch.ipynb b/chapter_linear-networks/softmax-regression-scratch.ipynb
@@ -66,7 +66,6 @@
     "        .optLimit(Long.getLong(\"DATASET_LIMIT\", Long.MAX_VALUE))\n",
     "        .build();\n",
     "\n",
-    "\n",
     "FashionMnist validationSet = FashionMnist.builder()\n",
     "        .optUsage(Dataset.Usage.TEST)\n",
     "        .setSampling(batchSize, false)\n",
diff --git a/chapter_natural-language-processing-pretraining/word-embedding-dataset.ipynb b/chapter_natural-language-processing-pretraining/word-embedding-dataset.ipynb
@@ -40,12 +40,6 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "import ai.djl.Device;\n",
-    "import ai.djl.Model;\n",
-    "import ai.djl.engine.Engine;\n",
-    "import ai.djl.ndarray.*;\n",
-    "import ai.djl.ndarray.index.NDIndex;\n",
-    "\n",
     "import java.util.stream.*;\n",
     "import org.apache.commons.math3.distribution.EnumeratedDistribution;"
    ]
diff --git a/chapter_recurrent-modern/machine-translation-and-dataset.ipynb b/chapter_recurrent-modern/machine-translation-and-dataset.ipynb
@@ -90,6 +90,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
+    "import java.nio.charset.*;\n",
     "import java.util.zip.*;\n",
     "import java.util.stream.*;"
    ]
@@ -132,38 +133,22 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "public static StringBuilder readDataNMT() throws IOException {\n",
-    "    File file = new File(\"./fra-eng.zip\");\n",
-    "    if (!file.exists()) {\n",
-    "        InputStream inputStream =\n",
-    "                new URL(\"http://d2l-data.s3-accelerate.amazonaws.com/fra-eng.zip\").openStream();\n",
-    "        Files.copy(\n",
-    "                inputStream, Paths.get(\"./fra-eng.zip\"), StandardCopyOption.REPLACE_EXISTING);\n",
-    "    }\n",
-    "\n",
-    "    ZipFile zipFile = new ZipFile(file);\n",
+    "public static String readDataNMT() throws IOException {\n",
+    "    DownloadUtils.download(\n",
+    "            \"http://d2l-data.s3-accelerate.amazonaws.com/fra-eng.zip\", \"fra-eng.zip\");\n",
+    "    ZipFile zipFile = new ZipFile(new File(\"fra-eng.zip\"));\n",
     "    Enumeration<? extends ZipEntry> entries = zipFile.entries();\n",
-    "    InputStream stream = null;\n",
     "    while (entries.hasMoreElements()) {\n",
     "        ZipEntry entry = entries.nextElement();\n",
     "        if (entry.getName().contains(\"fra.txt\")) {\n",
-    "            stream = zipFile.getInputStream(entry);\n",
-    "            break;\n",
+    "            InputStream stream = zipFile.getInputStream(entry);\n",
+    "            return new String(stream.readAllBytes(), StandardCharsets.UTF_8);\n",
     "        }\n",
     "    }\n",
-    "\n",
-    "    String[] lines;\n",
-    "    try (BufferedReader in = new BufferedReader(new InputStreamReader(stream))) {\n",
-    "        lines = in.lines().toArray(String[]::new);\n",
-    "    }\n",
-    "    StringBuilder output = new StringBuilder();\n",
-    "    for (int i = 0; i < lines.length; i++) {\n",
-    "        output.append(lines[i] + \"\\n\");\n",
-    "    }\n",
-    "    return output;\n",
+    "    return null;\n",
     "}\n",
     "\n",
-    "StringBuilder rawText = readDataNMT();\n",
+    "String rawText = readDataNMT();\n",
     "System.out.println(rawText.substring(0, 75));"
    ]
   },
@@ -188,7 +173,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "public static StringBuilder preprocessNMT(String text) {\n",
+    "public static String preprocessNMT(String text) {\n",
     "    // Replace non-breaking space with space, and convert uppercase letters to\n",
     "    // lowercase ones\n",
     "\n",
@@ -204,7 +189,7 @@
     "        }\n",
     "        out.append(currChar);\n",
     "    }\n",
-    "    return out;\n",
+    "    return out.toString();\n",
     "}\n",
     "\n",
     "public static boolean noSpace(Character currChar, Character prevChar) {\n",
@@ -213,7 +198,7 @@
     "            && prevChar != ' ';\n",
     "}\n",
     "\n",
-    "StringBuilder text = preprocessNMT(rawText.toString());\n",
+    "String text = preprocessNMT(rawText);\n",
     "System.out.println(text.substring(0, 80));"
    ]
   },
@@ -281,7 +266,9 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "for (String[] subArr : target.subList(0, 6)) System.out.println(Arrays.toString(subArr));"
+    "for (String[] subArr : target.subList(0, 6)) {\n",
+    "    System.out.println(Arrays.toString(subArr));\n",
+    "}"
    ]
   },
   {
@@ -407,9 +394,11 @@
     "public static int[] truncatePad(Integer[] integerLine, int numSteps, int paddingToken) {\n",
     "    /* Truncate or pad sequences */\n",
     "    int[] line = Arrays.stream(integerLine).mapToInt(i -> i).toArray();\n",
-    "    if (line.length > numSteps) return Arrays.copyOfRange(line, 0, numSteps);\n",
+    "    if (line.length > numSteps) {\n",
+    "        return Arrays.copyOfRange(line, 0, numSteps);\n",
+    "    }\n",
     "    int[] paddingTokenArr = new int[numSteps - line.length]; // Pad\n",
-    "    for (int i = 0; i < paddingTokenArr.length; i++) paddingTokenArr[i] = paddingToken;\n",
+    "    Arrays.fill(paddingTokenArr, paddingToken);\n",
     "\n",
     "    return IntStream.concat(Arrays.stream(line), Arrays.stream(paddingTokenArr)).toArray();\n",
     "}\n",
@@ -451,19 +440,20 @@
    "outputs": [],
    "source": [
     "public static Pair<NDArray, NDArray> buildArrayNMT(\n",
-    "        ArrayList<String[]> lines, Vocab vocab, int numSteps) {\n",
+    "        List<String[]> lines, Vocab vocab, int numSteps) {\n",
     "    /* Transform text sequences of machine translation into minibatches. */\n",
     "    List<Integer[]> linesIntArr = new ArrayList<>();\n",
-    "    for (int i = 0; i < lines.size(); i++) {\n",
-    "        linesIntArr.add(vocab.getIdxs(lines.get(i)));\n",
+    "    for (String[] strings : lines) {\n",
+    "        linesIntArr.add(vocab.getIdxs(strings));\n",
     "    }\n",
     "    for (int i = 0; i < linesIntArr.size(); i++) {\n",
-    "        ArrayList<Integer> temp = new ArrayList<>();\n",
-    "        temp.addAll(Arrays.asList(linesIntArr.get(i)));\n",
+    "        List<Integer> temp = new ArrayList<>(Arrays.asList(linesIntArr.get(i)));\n",
     "        temp.add(vocab.getIdx(\"<eos>\"));\n",
-    "        linesIntArr.set(i, temp.stream().toArray(n -> new Integer[n]));\n",
+    "        linesIntArr.set(i, temp.toArray(new Integer[0]));\n",
     "    }\n",
     "\n",
+    "    NDManager manager = NDManager.newBaseManager();\n",
+    "\n",
     "    NDArray arr = manager.create(new Shape(linesIntArr.size(), numSteps), DataType.INT32);\n",
     "    int row = 0;\n",
     "    for (Integer[] line : linesIntArr) {\n",
@@ -498,19 +488,18 @@
     "public static Pair<ArrayDataset, Pair<Vocab, Vocab>> loadDataNMT(\n",
     "        int batchSize, int numSteps, int numExamples) throws IOException {\n",
     "    /* Return the iterator and the vocabularies of the translation dataset. */\n",
-    "    StringBuilder text = preprocessNMT(readDataNMT().toString());\n",
-    "    Pair<ArrayList<String[]>, ArrayList<String[]>> pair =\n",
-    "            tokenizeNMT(text.toString(), numExamples);\n",
+    "    String text = preprocessNMT(readDataNMT());\n",
+    "    Pair<ArrayList<String[]>, ArrayList<String[]>> pair = tokenizeNMT(text, numExamples);\n",
     "    ArrayList<String[]> source = pair.getKey();\n",
     "    ArrayList<String[]> target = pair.getValue();\n",
     "    Vocab srcVocab =\n",
     "            new Vocab(\n",
-    "                    source.stream().toArray(String[][]::new),\n",
+    "                    source.toArray(String[][]::new),\n",
     "                    2,\n",
     "                    new String[] {\"<pad>\", \"<bos>\", \"<eos>\"});\n",
     "    Vocab tgtVocab =\n",
     "            new Vocab(\n",
-    "                    target.stream().toArray(String[][]::new),\n",
+    "                    target.toArray(String[][]::new),\n",
     "                    2,\n",
     "                    new String[] {\"<pad>\", \"<bos>\", \"<eos>\"});\n",
     "\n",
@@ -582,13 +571,6 @@
     "1. Try different values of the `numExamples` argument in the `loadDataNMT` function. How does this affect the vocabulary sizes of the source language and the target language?\n",
     "1. Text in some languages such as Chinese and Japanese does not have word boundary indicators (e.g., space). Is word-level tokenization still a good idea for such cases? Why or why not?\n"
    ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": []
   }
  ],
  "metadata": {