PaddlePaddle · HydrogenSulfate · Oct 20, 2023 · Sep 12, 2023 · Sep 13, 2023 · Sep 19, 2023
diff --git a/docs/zh/examples/pirbn.md b/docs/zh/examples/pirbn.md
@@ -0,0 +1,100 @@
+# PIRBN
+
+## 1. 背景简介
+
+我们最近发现经过训练，物理信息神经网络（PINN）往往会成为局部近似函数。这一观察结果促使我们开发了一种新型的物理-信息径向基网络（PIRBN），该网络在整个训练过程中都能够维持局部近似性质。与深度神经网络不同，PIRBN仅包含一个隐藏层和一个径向基“激活”函数。在适当的条件下，我们证明了使用梯度下降方法训练PIRBN可以收敛到高斯过程。此外，我们还通过神经邻近核（NTK）理论研究了PIRBN的训练动态。此外，我们还对PIRBN的初始化策略进行了全面调查。基于数值示例，我们发现PIRBN在解决具有高频特征和病态计算域的非线性偏微分方程方面比PINN更有效。此外，现有的PINN数值技术，如自适应学习、分解和不同类型的损失函数，也适用于PIRBN。
+
+## 2. 问题定义
+
+在NTK和基于NTK的适应性训练方法的帮助下，PINN在处理具有高频特征的问题时的性能可以得到显著提升。例如，考虑一个偏微分方程及其边界条件：
+
+$$
+\begin{aligned}
+& \frac{\mathrm{d}^2}{\mathrm{~d} x^2} u(x)-4 \mu^2 \pi^2 \sin (2 \mu \pi x)=0, \text { for } x \in[0,1] \\
+& u(0)=u(1)=0
+\end{aligned}
+$$
+
+其中μ是一个控制PDE解的频率特征的常数。
+
+## 3. 问题求解
+
+接下来开始讲解如何将问题一步一步地转化为 PaddlePaddle 代码，用深度学习的方法求解该问题。
+为了快速理解 PaddlePaddle，接下来仅对模型构建、方程构建、计算域构建等关键步骤进行阐述，而其余细节请参考 [API文档](../api/arch.md)。
+
+### 3.1 模型构建
+
+在 PIRBN 问题中，建立网络，用 PaddlePaddle 代码表示如下
+
+``` py linenums="44"
+--8<--
+jointContribution/PIRBN/main.py:44:46
+--8<--
+```
+
+### 3.2 数据构建
+
+本案例涉及读取数据构建，如下所示
+
+``` py linenums="18"
+--8<--
+jointContribution/PIRBN/main.py:18:41
+--8<--
+```
+
+### 3.3 训练和评估构建
+
+训练和评估构建，设置损失计算函数，返回字段，代码如下所示：
+
+``` py linenums="59"
+--8<--
+jointContribution/PIRBN/train.py:59:97
+--8<--
+```
+
+### 3.4 超参数设定
+
+接下来我们需要指定训练轮数，此处我们按实验经验，使用 20001 轮训练轮数。
+
+``` py linenums="47"
+--8<--
+jointContribution/PIRBN/main.py:47:47
+--8<--
+```
+
+### 3.5 优化器构建
+
+训练过程会调用优化器来更新模型参数，此处选择 `Adam` 优化器并设定 `learning_rate` 为 1e-3。
+
+``` py linenums="40"
+--8<--
+jointContribution/PIRBN/train.py:40:42
+--8<--
+```
+
+### 3.6 模型训练与评估
+
+模型训练与评估
+
+``` py linenums="99"
+--8<--
+jointContribution/PIRBN/train.py:99:106
+--8<--
+```
+
+## 4. 完整代码
+
+``` py linenums="1" title="main.py"
+--8<--
+jointContribution/PIRBN/main.py
+--8<--
+```
+
+## 5. 结果展示
+
+PIRBN 案例针对 epoch=20001 和 learning\_rate=1e-3 的参数配置进行了实验，结果返回Loss为 0.13567。
+
+## 6. 参考资料
+
+- [Physics-informed radial basis network (PIRBN): A local approximating neural network for solving nonlinear PDEs](https://arxiv.org/abs/2304.06234)
+- <https://github.com/JinshuaiBai/PIRBN >
diff --git a/jointContribution/PIRBN/README.md b/jointContribution/PIRBN/README.md
@@ -0,0 +1,39 @@
+# Physics-informed radial basis network (PIRBN)
+
+This repository provides numerical examples of the **physics-informed radial basis network** (**PIRBN**).
+
+Physics-informed neural network (PINN) has recently gained increasing interest in computational  mechanics.
+
+This work starts from studying the training dynamics of PINNs via the nerual tangent kernel (NTK) theory. Based on numerical experiments, we found:
+
+- PINNs tend to be a **local approximator** during the training
+- For PINNs who fail to be a local apprixmator, the physics-informed loss can be hardly minimised through training
+
+Inspired by findings, we proposed the PIRBN, which can exhibit the local property intuitively. It has been demonstrated that the NTK theory is applicable for PIRBN. Besides, other PINN techniques can be directly migrated to PIRBNs.
+
+Numerical examples include:
+
+ - 1D sine funtion (**Eq. 13** in the manuscript)
+
+      **PDE**: $\frac{\partial^2 }{\partial x^2}u(x)-4\mu^2\pi^2 sin(2\mu\pi(x))=0, x\in[0,1]$
+
+      **BC**:  $u(0)=u(1)=0.$
+
+ - 1D sine funtion (**Eq. 15** in the manuscript)
+      **PDE**: $\frac{\partial^2 }{\partial x^2}u(x-100)-4\mu^2\pi^2 sin(2\mu\pi(x-100))=0, x\in[100,101]$
+
+      **BC**:  $u(100)=u(101)=0.$
+
+For more details in terms of mathematical proofs and numerical examples, please refer to our paper.
+
+# Link
+
+<https://doi.org/10.1016/j.cma.2023.116290>
+
+<https://github.com/JinshuaiBai/PIRBN>
+
+# Enviornmental settings
+
+```
+pip install -r requirements.txt
+```
diff --git a/jointContribution/PIRBN/analytical_solution.py b/jointContribution/PIRBN/analytical_solution.py
@@ -0,0 +1,84 @@
+import os
+
+import matplotlib.pyplot as plt
+import numpy as np
+import paddle
+
+
+def output_fig(train_obj, mu, b, right_by, activation_function, output_Kgg):
+    plt.figure(figsize=(15, 9))
+    rbn = train_obj.pirbn.rbn
+
+    output_dir = os.path.join(os.path.dirname(__file__), "output")
+    if not os.path.exists(output_dir):
+        os.mkdir(output_dir)
+
+    # Comparisons between the network predictions and the ground truth.
+    plt.subplot(2, 3, 1)
+    ns = 1001
+    dx = 1 / (ns - 1)
+    xy = np.zeros((ns, 1)).astype(np.float32)
+    for i in range(0, ns):
+        xy[i, 0] = i * dx + right_by
+    y = rbn(paddle.to_tensor(xy))
+    y = y.numpy()
+    y_true = np.sin(2 * mu * np.pi * xy)
+    plt.plot(xy, y_true)
+    plt.plot(xy, y, linestyle="--")
+    plt.legend(["ground truth", "predict"])
+    plt.xlabel("x")
+
+    # Point-wise absolute error plot.
+    plt.subplot(2, 3, 2)
+    xy_y = np.abs(y_true - y)
+    plt.plot(xy, xy_y)
+    plt.ylim(top=np.max(xy_y))
+    plt.ylabel("Absolute Error")
+    plt.xlabel("x")
+
+    # Loss history of the network during the training process.
+    plt.subplot(2, 3, 3)
+    loss_g = train_obj.loss_g
+    x = range(len(loss_g))
+    plt.yscale("log")
+    plt.plot(x, loss_g)
+    plt.plot(x, train_obj.loss_b)
+    plt.legend(["Lg", "Lb"])
+    plt.ylabel("Loss")
+    plt.xlabel("Iteration")
+
+    # Visualise NTK after initialisation, The normalised Kg at 0th iteration.
+    plt.subplot(2, 3, 4)
+    index = str(output_Kgg[0])
+    K = train_obj.ntk_list[index].numpy()
+    plt.imshow(K / (np.max(abs(K))), cmap="bwr", vmax=1, vmin=-1)
+    plt.colorbar()
+    plt.title(f"Kg at {index}-th iteration")
+    plt.xlabel("Sample point index")
+
+    # Visualise NTK after training, The normalised Kg at 2000th iteration.
+    plt.subplot(2, 3, 5)
+    index = str(output_Kgg[1])
+    K = train_obj.ntk_list[index].numpy()
+    plt.imshow(K / (np.max(abs(K))), cmap="bwr", vmax=1, vmin=-1)
+    plt.colorbar()
+    plt.title(f"Kg at {index}-th iteration")
+    plt.xlabel("Sample point index")
+
+    # The normalised Kg at 20000th iteration.
+    plt.subplot(2, 3, 6)
+    index = str(output_Kgg[2])
+    K = train_obj.ntk_list[index].numpy()
+    plt.imshow(K / (np.max(abs(K))), cmap="bwr", vmax=1, vmin=-1)
+    plt.colorbar()
+    plt.title(f"Kg at {index}-th iteration")
+    plt.xlabel("Sample point index")
+
+    plt.savefig(
+        os.path.join(
+            output_dir, f"sine_function_{mu}_{b}_{right_by}_{activation_function}.png"
+        )
+    )
+
+    # Save data
+    # scipy.io.savemat(os.path.join(output_dir, "out.mat"), {"NTK": a, "x": xy, "y": y})
diff --git a/jointContribution/PIRBN/jacobian_function.py b/jointContribution/PIRBN/jacobian_function.py
@@ -0,0 +1,36 @@
+import paddle
+
+
+def flat(x, start_axis=0, stop_axis=None):
+    # TODO Error if use paddle.flatten -> The Op flatten_grad doesn't have any gradop
+    stop_axis = None if stop_axis is None else stop_axis + 1
+    shape = x.shape
+
+    # [3, 1] --flat--> [3]
+    # [2, 2] --flat--> [4]
+    temp = shape[start_axis:stop_axis]
+    temp = [0 if x == 1 else x for x in temp]  # kill invalid axis
+    flat_sum = sum(temp)
+    head = shape[0:start_axis]
+    body = [flat_sum]
+    tail = [] if stop_axis is None else shape[stop_axis:]
+    new_shape = head + body + tail
+    x_flat = x.reshape(new_shape)
+    return x_flat
+
+
+def jacobian(y, x):
+    J_shape = y.shape + x.shape
+    J = paddle.zeros(J_shape)
+    y_flat = flat(y)
+    J_flat = flat(
+        J, start_axis=0, stop_axis=len(y.shape) - 1
+    )  # partialy flatten as y_flat
+    for i, y_i in enumerate(y_flat):
+        grad = paddle.grad(y_i, x, allow_unused=True)[
+            0
+        ]  # grad[i] == sum by j (dy[j] / dx[i])
+        if grad is None:
+            grad = paddle.zeros_like(x)
+        J_flat[i] = grad
+    return J_flat.reshape(J_shape)
diff --git a/jointContribution/PIRBN/jacobian_test.py b/jointContribution/PIRBN/jacobian_test.py
@@ -0,0 +1,51 @@
+import torch
+
+__all__ = []
+
+
+input = [[1.0, 2.0], [3.0, 4.0]]
+# input = [[1.0, 2.0],]
+print("input = ", input)
+x = torch.tensor(input, requires_grad=True)
+print("\ninput.shape = ", x.shape)
+
+# y = x
+def exp_reducer(x):
+    return x.exp()
+
+
+J = torch.autograd.functional.jacobian(exp_reducer, x)
+print("torch Jacobian matrix = ", J)
+print("torch Jacobian shape  = ", J.shape)
+
+
+# paddle.fluid.core.set_prim_eager_enabled(True)
+# input = ((1.0, 2.0),)
+# x = paddle.to_tensor(input)
+# print("\ninput.shape = ", x.shape)
+
+# x.stop_gradient = False
+# y = x.exp()
+# J = jacobian(y, x)
+# print("paddle Jacobian matrix= ", J)
+
+# input_list = [
+#     ((1.0, 2.0),),
+#     ((1.0, 2.0),(3.0, 4.0)),
+# ]
+# @pytest.mark.parametrize("input",  input_list)
+# def test_matrix_jacobian(input):
+#     x = paddle.to_tensor(input)
+#     print("\ninput.shape = ", x.shape)
+#     x.stop_gradient = False
+#     y = x
+#     J = jacobian(y, x)
+#     test_result = J
+#     x = torch.tensor(input, requires_grad=True)
+#     def exp_reducer(x):
+#         return x
+#     J = torch.autograd.functional.jacobian(exp_reducer, x)
+#     expected_result = paddle.to_tensor(J.numpy())
+
+#     # check result whether is equal
+#     assert paddle.allclose(expected_result, test_result)
diff --git a/jointContribution/PIRBN/main.py b/jointContribution/PIRBN/main.py
@@ -0,0 +1,72 @@
+import analytical_solution
+import numpy as np
+import pirbn
+import rbn_net
+import train
+
+import ppsci
+
+# set random seed for reproducibility
+SEED = 2023
+ppsci.utils.misc.set_random_seed(SEED)
+
+# mu, Fig.1, Page5
+# right_by, Formula (15) Page5
+def sine_function_main(
+    mu, adaptive_weights, right_by=0, activation_function="gaussian"
+):
+    # Define the number of sample points
+    ns = 50
+
+    # Define the sample points' interval
+    dx = 1.0 / (ns - 1)
+
+    # Initialise sample points' coordinates
+    x_eq = np.linspace(0.0, 1.0, ns)[:, None]
+
+    for i in range(0, ns):
+        x_eq[i, 0] = i * dx + right_by
+    x_bc = np.array([[right_by + 0.0], [right_by + 1.0]])
+    # x_bc = np.ones((50, 1))
+    # x_bc[0:25] = 0
+    x = [x_eq, x_bc]
+    y = -4 * mu**2 * np.pi**2 * np.sin(2 * mu * np.pi * x_eq)
+    # print(y[:10])
+    # exit()
+    # Set up radial basis network
+    n_in = 1
+    n_out = 1
+    n_neu = 61
+    b = 10.0
+    c = [right_by - 0.1, right_by + 1.1]
+
+    # Set up PIRBN
+    rbn = rbn_net.RBN_Net(n_in, n_out, n_neu, b, c, activation_function)
+    # paddle.summary(rbn, input=paddle.to_tensor(x_eq))
+    rbn_loss = pirbn.PIRBN(rbn, activation_function)
+    maxiter = 20001
+    output_Kgg = [0, int(0.1 * maxiter), maxiter - 1]
+    train_obj = train.Trainer(
+        rbn_loss,
+        x,
+        y,
+        learning_rate=0.001,
+        maxiter=maxiter,
+        adaptive_weights=adaptive_weights,
+    )
+    train_obj.fit(output_Kgg)
+
+    # Visualise results
+    analytical_solution.output_fig(
+        train_obj, mu, b, right_by, activation_function, output_Kgg
+    )
+
+
+# Fig.1
+sine_function_main(mu=4, right_by=0, activation_function="tanh", adaptive_weights=True)
+# # # Fig.2
+# sine_function_main(mu=8, right_by=0, activation_function="tanh")
+# # # Fig.3
+# sine_function_main(mu=4, right_by=100, activation_function="tanh")
+# Fig.6
+# sine_function_main(mu=8, right_by=100, activation_function="gaussian", adaptive_weights=True)