-
Notifications
You must be signed in to change notification settings - Fork 5.6k
2018 01 10
- Bug fix: https://github.com/PaddlePaddle/Paddle/pull/7308#pullrequestreview-87361282
- Inference engine design doc: https://github.com/PaddlePaddle/Paddle/pull/7315#pullrequestreview-87330611
- Merge
buildtools
repo intopaddle
: https://github.com/PaddlePaddle/Paddle/pull/7326 - Fluid Inference: https://github.com/PaddlePaddle/Paddle/pull/7339#pullrequestreview-87633025
- Finishing up Android Camera Demo https://github.com/PaddlePaddle/Mobile/pull/61
[WIP] Unittests for MultiDevice
- https://github.com/PaddlePaddle/Paddle/pull/7396
- https://github.com/PaddlePaddle/Paddle/pull/7381
- https://github.com/PaddlePaddle/Paddle/pull/7292
[WIP] Performance tuning for LoD related op
- https://github.com/PaddlePaddle/Paddle/pull/7247
- https://github.com/PaddlePaddle/Paddle/pull/7240
- https://github.com/PaddlePaddle/Paddle/pull/7219
Performance enhancements
Code style enhancement
- Distribution Fluid image classification: https://github.com/PaddlePaddle/Paddle/pull/7435
- Async send recv:https://github.com/PaddlePaddle/Paddle/pull/7133
- Adapt warp-CTC grad op for gradient checking
- Add python API for warp-CTC op
- Review:
-
Add learning curve plot script
-
profile the model vgg16 and mnist
Memory optimization on Fluid:
Memory optimization result(by bytes):
Model | Before | After | Saving |
---|---|---|---|
VGG | 1729499136 | 1132920832 | 34.5% |
Resnet | 1277345792 | 666173440 | 47.8% |
Memory optimization transpiler:
- add memory optimization design doc
- add simple memory optimization transpiler
- Refine memory optimization transpiler
Memory optimization Demo:
- Memory optimization for fit a line demo
- memory optimization for vgg and resnet in image classification demo
Fix and enhance:
- add general memory usage interface for both CPU/CUDA
- refine batch norm python layer
- set stop gradient for mask in dropout layer
- Add and merge sequence erase operator
- Update and merge edit distance operator
- Update and merge the parsing part of profiler for Fluid
- Code Review:
- OpenBLAS:
- update Openblas benchmark: https://github.com/PaddlePaddle/Paddle/pull/7295/
- auto set openblas env in v1/v2: https://github.com/PaddlePaddle/Paddle/pull/7397
- inference: [WIP] Create an independent project, like capi dense example to verify the link to fluid library.
- fix compile error in profiler.cc: https://github.com/PaddlePaddle/Paddle/pull/7290
- code review:
- Stop gradient when pool_type=='max': https://github.com/PaddlePaddle/Paddle/pull/7284
- Add grad_op_maker for sequence_pool: https://github.com/PaddlePaddle/Paddle/pull/7363
- Add flag use_mkl_packed: https://github.com/PaddlePaddle/Paddle/pull/7183/
- capi pacakge: https://github.com/PaddlePaddle/Paddle/pull/7237, https://github.com/PaddlePaddle/Paddle/pull/7255
- fix ds2 of MKLDNNLayer issue: https://github.com/PaddlePaddle/Paddle/pull/7395
- Error clip design doc and implementation:
- Test error clip on MNIST
- Review of adversarial sample generating model from Baidu security lab
- WhileOp code clean
- Other reviews:
- Fix adagrad op by removing broadcast of Eigen.
- Fix the docstring of 'filter groups' in img_conv_layer.
- Enhance learning curve plot script.
- ResNet-50 performance comparison with TensorFlow.
-
Follow comments to merge the C-API documentation.
-
issues about plans for the NMT task.
- To implement NMT, some enhancements are required to to current implementation:
- Support padding_idx in the lookup_table_op: https://github.com/PaddlePaddle/Paddle/issues/7309
- Enhance matmul_op to support 4-D inputs: https://github.com/PaddlePaddle/Paddle/issues/7319
- A general purpose normalization operator: https://github.com/PaddlePaddle/Paddle/issues/7350
- TODO list for RNN search: https://github.com/PaddlePaddle/Paddle/issues/7357
- TODO list for Transformer: https://github.com/PaddlePaddle/Paddle/issues/7355
- TODO list for ConvS2S: https://github.com/PaddlePaddle/Paddle/issues/7353
- To implement NMT, some enhancements are required to to current implementation:
- Enhance reorder_lod_tensor_by_rank_op to support reorderring Tensor(Merged)
- convS2S related paper and code reading
- PR
- https://github.com/PaddlePaddle/Paddle/pull/7429 book chapter 6, distributed version
- Issue
- Mobile
- Follow up OCR mobile inference library switch to paddle mobile inference library. At present, paddle mobile inference library is better than ocr's inference library in both memory usage and computational performance.
- https://github.com/PaddlePaddle/Paddle/pull/7441
- Refine broadcast_scale layer
- Implement SE-ResNeXt for image classification
- Minor doc fixes
- Image
- Work plan on image: https://github.com/PaddlePaddle/Paddle/issues/7253
- SSD design on Fluid: https://github.com/PaddlePaddle/Paddle/issues/7402
- Code Review:
- Fix the docstring of 'filter groups' in img_conv_layer. https://github.com/PaddlePaddle/Paddle/pull/7301
- Add COWPtr and its unittest. https://github.com/PaddlePaddle/Paddle/pull/7240
- Correctly handle lod information of image operators: https://github.com/PaddlePaddle/Paddle/pull/7219
- Release the middle layer's output memory: https://github.com/PaddlePaddle/Paddle/pull/7159
- Add the parsing part for the profiling tool: https://github.com/PaddlePaddle/Paddle/pull/7043
- Add edit distance operator: https://github.com/PaddlePaddle/Paddle/pull/5300#pullrequestreview-87720377
- Finished end-to-end demo of how to integrate VisualDL into MXNet while training MNIST and visualize the accuracy using VisualDL’s Scalar component https://github.com/PaddlePaddle/VisualDL/tree/demo_mxnet
- Recorded a 5-min video for Baidu USA
- [WIP] Integrate VisualDL’s Graph into MXNet (blocked by onnx-mxnet conversion)
- Translated PaddlePaddle brochure
- update prior box operator
- ssd discuss
- Fluid Heterogeneous device support
- runtime switch for sync GPU operators
- fix init device pools
- add layout infer
- refine CUDNNDeviceContext
- benchmark - add cifar10 resnet - https://github.com/dzhwinter/benchmark/pull/42 - refine benchmark repository - https://github.com/dzhwinter/benchmark/pull/41
- Reviews
- Add static_input [WIP]
https://github.com/PaddlePaddle/Paddle/pull/7434 - Fix sequence_pool
https://github.com/PaddlePaddle/Paddle/pull/7363
https://github.com/PaddlePaddle/Paddle/pull/7284 - Bug fix for backward
https://github.com/PaddlePaddle/Paddle/pull/7278 - Enhance shrink_memory_op
https://github.com/PaddlePaddle/Paddle/pull/7176 - Code Review
https://github.com/dzhwinter/benchmark/pull/41#pullrequestreview-87140258
https://github.com/dzhwinter/benchmark/pull/45#pullrequestreview-87408295
https://github.com/dzhwinter/benchmark/pull/42#pullrequestreview-87719102
- parallel_do
- update distributed design doc: https://github.com/PaddlePaddle/Paddle/pull/5776
- variable split to multi-servers: https://github.com/PaddlePaddle/Paddle/pull/7249
- reviews https://github.com/PaddlePaddle/Paddle/pull/7133, https://github.com/PaddlePaddle/Paddle/pull/7276
- distributed benchmark (DOING)
PR
- Add design doc: concurrent data transfer and kernel execution
- [WIP]Add Channel
- make mnist.py enable running in GPU
Code Review
- Feature/use cudnn
- fix layout transform
- Inference
- Format proposal for saving models in Fluid: https://github.com/PaddlePaddle/Paddle/issues/7329
- Design of InferenceProgramDesc: https://github.com/PaddlePaddle/Paddle/issues/7328
- Work on designing Inference to remove Pickle: https://github.com/PaddlePaddle/Paddle/issues/7221
- Survey on how TensorFlow and Caffe2 prefer storing models: https://github.com/PaddlePaddle/Paddle/issues/7222
- PR review
- NMT
- Read and discuss the Transformers paper (https://arxiv.org/abs/1706.03762) for NMT implementation in Fluid
- TODO list for RNN search: https://github.com/PaddlePaddle/Paddle/issues/7357
- TODO list for Transformer: https://github.com/PaddlePaddle/Paddle/issues/7355
- Fixing the html to not render io file in Fluid: https://github.com/PaddlePaddle/Paddle/issues/7335
-
Inference
- Design and code for InferenceDesc https://github.com/PaddlePaddle/Paddle/pull/7367
- https://github.com/PaddlePaddle/Paddle/pull/7315
- Pytorch Survey https://github.com/PaddlePaddle/Paddle/issues/7224
- Plan Inference ProgramDesc optimization https://github.com/PaddlePaddle/Paddle/issues/7417
-
NMT
- Study Transformers paper (https://arxiv.org/abs/1706.03762) for NMT implementation in Fluid
- TODO list for RNN search: https://github.com/PaddlePaddle/Paddle/issues/7357
- TODO list for Transformer: https://github.com/PaddlePaddle/Paddle/issues/7355
-
Refine Memory optimization Doc https://github.com/PaddlePaddle/Paddle/pull/7420
-
Fix issues https://github.com/PaddlePaddle/Paddle/issues/7418 and https://github.com/PaddlePaddle/Paddle/issues/6957 in Elementwise Ops - PR https://github.com/PaddlePaddle/Paddle/pull/7445
-
https://github.com/PaddlePaddle/Paddle/pull/7178/files#r160868932
-
Fix Bias attr in Batch norm https://github.com/PaddlePaddle/Paddle/pull/7449
-
PR Review - https://github.com/PaddlePaddle/Paddle/pull/7457#pullrequestreview-88257754
- Fluid
- List the operators that will be used in inference (for Movidius)
- Add design doc of inference API for Fluid
- Port WarpCTC Operator
- Review and discussion about storing format
- Add link from home page to book (https://github.com/PaddlePaddle/PaddlePaddle.org/pull/369)
- Research and planning for Master process dashboard (https://github.com/PaddlePaddle/Paddle/projects/41)
- Distributed Training
- understanding code
- reviews:
- Distributed label semantic role PR: https://github.com/PaddlePaddle/Paddle/pull/7425
- Fluid
-
Mixed Device
- mixed device design https://github.com/PaddlePaddle/Paddle/pull/7412
- Cpu gpu transform merged https://github.com/PaddlePaddle/Paddle/pull/7085
- add back priority https://github.com/PaddlePaddle/Paddle/pull/7294
- reorganize data transform related code https://github.com/PaddlePaddle/Paddle/pull/7391
- [WIP]init complete data layout transform https://github.com/PaddlePaddle/Paddle/pull/7440
-
beam search
- [WIP]Nmt model https://github.com/PaddlePaddle/Paddle/pull/7340
- add lod tensor ToAbsOffset test https://github.com/PaddlePaddle/Paddle/pull/7372
- Topk share lod https://github.com/PaddlePaddle/Paddle/pull/7373
- change data type of beam_search op https://github.com/PaddlePaddle/Paddle/pull/7374
- fix GetDims bug https://github.com/PaddlePaddle/Paddle/pull/7340
-
Review
- set stop gradient for mask in dropout layer: https://github.com/PaddlePaddle/Paddle/pull/7390
- refine batch norm python layer: https://github.com/PaddlePaddle/Paddle/pull/7348
- Add Fluid Compiler design doc https://github.com/PaddlePaddle/Paddle/pull/7178
-
PR:
- Remove pickling tools for inference example: https://github.com/PaddlePaddle/Paddle/pull/7262
- Fluid documentation: https://github.com/PaddlePaddle/Paddle/pull/7151
- Fluid documentation: https://github.com/PaddlePaddle/Paddle/pull/7274
PR review:
- Fluid compiler: https://github.com/PaddlePaddle/Paddle/pull/7178
- Inference design: https://github.com/PaddlePaddle/Paddle/pull/7315
Others:
- NMT project: reading and discussing transformer paper
- TODOlist for NMT: https://github.com/PaddlePaddle/Paddle/issues/7355
- Understanding fluid code for inference (https://github.com/PaddlePaddle/Paddle/issues/7145)
-
Inference
- Discussions on the Fluid inference framework design
- Remove dependency on Pickle https://github.com/PaddlePaddle/Paddle/pull/7339
-
Document and User Experience
-
NMT
- Read and discuss the transformer paper
- To-do list for transformer: https://github.com/PaddlePaddle/Paddle/issues/7355
-
Review
- https://github.com/PaddlePaddle/PaddlePaddle.org/pull/371 Initial setup for the VisualDL subdomain homepage
- Register visualdl.ai domain
- Paddleboard project direction discussion and exploration on debugger.
- Paddleboard visualization research and prototyping
- VisualDL design specification building, logo design