V1.1.88 Release
What's Changed
- Fixed multi_device compiling error in gcc 12 by @chenfeiyue-cfy in #501
- add custom op in lenet test by @zhengzhouheng in #504
- Fixed roi_align golden mismatch error by @chxin66 in #509
- Added minimum unit test by @chxin66 in #449
- Update x86_64_linux/include for 22Q3 by @chenfeiyue-cfy in #506
- Set RNN internal dtype by @xuke537 in #511
- Fixed BidirectionalSequenceLSTM bug by @chenfeiyue-cfy in #514
- add rnn by @MESeraph in #513
- Fixed average pool layout infer by @chxin66 in #508
- diabled a failed case in conv2d by @chxin66 in #517
- fix typo for graph_test.cc by @antkillerfarm in #516
- Add BUILD_WITH_BAZEL option, marco of VSI_FEAT_OP_XXX should behind h… by @MercuryChen in #518
- Added layout_inference for UnidirectionalRnn by @chenfeiyue-cfy in #519
- Fixed BidirectionalSequenceRnn bugs by @chenfeiyue-cfy in #520
- Added HashtableLookup Op by @chenfeiyue-cfy in #524
- Modified README.md about rnn&lstm by @chenfeiyue-cfy in #522
- Added invalidate handle marco by @chenfeiyue-cfy in #525
- Fix maxpoolgrad by @MercuryChen in #526
- Added EmbeddingLookup op by @chenfeiyue-cfy in #523
- Added pad_v2 & pad_v2 layout infer & unit_test by @chxin66 in #533
- fixed groupconv2d params in internal by @chenfeiyue-cfy in #535
- Fix groupconv both in TIM-VX by @chenfeiyue-cfy in #532
- Added EmbeddingLookup 4d support internal by @chenfeiyue-cfy in #534
- Fixed (grouped)conv2d layout infer bug by @chxin66 in #527
- Fixed bug for pad_v2 by @chxin66 in #540
- Added a case for fc by @chxin66 in #537
- mapped pool1d by @MESeraph in #536
- Fixed wrong type of hashtable_lookup unit test by @chenfeiyue-cfy in #538
- Added EmbeddingLookup op 4d lut unit test by @chenfeiyue-cfy in #531
- Added reduce_all layoutinfer & a reduce_all case by @chxin66 in #543
- Fixed Invalid read of size 8 reported by Valgrind by @HuanyuCai in #545
- fix depth2space mode enum by @MESeraph in #547
- update copyright information by @antkillerfarm in #549
- Fixed grouped_conv2d layout infer & Added cases by @chxin66 in #546
- complete custom op readme by @zhengzhouheng in #552
- Refine Depth2Space op by @chenfeiyue-cfy in #553
- Introduce CMAKE option TIM_VX_DBG_ENABLE_TENSOR_HNDL=ON by @sunshinemyson in #550
- Fixed deconv2d layout infer bug by @chxin66 in #557
- Added case for hardswish by @chxin66 in #560
- Fix mirror pad param mismatch by @zhengzhouheng in #568
- Vts inputs order bug by @chxin66 in #567
- remove tensor GetDataRef api by @zhengzhouheng in #569
- Fixed l2normalization layout infer bug by @chxin66 in #570
- Support remote platform by gRPC by @liyuenan2333 in #561
- Modified VSILOGE to VSILOGW when check consumed tensor by @chxin66 in #572
- Update internal & prebuilt-sdk for 23Q1 release by @chenfeiyue-cfy in #573
- Share const tensor in graph by @chxin66 in #574
- Update readme in ops by @chenfeiyue-cfy in #575
- Refine prelu layout inference by @chenfeiyue-cfy in #577
- Fixed ovxlib build error on android by @chxin66 in #578
- Add some tensor dtype convert APIs by @SCUWQ in #576
- Support build openssl from local repo by @chxin66 in #582
- Reload "==" operator for quantizations of two tensor by @chenfeiyue-cfy in #583
- Refine UnidirectionalGRU and GRUCell by @chenfeiyue-cfy in #587
- Added missed ops include header by @chenfeiyue-cfy in #584
- Fixed bug for pad test by @chxin66 in #588
- Add a case for local response norm by @chxin66 in #590
- Fixed LayerNormalization eps bug by @shijie-nv in #589
- optimization for tiny_yolov4 by @chxin66 in #591
- fixed instance norm layout bug & add its layoutinfer by @chxin66 in #593
- Added api json for each op to support acuity by @chenfeiyue-cfy in #596
- Added a case for resize_bilinear layoutinfer by @chxin66 in #595
- Added axis support for layernorm by @chenfeiyue-cfy in #602
- fixed reduce layoutinfer bug by @chxin66 in #605
- Fixed openssl android build bug by @chxin66 in #606
- Remove unused value in op layoutinfer by @chenfeiyue-cfy in #607
- Added new_axis_mask param for stridedslice by @chenfeiyue-cfy in #600
- Fixed viplite driver build issue by @chxin66 in #611
- Added case for gather by @chxin66 in #599
- Enable float16 bias convolution model runs on NN by @chenfeiyue-cfy in #612
- Optimize compilation process for openssl by @chxin66 in #613
- fixed yolov4 build issue by @chxin66 in #618
- Update prebuilt && internal for 23Q2 release by @chenfeiyue-cfy in #617
- Added roi_align layoutinfer & cases by @chxin66 in #615
- Fixed transient deconv1d generate wrong output shape bug by @chenfeiyue-cfy in #619
- Added axis param for TopK by @chenfeiyue-cfy in #610
- Remove confusing comment in depthwise conv test by @chenfeiyue-cfy in #621
- Integrate api trace into tim-vx source as an experimental feature. by @MercuryChen in #623
- Fixed conv2d grouped_conv2d deconv2d layoutinfer bug by @chxin66 in #622
- Fixed IExecutable object not bind with DeviceID by @sunshinemyson in #624
- Create self-hosted.yml by @zhongzhuonan in #625
- fixed prelu layoutinfer bug & added cases by @chxin66 in #628
- fix size compute bug in lrn by @chxin66 in #626
- Added op fusion for mean_stddev_normalization by @chenfeiyue-cfy in #629
- Added general Float16 support by @chenfeiyue-cfy in #631
- Code refinement for mean-stddev-normalization fuse by @chenfeiyue-cfy in #632
- Refine api trace code and document by @MercuryChen in #634
- Fixed misleading test case bug in deconv1d by @chenfeiyue-cfy in #633
- Fix data missing when use trace::Graph::CreateTensor by @xie-oritek in #636
- Add CumSum&LRN operator to trace module by @xie-oritek in #639
- Add int4/uint4 definition by @xie-oritek in #638
- Split replayer code from tracer.h by @MercuryChen in #642
- Refine Tensor::SetShape api to avoid compile warning using const ref by @xie-oritek in #640
- export Swap Handle API by @antkillerfarm in #635
- Fix the instance norm test input size bug by @zhengzhouheng in #645
- fixed tensor cache mismatch issue in yolo_v4_tiny by @chxin66 in #644
- build fix for export Swap Handle API by @antkillerfarm in #643
- Fixed unsupported float16 bias in fc by @chenfeiyue-cfy in #646
- fix quant param lost in Bidirectional lstm by @chxin66 in #649
- Add ScatterND_Update operator by @xie-oritek in #652
- Update internal to 1.1.88 release by @chenfeiyue-cfy in #657
New Contributors
- @HuanyuCai made their first contribution in #545
- @SCUWQ made their first contribution in #576
- @shijie-nv made their first contribution in #589
- @zhongzhuonan made their first contribution in #625
- @xie-oritek made their first contribution in #636
Full Changelog: v1.1.57...v1.1.88