Releases: pyscf/gpu4pyscf
Releases · pyscf/gpu4pyscf
v1.1.0
New Features
- Add esp charge and resp charge by @wxj6000 in #208
- New Rys kernel by @sunqm in #221
- Optimize nuclear gradients using new Rys kernel by @sunqm in #224
- GPU kernel for analytical hessian by @sunqm in #227
- Add QM/MM by @MoleOrbitalHybridAnalyst in #218
Improvements
- Improved compatiability with pyscf 2.7.0 by @wxj6000 in #216
- Add skipping SCF cycles by @kvkarandashev in #229
- Skip building gint, gvhf, ... when building libxc by @wxj6000 in #210
Bugfix
- Typo in build_wheels.sh by @wxj6000 in #209
- Typo in dft_driver.py by @wxj6000 in #220
- Bugfix: cusolver error when specifying gpu by @wxj6000 in #213
- Bugfix: error in int2c2e by @wxj6000 in #212
- Bugfix: inconsistent gradient with CPU. Improved to_cpu, uks gradient, and grid_response by @wxj6000 in #230
- Bugfix: recompute int3c2e in DF UHF by @wxj6000 in #226
New Contributors
- @MoleOrbitalHybridAnalyst made their first contribution in #218
- @kvkarandashev made their first contribution in #229
Full Changelog: v1.0.2...v1.1.0
v1.0.2
What's Changed
- Bugfix: append data in h5 file by @wxj6000 in #200
- Support customized CHELPG radii by @wxj6000 in #202
- Add cupy installation guide for developer installation instructions by @henryw7 in #204
- Bugfix: save density when spin unrestricted by @wxj6000 in #205
- Add chkfile support for pysisyphus by @henryw7 in #203
New Contributors
Full Changelog: v1.0.1...v1.0.2
v1.0.1
What's Changed
- Bugfix in rks.reset by @wxj6000 in #191. The bug leads to the failure of geometry optimization with direct SCF (#190)
- Bugfix when CUDA unified memory is disabled. Removed CUDA unified memory in libxc, and reduced the overhead in calling libxc @wxj6000 in #180, #189
- Bugfix and Improvement in opt_driver by @wxj6000 in #187 #197
- Support SMD in opt_driver and dft driver @liuyu-chem1996 in #196
- Support thermo calculation in dft_driver @liuyu-chem1996 in #192
New Contributors
- @liuyu-chem1996 made their first contribution in #192
Full Changelog: v1.0...v1.0.1
v1.0
GPU4PySCF v0.7.9 release
New Features
- Gradient and Hessian for SMD model @wxj6000 in #86
- Unrestricted HF and DFT @puzhichen @wxj6000 in #87 #128 #102 #118 #117 #119 #92
- Incore CCSD @sunqm in #93
- Incore FCI fci.direct_spin1 by @sunqm in #109
- MP2 and DF MP2 @wxj6000 in #123
- IR, NMR, and polarizability @puzhichen in #147
- Cartesian atomic orbitals @wxj6000 in #100
Improvements
- Support Cupy v13 and cutensor 2.0
- Support SM 90 in PyPi binary package
- Improve Krylov subspace solver in CPHF by @wxj6000 in #156
- Rys roots up to 9 by @wxj6000 in #125
- Refactor GPU4PySCF classes @wxj6000 @sunqm in #139 #91 #132
- Nightly build by @wxj6000 in #138 #140
- Improve memory efficiency in PCM model @wxj6000 in #150
- Support specifying gpu in multi-GPU environment by @wxj6000 in #158
- Release LibXC in a separate package gpu4pyscf-libxc-cuda @wxj6000 in #110
- Improve cart2sph with cuda kernel by @wxj6000 in #137
- Align setups with pyscf by @wxj6000 in #146
- Reduce pair data reads in cuda kernels by @wxj6000 in #157
Bugfixes
- Fix a bug in shell level screening by @wxj6000 in #161
- Fix double counting dispersion correction between PySCF v2.5 and GPU4PySCF v0.6.17 #111
New Contributors
- @wenyan4work made their first contribution in #108
Full Changelog: v0.6.17...v0.7.9
GPU4PySCF v0.6.17 release
New Features
- SMD solvent model
- Built-in serial DFTD3 and DFTD4, disable openmp for compatibility
- Unrestricted DFT (w/o density fitting)
- Zero-copy np array
Improvements
- Improve grids generation, vxc integration, AO evaluation of DFT.
- Reduced the initialization overhead in cupy arrays.
- Optimized 2nd derivative of hcore
- timer_debug2 for profiling cuda kernels
Bugfixes
- Fixed a bug in traspose_sum cuda kernel
- Fixed a bug in screen_index
New Contributors
- @t0saki made their first contribution in #79
- @puzhichen made their first contribution in #78
Full Changelog: v0.6.9...v0.6.17
GPU4PySCF v0.6.9 release
Features
- PCM SCF, analytical gradient, semi-analytical Hessian
- Analytical gradient for Direct SCF
- Workflow for PyPI binary release, support cuda 11.x and cuda 12.x
- 'to_cpu', send GPU4PySCF object to cpu PySCF
- Support various tensor contraction engine, cutensor, cuquantum, cupy, and opt_einsum.
- Chelpg charge
Improvements
- Compute Vxc with sparse AO
- Support analytical Hessian up to 168 atoms
- Improve the accuracy of Rys roots
Bugfixes
- Fixed a bug in dumping log file
GPU4PySCF v0.5 release
Bugfixes
- Align the initial guess with PySCF
- Fix CUDA _sync_threads in contract_rho
Improvements
- Compress CDERI in the 'ij' direction
- Save CDERI in CPU memory, and async data transfer between CPU and GPU
- Evaluate spherical AO on grids
- Make use of the sparsity in DFT integrals
- Tensor contraction with cupyx.cutensor
- Improve VV10 calculations
- Multiple streaming for gradient calculations
GPU4PySCF v0.4.3 release
Features
- Density Fitting SCF, Gradient, and Hessian
- DFT with CUDA LibXC
- LDA, GGA, mGGA without density laplacian, hybrid, and range-separated functionals
- Dispersion corrections with D3 and D4
- VV10 and its gradient on GPU
Improvements
- Support g orbitals for 4-center integrals
- SCF and DIIS on GPU, reduce data transfer
- Improved the efficiency of 4-center integral kernels
GPU4PySCF 0.1
Fix dimension bug in hf