Change the repository type filter
All
Repositories list
56 repositories
- A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
- SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
- Efficient vision foundation models for high-resolution generation and perception.
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Block-Sparse-Attention
Public- [NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
- [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
spvnas
Public archive[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolutiontemporal-shift-module
Public[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
- TinyChatEngine: On-Device LLM Inference Library
- [CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
- [CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs