Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-28 | Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models | YangTian Yan et.al. | 2503.22205v1 | null |
2025-03-27 | Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples | Samra Irshad et.al. | 2503.21164v1 | null |
2025-03-26 | Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks | Tao Wu et.al. | 2503.20310v1 | null |
2025-03-25 | Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization | Weifei Jin et.al. | 2503.19591v1 | null |
2025-03-25 | Towards Imperceptible Adversarial Attacks for Time Series Classification with Local Perturbations and Frequency Analysis | Wenwei Gu et.al. | 2503.19519v1 | null |
2025-03-25 | Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent | Philip Doldo et.al. | 2503.19347v1 | null |
2025-03-26 | Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168v2 | null |
2025-03-21 | EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision | Xiaofeng Mao et.al. | 2503.16975v1 | null |
2025-03-20 | Narrowing Class-Wise Robustness Gaps in Adversarial Training | Fatemeh Amerehi et.al. | 2503.16179v1 | null |
2025-03-19 | Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement | Yuchen Ren et.al. | 2503.15404v1 | link |
2025-03-18 | Make the Most of Everything: Further Considerations on Disrupting Diffusion-based Customization | Long Tang et.al. | 2503.13945v1 | null |
2025-03-18 | Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models | Xiaojun Jia et.al. | 2503.12874v2 | null |
2025-03-18 | GSBA$^K$: |
Md Farhamdur Reza et.al. | 2503.12827v2 | null |
2025-03-19 | Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization | Yechao Zhang et.al. | 2503.12793v2 | link |
2025-03-16 | Algebraic Adversarial Attacks on Explainability Models | Lachlan Simpson et.al. | 2503.12683v1 | null |
2025-03-20 | Weakly Supervised Contrastive Adversarial Training for Learning Robust Features from Semi-supervised Data | Lilin Zhang et.al. | 2503.11032v2 | null |
2025-03-13 | A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1 | Zhaoyi Li et.al. | 2503.10635v1 | link |
2025-03-13 | Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology | Hashmat Shadab Malik et.al. | 2503.10629v1 | link |
2025-03-12 | Enhancing Adversarial Example Detection Through Model Explanation | Qian Ma et.al. | 2503.09735v1 | null |
2025-03-12 | AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks | Jin Li et.al. | 2503.09124v1 | null |
2025-03-09 | MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation | Yuzheng Wang et.al. | 2503.06559v1 | null |
2025-03-08 | Boosting the Local Invariance for Better Adversarial Transferability | Bohan Liu et.al. | 2503.06140v1 | null |
2025-03-06 | From Pixels to Trajectory: Universal Adversarial Example Detection via Temporal Imprints | Yansong Gao et.al. | 2503.04853v1 | null |
2025-03-05 | Adversarial Example Based Fingerprinting for Robust Copyright Protection in Split Learning | Zhangting Lin et.al. | 2503.04825v1 | null |
2025-03-06 | Provable Robust Overfitting Mitigation in Wasserstein Distributionally Robust Optimization | Shuang Liu et.al. | 2503.04315v1 | link |
2025-03-05 | Task-Agnostic Attacks Against Vision Foundation Models | Brian Pulfer et.al. | 2503.03842v1 | link |
2025-03-05 | Towards Robust Universal Information Extraction: Benchmark, Evaluation, and Solution | Jizhao Zhu et.al. | 2503.03201v1 | null |
2025-03-04 | DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy | Jiacheng Zhang et.al. | 2503.02169v1 | null |
2025-03-03 | AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses | Nicholas Carlini et.al. | 2503.01811v1 | link |
2025-03-03 | Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Kyle Domico et.al. | 2503.01734v1 | null |
2025-03-02 | Improving the Transferability of Adversarial Attacks by an Input Transpose | Qing Wan et.al. | 2503.00932v1 | null |
2025-03-02 | AMUN: Adversarial Machine UNlearning | Ali Ebrahimpour-Boroojeny et.al. | 2503.00917v1 | null |
2025-02-28 | QFAL: Quantum Federated Adversarial Learning | Walid El Maouaki et.al. | 2502.21171v1 | null |
2025-02-27 | LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks | Joana C. Costa et.al. | 2502.20562v1 | null |
2025-03-04 | Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion | Yuan Bian et.al. | 2502.19697v2 | null |
2025-02-27 | Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack | Chenhe Gu et.al. | 2502.19672v1 | null |
2025-02-25 | Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation | Guang Lin et.al. | 2502.17972v1 | null |
2025-02-24 | Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation | Wenyuan Wu et.al. | 2502.17003v1 | null |
2025-02-20 | Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide | Xingyu Zhao et.al. | 2502.14833v1 | null |
2025-02-18 | Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training | Yuanfan Li et.al. | 2502.12734v1 | null |
2025-02-17 | Alignment and Adversarial Robustness: Are More Human-Like Models More Secure? | Blaine Hoak et.al. | 2502.12377v1 | null |
2025-02-16 | PAR-AdvGAN: Improving Adversarial Attack Capability with Progressive Auto-Regression AdvGAN | Jiayu Zhang et.al. | 2502.12207v1 | null |
2025-02-13 | Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks | Eylon Mizrahi et.al. | 2502.09110v1 | null |
2025-02-20 | Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples | Andrianos Michail et.al. | 2502.08638v3 | null |
2025-02-11 | Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models | Stanislav Fort et.al. | 2502.07753v1 | null |
2025-02-11 | CAT: Contrastive Adversarial Training for Evaluating the Robustness of Protective Perturbations in Latent Diffusion Models | Sen Peng et.al. | 2502.07225v1 | link |
2025-02-10 | SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation | Saurabh Kumar Pandey et.al. | 2502.07101v1 | link |
2025-02-04 | CoRPA: Adversarial Image Generation for Chest X-rays Using Concept Vector Perturbations and Generative Models | Amy Rafferty et.al. | 2502.05214v1 | null |
2025-02-07 | Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers | Chashi Mahiul Islam et.al. | 2502.04679v1 | null |
2025-02-03 | FSPGD: Rethinking Black-box Attacks on Semantic Segmentation | Eun-Sol Park et.al. | 2502.01262v1 | link |
2025-02-03 | Converting MLPs into Polynomials in Closed Form | Nora Belrose et.al. | 2502.01032v1 | null |
2025-02-02 | "I am bad": Interpreting Stealthy, Universal and Robust Audio Jailbreaks in Audio-Language Models | Isha Gupta et.al. | 2502.00718v1 | null |
2025-01-28 | Bones of Contention: Exploring Query-Efficient Attacks Against Skeleton Recognition Systems | Yuxin Cao et.al. | 2501.16843v1 | null |
2025-01-26 | A general, flexible and harmonious framework to construct interpretable functions in regression analysis | Tianyu Zhan et.al. | 2501.15526v1 | link |
2025-01-25 | Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning | Yu Qiao et.al. | 2501.15257v1 | null |
2025-01-25 | VideoPure: Diffusion-based Adversarial Purification for Video Recognition | Kaixun Jiang et.al. | 2501.14999v1 | link |
2025-01-24 | GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm | Hanrui Wang et.al. | 2501.14230v1 | null |
2025-01-23 | Reinforcement Learning Platform for Adversarial Black-box Attacks with Custom Distortion Filters | Soumyendu Sarkar et.al. | 2501.14122v1 | null |
2025-01-23 | Device-aware Optical Adversarial Attack for a Portable Projector-camera System | Ning Jiang et.al. | 2501.14005v1 | null |
2025-01-22 | Modality Unified Attack for Omni-Modality Person Re-Identification | Yuan Bian et.al. | 2501.12761v1 | null |
2025-01-21 | Extend Adversarial Policy Against Neural Machine Translation via Unknown Token | Wei Zou et.al. | 2501.12183v1 | null |
2025-01-21 | Enhancing Adversarial Transferability via Component-Wise Augmentation Method | Hangyu Liu et.al. | 2501.11901v1 | null |
2025-01-19 | Effectiveness of Adversarial Benign and Malware Examples in Evasion and Poisoning Attacks | Matouš Kozák et.al. | 2501.10996v1 | null |
2025-01-17 | CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers | Matan Ben-Tov et.al. | 2501.10013v1 | link |
2025-01-14 | Cross-Modal Transferable Image-to-Video Attack on Video Quality Metrics | Georgii Gotin et.al. | 2501.08415v1 | null |
2025-01-14 | VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models | Hui Kuurila-Zhang et.al. | 2501.07922v1 | link |
2025-01-23 | MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework | Ping Guo et.al. | 2501.07251v2 | null |
2025-01-13 | Protego: Detecting Adversarial Examples for Vision Transformers via Intrinsic Capabilities | Jialin Wu et.al. | 2501.07044v1 | null |
2025-01-03 | Towards Robust and Accurate Stability Estimation of Local Surrogate Models in Text-based Explainable AI | Christopher Burger et.al. | 2501.02042v1 | null |
2025-01-02 | Improving Robustness Estimates in Natural Language Explainable AI though Synonymity Weighted Similarity Measures | Christopher Burger et.al. | 2501.01516v1 | null |
2025-01-02 | AIM: Additional Image Guided Generation of Transferable Adversarial Attacks | Teng Li et.al. | 2501.01106v1 | null |
2025-01-10 | Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs | Linhao Huang et.al. | 2501.01042v2 | null |
2025-01-12 | Towards Adversarially Robust Deep Metric Learning | Xiaopeng Ke et.al. | 2501.01025v2 | null |
2025-01-02 | Boosting Adversarial Transferability with Spatial Adversarial Alignment | Zhaoyu Chen et.al. | 2501.01015v1 | null |
2025-01-01 | Everywhere Attack: Attacking Locally and Globally to Boost Targeted Transferability | Hui Zeng et.al. | 2501.00707v1 | null |
2024-12-31 | Extending XReason: Formal Explanations for Adversarial Detection | Amira Jemaa et.al. | 2501.00537v1 | null |
2024-12-30 | Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted Transferability | Hui Zeng et.al. | 2412.20807v1 | link |
2024-12-30 | Sample Correlation for Fingerprinting Deep Face Recognition | Jiyang Guan et.al. | 2412.20768v1 | link |
2024-12-28 | A Robust Adversarial Ensemble with Causal (Feature Interaction) Interpretations for Image Classification | Chunheng Zhao et.al. | 2412.20025v1 | null |
2024-12-27 | Standard-Deviation-Inspired Regularization for Improving Adversarial Robustness | Olukorede Fakorede et.al. | 2412.19947v1 | null |
2024-12-25 | Improving Integrated Gradient-based Transferable Adversarial Examples by Refining the Integration Path | Yuchen Ren et.al. | 2412.18844v1 | link |
2024-12-25 | Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors | Pham Phuc et.al. | 2412.18815v1 | link |
2024-12-25 | Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models | Yu-An Liu et.al. | 2412.18770v1 | link |
2024-12-24 | Efficient Contrastive Explanations on Demand | Yacine Izza et.al. | 2412.18262v1 | null |
2024-12-29 | ErasableMask: A Robust and Erasable Privacy Protection Scheme against Black-box Face Recognition Models | Sipeng Shen et.al. | 2412.17038v3 | null |
2024-12-22 | Breaking Barriers in Physical-World Adversarial Examples: Improving Robustness and Transferability via Robust Feature | Yichen Wang et.al. | 2412.16958v1 | link |
2024-12-22 | NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors | Ziqi Zhou et.al. | 2412.16955v1 | link |
2024-12-21 | PB-UAP: Hybrid Universal Adversarial Attack For Image Segmentation | Yufei Song et.al. | 2412.16651v1 | null |
2024-12-17 | Targeted View-Invariant Adversarial Perturbations for 3D Object Recognition | Christian Green et.al. | 2412.13376v1 | null |
2024-12-17 | Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs | Shiyu Hu et.al. | 2412.12626v1 | null |
2024-12-17 | Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script | Xi Cao et.al. | 2412.12478v1 | link |
2024-12-16 | Comprehensive Survey on Adversarial Examples in Cybersecurity: Impacts, Challenges, and Mitigation Strategies | Li Li et.al. | 2412.12217v1 | null |
2024-12-16 | Transferable Adversarial Face Attack with Text Controlled Attribute | Wenyun Li et.al. | 2412.11735v1 | null |
2024-12-15 | Unpacking the Resilience of SNLI Contradiction Examples to Attacks | Chetan Verma et.al. | 2412.11172v1 | link |
2024-12-15 | Learning Robust and Privacy-Preserving Representations via Information Theory | Binghui Zhang et.al. | 2412.11066v1 | link |
2024-12-13 | Err on the Side of Texture: Texture Bias on Real Data | Blaine Hoak et.al. | 2412.10597v1 | link |
2024-12-13 | Robust image classification with multi-modal large language models | Francesco Villani et.al. | 2412.10353v1 | null |
2024-12-13 | Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images | Yasamin Medghalchi et.al. | 2412.09910v1 | link |
2024-12-12 | TOAP: Towards Better Robustness in Universal Transferable Anti-Facial Retrieval | Yunna Lv et.al. | 2412.09692v1 | null |
2024-12-16 | Deep Learning Model Security: Threats and Defenses | Tianyang Wang et.al. | 2412.08969v2 | null |
2024-12-11 | DynamicPAE: Generating Scene-Aware Physical Adversarial Examples in Real-Time | Jin Hu et.al. | 2412.08053v1 | null |
2024-12-16 | Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection | Caiyun Xie et.al. | 2412.06727v2 | link |
2024-12-05 | Intriguing Properties of Robust Classification | Bernd Prach et.al. | 2412.04245v1 | null |
2024-12-04 | NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model | Xinheng Xie et.al. | 2412.03539v1 | null |
2024-12-03 | Sustainable Self-evolution Adversarial Training | Wenxuan Wang et.al. | 2412.02270v1 | null |
2024-12-02 | Traversing the Subspace of Adversarial Patches | Jens Bayer et.al. | 2412.01527v1 | null |
2024-12-01 | Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance | Chen-Wei Chang et.al. | 2412.00621v1 | null |
2024-11-30 | Robust Table Integration in Data Lakes | Daomin Ji et.al. | 2412.00324v1 | null |
2024-11-29 | Towards Class-wise Robustness Analysis | Tejaswini Medi et.al. | 2411.19853v1 | null |
2024-11-27 | Adversarial Training in Low-Label Regimes with Margin-Based Interpolation | Tian Ye et.al. | 2411.17959v1 | null |
2024-11-25 | Scaling Laws for Black box Adversarial Attacks | Chuan Liu et.al. | 2411.16782v1 | null |
2024-11-25 | Imperceptible Adversarial Examples in the Physical World | Weilin Xu et.al. | 2411.16622v1 | null |
2024-11-25 | Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification | Andre Kassis et.al. | 2411.16598v1 | link |
2024-11-24 | Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks | Peng Xie et.al. | 2411.15720v1 | null |
2024-11-23 | Enhancing the Transferability of Adversarial Attacks on Face Recognition with Diverse Parameters Augmentation | Fengfan Zhou et.al. | 2411.15555v1 | null |
2024-11-23 | Improving Transferable Targeted Attacks with Feature Tuning Mixup | Kaisheng Liang et.al. | 2411.15553v1 | null |
2024-11-22 | Gradient Masking All-at-Once: Ensemble Everything Everywhere Is Not Robust | Jie Zhang et.al. | 2411.14834v1 | null |
2024-11-21 | Generating Realistic Adversarial Examples for Business Processes using Variational Autoencoders | Alexander Stevens et.al. | 2411.14263v1 | null |
2024-11-18 | Theoretical Corrections and the Leveraging of Reinforcement Learning to Enhance Triangle Attack | Nicole Meng et.al. | 2411.12071v1 | null |
2024-11-15 | Continual Adversarial Reinforcement Learning (CARL) of False Data Injection detection: forgetting and explainability | Pooja Aslami et.al. | 2411.10367v1 | null |
2024-11-15 | A Hard-Label Cryptanalytic Extraction of Non-Fully Connected Deep Neural Networks using Side-Channel Attacks | Benoit Coqueret et.al. | 2411.10174v1 | null |
2024-11-12 | IAE: Irony-based Adversarial Examples for Sentiment Analysis Systems | Xiaoyin Yi et.al. | 2411.07850v1 | null |
2024-11-12 | Chain Association-based Attacking and Shielding Natural Language Processing Systems | Jiacheng Huang et.al. | 2411.07843v1 | null |
2024-11-11 | Boosting the Targeted Transferability of Adversarial Examples via Salient Region & Weighted Feature Drop | Shanjun Xu et.al. | 2411.06784v1 | null |
2024-11-11 | Adversarial Detection with a Dynamically Stable System | Xiaowei Long et.al. | 2411.06666v1 | null |
2024-11-07 | Neural Fingerprints for Adversarial Attack Detection | Haim Fisher et.al. | 2411.04533v1 | link |
2024-11-05 | Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training | Junhao Dong et.al. | 2411.02871v1 | null |
2024-11-04 | Semantic-Aligned Adversarial Evolution Triangle for High-Transferability Vision-Language Attack | Xiaojun Jia et.al. | 2411.02669v1 | link |
2024-11-04 | LiDAttack: Robust Black-box Attack on LiDAR-based Object Detection | Jinyin Chen et.al. | 2411.01889v1 | link |
2024-11-01 | Replace-then-Perturb: Targeted Adversarial Attacks With Visual Reasoning for Vision-Language Models | Jonggyu Jang et.al. | 2411.00898v1 | null |
2024-10-29 | CausAdv: A Causal-based Framework for Detecting Adversarial Examples | Hichem Debbi et.al. | 2411.00839v1 | null |
2024-10-31 | Protecting Feed-Forward Networks from Adversarial Attacks Using Predictive Coding | Ehsan Ganjidoost et.al. | 2411.00222v1 | null |
2024-10-31 | Noise as a Double-Edged Sword: Reinforcement Learning Exploits Randomized Defenses in Neural Networks | Steve Bakos et.al. | 2410.23870v1 | null |
2024-10-31 | Wide Two-Layer Networks can Learn from Adversarial Perturbations | Soichiro Kumano et.al. | 2410.23677v1 | link |
2024-10-30 | Teaching a Language Model to Distinguish Between Similar Details using a Small Adversarial Training Set | Chris Achard et.al. | 2410.23118v1 | null |
2024-10-30 | CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense | Mingkun Zhang et.al. | 2410.23091v1 | null |
2024-10-31 | One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks | Ji Guo et.al. | 2410.22725v2 | null |
2024-10-29 | On the Robustness of Adversarial Training Against Uncertainty Attacks | Emanuele Ledda et.al. | 2410.21952v1 | link |
2024-10-30 | Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models | Lu Yu et.al. | 2410.21802v2 | link |
2024-10-28 | Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models | Piotr Przybyła et.al. | 2410.20940v1 | null |
2024-10-29 | Transferable Adversarial Attacks on SAM and Its Downstream Models | Song Xia et.al. | 2410.20197v2 | link |
2024-10-26 | Adversarial Attacks Against Double RIS-Assisted MIMO Systems-based Autoencoder in Finite-Scattering Environments | Bui Duc Son et.al. | 2410.20103v1 | null |
2024-10-24 | GADT: Enhancing Transferable Adversarial Attacks through Gradient-guided Adversarial Data Transformation | Yating Ma et.al. | 2410.18648v1 | null |
2024-10-23 | Advancing NLP Security by Leveraging LLMs as Adversarial Engines | Sudarshan Srinivasan et.al. | 2410.18215v1 | null |
2024-10-22 | Detecting Adversarial Examples | Furkan Mumcu et.al. | 2410.17442v1 | null |
2024-10-21 | Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network | Suman Sapkota et.al. | 2410.16159v1 | null |
2024-10-21 | Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples | Kirill Lukyanov et.al. | 2410.15889v1 | null |
2024-10-20 | Efficient Model Extraction via Boundary Sampling | Maor Biton Dor et.al. | 2410.15429v1 | null |
2024-10-20 | PEAS: A Strategy for Crafting Transferable Adversarial Examples | Bar Avraham et.al. | 2410.15409v1 | null |
2024-10-19 | Adversarial Training: A Survey | Mengnan Zhao et.al. | 2410.15042v1 | null |
2024-10-18 | A Hybrid Defense Strategy for Boosting Adversarial Robustness in Vision-Language Models | Yuhan Liang et.al. | 2410.14911v1 | null |
2024-10-17 | MMAD-Purify: A Precision-Optimized Framework for Efficient and Scalable Multi-Modal Attacks | Xinxin Liu et.al. | 2410.14089v1 | null |
2024-10-13 | S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack | Yongxiang Liu et.al. | 2410.13891v1 | null |
2024-10-17 | Golyadkin's Torment: Doppelgängers and Adversarial Vulnerability | George I. Kamberov et.al. | 2410.13193v1 | null |
2024-10-17 | Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples Generation with Momentum | Nashrah Haque et.al. | 2410.13122v1 | link |
2024-10-16 | DAT: Improving Adversarial Robustness via Generative Amplitude Mix-up in Frequency Domain | Fengpeng Li et.al. | 2410.12307v1 | link |
2024-10-11 | On the Adversarial Transferability of Generalized "Skip Connections" | Yisen Wang et.al. | 2410.08950v1 | link |
2024-10-11 | Natural Language Induced Adversarial Images | Xiaopei Zhu et.al. | 2410.08620v1 | link |
2024-10-11 | Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data | Binghui Li et.al. | 2410.08503v1 | null |
2024-10-10 | Bilinear MLPs enable weight-based mechanistic interpretability | Michael T. Pearce et.al. | 2410.08417v1 | link |
2024-10-10 | Time Traveling to Defend Against Adversarial Example Attacks in Image Classification | Anthony Etim et.al. | 2410.08338v1 | null |
2024-10-10 | Understanding Adversarially Robust Generalization via Weight-Curvature Index | Yuelin Xu et.al. | 2410.07719v1 | null |
2024-10-09 | Understanding Model Ensemble in Transferable Adversarial Attack | Wei Yao et.al. | 2410.06851v1 | null |
2024-10-09 | Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models | Yubo Wang et.al. | 2410.06699v1 | null |
2024-10-08 | Hyper Adversarial Tuning for Boosting Adversarial Robustness of Pretrained Large Vision Models | Kangtao Lv et.al. | 2410.05951v1 | null |
2024-10-08 | TaeBench: Improving Quality of Toxic Adversarial Examples | Xuan Zhu et.al. | 2410.05573v1 | null |
2024-10-07 | AnyAttack: Towards Large-scale Self-supervised Generation of Targeted Adversarial Examples for Vision-Language Models | Jiaming Zhang et.al. | 2410.05346v1 | null |
2024-10-07 | LOTOS: Layer-wise Orthogonalization for Training Robust Ensembles | Ali Ebrahimpour-Boroojeny et.al. | 2410.05136v1 | null |
2024-10-06 | Suspiciousness of Adversarial Texts to Human | Shakila Mahjabin Tonni et.al. | 2410.04377v1 | null |
2024-10-05 | Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System | Ze Li et.al. | 2410.04017v1 | null |
2024-10-04 | SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack | Zihao Pan et.al. | 2410.02240v2 | link |
2024-10-03 | MOREL: Enhancing Adversarial Robustness through Multi-Objective Representation Learning | Sedjro Salomon Hotegni et.al. | 2410.01697v2 | link |
2024-10-02 | On Using Certified Training towards Empirical Robustness | Alessandro De Palma et.al. | 2410.01617v1 | null |
2024-10-03 | Fake It Until You Break It: On the Adversarial Robustness of AI-generated Image Detectors | Sina Mavali et.al. | 2410.01574v2 | link |
2024-10-02 | Signal Adversarial Examples Generation for Signal Detection Network via White-Box Attack | Dongyang Li et.al. | 2410.01393v1 | null |
2024-09-29 | Adversarial Examples for DNA Classification | Hyunwoo Yoo et.al. | 2409.19788v1 | null |
2024-09-29 | MASKDROID: Robust Android Malware Detection with Masked Graph Representations | Jingnan Zheng et.al. | 2409.19594v1 | link |
2024-09-26 | Discovering New Shadow Patterns for Black-Box Attacks on Lane Detection of Autonomous Vehicles | Pedram MohajerAnsari et.al. | 2409.18248v1 | null |
2024-09-26 | Showing Many Labels in Multi-label Classification Models: An Empirical Study of Adversarial Examples | Yujiang Liu et.al. | 2409.17568v1 | link |
2024-09-24 | Adversarial Backdoor Defense in CLIP | Junhao Kuang et.al. | 2409.15968v1 | null |
2024-09-21 | Cloud Adversarial Example Generation for Remote Sensing Image Classification | Fei Ma et.al. | 2409.14240v1 | null |
2024-09-20 | ViTGuard: Attention-aware Detection against Adversarial Examples for Vision Transformer | Shihua Sun et.al. | 2409.13828v1 | null |
2024-09-20 | Efficient Visualization of Neural Networks with Generative Models and Adversarial Perturbations | Athanasios Karagounis et.al. | 2409.13559v1 | null |
2024-09-20 | Hidden Activations Are Not Enough: A General Approach to Neural Network Predictions | Samuel Leblanc et.al. | 2409.13163v1 | link |
2024-09-19 | Deep generative models as an adversarial attack strategy for tabular machine learning | Salijona Dyrmishi et.al. | 2409.12642v1 | link |
2024-09-19 | TEAM: Temporal Adversarial Examples Attack Model against Network Intrusion Detection System Applied to RNN | Ziyi Liu et.al. | 2409.12472v1 | null |
2024-09-19 | Enhancing 3D Robotic Vision Robustness by Minimizing Adversarial Mutual Information through a Curriculum Training Approach | Nastaran Darabi et.al. | 2409.12379v1 | link |
2024-09-12 | FedProphet: Memory-Efficient Federated Adversarial Training via Theoretic-Robustness and Low-Inconsistency Cascade Learning | Minxue Tang et.al. | 2409.08372v1 | null |
2024-09-12 | LoRID: Low-Rank Iterative Diffusion for Adversarial Purification | Geigh Zollicoffer et.al. | 2409.08255v1 | null |
2024-09-12 | Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models | Nikolai L. Kühne et.al. | 2409.07936v1 | null |
2024-09-10 | Unrevealed Threats: A Comprehensive Study of the Adversarial Robustness of Underwater Image Enhancement Models | Siyu Zhai et.al. | 2409.06420v1 | null |
2024-09-09 | Input Space Mode Connectivity in Deep Neural Networks | Jakub Vrabel et.al. | 2409.05800v1 | null |
2024-09-09 | Adversarial Attacks on Data Attribution | Xinhe Wang et.al. | 2409.05657v1 | null |
2024-09-09 | Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs | Yahya Jabary et.al. | 2409.05558v1 | link |
2024-09-08 | PIP: Detecting Adversarial Examples in Large Vision-Language Models via Attention Patterns of Irrelevant Probe Questions | Yudong Zhang et.al. | 2409.05076v1 | link |
2024-09-07 | Phrase-Level Adversarial Training for Mitigating Bias in Neural Network-based Automatic Essay Scoring | Haddad Philip et.al. | 2409.04795v1 | null |
2024-09-06 | Learning to Learn Transferable Generative Attack for Person Re-Identification | Yuan Bian et.al. | 2409.04208v1 | null |
2024-09-05 | Bypassing DARCY Defense: Indistinguishable Universal Adversarial Triggers | Zuquan Peng et.al. | 2409.03183v1 | null |
2024-09-05 | OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation | Włodzimierz Lewoniewski et.al. | 2409.02649v2 | null |
2024-09-02 | Adversarial Pruning: A Survey and Benchmark of Pruning Methods for Adversarial Robustness | Giorgio Piras et.al. | 2409.01249v1 | link |
2024-09-01 | Accurate Forgetting for All-in-One Image Restoration Model | Xin Su et.al. | 2409.00685v1 | null |
2024-09-01 | Comprehensive Botnet Detection by Mitigating Adversarial Attacks, Navigating the Subtleties of Perturbation Distances and Fortifying Predictions with Conformal Layers | Rahul Yumlembam et.al. | 2409.00667v1 | null |
2024-08-27 | Improving Adversarial Robustness in Android Malware Detection by Reducing the Impact of Spurious Correlations | Hamid Bostani et.al. | 2408.16025v1 | link |
2024-08-28 | Evaluating Model Robustness Using Adaptive Sparse L0 Regularization | Weiyou Liu et.al. | 2408.15702v1 | null |
2024-08-27 | TART: Boosting Clean Accuracy Through Tangent Direction Guided Adversarial Training | Bongsoo Yi et.al. | 2408.14728v1 | null |
2024-08-25 | On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective | Tal Alter et.al. | 2408.13809v1 | null |
2024-08-23 | Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting | Zhenyu Wang et.al. | 2408.13355v1 | null |
2024-08-23 | Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples | Zhenyu Wang et.al. | 2408.13341v1 | null |
2024-08-23 | Dynamic Label Adversarial Training for Deep Learning Robustness Against Adversarial Attacks | Zhenyu Liu et.al. | 2408.13102v1 | null |
2024-08-22 | Leveraging Information Consistency in Frequency and Spatial Domain for Adversarial Attacks | Zhibo Jin et.al. | 2408.12670v1 | link |
2024-08-22 | Query-Efficient Video Adversarial Attack with Stylized Logo | Duoxun Tang et.al. | 2408.12099v1 | null |
2024-08-20 | Revisiting Min-Max Optimization Problem in Adversarial Training | Sina Hajer Ahmadi et.al. | 2408.11218v1 | null |
2024-08-20 | Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models | Cong Wan et.al. | 2408.10571v1 | link |
2024-08-19 | Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis | Kira Maag et.al. | 2408.10021v1 | null |
2024-08-19 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839v1 | link |
2024-08-20 | Enhancing Adversarial Transferability with Adversarial Weight Tuning | Jiahao Chen et.al. | 2408.09469v2 | null |
2024-08-16 | LEVIS: Large Exact Verifiable Input Spaces for Neural Networks | Mohamad Fares El Hajj Chehade et.al. | 2408.08824v1 | null |
2024-08-15 | Evaluating Text Classification Robustness to Part-of-Speech Adversarial Examples | Anahita Samadi et.al. | 2408.08374v1 | null |
2024-08-14 | Achieving Data Efficient Neural Networks with Hybrid Concept-based Models | Tobias A. Opsahl et.al. | 2408.07438v1 | link |
2024-08-12 | Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment | Kejia Zhang et.al. | 2408.06079v1 | null |
2024-08-12 | Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information | Mingkun Zhang et.al. | 2408.05900v1 | null |
2024-08-11 | Improving Adversarial Transferability with Neighbourhood Gradient Information | Haijing Guo et.al. | 2408.05745v1 | null |
2024-08-11 | StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model | Ziyin Zhou et.al. | 2408.05669v1 | link |
2024-08-10 | ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack | Ziyi Gao et.al. | 2408.05479v1 | null |
2024-08-08 | Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness | Stanislav Fort et.al. | 2408.05446v1 | null |
2024-08-09 | Adversarially Robust Industrial Anomaly Detection Through Diffusion Model | Yuanpu Cao et.al. | 2408.04839v1 | null |
2024-08-08 | Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit | Duanyi Yao et.al. | 2408.04310v1 | null |
2024-08-08 | Stability Analysis of Equivariant Convolutional Representations Through The Lens of Equivariant Multi-layered CKNs | Soutrik Roy Chowdhury et.al. | 2408.04277v1 | null |
2024-08-08 | Unveiling Hidden Visual Information: A Reconstruction Attack Against Adversarial Visual Information Hiding | Jonggyu Jang et.al. | 2408.04261v1 | null |
2024-08-07 | Enhancing Output Diversity Improves Conjugate Gradient-based Adversarial Attacks | Keiichiro Yamamura et.al. | 2408.03972v1 | link |
2024-08-07 | MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems | Renzhi Wang et.al. | 2408.03892v1 | null |
2024-08-05 | On the Robustness of Malware Detectors to Adversarial Samples | Muhammad Salman et.al. | 2408.02310v1 | null |
2024-08-04 | AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning | Xin Wang et.al. | 2408.01978v1 | link |
2024-08-06 | A Survey and Evaluation of Adversarial Attacks for Object Detection | Khoi Nguyen Tiet Nguyen et.al. | 2408.01934v2 | null |
2024-08-03 | ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features | Peng Cheng et.al. | 2408.01808v1 | null |
2024-08-03 | Joint Universal Adversarial Perturbations with Interpretations | Liang-bo Ning et.al. | 2408.01715v1 | null |
2024-08-03 | Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers | Weijie Zheng et.al. | 2408.01705v1 | null |
2024-08-02 | Trustworthy Machine Learning under Social and Adversarial Data Sources | Han Shao et.al. | 2408.01596v1 | null |
2024-08-01 | CERT-ED: Certifiably Robust Text Classification for Edit Distance | Zhuoqun Huang et.al. | 2408.00728v1 | null |
2024-08-01 | Securing the Diagnosis of Medical Imaging: An In-depth Analysis of AI-Resistant Attacks | Angona Biswas et.al. | 2408.00348v1 | null |
2024-08-01 | ADBM: Adversarial diffusion bridge model for reliable adversarial purification | Xiao Li et.al. | 2408.00315v1 | null |
2024-07-30 | AI Safety in Practice: Enhancing Adversarial Robustness in Multimodal Image Captioning | Maisha Binte Rashid et.al. | 2407.21174v1 | null |
2024-07-29 | Enhancing Adversarial Text Attacks on BERT Models with Projected Gradient Descent | Hetvi Waghela et.al. | 2407.21073v1 | null |
2024-07-30 | Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks | Yunfeng Diao et.al. | 2407.20836v1 | null |
2024-07-30 | Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks | Hunmin Yang et.al. | 2407.20657v1 | null |
2024-07-30 | FACL-Attack: Frequency-Aware Contrastive Learning for Transferable Adversarial Attacks | Hunmin Yang et.al. | 2407.20653v1 | null |
2024-07-29 | Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter | Chao Liu et.al. | 2407.19981v1 | null |
2024-07-27 | EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection | Shigang Liu et.al. | 2407.19216v1 | null |
2024-07-25 | Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis | Cristian-Alexandru Botocan et.al. | 2407.18251v1 | link |
2024-07-23 | Algebraic Adversarial Attacks on Integrated Gradients | Lachlan Simpson et.al. | 2407.16233v1 | null |
2024-07-22 | Enhancing Transferability of Targeted Adversarial Examples: A Self-Universal Perspective | Bowen Peng et.al. | 2407.15683v1 | link |
2024-07-22 | Towards Robust Vision Transformer via Masked Adaptive Ensemble | Fudong Lin et.al. | 2407.15385v1 | null |
2024-07-18 | VeriQR: A Robustness Verification Tool for Quantum Machine Learning Models | Yanling Lin et.al. | 2407.13533v1 | null |
2024-07-17 | Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective | Zhaoxin Wang et.al. | 2407.12443v1 | null |
2024-07-17 | Context-Aware Fuzzing for Robustness Enhancement of Deep Learning Models | Haipeng Wang et.al. | 2407.12428v1 | link |
2024-07-17 | Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection | Youheng Sun et.al. | 2407.12292v1 | link |
2024-07-16 | Variational Randomized Smoothing for Sample-Wise Adversarial Robustness | Ryo Hase et.al. | 2407.11844v1 | null |
2024-07-16 | AEMIM: Adversarial Examples Meet Masked Image Modeling | Wenzhao Xiang et.al. | 2407.11537v1 | null |
2024-07-16 | Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis | Zhipeng He et.al. | 2407.11463v1 | link |
2024-07-22 | Towards Robust Recommendation via Decision Boundary-aware Graph Contrastive Learning | Jiakai Tang et.al. | 2407.10184v2 | null |
2024-07-14 | Transferable 3D Adversarial Shape Completion using Diffusion Models | Xuelong Dai et.al. | 2407.10077v1 | null |
2024-07-12 | Soft Prompts Go Hard: Steering Visual Language Models with Hidden Meta-Instructions | Tingwei Zhang et.al. | 2407.08970v1 | link |
2024-07-11 | Boosting Adversarial Transferability for Skeleton-based Action Recognition via Exploring the Model Posterior Space | Yunfeng Diao et.al. | 2407.08572v1 | null |
2024-07-11 | Rethinking the Threat and Accessibility of Adversarial Attacks against Face Recognition Systems | Yuxin Cao et.al. | 2407.08514v1 | link |
2024-07-09 | A Hybrid Training-time and Run-time Defense Against Adversarial Attacks in Modulation Classification | Lu Zhang et.al. | 2407.06807v1 | null |
2024-07-09 | Countermeasures Against Adversarial Examples in Radio Signal Classification | Lu Zhang et.al. | 2407.06796v1 | null |
2024-07-09 | Improving the Transferability of Adversarial Examples by Feature Augmentation | Donghua Wang et.al. | 2407.06714v1 | null |
2024-07-09 | Universal Multi-view Black-box Attack against Object Detectors via Layout Optimization | Donghua Wang et.al. | 2407.06688v1 | null |
2024-07-08 | Non-Robust Features are Not Always Useful in One-Class Classification | Matthew Lau et.al. | 2407.06372v1 | null |
2024-07-07 | Rethinking Targeted Adversarial Attacks For Neural Machine Translation | Junjie Wu et.al. | 2407.05319v1 | link |
2024-07-06 | A Novel Bifurcation Method for Observation Perturbation Attacks on Reinforcement Learning Agents: Load Altering Attacks on a Cyber Physical Power System | Kiernan Broda-Milian et.al. | 2407.05182v1 | null |
2024-07-04 | Protecting Deep Learning Model Copyrights with Adversarial Example-Free Reuse Detection | Xiaokun Luan et.al. | 2407.03883v1 | null |
2024-07-04 | RobQuNNs: A Methodology for Robust Quanvolutional Neural Networks against Adversarial Attacks | Walid El Maouaki et.al. | 2407.03875v1 | null |
2024-07-03 | Chao Zhou et.al. | 2407.03115v1 | null | |
2024-07-03 | SPLITZ: Certifiable Robustness via Split Lipschitz Randomized Smoothing | Meiyu Zhong et.al. | 2407.02811v1 | null |
2024-07-04 | EvolBA: Evolutionary Boundary Attack under Hard-label Black Box condition | Ayane Tajima et.al. | 2407.02248v2 | null |
2024-07-02 | Secure Semantic Communication via Paired Adversarial Residual Networks | Boxiang He et.al. | 2407.02053v1 | null |
2024-06-28 | Deceptive Diffusion: Generating Synthetic Adversarial Examples | Lucas Beerens et.al. | 2406.19807v1 | null |
2024-06-18 | Saliency Attention and Semantic Similarity-Driven Adversarial Perturbation | Hetvi Waghela et.al. | 2406.19413v1 | null |
2024-06-27 | Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems | Zheng Fang et.al. | 2406.19311v1 | null |
2024-06-25 | Diffusion-based Adversarial Purification for Intrusion Detection | Mohamed Amine Merzouk et.al. | 2406.17606v1 | null |
2024-06-24 | ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks | Yoo Yeon Sung et.al. | 2406.16342v1 | null |
2024-06-28 | Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors | Peter Lorenz et.al. | 2406.15104v3 | link |
2024-06-20 | Enhancing robustness of data-driven SHM models: adversarial training with circle loss | Xiangli Yang et.al. | 2406.14232v1 | null |
2024-06-20 | Exploring Layerwise Adversarial Robustness Through the Lens of t-SNE | Inês Valentim et.al. | 2406.14073v1 | null |
2024-06-20 | Explainable AI Security: Exploring Robustness of Graph Neural Networks to Adversarial Attacks | Tao Wu et.al. | 2406.13920v1 | null |
2024-06-19 | Towards Trustworthy Unsupervised Domain Adaptation: A Representation Learning Perspective for Enhancing Robustness, Discrimination, and Generalization | Jia-Li Yin et.al. | 2406.13180v1 | null |
2024-06-17 | FullCert: Deterministic End-to-End Certification for Training and Inference of Neural Networks | Tobias Lorenz et.al. | 2406.11522v1 | null |
2024-06-17 | Obfuscating IoT Device Scanning Activity via Adversarial Example Generation | Haocong Li et.al. | 2406.11515v1 | null |
2024-06-16 | Improving Adversarial Robustness via Decoupled Visual Representation Masking | Decheng Liu et.al. | 2406.10933v1 | link |
2024-06-16 | Imperceptible Face Forgery Attack via Adversarial Semantic Mask | Decheng Liu et.al. | 2406.10887v1 | link |
2024-06-15 | Robust Image Classification in the Presence of Out-of-Distribution and Adversarial Samples Using Attractors in Neural Networks | Nasrin Alipour et.al. | 2406.10579v1 | null |
2024-06-14 | Adaptive Randomized Smoothing: Certifying Multi-Step Defences against Adversarial Examples | Saiyue Lyu et.al. | 2406.10427v1 | null |
2024-06-14 | Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis | Zhang Chen et.al. | 2406.10090v1 | null |
2024-06-14 | Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models | Changjiang Li et.al. | 2406.09669v1 | null |
2024-06-13 | Improving Adversarial Robustness via Feature Pattern Consistency Constraint | Jiacong Hu et.al. | 2406.08829v1 | null |
2024-06-12 | Adversarial Evasion Attack Efficiency against Large Language Models | João Vitorino et.al. | 2406.08050v1 | null |
2024-06-11 | On the Hölder Stability of Multiset and Graph Neural Networks | Yair Davidson et.al. | 2406.06984v1 | null |
2024-06-11 | Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples | Kailas Dayanandan et.al. | 2406.06967v1 | link |
2024-06-09 | MeanSparse: Post-Training Robustness Enhancement Through Mean-Centered Feature Sparsification | Sajjad Amini et.al. | 2406.05927v1 | link |
2024-06-08 | Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization | Jiancong Xiao et.al. | 2406.05372v1 | null |
2024-06-12 | ADBA:Approximation Decision Boundary Approach for Black-Box Adversarial Attacks | Feiyang Wang et.al. | 2406.04998v2 | link |
2024-06-06 | Interpreting the Second-Order Effects of Neurons in CLIP | Yossi Gandelsman et.al. | 2406.04341v1 | null |
2024-06-05 | A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models | Hamidreza Kamkari et.al. | 2406.03537v1 | null |
2024-06-05 | ZeroPur: Succinct Training-Free Adversarial Purification | Xiuli Bi et.al. | 2406.03143v1 | link |
2024-06-05 | DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross Domain | Jun Liu et.al. | 2406.03017v1 | link |
2024-06-05 | Effects of Exponential Gaussian Distribution on (Double Sampling) Randomized Smoothing | Youwei Shu et.al. | 2406.02309v2 | null |
2024-06-04 | Advancing Generalized Transfer Attack with Initialization Derived Bilevel Optimization and Dynamic Sequence Truncation | Yaohua Liu et.al. | 2406.02064v1 | link |
2024-06-04 | SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks | Yi Pan et.al. | 2406.01894v1 | link |
2024-06-03 | Constraint-based Adversarial Example Synthesis | Fang Yu et.al. | 2406.01219v1 | null |
2024-06-02 | Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training | Jiacheng Zhang et.al. | 2406.00685v1 | link |
2024-05-28 | Improved Generation of Adversarial Examples Against Safety-aligned LLMs | Qizhang Li et.al. | 2405.20778v1 | null |
2024-05-31 | Query Provenance Analysis for Robust and Efficient Query-based Black-box Attack Defense | Shaofei Li et.al. | 2405.20641v1 | null |
2024-05-31 | Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization | Yisu Liu et.al. | 2405.20584v1 | null |
2024-05-30 | Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models | Hao Cheng et.al. | 2405.20090v1 | null |
2024-05-30 | HOLMES: to Detect Adversarial Examples with Multiple Detectors | Jing Wen et.al. | 2405.19956v1 | null |
2024-06-02 | PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models | Omead Pooladzandi et.al. | 2405.19376v2 | null |
2024-05-29 | Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior | Shuyu Cheng et.al. | 2405.19098v1 | link |
2024-06-02 | PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics | Sunay Bhat et.al. | 2405.18627v2 | null |
2024-05-28 | Towards Unified Robustness Against Both Backdoor and Adversarial Attacks | Zhenxing Niu et.al. | 2405.17929v1 | link |
2024-05-27 | Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training | Enes Altinisik et.al. | 2405.17130v1 | null |
2024-05-27 | Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models | Fengfan Zhou et.al. | 2405.16940v1 | null |
2024-05-27 | The Uncanny Valley: Exploring Adversarial Robustness from a Flatness Perspective | Nils Philipp Walter et.al. | 2405.16918v1 | null |
2024-05-25 | Enhancing Adversarial Transferability Through Neighborhood Conditional Sampling | Chunlin Qiu et.al. | 2405.16181v1 | null |
2024-05-24 | Robust width: A lightweight and certifiable adversarial defense | Jonathan Peck et.al. | 2405.15971v1 | link |
2024-05-24 | TrojanForge: Adversarial Hardware Trojan Examples with Reinforcement Learning | Amin Sarihi et.al. | 2405.15184v1 | null |
2024-05-23 | Generating camera failures as a class of physics-based adversarial examples | Manav Prabhakar et.al. | 2405.15033v1 | null |
2024-05-23 | How Does Bayes Error Limit Probabilistic Robust Accuracy | Ruihan Zhang et.al. | 2405.14923v1 | null |
2024-05-23 | Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds | Hanwei Zhang et.al. | 2405.14210v1 | null |
2024-05-23 | Learning to Transform Dynamically for Better Adversarial Transferability | Rongyi Zhu et.al. | 2405.14077v1 | null |
2024-05-20 | A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers | Tom Roth et.al. | 2405.11904v1 | null |
2024-05-27 | Adaptive Batch Normalization Networks for Adversarial Robustness | Shao-Yuan Lo et.al. | 2405.11708v2 | null |
2024-05-19 | Certified Robust Accuracy of Neural Networks Are Bounded due to Bayes Errors | Ruihan Zhang et.al. | 2405.11547v1 | null |
2024-05-18 | Revisiting the Robust Generalization of Adversarial Prompt Tuning | Fan Yang et.al. | 2405.11154v1 | null |
2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924v1 | null |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882v1 | link |
2024-05-15 | Properties that allow or prohibit transferability of adversarial attacks among quantized networks | Abhishek Shrestha et.al. | 2405.09598v1 | link |
2024-05-15 | Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer | Weifei Jin et.al. | 2405.09470v1 | null |
2024-05-14 | SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models | Raghuveer Peri et.al. | 2405.08317v1 | null |
2024-05-10 | Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach | Amira Guesmi et.al. | 2405.06278v1 | null |
2024-05-08 | Adversarial Threats to Automatic Modulation Open Set Recognition in Wireless Networks | Yandie Yang et.al. | 2405.05022v1 | null |
2024-05-07 | Revisiting character-level adversarial attacks | Elias Abad Rocamora et.al. | 2405.04346v1 | link |
2024-05-06 | On Adversarial Examples for Text Classification by Perturbing Latent Representations | Korn Sooksatra et.al. | 2405.03789v1 | null |
2024-05-06 | Is ReLU Adversarially Robust? | Korn Sooksatra et.al. | 2405.03777v1 | null |
2024-05-06 | Cutting through buggy adversarial example defenses: fixing 1 line of code breaks Sabre | Nicholas Carlini et.al. | 2405.03672v1 | null |
2024-05-06 | Exploring Frequencies via Feature Mixing and Meta-Learning for Improving Adversarial Transferability | Juanjuan Weng et.al. | 2405.03193v1 | link |
2024-05-03 | ProFLingo: A Fingerprinting-based Copyright Protection Scheme for Large Language Models | Heng Jin et.al. | 2405.02466v1 | link |
2024-05-03 | A Novel Approach to Guard from Adversarial Attacks using Stable Diffusion | Trinath Sai Subhash Reddy Pittala et.al. | 2405.01838v1 | null |
2024-05-02 | Position Paper: Beyond Robustness Against Single Attack Types | Sihui Dai et.al. | 2405.01349v1 | null |
2024-05-01 | ASAM: Boosting Segment Anything Model with Adversarial Tuning | Bo Li et.al. | 2405.00256v1 | link |
2024-04-30 | Provably Robust Conformal Prediction with Improved Efficiency | Ge Yan et.al. | 2404.19651v1 | link |
2024-04-27 | Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks | Yunzhen Feng et.al. | 2404.19640v1 | null |
2024-04-30 | AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples | Antonio Emanuele Cinà et.al. | 2404.19460v1 | null |
2024-04-29 | A Systematic Evaluation of Adversarial Attacks against Speech Emotion Recognition Models | Nicolas Facchinetti et.al. | 2404.18514v1 | link |
2024-04-27 | Adversarial Examples: Generation Proposal in the Context of Facial Recognition Systems | Marina Fuster et.al. | 2404.17760v1 | null |
2024-04-25 | Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach | Cristopher McIntyre-Garcia et.al. | 2404.17020v1 | link |
2024-04-24 | Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks | Erh-Chung Chen et.al. | 2404.15881v1 | null |
2024-04-24 | An Empirical Study of Aegis | Daniel Saragih et.al. | 2404.15784v1 | null |
2024-04-21 | Fermi-Bose Machine | Mingshan Xie et.al. | 2404.13631v1 | null |
2024-04-28 | Attack on Scene Flow using Point Clouds | Haniyeh Ehsani Oskouie et.al. | 2404.13621v2 | null |
2024-04-21 | Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion | Hongyu Zhu et.al. | 2404.13518v1 | null |
2024-04-20 | Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think | Haotian Xue et.al. | 2404.13320v1 | link |
2024-04-24 | Beyond Score Changes: Adversarial Attack on No-Reference Image Quality Assessment from Two Perspectives | Chenxi Yang et.al. | 2404.13277v2 | null |
2024-04-19 | How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples | Dren Fazlija et.al. | 2404.12653v1 | null |
2024-04-19 | AED-PADA:Improving Generalizability of Adversarial Example Detection via Principal Adversarial Domain Adaptation | Heqi Peng et.al. | 2404.12635v1 | null |
2024-04-18 | Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors | Raz Lapid et.al. | 2404.12120v1 | null |
2024-04-18 | Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement | Pushkar Shukla et.al. | 2404.11819v1 | null |
2024-04-18 | Efficiently Adversarial Examples Generation for Visual-Language Models under Targeted Transfer Scenarios using Diffusion Models | Qi Guo et.al. | 2404.10335v2 | null |
2024-04-16 | Towards a Novel Perspective on Adversarial Examples Driven by Frequency | Zhun Zhang et.al. | 2404.10202v1 | null |
2024-04-19 | Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models | Peifei Zhu et.al. | 2404.09401v2 | null |
2024-04-14 | Counteracting Concept Drift by Learning with Future Malware Predictions | Branislav Bosansky et.al. | 2404.09352v1 | null |
2024-04-09 | Towards Building a Robust Toxicity Predictor | Dmitriy Bespalov et.al. | 2404.08690v1 | null |
2024-04-11 | Eliminating Catastrophic Overfitting Via Abnormal Adversarial Examples Regularization | Runqi Lin et.al. | 2404.08154v1 | link |
2024-04-11 | Persistent Classification: A New Approach to Stability of Data and Adversarial Examples | Brian Bell et.al. | 2404.08069v1 | null |
2024-04-10 | Logit Calibration and Feature Contrast for Robust Federated Learning on Non-IID Data | Yu Qiao et.al. | 2404.06776v1 | null |
2024-04-11 | On adversarial training and the 1 Nearest Neighbor classifier | Amir Hagai et.al. | 2404.06313v2 | link |
2024-04-08 | David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge | Miguel Costa et.al. | 2404.05688v1 | link |
2024-04-08 | Certified PEFTSmoothing: Parameter-Efficient Fine-Tuning with Randomized Smoothing | Chengyan Fu et.al. | 2404.05350v1 | null |
2024-04-08 | BruSLeAttack: A Query-Efficient Score-Based Black-Box Sparse Adversarial Attack | Viet Quoc Vo et.al. | 2404.05311v1 | null |
2024-04-08 | Out-of-Distribution Data: An Acquaintance of Adversarial Examples -- A Survey | Naveen Karunanayake et.al. | 2404.05219v1 | null |
2024-04-08 | Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods | Roopkatha Dey et.al. | 2404.05159v1 | null |
2024-04-06 | Learning Minimal NAP Specifications for Neural Network Verification | Chuqin Geng et.al. | 2404.04662v1 | null |
2024-04-05 | Reliable Feature Selection for Adversarially Robust Cyber-Attack Detection | João Vitorino et.al. | 2404.04188v1 | null |
2024-04-03 | Adversarial Attacks and Dimensionality in Text Classifiers | Nandish Chattopadhyay et.al. | 2404.02660v1 | null |
2024-04-03 | Unsegment Anything by Simulating Deformation | Jiahao Lu et.al. | 2404.02585v1 | link |
2024-04-02 | One Noise to Rule Them All: Multi-View Adversarial Attacks with Universal Perturbation | Mehmet Ergezer et.al. | 2404.02287v1 | link |
2024-04-02 | Multi-granular Adversarial Attacks against Black-box Neural Ranking Models | Yu-An Liu et.al. | 2404.01574v1 | null |
2024-03-30 | STBA: Towards Evaluating the Robustness of DNNs for Query-Limited Black-box Scenario | Renyang Liu et.al. | 2404.00362v1 | null |
2024-04-05 | On Inherent Adversarial Robustness of Active Vision Systems | Amitangshu Mukherjee et.al. | 2404.00185v2 | null |
2024-03-28 | Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset | Janis Goldzycher et.al. | 2403.19559v1 | link |
2024-03-27 | CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection | Jiayi Zhu et.al. | 2403.18554v1 | null |
2024-03-26 | DataCook: Crafting Anti-Adversarial Examples for Healthcare Data Copyright Protection | Sihan Shang et.al. | 2403.17755v1 | null |
2024-03-24 | Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals | Rui Zheng et.al. | 2403.16176v1 | null |
2024-03-22 | Robust optimization for adversarial learning with finite sample complexity guarantees | André Bertolace et.al. | 2403.15207v1 | null |
2024-03-21 | Diffusion Attack: Leveraging Stable Diffusion for Naturalistic Image Attacking | Qianyu Guo et.al. | 2403.14778v1 | null |
2024-03-21 | Few-Shot Adversarial Prompt Learning on Vision-Language Models | Yiwei Zhou et.al. | 2403.14774v1 | null |
2024-03-21 | Reversible Jump Attack to Textual Classifiers with Modification Reduction | Mingze Ni et.al. | 2403.14731v1 | link |
2024-03-19 | As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? | Anjun Hu et.al. | 2403.12693v1 | null |
2024-03-19 | Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory | Sensen Gao et.al. | 2403.12445v1 | null |
2024-03-18 | SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator | Javad Rafiei Asl et.al. | 2403.11833v1 | null |
2024-03-18 | Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM | Linyu Tang et.al. | 2403.11448v1 | null |
2024-03-17 | A Modified Word Saliency-Based Adversarial Attack on Text Classification Models | Hetvi Waghela et.al. | 2403.11297v1 | null |
2024-03-16 | Understanding Robustness of Visual State Space Models for Image Classification | Chengbin Du et.al. | 2403.10935v1 | null |
2024-03-19 | Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples | Ziqi Zhou et.al. | 2403.10801v2 | link |
2024-03-14 | Counter-Samples: A Stateless Strategy to Neutralize Black Box Adversarial Attacks | Roey Bokobza et.al. | 2403.10562v1 | null |
2024-03-15 | Towards Non-Adversarial Algorithmic Recourse | Tobias Leemann et.al. | 2403.10330v1 | null |
2024-03-14 | An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models | Haochen Luo et.al. | 2403.09766v1 | link |
2024-03-12 | Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation | Chengxing Jia et.al. | 2403.07261v1 | link |
2024-03-11 | Overcoming the Paradox of Certified Training with Gaussian Smoothing | Stefan Balauca et.al. | 2403.07095v1 | null |
2024-03-11 | Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification | Shuai Li et.al. | 2403.06798v1 | null |
2024-03-11 | PeerAiD: Improving Adversarial Distillation from a Specialized Peer Tutor | Jaewon Jung et.al. | 2403.06668v1 | null |
2024-03-11 | Real is not True: Backdoor Attacks Against Deepfake Detection | Hong Sun et.al. | 2403.06610v1 | null |
2024-03-08 | Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds | Tianrui Lou et.al. | 2403.05247v1 | null |
2024-03-08 | Adversarial Sparse Teacher: Defense Against Distillation-Based Model Stealing Attacks Using Adversarial Examples | Eda Yilmaz et.al. | 2403.05181v1 | null |
2024-03-06 | Improving Adversarial Training using Vulnerability-Aware Perturbation Budget | Olukorede Fakorede et.al. | 2403.04070v1 | null |
2024-03-05 | Towards Robust Federated Learning via Logits Calibration on Non-IID Data | Yu Qiao et.al. | 2403.02803v1 | null |
2024-03-04 | Robustness Bounds on the Successful Adversarial Examples: Theory and Practice | Hiroaki Maeshima et.al. | 2403.01896v1 | null |
2024-03-04 | One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models | Lin Li et.al. | 2403.01849v1 | link |
2024-03-02 | SAR-AE-SFP: SAR Imagery Adversarial Example in Real Physics domain with Target Scattering Feature Parameters | Jiahao Cui et.al. | 2403.01210v1 | null |
2024-02-29 | Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification | Sonal Joshi et.al. | 2402.19355v1 | null |
2024-02-29 | Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials | Gennaro Nolano et.al. | 2402.19076v1 | null |
2024-02-29 | How to Train your Antivirus: RL-based Hardening through the Problem-Space | Jacopo Cortellazzi et.al. | 2402.19027v1 | null |
2024-02-29 | MPAT: Building Robust Deep Neural Networks against Textual Adversarial Attacks | Fangyuan Zhang et.al. | 2402.18792v1 | null |
2024-02-29 | Enhancing the "Immunity" of Mixture-of-Experts Networks for Adversarial Defense | Qiao Han et.al. | 2402.18787v1 | null |
2024-02-27 | Adversarial example soups: averaging multiple adversarial examples improves transferability without increasing additional generation time | Bo Yang et.al. | 2402.18370v1 | null |
2024-02-28 | Catastrophic Overfitting: A Potential Blessing in Disguise | Mengnan Zhao et.al. | 2402.18211v1 | null |
2024-02-27 | LLM-Resistant Math Word Problem Generation via Adversarial Attacks | Roy Xie et.al. | 2402.17916v1 | link |
2024-02-28 | Black-box Adversarial Attacks Against Image Quality Assessment Models | Yu Ran et.al. | 2402.17533v2 | null |
2024-02-27 | Extreme Miscalibration and the Illusion of Adversarial Robustness | Vyas Raina et.al. | 2402.17509v1 | null |
2024-02-27 | Conformal Shield: A Novel Adversarial Attack Detection Framework for Automatic Modulation Classification | Tailai Wen et.al. | 2402.17450v1 | null |
2024-02-27 | Robustness-Congruent Adversarial Training for Secure Machine Learning Model Updates | Daniele Angioni et.al. | 2402.17390v1 | null |
2024-02-25 | An Adversarial Robustness Benchmark for Enterprise Network Intrusion Detection | João Vitorino et.al. | 2402.16912v1 | null |
2024-02-26 | Improving the JPEG-resistance of Adversarial Attacks on Face Recognition by Interpolation Smoothing | Kefu Guo et.al. | 2402.16586v1 | null |
2024-02-25 | From Noise to Clarity: Unraveling the Adversarial Suffix of Large Language Model Attacks via Translation of Text Embeddings | Hao Wang et.al. | 2402.16006v1 | null |
2024-02-23 | Distilling Adversarial Robustness Using Heterogeneous Teachers | Jieren Deng et.al. | 2402.15586v1 | null |
2024-02-23 | Deep Networks Always Grok and Here is Why | Ahmed Imtiaz Humayun et.al. | 2402.15555v1 | null |
2024-02-23 | ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang et.al. | 2402.15429v1 | link |
2024-02-22 | SoK: Analyzing Adversarial Examples: A Framework to Study Adversary Knowledge | Lucas Fenaux et.al. | 2402.14937v1 | null |
2024-02-22 | Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off | Futa Waseda et.al. | 2402.14648v1 | null |
2024-02-26 | AttackGNN: Red-Teaming GNNs in Hardware Security Using Reinforcement Learning | Vasudev Gohil et.al. | 2402.13946v2 | null |
2024-02-22 | Robustness of Deep Neural Networks for Micro-Doppler Radar Classification | Mikolaj Czerkawski et.al. | 2402.13651v2 | null |
2024-02-20 | QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems | Jinjing Shi et.al. | 2402.12950v1 | link |
2024-02-19 | Query-Based Adversarial Prompt Generation | Jonathan Hayase et.al. | 2402.12329v1 | null |
2024-02-19 | Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training | Leo Hyun Park et.al. | 2402.12187v1 | null |
2024-02-19 | Stealing the Invisible: Unveiling Pre-Trained CNN Models through Adversarial Examples and Timing Side-Channels | Shubhi Shukla et.al. | 2402.11953v1 | null |
2024-02-16 | DART: A Principled Approach to Adversarially Robust Unsupervised Domain Adaptation | Yunjuan Wang et.al. | 2402.11120v1 | null |
2024-02-16 | Zero-shot sampling of adversarial entities in biomedical question answering | R. Patrick Xian et.al. | 2402.10527v1 | null |
2024-02-16 | Theoretical Understanding of Learning from Adversarial Perturbations | Soichiro Kumano et.al. | 2402.10470v1 | link |
2024-02-15 | Exploring the Adversarial Capabilities of Large Language Models | Lukas Struppek et.al. | 2402.09132v2 | null |
2024-02-13 | Faster Repeated Evasion Attacks in Tree Ensembles | Lorenzo Cascioli et.al. | 2402.08586v1 | null |
2024-02-12 | Understanding Deep Learning defenses Against Adversarial Examples Through Visualizations for Dynamic Risk Assessment | Xabier Echeberria-Barrio et.al. | 2402.07496v1 | null |
2024-02-11 | A Random Ensemble of Encrypted Vision Transformers for Adversarially Robust Defense | Ryota Iijima et.al. | 2402.07183v1 | null |
2024-02-05 | Adversarial Text Purification: A Large Language Model Approach for Defense | Raha Moraffah et.al. | 2402.06655v1 | null |
2024-02-08 | Comprehensive Assessment of Jailbreak Attacks Against LLMs | Junjie Chu et.al. | 2402.05668v1 | null |
2024-02-07 | Adversarial Robustness Through Artifact Design | Tsufit Shua et.al. | 2402.04660v1 | null |
2024-02-06 | Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping | Qinliang Lin et.al. | 2402.03951v1 | link |
2024-02-05 | Arabic Synonym BERT-based Adversarial Examples for Text Classification | Norah Alshahrani et.al. | 2402.03477v1 | link |
2024-02-05 | Transcending Adversarial Perturbations: Manifold-Aided Adversarial Examples with Legitimate Semantics | Shuai Li et.al. | 2402.03095v1 | link |
2024-02-05 | A Generative Approach to Surrogate-based Black-box Attacks | Raha Moraffah et.al. | 2402.02732v1 | null |
2024-02-04 | DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision Transformers | Oryan Yehezkel et.al. | 2402.02554v1 | null |
2024-02-02 | Antonio Emanuele Cinà et.al. | 2402.01879v1 | link | |
2024-02-02 | HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text | Han Liu et.al. | 2402.01806v1 | link |
2024-02-02 | STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition | Yi Chang et.al. | 2402.01227v1 | null |
2024-02-01 | Improving QA Model Performance with Cartographic Inoculation | Allen Chen et.al. | 2401.17498v2 | null |
2024-01-30 | Single Word Change is All You Need: Designing Attacks and Defenses for Text Classifiers | Lei Xu et.al. | 2401.17196v1 | null |
2024-01-29 | LESSON: Multi-Label Adversarial False Data Injection Attack for Deep Learning Locational Detection | Jiwei Tian et.al. | 2401.16001v1 | null |
2024-01-24 | Boosting the Transferability of Adversarial Examples via Local Mixup and Adaptive Step Size | Junlin Liu et.al. | 2401.13205v1 | null |
2024-01-24 | Compositional Generative Inverse Design | Tailin Wu et.al. | 2401.13171v1 | link |
2024-01-23 | Fast Adversarial Training against Textual Adversarial Attacks | Yichen Yang et.al. | 2401.12461v1 | null |
2024-01-25 | The Surprising Harmfulness of Benign Overfitting for Adversarial Robustness | Yifan Hao et.al. | 2401.12236v2 | null |
2024-01-21 | How Robust Are Energy-Based Models Trained With Equilibrium Propagation? | Siddharth Mansingh et.al. | 2401.11543v1 | null |
2024-02-02 | Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion | Aly M. Kassem et.al. | 2401.11373v2 | null |
2024-01-19 | Explainable and Transferable Adversarial Attack for ML-Based Network Intrusion Detectors | Hangsheng Zhang et.al. | 2401.10691v1 | null |
2024-01-19 | PuriDefense: Randomized Local Implicit Adversarial Purification for Defending Black-box Query-based Attacks | Ping Guo et.al. | 2401.10586v1 | null |
2024-01-18 | Marrying Adapters and Mixup to Efficiently Enhance the Adversarial Robustness of Pre-Trained Language Models for Text Classification | Tuc Nguyen et.al. | 2401.10111v1 | null |
2024-01-19 | Hijacking Attacks against Neural Networks by Analyzing Training Data | Yunjie Ge et.al. | 2401.09740v2 | link |
2024-01-17 | PPR: Enhancing Dodging Attacks while Maintaining Impersonation Attacks on Face Recognition Systems | Fengfan Zhou et.al. | 2401.08903v1 | null |
2024-01-16 | Robust Localization of Key Fob Using Channel Impulse Response of Ultra Wide Band Sensors for Keyless Entry Systems | Abhiram Kolli et.al. | 2401.08863v1 | null |
2024-01-16 | Bag of Tricks to Boost Adversarial Transferability | Zeliang Zhang et.al. | 2401.08734v1 | null |
2024-01-16 | A Generative Adversarial Attack for Multilingual Text Classifiers | Tom Roth et.al. | 2401.08255v1 | null |
2024-01-13 | Exploring Adversarial Attacks against Latent Diffusion Model from the Perspective of Adversarial Transferability | Junxi Chen et.al. | 2401.07087v1 | null |
2024-01-17 | Adversarial Examples are Misaligned in Diffusion Model Manifolds | Peter Lorenz et.al. | 2401.06637v3 | null |
2024-01-11 | GE-AdvGAN: Improving the transferability of adversarial samples by gradient editing-based adversarial generative model | Zhiyu Zhu et.al. | 2401.06031v1 | link |
2024-01-12 | Bound Tightening using Rolling-Horizon Decomposition for Neural Network Verification | Haoruo Zhao et.al. | 2401.05280v2 | link |
2024-01-09 | Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness | Sibo Wang et.al. | 2401.04350v1 | null |
2024-01-06 | Data-Dependent Stability Analysis of Adversarial Training | Yihan Wang et.al. | 2401.03156v1 | null |
2024-01-13 | Enhancing targeted transferability via feature space fine-tuning | Hui Zeng et.al. | 2401.02727v2 | link |
2024-01-05 | A Random Ensemble of Encrypted models for Enhancing Robustness against Adversarial Examples | Ryota Iijima et.al. | 2401.02633v1 | null |
2024-01-08 | Vulnerabilities Unveiled: Adversarially Attacking a Multimodal Vision Language Model for Pathology Imaging | Jai Prakash Veerla et.al. | 2401.02565v2 | null |
2024-01-02 | JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial Example | Benedetta Tondi et.al. | 2401.01199v1 | link |
2023-12-30 | CamPro: Camera-based Anti-Facial Recognition | Wenjun Zhu et.al. | 2401.00151v1 | link |
2023-12-28 | BlackboxBench: A Comprehensive Benchmark of Black-box Adversarial Attacks | Meixi Zheng et.al. | 2312.16979v1 | link |
2023-12-28 | Attack Tree Analysis for Adversarial Evasion Attacks | Yuki Yamaguchi et.al. | 2312.16957v1 | null |
2023-12-26 | From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems | Gulsum Yigit et.al. | 2312.16156v1 | null |
2023-12-25 | GanFinger: GAN-Based Fingerprint Generation for Deep Neural Network Ownership Verification | Huali Ren et.al. | 2312.15617v1 | null |
2023-12-21 | AutoAugment Input Transformation for Highly Transferable Targeted Attacks | Haobo Lu et.al. | 2312.14218v1 | null |
2023-12-21 | Where and How to Attack? A Causality-Inspired Recipe for Generating Counterfactual Adversarial Examples | Ruichu Cai et.al. | 2312.13628v1 | null |
2023-12-20 | LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate | Tao Wu et.al. | 2312.13118v1 | link |
2023-12-20 | PGN: A perturbation generation network against deep reinforcement learning | Xiangjuan Li et.al. | 2312.12904v1 | null |
2023-12-18 | The Ultimate Combo: Boosting Adversarial Example Transferability by Composing Data Augmentations | Zebin Yun et.al. | 2312.11309v1 | null |
2023-12-18 | The Pros and Cons of Adversarial Robustness | Yacine Izza et.al. | 2312.10911v1 | null |
2023-12-16 | Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off | Yu-An Liu et.al. | 2312.10329v1 | null |
2023-12-15 | LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer | Yuxin Cao et.al. | 2312.09935v1 | link |
2023-12-15 | Verification-Friendly Deep Neural Networks | Anahita Baninajjar et.al. | 2312.09748v1 | null |
2023-12-15 | Towards Transferable Targeted 3D Adversarial Attack in the Physical World | Yao Huang et.al. | 2312.09558v1 | null |
2023-12-15 | Embodied Adversarial Attack: A Dynamic Robust Physical Attack in Autonomous Driving | Yitong Sun et.al. | 2312.09554v1 | null |
2023-12-15 | SlowTrack: Increasing the Latency of Camera-based Perception in Autonomous Driving Using Adversarial Examples | Chen Ma et.al. | 2312.09520v1 | null |
2023-12-13 | Defenses in Adversarial Machine Learning: A Survey | Baoyuan Wu et.al. | 2312.08890v1 | null |
2023-12-12 | May the Noise be with you: Adversarial Training without Adversarial Examples | Ayoub Arous et.al. | 2312.08877v1 | null |
2023-12-13 | Accelerating the Global Aggregation of Local Explanations | Alon Mor et.al. | 2312.07991v1 | null |
2023-12-13 | Robust Few-Shot Named Entity Recognition with Boundary Discrimination and Correlation Purification | Xiaojun Xue et.al. | 2312.07961v1 | link |
2023-12-13 | Radio Signal Classification by Adversarially Robust Quantum Machine Learning | Yanqiu Wu et.al. | 2312.07821v1 | null |
2023-12-12 | Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval | Qiwei Tian et.al. | 2312.07364v1 | null |
2023-12-12 | SSTA: Salient Spatially Transformed Attack | Renyang Liu et.al. | 2312.07258v1 | null |
2023-12-12 | DTA: Distribution Transform-based Attack for Query-Limited Scenario | Renyang Liu et.al. | 2312.07245v1 | null |
2023-12-12 | Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training | Qian Li et.al. | 2312.07067v1 | null |
2023-12-11 | Towards Transferable Adversarial Attacks with Centralized Perturbation | Shangbo Wu et.al. | 2312.06199v1 | null |
2023-12-09 | Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation | Shiji Zhao et.al. | 2312.05508v1 | null |
2023-12-09 | Poisoning |
Ege Erdogan et.al. | 2312.05502v1 | null |
2023-12-08 | MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness | Xiaoyun Xu et.al. | 2312.04960v1 | link |
2023-12-08 | SA-Attack: Improving Adversarial Transferability of Vision-Language Pre-training Models via Self-Augmentation | Bangyan He et.al. | 2312.04913v1 | null |
2023-12-08 | HC-Ref: Hierarchical Constrained Refinement for Robust Adversarial Training of GNNs | Xiaobing Pei et.al. | 2312.04879v1 | null |
2023-12-07 | OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization | Dongchen Han et.al. | 2312.04403v1 | null |
2023-12-07 | Class Incremental Learning for Adversarial Robustness | Seungju Cho et.al. | 2312.03289v2 | null |
2023-12-06 | A Simple Framework to Enhance the Adversarial Robustness of Deep Learning-based Intrusion Detection System | Xinwei Yuan et.al. | 2312.03245v1 | null |
2023-12-05 | ScAR: Scaling Adversarial Robustness for LiDAR Object Detection | Xiaohu Lu et.al. | 2312.03085v1 | link |
2023-12-05 | Generating Visually Realistic Adversarial Patch | Xiaosen Wang et.al. | 2312.03030v1 | null |
2023-12-04 | Singular Regularization with Information Bottleneck Improves Model's Adversarial Robustness | Guanlin Li et.al. | 2312.02237v1 | null |
2023-12-03 | QuantAttack: Exploiting Dynamic Quantization to Attack Vision Transformers | Amit Baras et.al. | 2312.02220v1 | null |
2023-12-03 | TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation | Xiaojun Jia et.al. | 2312.02207v1 | null |
2023-12-04 | InstructTA: Instruction-Tuned Targeted Attack for Large Vision-Language Models | Xunguang Wang et.al. | 2312.01886v1 | null |
2023-12-04 | Adversarial Medical Image with Hierarchical Feature Hiding | Qingsong Yao et.al. | 2312.01679v1 | link |
2023-11-30 | Universal Backdoor Attacks | Benjamin Schneider et.al. | 2312.00157v1 | null |
2023-11-28 | Rethinking Mixup for Improving the Adversarial Transferability | Xiaosen Wang et.al. | 2311.17087v1 | null |
2023-11-28 | Efficient Key-Based Adversarial Defense for ImageNet by Using Pre-trained Model | AprilPyone MaungMaung et.al. | 2311.16577v1 | null |
2023-11-27 | RetouchUAA: Unconstrained Adversarial Attack via Image Retouching | Mengda Xie et.al. | 2311.16478v1 | null |
2023-11-28 | CLAP: Contrastive Learning with Augmented Prompts for Robustness on Pretrained Vision-Language Models | Yichao Cai et.al. | 2311.16445v1 | null |
2023-11-28 | Adversarial Doodles: Interpretable and Human-drawable Attacks Provide Describable Insights | Ryoya Nara et.al. | 2311.15994v2 | null |
2023-11-27 | Instruct2Attack: Language-Guided Semantic Adversarial Attacks | Jiang Liu et.al. | 2311.15551v1 | null |
2023-11-26 | Having Second Thoughts? Let's hear it | Jung H. Lee et.al. | 2311.15356v1 | null |
2023-11-23 | When Side-Channel Attacks Break the Black-Box Property of Embedded Artificial Intelligence | Benoit Coqueret et.al. | 2311.14005v1 | null |
2023-11-23 | Adversarial defense based on distribution transfer | Jiahao Chen et.al. | 2311.13841v1 | null |
2023-11-22 | A Somewhat Robust Image Watermark against Diffusion-based Editing Models | Mingtian Tan et.al. | 2311.13713v1 | null |
2023-11-22 | Transfer Attacks and Defenses for Large Language Models on Coding Tasks | Chi Zhang et.al. | 2311.13445v1 | null |
2023-11-22 | A Survey of Adversarial CAPTCHAs on its History, Classification and Generation | Zisheng Xu et.al. | 2311.13233v1 | null |
2023-11-21 | SD-NAE: Generating Natural Adversarial Examples with Stable Diffusion | Yueqian Lin et.al. | 2311.12981v1 | null |
2023-11-18 | Boost Adversarial Transferability by Uniform Scale and Mix Mask Method | Tao Wang et.al. | 2311.12051v1 | null |
2023-11-20 | Generating Valid and Natural Adversarial Examples with Large Language Models | Zimu Wang et.al. | 2311.11861v1 | null |
2023-11-18 | Improving Adversarial Transferability by Stable Diffusion | Jiayang Liu et.al. | 2311.11017v1 | null |
2023-11-17 | Breaking Boundaries: Balancing Performance and Robustness in Deep Wireless Traffic Forecasting | Romain Ilbert et.al. | 2311.09790v2 | null |
2023-11-15 | Jailbreaking GPT-4V via Self-Adversarial Attacks with System Prompts | Yuanwei Wu et.al. | 2311.09127v1 | null |
2023-11-14 | DALA: A Distribution-Aware LoRA-Based Adversarial Attack against Pre-trained Language Models | Yibo Wang et.al. | 2311.08598v1 | null |
2023-11-14 | Physical Adversarial Examples for Multi-Camera Systems | Ana Răduţoiu et.al. | 2311.08539v1 | null |
2023-11-14 | On The Relationship Between Universal Adversarial Attacks And Sparse Representations | Dana Weitzner et.al. | 2311.08265v1 | link |
2023-11-14 | Towards Improving Robustness Against Common Corruptions in Object Detectors Using Adversarial Contrastive Learning | Shashank Kotyan et.al. | 2311.07928v1 | null |
2023-11-17 | Parrot-Trained Adversarial Examples: Pushing the Practicality of Black-Box Audio Attacks against Speaker Recognition Models | Rui Duan et.al. | 2311.07780v2 | null |
2023-11-13 | An Extensive Study on Adversarial Attack against Pre-trained Models of Code | Xiaohu Du et.al. | 2311.07553v1 | link |
2023-11-10 | Flatness-aware Adversarial Attack | Mingyuan Fan et.al. | 2311.06423v1 | null |
2023-11-08 | Constrained Adaptive Attacks: Realistic Evaluation of Adversarial Examples and Robust Training of Deep Neural Networks for Tabular Data | Thibault Simonetto et.al. | 2311.04503v1 | null |
2023-11-07 | Unveiling Safety Vulnerabilities of Large Language Models | George Kour et.al. | 2311.04124v1 | null |
2023-11-06 | Measuring Adversarial Datasets | Yuanchen Bai et.al. | 2311.03566v1 | null |
2023-11-02 | Adversary ML Resilience in Autonomous Driving Through Human Centered Perception Mechanisms | Aakriti Shah et.al. | 2311.01478v1 | null |
2023-11-01 | Adversarial Examples in the Physical World: A Survey | Jiakai Wang et.al. | 2311.01473v1 | null |
2023-11-02 | Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models | Andy Zhou et.al. | 2311.01441v1 | link |
2023-11-02 | On the Lipschitz constant of random neural networks | Paul Geuchen et.al. | 2311.01356v1 | null |
2023-11-02 | Towards Evaluating Transfer-based Attacks Systematically, Practically, and Fairly | Qizhang Li et.al. | 2311.01323v1 | null |
2023-11-02 | Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game | Sam Toyer et.al. | 2311.01011v1 | null |
2023-11-01 | NEO-KD: Knowledge-Distillation-Based Adversarial Training for Robust Multi-Exit Neural Networks | Seokil Ham et.al. | 2311.00428v1 | null |
2023-10-31 | Robust Safety Classifier for Large Language Models: Adversarial Prompt Shield | Jinhwa Kim et.al. | 2311.00172v1 | null |
2023-11-01 | LFAA: Crafting Transferable Targeted Adversarial Examples with Low-Frequency Perturbations | Kunyu Wang et.al. | 2310.20175v2 | null |
2023-10-29 | Adversarial Examples Are Not Real Features | Ang Li et.al. | 2310.18936v1 | link |
2023-10-28 | Assessing and Improving Syntactic Adversarial Robustness of Pre-trained Models for Code Translation | Guang Yang et.al. | 2310.18587v1 | link |
2023-11-02 | Understanding and Improving Ensemble Adversarial Defense | Yian Deng et.al. | 2310.18477v2 | link |
2023-10-26 | A Survey on Transferability of Adversarial Examples across Deep Neural Networks | Jindong Gu et.al. | 2310.17626v1 | link |
2023-10-26 | Instability of computer vision models is a necessary result of the task itself | Oliver Turnbull et.al. | 2310.17559v1 | null |
2023-10-25 | Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks | Aradhana Sinha et.al. | 2310.16955v1 | null |
2023-10-24 | Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks | Xiaojun Jia et.al. | 2310.15444v1 | null |
2023-10-23 | Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval | Xu Yuan et.al. | 2310.14637v1 | link |
2023-10-23 | F$^2$AT: Feature-Focusing Adversarial Training via Disentanglement of Natural and Perturbed Patterns | Yaguan Qian et.al. | 2310.14561v1 | null |
2023-10-24 | Diffusion-Based Adversarial Purification for Speaker Verification | Yibo Bai et.al. | 2310.14270v2 | null |
2023-10-22 | CT-GAT: Cross-Task Generative Adversarial Attack based on Transferability | Minxuan Lv et.al. | 2310.14265v1 | link |
2023-10-21 | Adversarial Image Generation by Spatial Transformation in Perceptual Colorspaces | Ayberk Aydin et.al. | 2310.13950v1 | link |
2023-10-20 | An LLM can Fool Itself: A Prompt-Based Adversarial Attack | Xilie Xu et.al. | 2310.13345v1 | null |
2023-10-23 | Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting | Zecheng Tang et.al. | 2310.13321v2 | link |
2023-10-19 | Generating Robust Adversarial Examples against Online Social Networks (OSNs) | Jun Liu et.al. | 2310.12708v1 | link |
2023-10-19 | Recoverable Privacy-Preserving Image Classification through Noise-like Adversarial Examples | Jun Liu et.al. | 2310.12707v1 | link |
2023-10-19 | Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks | Xiaodong Yu et.al. | 2310.12516v1 | null |
2023-10-18 | Exploring Decision-based Black-box Attacks on Face Forgery Detection | Zhaoyu Chen et.al. | 2310.12017v1 | null |
2023-10-18 | IRAD: Implicit Representation-driven Image Resampling against Adversarial Attacks | Yue Cao et.al. | 2310.11890v1 | null |
2023-10-18 | Revisiting Transferable Adversarial Image Examples: Attack Categorization, Evaluation Guidelines, and New Insights | Zhengyu Zhao et.al. | 2310.11850v1 | link |
2023-10-17 | The Efficacy of Transformer-based Adversarial Attacks in Security Domains | Kunyang Li et.al. | 2310.11597v1 | null |
2023-10-16 | Black-box Targeted Adversarial Attack on Segment Anything (SAM) | Sheng Zheng et.al. | 2310.10010v1 | null |
2023-10-15 | Towards Deep Learning Models Resistant to Transfer-based Adversarial Attacks via Data-centric Robust Learning | Yulong Yang et.al. | 2310.09891v1 | null |
2023-10-15 | AFLOW: Developing Adversarial Examples under Extremely Noise-limited Settings | Renyang Liu et.al. | 2310.09795v1 | null |
2023-10-15 | SCME: A Self-Contrastive Method for Data-free and Query-Limited Model Extraction Attack | Renyang Liu et.al. | 2310.09792v1 | null |
2023-10-13 | Is Certifying |
Ravi Mangal et.al. | 2310.09361v1 | null |
2023-10-18 | Attacks Meet Interpretability (AmI) Evaluation and Findings | Qian Ma et.al. | 2310.08808v2 | null |
2023-10-12 | Concealed Electronic Countermeasures of Radar Signal with Adversarial Examples | Ruinan Ma et.al. | 2310.08292v1 | null |
2023-10-12 | Samples on Thin Ice: Re-Evaluating Adversarial Pruning of Neural Networks | Giorgio Piras et.al. | 2310.08073v1 | null |
2023-10-11 | Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models | Renyang Liu et.al. | 2310.07492v1 | null |
2023-10-14 | An Adversarial Example for Direct Logit Attribution: Memory Management in gelu-4l | James Dao et.al. | 2310.07325v2 | null |
2023-10-12 | An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification | Jiaqi Li et.al. | 2310.05354v2 | null |
2023-10-09 | GReAT: A Graph Regularized Adversarial Training Method | Samet Bayram et.al. | 2310.05336v1 | null |
2023-10-08 | Do Large Language Models Know about Facts? | Xuming Hu et.al. | 2310.05177v1 | null |
2023-10-10 | BRAINTEASER: Lateral Thinking Puzzles for Large Language Models | Yifan Jiang et.al. | 2310.05057v2 | null |
2023-10-06 | Generating Less Certain Adversarial Examples Improves Robust Generalization | Minxing Zhang et.al. | 2310.04539v1 | link |
2023-10-06 | Assessing Robustness via Score-Based Adversarial Image Generation | Marcel Kollovieh et.al. | 2310.04285v1 | null |
2023-10-05 | OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks | Ofir Bar Tal et.al. | 2310.03707v1 | null |
2023-10-05 | Adversarial Machine Learning for Social Good: Reframing the Adversary as an Ally | Shawqi Al-Maliki et.al. | 2310.03614v1 | null |
2023-10-05 | Robust Representation Learning via Asymmetric Negative Contrast and Reverse Attention | Nuoyan Zhou et.al. | 2310.03358v1 | link |
2023-10-05 | An Integrated Algorithm for Robust and Imperceptible Audio Adversarial Examples | Armin Ettenhofer et.al. | 2310.03349v1 | null |
2023-10-07 | Untargeted White-box Adversarial Attack with Heuristic Defence Methods in Real-time Deep Learning based Network Intrusion Detection System | Khushnaseeb Roshan et.al. | 2310.03334v2 | null |
2023-10-04 | Misusing Tools in Large Language Models With Visual Adversarial Examples | Xiaohan Fu et.al. | 2310.03185v1 | null |
2023-10-03 | Splitting the Difference on Adversarial Training | Matan Levi et.al. | 2310.02480v1 | link |
2023-10-04 | LLM Lies: Hallucinations are not Bugs, but Features as Adversarial Examples | Jia-Yu Yao et.al. | 2310.01469v2 | link |
2023-10-02 | Fooling the Textual Fooler via Randomizing Latent Representations | Duy C. Hoang et.al. | 2310.01452v1 | null |
2023-10-01 | A Survey of Robustness and Safety of 2D and 3D Deep Learning Models Against Adversarial Attacks | Yanjie Li et.al. | 2310.00633v1 | null |
2023-10-01 | Understanding the Robustness of Randomized Feature Defense Against Query-Based Adversarial Attacks | Quang H. Nguyen et.al. | 2310.00567v1 | null |
2023-09-30 | Human-Producible Adversarial Examples | David Khachaturov et.al. | 2310.00438v1 | null |
2023-09-30 | Refutation of Shapley Values for XAI -- Additional Evidence | Xuanxiang Huang et.al. | 2310.00416v1 | null |
2023-09-29 | On Continuity of Robust and Accurate Classifiers | Ramin Barati et.al. | 2309.17048v1 | null |
2023-09-28 | Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness | Ambar Pal et.al. | 2309.16096v1 | null |
2023-10-04 | On Computational Entanglement and Its Interpretation in Adversarial Machine Learning | YenLung Lai et.al. | 2309.15669v2 | null |
2023-09-26 | Structure Invariant Transformation for better Adversarial Transferability | Xiaosen Wang et.al. | 2309.14700v1 | link |
2023-09-26 | DifAttack: Query-Efficient Black-Box Attack via Disentangled Feature Space | Liu Jun et.al. | 2309.14585v1 | link |
2023-09-25 | Adversarial Attacks on Video Object Segmentation with Hard Region Discovery | Ping Li et.al. | 2309.13857v1 | null |
2023-09-23 | RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias | Hao Cheng et.al. | 2309.13245v1 | null |
2023-09-22 | Improving Machine Learning Robustness via Adversarial Training | Long Dang et.al. | 2309.12593v1 | null |
2023-09-21 | HANS, are you clever? Clever Hans Effect Analysis of Neural Systems | Leonardo Ranaldi et.al. | 2309.12481v1 | null |
2023-09-21 | How Robust is Google's Bard to Adversarial Image Attacks? | Yinpeng Dong et.al. | 2309.11751v1 | link |
2023-09-20 | When to Trust AI: Advances and Challenges for Certification of Neural Networks | Marta Kwiatkowska et.al. | 2309.11196v1 | null |
2023-09-20 | PRAT: PRofiling Adversarial aTtacks | Rahul Ambati et.al. | 2309.11111v1 | null |
2023-09-21 | What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples | Shakila Mahjabin Tonni et.al. | 2309.10916v2 | link |
2023-09-19 | Adversarial Attacks Against Uncertainty Quantification | Emanuele Ledda et.al. | 2309.10586v1 | null |
2023-09-19 | Language Guided Adversarial Purification | Himanshu Singh et.al. | 2309.10348v1 | null |
2023-09-19 | Transferable Adversarial Attack on Image Tampering Localization | Yuqi Wang et.al. | 2309.10243v1 | null |
2023-09-18 | MAD: Meta Adversarial Defense Benchmark | X. Peng et.al. | 2309.09776v1 | null |
2023-09-18 | Stealthy Physical Masked Face Recognition Attack via Adversarial Style Optimization | Huihui Gong et.al. | 2309.09480v1 | null |
2023-09-18 | Reducing Adversarial Training Cost with Gradient Approximation | Huihui Gong et.al. | 2309.09464v1 | null |
2023-09-16 | Context-aware Adversarial Attack on Named Entity Recognition | Shuguang Chen et.al. | 2309.08999v1 | null |
2023-09-16 | Inverse classification with logistic and softmax classifiers: efficient optimization | Miguel Á. Carreira-Perpiñán et.al. | 2309.08945v1 | null |
2023-09-15 | Adversarial Attacks on Tables with Entity Swap | Aneta Koleva et.al. | 2309.08650v1 | null |
2023-09-14 | Unleashing the Adversarial Facet of Software Debloating | Do-Men Su et.al. | 2309.08058v1 | null |
2023-09-13 | Mitigating Adversarial Attacks in Federated Learning with Trusted Execution Environments | Simon Queyrut et.al. | 2309.07197v1 | link |
2023-09-13 | Hardening RGB-D Object Recognition Systems against Adversarial Patch Attacks | Yang Zheng et.al. | 2309.07106v1 | null |
2023-09-13 | APICom: Automatic API Completion via Prompt Learning and Adversarial Training-based Data Augmentation | Yafeng Gu et.al. | 2309.07026v1 | null |
2023-09-13 | PhantomSound: Black-Box, Query-Efficient Audio Adversarial Attack via Split-Second Phoneme Injection | Hanqing Guo et.al. | 2309.06960v1 | null |
2023-09-12 | Using Reed-Muller Codes for Classification with Rejection and Recovery | Daniel Fentham et.al. | 2309.06359v1 | link |
2023-09-12 | Certified Robust Models with Slack Control and Large Lipschitz Constants | Max Losch et.al. | 2309.06166v1 | link |
2023-09-11 | Diffusion-based Adversarial Purification for Robust Deep MRI Reconstruction | Ismail Alkhouri et.al. | 2309.05794v1 | link |
2023-09-09 | Exploring Robust Features for Improving Adversarial Robustness | Hong Wang et.al. | 2309.04650v1 | null |
2023-09-07 | How adversarial attacks can disrupt seemingly stable accurate classifiers | Oliver J. Sutton et.al. | 2309.03665v1 | null |
2023-09-05 | The Adversarial Implications of Variable-Time Inference | Dudi Biton et.al. | 2309.02159v1 | link |
2023-09-06 | Efficient Query-Based Attack against ML-Based Android Malware Detection under Zero Knowledge Setting | Ping He et.al. | 2309.01866v2 | null |
2023-09-04 | Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings | AprilPyone MaungMaung et.al. | 2309.01620v1 | null |
2023-09-04 | Adv3D: Generating 3D Adversarial Examples in Driving Scenarios with NeRF | Leheng Li et.al. | 2309.01351v1 | null |
2023-09-02 | Towards Certified Probabilistic Robustness with High Accuracy | Ruihan Zhang et.al. | 2309.00879v1 | null |
2023-09-01 | Curating Naturally Adversarial Datasets for Trustworthy AI in Healthcare | Sydney Pugh et.al. | 2309.00543v1 | null |
2023-09-01 | Image Hijacking: Adversarial Images can Control Generative Models at Runtime | Luke Bailey et.al. | 2309.00236v1 | null |
2023-08-31 | Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff | Satoshi Suzuki et.al. | 2308.16454v1 | null |
2023-08-29 | Adaptive Attack Detection in Text Classification: Leveraging Space Exploration Features for Text Sentiment Classification | Atefeh Mahdavi et.al. | 2308.15663v1 | null |
2023-08-29 | 3D Adversarial Augmentations for Robust Out-of-Domain Predictions | Alexander Lehner et.al. | 2308.15479v1 | null |
2023-08-29 | Imperceptible Adversarial Attack on Deep Neural Networks from Image Boundary | Fahad Alrasheedi et.al. | 2308.15344v1 | null |
2023-08-29 | A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation | Sahar Sadrizadeh et.al. | 2308.15246v1 | null |
2023-08-24 | Evaluating the Vulnerabilities in ML systems in terms of adversarial attacks | John Harshith et.al. | 2308.12918v1 | null |
2023-08-23 | On-Manifold Projected Gradient Descent | Aaron Mahler et.al. | 2308.12279v1 | null |
2023-08-23 | Does Physical Adversarial Example Really Matter to Autonomous Driving? Towards System-Level Effect of Adversarial Object Evasion Attack | Ningfei Wang et.al. | 2308.11894v1 | null |
2023-08-23 | SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks | Yue Gao et.al. | 2308.11845v1 | null |
2023-08-21 | Boosting Adversarial Attack with Similar Target | Shuo Zhang et.al. | 2308.10743v1 | link |
2023-08-21 | Improving the Transferability of Adversarial Examples with Arbitrary Style Transfer | Zhijin Ge et.al. | 2308.10601v1 | link |
2023-08-22 | Boosting Adversarial Transferability by Block Shuffle and Rotation | Kunyu Wang et.al. | 2308.10299v2 | null |
2023-08-15 | SEDA: Self-Ensembling ViT with Defensive Distillation and Adversarial Training for robust Chest X-rays Classification | Raza Imam et.al. | 2308.07874v1 | link |
2023-08-15 | Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models | Yugeng Liu et.al. | 2308.07847v1 | null |
2023-08-15 | Backpropagation Path Search On Adversarial Transferability | Zhuoer Xu et.al. | 2308.07625v1 | null |
2023-08-14 | White-Box Adversarial Attacks on Deep Learning-Based Radio Frequency Fingerprint Identification | Jie Ma et.al. | 2308.07433v1 | null |
2023-08-14 | AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning | Ziqi Zhou et.al. | 2308.07026v1 | link |
2023-08-13 | SoK: Realistic Adversarial Attacks and Defenses for Intelligent Network Intrusion Detection | João Vitorino et.al. | 2308.06819v1 | null |
2023-08-11 | Enhancing Generalization of Universal Adversarial Perturbation through Gradient Aggregation | Xuannan Liu et.al. | 2308.06015v1 | link |
2023-08-08 | Pelta: Shielding Transformers to Mitigate Evasion Attacks in Federated Learning | Simon Queyrut et.al. | 2308.04373v1 | null |
2023-08-04 | Multi-attacks: Many images |
Stanislav Fort et.al. | 2308.03792v1 | link |
2023-08-06 | CGBA: Curvature-aware Geometric Black-box Attack | Md Farhamdur Reza et.al. | 2308.03163v1 | link |
2023-08-05 | An Adaptive Model Ensemble Adversarial Attack for Boosting Adversarial Transferability | Bin Chen et.al. | 2308.02897v1 | link |
2023-08-01 | Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning | Kaijie Zhu et.al. | 2308.02533v1 | link |
2023-08-04 | AdvFAS: A robust face anti-spoofing framework against adversarial examples | Jiawei Chen et.al. | 2308.02116v1 | null |
2023-08-03 | URET: Universal Robustness Evaluation Toolkit (for Evasion) | Kevin Eykholt et.al. | 2308.01840v1 | link |
2023-08-03 | Hard Adversarial Example Mining for Improving Robust Fairness | Chenhao Lin et.al. | 2308.01823v1 | null |
2023-08-03 | Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time | Xinfeng Li et.al. | 2308.01040v2 | null |
2023-08-01 | Dynamic ensemble selection based on Deep Neural Network Uncertainty Estimation for Adversarial Robustness | Ruoxi Qin et.al. | 2308.00346v1 | null |
2023-08-01 | LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack | Hai Zhu et.al. | 2308.00319v1 | null |
2023-07-31 | Transferable Attack for Semantic Segmentation | Mengqi He et.al. | 2307.16572v1 | link |
2023-07-31 | Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples | Qiufan Ji et.al. | 2307.16361v1 | link |
2023-07-30 | Theoretically Principled Trade-off for Stateful Defenses against Query-Based Black-Box Attacks | Ashish Hooda et.al. | 2307.16331v1 | null |
2023-07-31 | R-LPIPS: An Adversarially Robust Perceptual Similarity Metric | Sara Ghazanfari et.al. | 2307.15157v2 | link |
2023-07-27 | FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Buse G. A. Tekgul et.al. | 2307.14751v1 | null |
2023-07-26 | Defending Adversarial Patches via Joint Region Localizing and Inpainting | Junwen Chen et.al. | 2307.14242v1 | null |
2023-07-26 | Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models | Dong Lu et.al. | 2307.14061v1 | null |
2023-07-26 | Enhanced Security against Adversarial Examples Using a Random Ensemble of Encrypted Vision Transformer Models | Ryota Iijima et.al. | 2307.13985v1 | null |
2023-07-27 | Why Don't You Clean Your Glasses? Perception Attacks with Dynamic Optical Perturbations | Yi Han et.al. | 2307.13131v2 | null |
2023-07-24 | Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation | Neel Bhandari et.al. | 2307.12520v1 | link |
2023-07-24 | AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models | Xuelong Dai et.al. | 2307.12499v1 | null |
2023-07-24 | Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training | Gege Qi et.al. | 2307.12498v1 | null |
2023-07-23 | Towards Generic and Controllable Attacks Against Object Detection | Guopeng Li et.al. | 2307.12342v1 | link |
2023-07-23 | Downstream-agnostic Adversarial Examples | Ziqi Zhou et.al. | 2307.12280v1 | null |
2023-07-21 | Unveiling Vulnerabilities in Interpretable Deep Learning Systems with Query-Efficient Black-box Attacks | Eldor Abdukhamidov et.al. | 2307.11906v1 | null |
2023-07-21 | Fast Adaptive Test-Time Defense with Robust Features | Anurag Singh et.al. | 2307.11672v1 | null |
2023-07-21 | Improving Transferability of Adversarial Examples via Bayesian Attacks | Qizhang Li et.al. | 2307.11334v1 | null |
2023-07-20 | Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples | Shaokui Wei et.al. | 2307.10562v1 | null |
2023-07-14 | Adversarial Training Over Long-Tailed Distribution | Guanlin Li et.al. | 2307.10205v1 | link |
2023-07-17 | Analyzing the Impact of Adversarial Examples on Explainable Machine Learning | Prathyusha Devabhakthini et.al. | 2307.08327v1 | null |
2023-07-18 | On the Robustness of Split Learning against Adversarial Attacks | Mingyuan Fan et.al. | 2307.07916v2 | null |
2023-07-19 | Why Does Little Robustness Help? Understanding Adversarial Transferability From Surrogate Training | Yechao Zhang et.al. | 2307.07873v2 | null |
2023-07-14 | Structured Pruning of Neural Networks for Constraints Learning | Matteo Cacciola et.al. | 2307.07457v1 | null |
2023-07-18 | Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning | Byung-Kwan Lee et.al. | 2307.07250v2 | link |
2023-07-14 | Vulnerability-Aware Instance Reweighting For Adversarial Training | Olukorede Fakorede et.al. | 2307.07167v1 | null |
2023-07-16 | Multi-objective Evolutionary Search of Variable-length Composite Semantic Perturbations | Jialiang Sun et.al. | 2307.06548v2 | null |
2023-07-13 | Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems | Eldor Abdukhamidov et.al. | 2307.06496v1 | null |
2023-07-11 | ATWM: Defense against adversarial malware based on adversarial training | Kun Li et.al. | 2307.05095v1 | null |
2023-07-10 | Practical Trustworthiness Model for DNN in Dedicated 6G Application | Anouar Nechi et.al. | 2307.04677v1 | null |
2023-07-09 | GNP Attack: Transferable Adversarial Examples via Gradient Norm Penalty | Tao Wu et.al. | 2307.04099v1 | null |
2023-07-06 | Quantification of Uncertainty with Adversarial Models | Kajetan Schweighofer et.al. | 2307.03217v1 | link |
2023-07-06 | NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic | Zi'ou Zheng et.al. | 2307.02849v1 | null |
2023-07-06 | Sampling-based Fast Gradient Rescaling Method for Highly Transferable Adversarial Attacks | Xu Han et.al. | 2307.02828v1 | null |
2023-07-05 | Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality | Peter Lorenz et.al. | 2307.02347v1 | link |
2023-07-04 | SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification | Junjie Wu et.al. | 2307.01488v1 | null |
2023-07-03 | Interpretability and Transparency-Driven Detection and Transformation of Textual Adversarial Examples (IT-DT) | Bushra Sabir et.al. | 2307.01225v1 | null |
2023-07-01 | Adversarial Attacks and Defenses on 3D Point Cloud Classification: A Survey | Hanieh Naderi et.al. | 2307.00309v1 | null |
2023-07-01 | Common Knowledge Learning for Generating Transferable Adversarial Examples | Ruijie Yang et.al. | 2307.00274v1 | null |
2023-07-05 | Defense against Adversarial Cloud Attack on Remote Sensing Salient Object Detection | Huiming Sun et.al. | 2306.17431v2 | null |
2023-06-29 | Defending Black-box Classifiers by Bayesian Boundary Correction | He Wang et.al. | 2306.16979v1 | null |
2023-06-29 | Towards Optimal Randomized Strategies in Adversarial Example Game | Jiahao Xie et.al. | 2306.16738v1 | null |
2023-06-28 | Does Saliency-Based Training bring Robustness for Deep Neural Networks in Image Classification? | Ali Karkehabadi et.al. | 2306.16581v1 | null |
2023-06-28 | Mitigating the Accuracy-Robustness Trade-off via Multi-Teacher Adversarial Distillation | Shiji Zhao et.al. | 2306.16170v1 | link |
2023-06-28 | Enrollment-stage Backdoor Attacks on Speaker Recognition Systems via Adversarial Ultrasound | Xinfeng Li et.al. | 2306.16022v1 | null |
2023-06-28 | Boosting Adversarial Transferability with Learnable Patch-wise Masks | Xingxing Wei et.al. | 2306.15931v1 | null |
2023-06-26 | Are aligned neural networks adversarially aligned? | Nicholas Carlini et.al. | 2306.15447v1 | null |
2023-06-26 | 3D-Aware Adversarial Makeup Generation for Facial Privacy Protection | Yueming Lyu et.al. | 2306.14640v1 | null |
2023-06-25 | RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations | Yilun Zhao et.al. | 2306.14321v1 | link |
2023-06-25 | On Evaluating the Adversarial Robustness of Semantic Segmentation Models | Levente Halmosi et.al. | 2306.14217v1 | null |
2023-06-25 | Robust Spatiotemporal Traffic Forecasting with Reinforced Dynamic Adversarial Training | Fan Liu et.al. | 2306.14126v1 | link |
2023-06-24 | Machine Learning needs its own Randomness Standard: Randomised Smoothing and PRNG-based attacks | Pranav Dahiya et.al. | 2306.14043v1 | null |
2023-06-24 | Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks | Zeming Wei et.al. | 2306.14040v1 | null |
2023-06-24 | Boosting Model Inversion Attacks with Adversarial Examples | Shuai Zhou et.al. | 2306.13965v1 | null |
2023-06-23 | Creating Valid Adversarial Examples of Malware | Matouš Kozák et.al. | 2306.13587v1 | link |
2023-06-22 | Document Image Cleaning using Budget-Aware Black-Box Approximation | Ganesh Tata et.al. | 2306.13236v1 | link |
2023-06-22 | Visual Adversarial Examples Jailbreak Large Language Models | Xiangyu Qi et.al. | 2306.13213v1 | link |
2023-06-22 | Anticipatory Thinking Challenges in Open Worlds: Risk Management | Adam Amos-Binks et.al. | 2306.13157v1 | null |
2023-06-22 | Adversarial Resilience in Sequential Prediction via Abstention | Surbhi Goel et.al. | 2306.13119v1 | null |
2023-06-22 | Towards quantum enhanced adversarial robustness in machine learning | Maxwell T. West et.al. | 2306.12688v1 | null |
2023-06-22 | Rethinking the Backward Propagation for Adversarial Transferability | Xiaosen Wang et.al. | 2306.12685v1 | null |
2023-06-21 | Evaluating Adversarial Robustness of Convolution-based Human Motion Prediction | Chengxu Duan et.al. | 2306.11990v1 | null |
2023-06-21 | Universal adversarial perturbations for multiple classification tasks with quantum classifiers | Yun-Zhong Qiu et.al. | 2306.11974v1 | null |
2023-06-20 | Reversible Adversarial Examples with Beam Search Attack and Grayscale Invariance | Haodong Zhang et.al. | 2306.11322v1 | null |
2023-06-17 | Edge Learning for 6G-enabled Internet of Things: A Comprehensive Survey of Vulnerabilities, Datasets, and Defenses | Mohamed Amine Ferrag et.al. | 2306.10309v1 | null |
2023-06-16 | Query-Free Evasion Attacks Against Machine Learning-Based Malware Detectors with Generative Adversarial Networks | Daniel Gibert et.al. | 2306.09925v1 | null |
2023-06-14 | Augment then Smooth: Reconciling Differential Privacy with Certified Robustness | Jiapeng Wu et.al. | 2306.08656v1 | null |
2023-06-14 | Reliable Evaluation of Adversarial Transferability | Wenqian Yu et.al. | 2306.08565v1 | null |
2023-06-14 | A Relaxed Optimization Approach for Adversarial Attacks against Neural Machine Translation Models | Sahar Sadrizadeh et.al. | 2306.08492v1 | null |
2023-06-11 | Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework | Minglei Yin et.al. | 2306.07992v1 | null |
2023-06-13 | Area is all you need: repeatable elements make stronger adversarial attacks | Dillon Niederhut et.al. | 2306.07768v1 | null |
2023-06-13 | Generative Watermarking Against Unauthorized Subject-Driven Image Synthesis | Yihan Ma et.al. | 2306.07754v1 | null |
2023-06-13 | Theoretical Foundations of Adversarially Robust Learning | Omar Montasser et.al. | 2306.07723v1 | null |
2023-06-13 | I See Dead People: Gray-Box Adversarial Attack on Image-To-Text Models | Raz Lapid et.al. | 2306.07591v1 | null |
2023-06-12 | AROID: Improving Adversarial Robustness through Online Instance-wise Data Augmentation | Lin Li et.al. | 2306.07197v1 | null |
2023-06-12 | When Vision Fails: Text Attacks Against ViT and OCR | Nicholas Boucher et.al. | 2306.07033v1 | link |
2023-06-08 | Boosting Adversarial Transferability by Achieving Flat Local Maxima | Zhijin Ge et.al. | 2306.05225v1 | null |
2023-06-08 | Expanding Scope: Adapting English Adversarial Attacks to Chinese | Hanyu Liu et.al. | 2306.04874v1 | null |
2023-06-07 | PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts | Xiangjue Dong et.al. | 2306.04535v1 | link |
2023-06-07 | Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs | Ines Reinig et.al. | 2306.04523v1 | link |
2023-06-13 | Extracting Cloud-based Model with Prior Knowledge | Shiqian Zhao et.al. | 2306.04192v4 | null |
2023-06-07 | Optimal Transport Model Distributional Robustness | Van-Anh Nguyen et.al. | 2306.04178v1 | null |
2023-06-06 | Revisiting the Trade-off between Accuracy and Robustness via Weight Distribution of Filters | Xingxing Wei et.al. | 2306.03430v1 | null |
2023-06-05 | Evading Black-box Classifiers Without Breaking Eggs | Edoardo Debenedetti et.al. | 2306.02895v1 | link |
2023-06-05 | Evaluating robustness of support vector machines with the Lagrangian dual approach | Yuting Liu et.al. | 2306.02639v1 | null |
2023-06-03 | Towards Black-box Adversarial Example Detection: A Data Reconstruction-based Method | Yifei Gao et.al. | 2306.02021v1 | null |
2023-06-02 | Adversarial Attack Based on Prediction-Correction | Chen Wan et.al. | 2306.01809v1 | null |
2023-06-02 | Why Clean Generalization and Robust Overfitting Both Happen in Adversarial Training | Binghui Li et.al. | 2306.01271v1 | null |
2023-06-01 | Reconstruction Distortion of Learned Image Compression with Imperceptible Perturbations | Yang Sui et.al. | 2306.01125v1 | null |
2023-06-01 | Constructing Semantics-Aware Adversarial Examples with Probabilistic Perspective | Andi Zhang et.al. | 2306.00353v1 | null |
2023-05-29 | Explainability in Simplicial Map Neural Networks | Eduardo Paluzo-Hidalgo et.al. | 2306.00010v1 | null |
2023-05-30 | Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation | Josef Jon et.al. | 2305.19330v1 | null |
2023-05-29 | NaturalFinger: Generating Natural Fingerprint with Generative Adversarial Networks | Kang Yang et.al. | 2305.17868v1 | null |
2023-05-28 | Amplification trojan network: Attack deep neural networks by amplifying their inherent weakness | Zhanhao Hu et.al. | 2305.17688v1 | null |
2023-05-28 | Choose your Data Wisely: A Framework for Semantic Counterfactuals | Edmund Dervakos et.al. | 2305.17667v1 | null |
2023-05-26 | Leveraging characteristics of the output probability distribution for identifying adversarial audio examples | Matías P. Pizarro B. et.al. | 2305.17000v1 | null |
2023-05-26 | On Evaluating Adversarial Robustness of Large Vision-Language Models | Yunqing Zhao et.al. | 2305.16934v1 | link |
2023-05-25 | IDEA: Invariant Causal Defense for Graph Adversarial Robustness | Shuchang Tao et.al. | 2305.15792v1 | null |
2023-05-24 | How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks | Salijona Dyrmishi et.al. | 2305.15587v1 | null |
2023-05-24 | Fantastic DNN Classifiers and How to Identify them without Data | Nathaniel Dean et.al. | 2305.15563v1 | link |
2023-05-24 | Introducing Competition to Boost the Transferability of Targeted Adversarial Examples through Clean Feature Mixup | Junyoung Byun et.al. | 2305.14846v1 | link |
2023-05-24 | Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models | Natalie Shapira et.al. | 2305.14763v1 | null |
2023-05-23 | Enhancing Accuracy and Robustness through Adversarial Training in Class Incremental Continual Learning | Minchan Kwon et.al. | 2305.13678v1 | null |
2023-05-28 | DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection | Jiang Liu et.al. | 2305.13625v2 | null |
2023-05-22 | Latent Magic: An Investigation into Adversarial Examples Crafted in the Semantic Latent Space | BoYang Zheng et.al. | 2305.12906v1 | null |
2023-05-22 | Uncertainty-based Detection of Adversarial Attacks in Semantic Segmentation | Kira Maag et.al. | 2305.12825v1 | link |
2023-05-22 | Mist: Towards Improved Adversarial Examples for Diffusion Models | Chumeng Liang et.al. | 2305.12683v1 | null |
2023-05-18 | Explaining V1 Properties with a Biologically Constrained Deep Learning Architecture | Galen Pogoncheff et.al. | 2305.11275v1 | null |
2023-05-18 | How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses | Joana C. Costa et.al. | 2305.10862v1 | null |
2023-05-18 | Towards an Accurate and Secure Detector against Adversarial Perturbations | Chao Wang et.al. | 2305.10856v1 | null |
2023-05-18 | Content-based Unrestricted Adversarial Attack | Zhaoyu Chen et.al. | 2305.10665v1 | null |
2023-05-17 | Releasing Inequality Phenomena in |
Junxi Chen et.al. | 2305.09305v2 | null |
2023-05-15 | Attacking Perceptual Similarity Metrics | Abhijay Ghildyal et.al. | 2305.08840v1 | null |
2023-05-14 | Diffusion Models for Imperceptible and Transferable Adversarial Attack | Jianqi Chen et.al. | 2305.08192v1 | link |
2023-05-11 | Inter-frame Accelerate Attack against Video Interpolation Models | Junpei Liao et.al. | 2305.06540v1 | null |
2023-05-11 | Randomized Smoothing with Masked Inference for Adversarially Robust Text Classifications | Han Cheol Moon et.al. | 2305.06522v1 | link |
2023-05-20 | RNNS: Representation Nearest Neighbor Search Black-Box Attack on Code Models | Jie Zhang et.al. | 2305.05896v2 | null |
2023-05-10 | Quantization Aware Attack: Enhancing the Transferability of Adversarial Attacks across Target Models with Different Quantization Bitwidths | Yulong Yang et.al. | 2305.05875v1 | null |
2023-05-09 | Attack Named Entity Recognition by Entity Boundary Interference | Yifei Yang et.al. | 2305.05253v1 | null |
2023-05-08 | Toward Adversarial Training on Contextualized Language Representation | Hongqiu Wu et.al. | 2305.04557v1 | link |
2023-05-08 | Adversarial Examples Detection with Enhanced Image Difference Features based on Local Histogram Equalization | Zhaoxia Yin et.al. | 2305.04436v1 | null |
2023-05-06 | Reactive Perturbation Defocusing for Textual Adversarial Defense | Heng Yang et.al. | 2305.04067v1 | null |
2023-05-11 | Beyond the Model: Data Pre-processing Attack to Deep Learning Models in Android Apps | Ye Sang et.al. | 2305.03963v2 | null |
2023-05-03 | New Adversarial Image Detection Based on Sentiment Analysis | Yulong Wang et.al. | 2305.03173v1 | link |
2023-05-04 | Madvex: Instrumentation-based Adversarial Attacks on Machine Learning Malware Detection | Nils Loose et.al. | 2305.02559v1 | null |
2023-05-05 | Boosting Adversarial Transferability via Fusing Logits of Top-1 Decomposed Feature | Juanjuan Weng et.al. | 2305.01361v2 | null |
2023-05-08 | Attack-SAM: Towards Attacking Segment Anything Model With Adversarial Examples | Chenshuang Zhang et.al. | 2305.00866v2 | null |
2023-05-02 | Revisiting Robustness in Graph Machine Learning | Lukas Gosch et.al. | 2305.00851v2 | null |
2023-04-28 | Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models | Yu-An Liu et.al. | 2304.14867v1 | null |
2023-04-26 | Improving Adversarial Transferability by Intermediate-level Perturbation Decay | Qizhang Li et.al. | 2304.13410v1 | null |
2023-04-26 | Generating Adversarial Examples with Task Oriented Multi-Objective Optimization | Anh Bui et.al. | 2304.13229v1 | null |
2023-04-23 | Evading DeepFake Detectors via Adversarial Statistical Consistency | Yang Hou et.al. | 2304.11670v1 | null |
2023-04-23 | StyLess: Boosting the Transferability of Adversarial Examples | Kaisheng Liang et.al. | 2304.11579v1 | link |
2023-04-20 | Can Perturbations Help Reduce Investment Risks? Risk-Aware Stock Recommendation via Split Variational Adversarial Training | Jiezhu Cheng et.al. | 2304.11043v1 | null |
2023-04-24 | Using Z3 for Formal Modeling and Verification of FNN Global Robustness | Yihao Zhang et.al. | 2304.10558v2 | link |
2023-04-20 | Diversifying the High-level Features for better Adversarial Transferability | Zhiyuan Wang et.al. | 2304.10136v1 | null |
2023-04-20 | Towards the Universal Defense for Query-Based Audio Adversarial Attacks | Feng Guo et.al. | 2304.10088v1 | null |
2023-04-18 | In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT | Xinyue Shen et.al. | 2304.08979v1 | null |
2023-04-18 | Towards the Transferable Audio Adversarial Attack via Ensemble Methods | Feng Guo et.al. | 2304.08811v1 | null |
2023-04-19 | Masked Language Model Based Textual Adversarial Example Detection | Xiaomei Zhang et.al. | 2304.08767v2 | link |
2023-04-16 | JoB-VS: Joint Brain-Vessel Segmentation in TOF-MRA Images | Natalia Valderrama et.al. | 2304.07744v1 | link |
2023-04-14 | Combining Generators of Adversarial Malware Examples to Increase Evasion Rate | Matouš Kozák et.al. | 2304.07360v1 | null |
2023-04-14 | Interpretability is a Kind of Safety: An Interpreter-based Ensemble for Adversary Defense | Jingyuan Wang et.al. | 2304.06919v1 | null |
2023-04-14 | Generating Adversarial Examples with Better Transferability via Masking Unimportant Parameters of Surrogate Model | Dingcheng Yang et.al. | 2304.06908v1 | link |
2023-04-13 | False Claims against Model Ownership Resolution | Jian Liu et.al. | 2304.06607v1 | null |
2023-04-13 | Adversarial Examples from Dimensional Invariance | Benjamin L. Badger et.al. | 2304.06575v1 | null |
2023-04-12 | Generative Adversarial Networks-Driven Cyber Threat Intelligence Detection Framework for Securing Internet of Things | Mohamed Amine Ferrag et.al. | 2304.05644v1 | null |
2023-04-11 | Boosting Cross-task Transferability of Adversarial Patches with Visual Relations | Tony Ma et.al. | 2304.05402v1 | null |
2023-04-11 | Simultaneous Adversarial Attacks On Multiple Face Recognition System Components | Inderjeet Singh et.al. | 2304.05048v1 | null |
2023-04-10 | Certifiable Black-Box Attack: Ensuring Provably Successful Attack for Adversarial Examples | Hanbin Hong et.al. | 2304.04343v1 | null |
2023-04-08 | RobCaps: Evaluating the Robustness of Capsule Networks against Affine Transformations and Adversarial Attacks | Alberto Marchisio et.al. | 2304.03973v1 | null |
2023-04-10 | Robust Neural Architecture Search | Xunyu Zhu et.al. | 2304.02845v2 | null |
2023-04-05 | Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability | Martin Gubri et.al. | 2304.02688v1 | link |
2023-04-05 | How to choose your best allies for a transferable attack? | Thibault Maho et.al. | 2304.02312v1 | link |
2023-04-03 | Model-Agnostic Reachability Analysis on Deep Neural Networks | Chi Zhang et.al. | 2304.00813v1 | null |
2023-04-06 | Improving Fast Adversarial Training with Prior-Guided Knowledge | Xiaojun Jia et.al. | 2304.00202v2 | null |
2023-03-25 | AdvCheck: Characterizing Adversarial Examples via Local Gradient Checking | Ruoxi Chen et.al. | 2303.18131v1 | null |
2023-03-29 | Beyond Empirical Risk Minimization: Local Structure Preserving Regularization for Improving Adversarial Robustness | Wei Wei et.al. | 2303.16861v1 | null |
2023-03-29 | Latent Feature Relation Consistency for Adversarial Robustness | Xingbin Liu et.al. | 2303.16697v1 | link |
2023-03-28 | TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations | Qi Gege et.al. | 2303.15940v1 | null |
2023-03-27 | EMShepherd: Detecting Adversarial Samples via Side-channel Leakage | Ruyi Ding et.al. | 2303.15571v1 | null |
2023-03-27 | Personalized Federated Learning on Long-Tailed Data via Adversarial Feature Augmentation | Yang Lu et.al. | 2303.15168v1 | link |
2023-03-27 | Improving the Transferability of Adversarial Examples via Direction Tuning | Xiangyuan Yang et.al. | 2303.15109v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-28 | Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Weiqi Li et.al. | 2503.22679v1 | link |
2025-03-28 | Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions | Mohammad Almansoori et.al. | 2503.22678v1 | null |
2025-03-28 | DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness | Ruining Li et.al. | 2503.22677v1 | null |
2025-03-28 | TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting | Boyang et.al. | 2503.22676v1 | null |
2025-03-28 | Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation | Jiakai Tang et.al. | 2503.22675v1 | null |
2025-03-28 | QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? | Belinda Z. Li et.al. | 2503.22674v1 | null |
2025-03-28 | ActionStudio: A Lightweight Framework for Data and Training of Action Models | Jianguo Zhang et.al. | 2503.22673v1 | link |
2025-03-28 | Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers | Francesca Pezzuti et.al. | 2503.22672v1 | link |
2025-03-28 | Non-Archimedean Hilbert geometry and degenerations of real Hilbert geometries | Xenia Flamm et.al. | 2503.22671v1 | null |
2025-03-28 | Light Tree Covers, Routing, and Path-Reporting Oracles via Spanning Tree Covers in Doubling Graphs | Hsien-Chih Chang et.al. | 2503.22669v1 | null |
2025-03-27 | Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model | Abdelrahman Shaker et.al. | 2503.21782v1 | link |
2025-03-27 | VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models | Chi-Pin Huang et.al. | 2503.21781v1 | null |
2025-03-27 | Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation | Reza Qorbani et.al. | 2503.21780v1 | link |
2025-03-27 | X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction | Weihao Yu et.al. | 2503.21779v1 | null |
2025-03-27 | Test-Time Visual In-Context Tuning | Jiahao Xie et.al. | 2503.21777v1 | link |
2025-03-27 | Video-R1: Reinforcing Video Reasoning in MLLMs | Kaituo Feng et.al. | 2503.21776v1 | link |
2025-03-27 | StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion | Ziyu Guo et.al. | 2503.21775v1 | null |
2025-03-27 | Optimal Stepsize for Diffusion Sampling | Jianning Pei et.al. | 2503.21774v1 | link |
2025-03-27 | Simulating quantum circuits with restricted quantum computers | Christophe Piveteau et.al. | 2503.21773v1 | null |
2025-03-27 | LOCORE: Image Re-ranking with Long-Context Sequence Modeling | Zilin Xiao et.al. | 2503.21772v1 | null |
2025-03-26 | Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark | Sondos Mahmoud Bsharat et.al. | 2503.20786v1 | null |
2025-03-26 | Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency | Tianqi Liu et.al. | 2503.20785v1 | null |
2025-03-26 | FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks | Jinwei Li et.al. | 2503.20784v1 | null |
2025-03-26 | Understanding R1-Zero-Like Training: A Critical Perspective | Zichen Liu et.al. | 2503.20783v1 | null |
2025-03-26 | Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising | Yan-Bo Lin et.al. | 2503.20782v1 | null |
2025-03-26 | BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation | Yulu Pan et.al. | 2503.20781v1 | null |
2025-03-26 | PGC: Physics-Based Gaussian Cloth from a Single Pose | Michelle Guo et.al. | 2503.20779v1 | null |
2025-03-26 | Detectability of the chiral gravitational wave background from audible axions with the LISA-Taiji network | Hong Su et.al. | 2503.20778v1 | null |
2025-03-26 | Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields | Shijie Zhou et.al. | 2503.20776v1 | null |
2025-03-26 | PUREPath-B: A Tessellated Bayesian Model for Recovering CMB B-modes over Large Angular Scales of the Sky | Vipin Sudevan et.al. | 2503.20774v1 | null |
2025-03-25 | A New Hope for Obscured AGN: The PRIMA-NewAthena Alliance | Luigi Barchiesi et.al. | 2503.19915v1 | null |
2025-03-25 | Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models | Sangwon Beak et.al. | 2503.19914v1 | null |
2025-03-25 | PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model | Mingju Gao et.al. | 2503.19913v1 | null |
2025-03-25 | SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Xiang Xu et.al. | 2503.19912v1 | null |
2025-03-25 | Real-time all-optical signal equalisation with silicon photonic recurrent neural networks | Ruben Van Assche et.al. | 2503.19911v1 | null |
2025-03-25 | CoLLM: A Large Language Model for Composed Image Retrieval | Chuong Huynh et.al. | 2503.19910v1 | link |
2025-03-25 | In the Magma chamber: Update and challenges in ground-truth vulnerabilities revival for automatic input generator comparison | Timothée Riom et.al. | 2503.19909v1 | null |
2025-03-25 | FullDiT: Multi-Task Video Generative Foundation Model with Full Attention | Xuan Ju et.al. | 2503.19907v1 | null |
2025-03-26 | AvatarArtist: Open-Domain 4D Avatarization | Hongyu Liu et.al. | 2503.19906v2 | null |
2025-03-25 | Helmet streamer influence on the evolution of magnetic flux ropes | M. Cécere et.al. | 2503.19905v1 | null |
2025-03-24 | Target-Aware Video Diffusion Models | Taeksoo Kim et.al. | 2503.18950v1 | null |
2025-03-24 | Equivariant Image Modeling | Ruixiao Dong et.al. | 2503.18948v1 | link |
2025-03-24 | Tuning-Free Amodal Segmentation via the Occlusion-Free Bias of Inpainting Models | Jae Joong Lee et.al. | 2503.18947v1 | null |
2025-03-25 | Aether: Geometric-Aware Unified World Modeling | Aether Team et.al. | 2503.18945v2 | null |
2025-03-24 | DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Karim Abou Zeid et.al. | 2503.18944v1 | link |
2025-03-24 | SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding | Mingze Xu et.al. | 2503.18943v1 | null |
2025-03-24 | Video-T1: Test-Time Scaling for Video Generation | Fangfu Liu et.al. | 2503.18942v1 | null |
2025-03-24 | Exploring Training and Inference Scaling Laws in Generative Retrieval | Hongru Cai et.al. | 2503.18941v1 | null |
2025-03-24 | Training-free Diffusion Acceleration with Bottleneck Sampling | Ye Tian et.al. | 2503.18940v1 | null |
2025-03-24 | AdaWorld: Learning Adaptable World Models with Latent Actions | Shenyuan Gao et.al. | 2503.18938v1 | null |
2025-03-21 | On the road to the radius valley: distinguishing between gas dwarfs and water worlds with young transiting exoplanets | James G. Rogers et.al. | 2503.17364v1 | null |
2025-03-21 | Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique | Yansi Li et.al. | 2503.17363v1 | null |
2025-03-21 | Criteria for unbiased estimation: applications to noise-agnostic sensing and learnability of quantum channel | Hyukgun Kwon et.al. | 2503.17362v1 | null |
2025-03-21 | Gumbel-Softmax Flow Matching with Straight-Through Guidance for Controllable Biological Sequence Generation | Sophia Tang et.al. | 2503.17361v1 | null |
2025-03-21 | Position: Interactive Generative Video as Next-Generation Game Engine | Jiwen Yu et.al. | 2503.17359v1 | null |
2025-03-21 | Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image | Jerred Chen et.al. | 2503.17358v1 | null |
2025-03-21 | Filtered Rayleigh-Ritz is all you need | Ryan Abbott et.al. | 2503.17357v1 | null |
2025-03-21 | Fast Convex Optimization with Quantum Gradient Methods | Brandon Augustino et.al. | 2503.17356v1 | null |
2025-03-21 | HCAST: Human-Calibrated Autonomy Software Tasks | David Rein et.al. | 2503.17354v1 | null |
2025-03-21 | NdLinear Is All You Need for Representation Learning | Alex Reneau et.al. | 2503.17353v1 | null |
2025-03-20 | Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation | Yuqing Wang et.al. | 2503.16430v1 | null |
2025-03-20 | Sonata: Self-Supervised Learning of Reliable Point Representations | Xiaoyang Wu et.al. | 2503.16429v1 | link |
2025-03-20 | XAttention: Block Sparse Attention with Antidiagonal Scoring | Ruyi Xu et.al. | 2503.16428v1 | null |
2025-03-20 | On the Holographic Dual of a Symmetry Operator at Finite Temperature | Jonathan J. Heckman et.al. | 2503.16427v1 | null |
2025-03-20 | DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding | Keyan Chen et.al. | 2503.16426v1 | link |
2025-03-20 | Tokenize Image as a Set | Zigang Geng et.al. | 2503.16425v1 | link |
2025-03-20 | GAEA: A Geolocation Aware Conversational Model | Ron Campos et.al. | 2503.16423v1 | null |
2025-03-20 | MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance | Quanhao Li et.al. | 2503.16421v1 | null |
2025-03-20 | SynCity: Training-Free Generation of 3D Worlds | Paul Engstler et.al. | 2503.16420v1 | null |
2025-03-20 | Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models | Yang Sui et.al. | 2503.16419v1 | null |
2025-03-19 | More Information is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns | Kushagra Gupta et.al. | 2503.15486v1 | null |
2025-03-19 | TULIP: Towards Unified Language-Image Pretraining | Zineng Tang et.al. | 2503.15485v1 | null |
2025-03-19 | Value Profiles for Encoding Human Variation | Taylor Sorensen et.al. | 2503.15484v1 | null |
2025-03-19 | Emergent coding phases and hardware-tailored quantum codes | Gaurav Gyawali et.al. | 2503.15483v1 | null |
2025-03-19 | Natural Quantization of Neural Networks | Richard Barney et.al. | 2503.15482v1 | null |
2025-03-19 | Learning to Play Piano in the Real World | Yves-Simon Zeulner et.al. | 2503.15481v1 | null |
2025-03-19 | The Cauchy problem for nonlinear dispersive models of long internal waves in the presence of the Coriolis force | Ricardo Freire et.al. | 2503.15480v1 | null |
2025-03-19 | Deep Mantle-Atmosphere Coupling and Carbonaceous Bombardment: Options for Biomolecule Formation on an Oxidized Early Earth | Klaus Paschek et.al. | 2503.15479v1 | null |
2025-03-19 | SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Yifei Zhou et.al. | 2503.15478v1 | null |
2025-03-19 | What Makes a Reward Model a Good Teacher? An Optimization Perspective | Noam Razin et.al. | 2503.15477v1 | null |
2025-03-18 | MusicInfuser: Making Video Diffusion Listen and Dance | Susung Hong et.al. | 2503.14505v1 | null |
2025-03-18 | Aligning Multimodal LLM with Human Preference: A Survey | Tao Yu et.al. | 2503.14504v1 | null |
2025-03-18 | The Power of Context: How Multimodality Improves Image Super-Resolution | Kangfu Mei et.al. | 2503.14503v1 | null |
2025-03-19 | Advances in 4D Generation: A Survey | Qiaowei Miao et.al. | 2503.14501v2 | null |
2025-03-18 | Utilization of Neighbor Information for Image Classification with Different Levels of Supervision | Gihan Jayatilaka et.al. | 2503.14500v1 | null |
2025-03-18 | Measuring AI Ability to Complete Long Tasks | Thomas Kwa et.al. | 2503.14499v1 | null |
2025-03-18 | Tracking Meets Large Multimodal Models for Driving Scenario Understanding | Ayesha Ishaq et.al. | 2503.14498v1 | null |
2025-03-18 | Strong local uniqueness for the vacant set of random interlacements | Subhajit Goswami et.al. | 2503.14497v1 | null |
2025-03-18 | Temporal Consistency for LLM Reasoning Process Error Identification | Jiacheng Guo et.al. | 2503.14495v1 | null |
2025-03-18 | Deeply Supervised Flow-Based Generative Models | Inkyu Shin et.al. | 2503.14494v1 | null |
2025-03-17 | MetaScale: Test-Time Scaling with Evolving Meta-Thoughts | Qin Liu et.al. | 2503.13447v1 | null |
2025-03-17 | MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation | Zhenyu Wu et.al. | 2503.13446v1 | null |
2025-03-17 | Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance | Noah Y. Siegel et.al. | 2503.13445v1 | null |
2025-03-17 | VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning | Ye Liu et.al. | 2503.13444v1 | null |
2025-03-17 | DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models | Haoyang Li et.al. | 2503.13443v1 | null |
2025-03-17 | Humanoid Policy ~ Human Policy | Ri-Zhao Qiu et.al. | 2503.13441v1 | null |
2025-03-18 | MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling | Yingyue Li et.al. | 2503.13440v2 | link |
2025-03-17 | Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images | Tianhao Wu et.al. | 2503.13439v1 | null |
2025-03-17 | Deep Belief Markov Models for POMDP Inference | Giacomo Arcieri et.al. | 2503.13438v1 | null |
2025-03-17 | Unified Autoregressive Visual Generation and Understanding with Continuous Tokens | Lijie Fan et.al. | 2503.13436v1 | null |
2025-03-14 | Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation | Hiroyasu Akada et.al. | 2503.11652v1 | null |
2025-03-14 | VGGT: Visual Geometry Grounded Transformer | Jianyuan Wang et.al. | 2503.11651v1 | null |
2025-03-14 | Centaur: Robust End-to-End Autonomous Driving with Test-Time Training | Chonghao Sima et.al. | 2503.11650v1 | null |
2025-03-14 | Scalable Video Conferencing Using SDN Principles | Oliver Michel et.al. | 2503.11649v1 | null |
2025-03-14 | ReCamMaster: Camera-Controlled Generative Rendering from A Single Video | Jianhong Bai et.al. | 2503.11647v1 | null |
2025-03-14 | Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning | Siyuan Huang et.al. | 2503.11646v1 | null |
2025-03-14 | Mechanical Sensors for Ultraheavy Dark Matter Searches via Long-range Forces | Juehang Qin et.al. | 2503.11645v1 | null |
2025-03-14 | The waves-in-space Purcell effect for superconducting qubits | Param Patel et.al. | 2503.11644v1 | null |
2025-03-14 | From few to many maps: A fast map-level emulator for extreme augmentation of CMB systematics datasets | P. Campeti et.al. | 2503.11643v1 | null |
2025-03-14 | Ladder Operator Block-Encoding | William A. Simon et.al. | 2503.11641v1 | null |
2025-03-13 | GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Rongyao Fang et.al. | 2503.10639v1 | link |
2025-03-13 | Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective | Xiaoming Zhao et.al. | 2503.10638v1 | null |
2025-03-14 | Distilling Diversity and Control in Diffusion Models | Rohit Gandikota et.al. | 2503.10637v2 | null |
2025-03-14 | The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation | Ho Kei Cheng et.al. | 2503.10636v2 | link |
2025-03-13 | A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1 | Zhaoyi Li et.al. | 2503.10635v1 | link |
2025-03-13 | Charting and Navigating Hugging Face's Model Atlas | Eliahu Horwitz et.al. | 2503.10633v1 | null |
2025-03-13 | Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? | Subhajit Maity et.al. | 2503.10632v1 | null |
2025-03-13 | HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model | Jiaming Liu et.al. | 2503.10631v1 | null |
2025-03-13 | UniGoal: Towards Universal Zero-shot Goal-oriented Navigation | Hang Yin et.al. | 2503.10630v1 | null |
2025-03-13 | Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology | Hashmat Shadab Malik et.al. | 2503.10629v1 | link |
2025-03-12 | Odd-parity altermagnetism through sublattice currents: From Haldane-Hubbard model to general bipartite lattices | Yu-Ping Lin et.al. | 2503.09602v1 | null |
2025-03-13 | RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling | Itay Chachy et.al. | 2503.09601v2 | null |
2025-03-12 | MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System | Jihao Zhao et.al. | 2503.09600v1 | null |
2025-03-12 | Hints of Primordial Magnetic Fields at Recombination and Implications for the Hubble Tension | Karsten Jedamzik et.al. | 2503.09599v1 | null |
2025-03-12 | How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation | Ruohao Guo et.al. | 2503.09598v1 | null |
2025-03-12 | PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop | Chenyu Li et.al. | 2503.09595v1 | null |
2025-03-12 | SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Katrin Renz et.al. | 2503.09594v1 | null |
2025-03-12 | A Monte Carlo approach for finding optimally controlled quantum gates with differential geometry | Adonai Hilário da Silva et.al. | 2503.09593v1 | null |
2025-03-12 | Parsing the Language of Expression: Enhancing Symbolic Regression with Domain-Aware Symbolic Priors | Sikai Huang et.al. | 2503.09592v1 | null |
2025-03-13 | BIMBA: Selective-Scan Compression for Long-Range Video Question Answering | Md Mohaiminul Islam et.al. | 2503.09590v2 | null |
2025-03-11 | QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension | Yongdong Luo et.al. | 2503.08689v1 | null |
2025-03-11 | Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs | Ariba Khan et.al. | 2503.08688v1 | null |
2025-03-11 | OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models | Jialv Zou et.al. | 2503.08686v1 | null |
2025-03-11 | "Principal Components" Enable A New Language of Images | Xin Wen et.al. | 2503.08685v1 | null |
2025-03-11 | Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents | Haoyu Wang et.al. | 2503.08684v1 | null |
2025-03-11 | CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Changxing Liu et.al. | 2503.08683v1 | null |
2025-03-11 | Self-Taught Self-Correction for Small Language Models | Viktor Moskvoretskii et.al. | 2503.08681v1 | null |
2025-03-11 | Chain-of-Thought Reasoning In The Wild Is Not Always Faithful | Iván Arcuschin et.al. | 2503.08679v1 | null |
2025-03-11 | GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing | Yuanhao Wang et.al. | 2503.08678v1 | null |
2025-03-12 | OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting | Yongsheng Yu et.al. | 2503.08677v2 | null |
2025-03-10 | Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru | Dunant Cusipuma et.al. | 2503.07587v1 | null |
2025-03-10 | Slow-fast systems with stochastic resetting | Paul C Bressloff et.al. | 2503.07585v1 | null |
2025-03-10 | Talking to GDELT Through Knowledge Graphs | Audun Myers et.al. | 2503.07584v1 | null |
2025-03-10 | Towards construction of superintegrable basis in matrix models | Azheev Batukhan et.al. | 2503.07583v1 | null |
2025-03-10 | Complexity Analysis of Environmental Time Series | Holger Lange et.al. | 2503.07582v1 | null |
2025-03-10 | Neural Combinatorial Optimization via Preference Optimization | Zijun Liao et.al. | 2503.07580v1 | null |
2025-03-10 | Phase Diagram of the Non-Reciprocal Cahn-Hilliard Model and the Effects of Symmetry | Martin Kjøllesdal Johnsrud et.al. | 2503.07579v1 | null |
2025-03-10 | Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation | Tianyu Chen et.al. | 2503.07578v1 | null |
2025-03-10 | Analyzing Symmetries of Swarms of Mobile Robots Using Equivariant Dynamical Systems | Raphael Gerlach et.al. | 2503.07576v1 | null |
2025-03-10 | VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models | Jen-tse Huang et.al. | 2503.07575v1 | null |
2025-03-10 | Securing External Deeper-than-black-box GPAI Evaluations | Alejandro Tlaie et.al. | 2503.07496v1 | null |
2025-03-10 | Finite deformations induce friction hysteresis in normal wavy contacts | M. Ceglie et.al. | 2503.07495v1 | null |
2025-03-10 | Composition effect in the thermo-mechanical behavior of glasses, and its modelization | Rene Alvarez-Donado et.al. | 2503.07494v1 | null |
2025-03-10 | V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Guiwei Zhang et.al. | 2503.07493v1 | null |
2025-03-10 | First generation 4H-SiC LGAD production and its performance evaluation | Radek Novotný et.al. | 2503.07490v1 | null |
2025-03-10 | High-order persistence of resonant caustics in perturbed circular billiards | Comlan Edmond Koudjinan et.al. | 2503.07488v1 | null |
2025-03-10 | LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? | Bangyan Li et.al. | 2503.07487v1 | null |
2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485v1 | null |
2025-03-10 | Euclid: Early Release Observations -- The Intracluster Light of Abell 2390 | A. Ellien et.al. | 2503.07484v1 | null |
2025-03-10 | Efficient Membership Inference Attacks by Bayesian Neural Network | Zhenlong Liu et.al. | 2503.07482v1 | null |
2025-03-07 | Fast and memory efficient strong simulation of noisy adaptive linear optical circuits | Timothée Goubault de Brugière et.al. | 2503.05699v1 | null |
2025-03-07 | Quantum State Designs from Minimally Random Quantum Circuits | Jonathon Riddell et.al. | 2503.05698v1 | null |
2025-03-07 | Multi-Fidelity Policy Gradient Algorithms | Xinjie Liu et.al. | 2503.05696v1 | null |
2025-03-07 | On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations | Vittorio Bilò et.al. | 2503.05695v1 | null |
2025-03-07 | The discovery and characterization of Earth-crossing asteroid 2024 YR$_4$ | Bryce T. Bolin et.al. | 2503.05694v1 | null |
2025-03-07 | Dynamics of disordered quantum systems with two- and three-dimensional tensor networks | Joseph Tindall et.al. | 2503.05693v1 | null |
2025-03-07 | Reionization and the Hubble Constant: Correlations in the Cosmic Microwave Background | Itamar J. Allali et.al. | 2503.05691v1 | null |
2025-03-10 | GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving | Zebin Xing et.al. | 2503.05689v2 | null |
2025-03-07 | Quantum gas microscopy of three-flavor Hubbard systems | Jirayu Mongkolkiattichai et.al. | 2503.05687v1 | null |
2025-03-07 | First order non-instantaneous corrections in collisional kinetic alignment models | Laura Kanzler et.al. | 2503.05686v1 | null |
2025-03-06 | Double Narrow-Line Signatures of Dark Matter Decay and New Constraints from XRISM Observations | Wen Yin et.al. | 2503.04726v1 | null |
2025-03-06 | L$^2$M: Mutual Information Scaling Law for Long-Context Language Modeling | Zhuo Chen et.al. | 2503.04725v1 | null |
2025-03-06 | LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM | Sambal Shikhar et.al. | 2503.04724v1 | null |
2025-03-07 | Shifting Long-Context LLMs Research from Input to Output | Yuhao Wu et.al. | 2503.04723v2 | null |
2025-03-06 | Enough Coin Flips Can Make LLMs Act Bayesian | Ritwik Gupta et.al. | 2503.04722v1 | null |
2025-03-06 | Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities | Guan-Ting Lin et.al. | 2503.04721v1 | null |
2025-03-06 | FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video | Yue Gao et.al. | 2503.04720v1 | null |
2025-03-06 | Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation | David T. Hoffmann et.al. | 2503.04718v1 | null |
2025-03-06 | MIGHTEE: exploring the relationship between spectral index, redshift and radio luminosity | Siddhant Pinjarkar et.al. | 2503.04717v1 | null |
2025-03-06 | Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Houyi Li et.al. | 2503.04715v1 | null |
2025-03-05 | GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control | Xuanchi Ren et.al. | 2503.03751v1 | link |
2025-03-05 | The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems | Richard Ren et.al. | 2503.03750v1 | null |
2025-03-05 | Quantum effects on pyrochlore higher-rank U(1) spin liquids: pinch-line singularities, spin nematics, and connections to oxide materials | Lasse Gresista et.al. | 2503.03749v1 | null |
2025-03-05 | Searching for continuous gravitational waves from highly deformed compact objects with DECIGO | Andrew L. Miller et.al. | 2503.03748v1 | null |
2025-03-05 | PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning | Ryozo Masukawa et.al. | 2503.03747v1 | null |
2025-03-05 | Process-based Self-Rewarding Language Models | Shimao Zhang et.al. | 2503.03746v1 | null |
2025-03-05 | Constrained Gaussian Wasserstein Optimal Transport with Commutative Covariance Matrices | Jun Chen et.al. | 2503.03744v1 | null |
2025-03-05 | CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning | Yuqi Zhou et.al. | 2503.03743v1 | null |
2025-03-05 | MaNGA AGN dwarf galaxies (MAD) -- III. The role of mergers and environment in AGN activity in dwarf galaxies | A. Eróstegui et.al. | 2503.03742v1 | null |
2025-03-05 | Comparison of Experimental and Theoretical Mechanical Jitter in a THz Communication Link | Ethan Abele et.al. | 2503.03740v1 | null |
2025-03-04 | ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models | Qinyu Zhao et.al. | 2503.02883v1 | null |
2025-03-04 | Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation | Han Xue et.al. | 2503.02881v1 | null |
2025-03-04 | A New |
Purba Mukherjee et.al. | 2503.02880v1 | null |
2025-03-04 | Wikipedia in the Era of LLMs: Evolution and Risks | Siming Huang et.al. | 2503.02879v1 | null |
2025-03-04 | Language Models can Self-Improve at State-Value Estimation for Better Search | Ethan Mendes et.al. | 2503.02878v1 | null |
2025-03-04 | Weak-to-Strong Generalization Even in Random Feature Networks, Provably | Marko Medvedev et.al. | 2503.02877v1 | null |
2025-03-04 | SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models | Dmitry Nechaev et.al. | 2503.02876v1 | null |
2025-03-04 | The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models | Ke Ji et.al. | 2503.02875v1 | null |
2025-03-04 | Prompting Generative AI with Interaction-Augmented Instructions | Leixian Shen et.al. | 2503.02874v1 | null |
2025-03-05 | Multiaccuracy and Multicalibration via Proxy Groups | Beepul Bharti et.al. | 2503.02870v2 | null |
2025-02-28 | LLM Post-Training: A Deep Dive into Reasoning Large Language Models | Komal Kumar et.al. | 2502.21321v1 | null |
2025-02-28 | Topological Quantum Dark Matter via Global Anomaly Cancellation | Juven Wang et.al. | 2502.21319v1 | null |
2025-02-28 | How far can we go with ImageNet for Text-to-Image generation? | L. Degeorge et.al. | 2502.21318v1 | null |
2025-02-28 | Assessing zero-shot generalisation behaviour in graph-neural-network interatomic potentials | Chiheb Ben Mahmoud et.al. | 2502.21317v1 | null |
2025-02-28 | Doping dependence of 2-spinon excitations in the doped 1D cuprate Ba$2$CuO${3+δ}$ | Jiarui Li et.al. | 2502.21316v1 | null |
2025-02-28 | Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos | Zhiyu Tan et.al. | 2502.21314v1 | null |
2025-02-28 | Unsupervised Parameter Efficient Source-free Post-pretraining | Abhishek Jha et.al. | 2502.21313v1 | null |
2025-02-28 | AutoComb: Automated Comb Sign Detector for 3D CTE Scans | Shashwat Gupta et.al. | 2502.21311v1 | null |
2025-02-28 | FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Yihong Dong et.al. | 2502.21309v1 | null |
2025-02-28 | Duality Theory for Bounded Lattices: A Comparative Study | Guram Bezhanishvili et.al. | 2502.21307v1 | null |
2025-02-27 | A physically motivated galaxy size definition across different state-of-the-art hydrodynamical simulations | Elena Arjona-Galvez et.al. | 2502.20398v1 | null |
2025-02-27 | Mechanics on flag manifolds | Andrew Kuzovchikov et.al. | 2502.20397v1 | null |
2025-02-27 | Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids | Toru Lin et.al. | 2502.20396v1 | null |
2025-02-27 | R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts | Zhongyang Li et.al. | 2502.20395v1 | null |
2025-02-27 | Superconductivity in doped planar Dirac insulators: A renormalization group study | Sk Asrap Murshed et.al. | 2502.20394v1 | null |
2025-02-27 | Walking the Web of Concept-Class Relationships in Incrementally Trained Interpretable Models | Susmit Agrawal et.al. | 2502.20393v1 | null |
2025-02-27 | Scalable Signature Kernel Computations for Long Time Series via Local Neumann Series Expansions | Matthew Tamayo-Rios et.al. | 2502.20392v1 | null |
2025-02-27 | Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation | Siddhant Haldar et.al. | 2502.20391v1 | null |
2025-02-27 | InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions | Sirui Xu et.al. | 2502.20390v1 | null |
2025-02-27 | LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding | Ang Cao et.al. | 2502.20389v1 | null |
2025-02-26 | Work and heat exchanged during sudden quenches of strongly coupled quantum systems | Zohreh Davoudi et.al. | 2502.19418v1 | null |
2025-02-26 | Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models | Lucy Xiaoyang Shi et.al. | 2502.19417v1 | null |
2025-02-26 | Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing | Akshat Gupta et.al. | 2502.19416v1 | null |
2025-02-26 | Seimei KOOLS-IFU mapping of the gas and dust distributions in Galactic PNe: the origin and evolution of DdDm1 | Masaaki Otsuka et.al. | 2502.19415v1 | null |
2025-02-26 | Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation | Shiven Sinha et.al. | 2502.19414v1 | null |
2025-02-26 | Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs | Christoph Schuhmann et.al. | 2502.19413v1 | null |
2025-02-26 | The Mighty ToRR: A Benchmark for Table Reasoning and Robustness | Shir Ashury-Tahan et.al. | 2502.19412v1 | null |
2025-02-26 | Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs | Dayu Yang et.al. | 2502.19411v1 | null |
2025-02-26 | Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices | Xinru Wang et.al. | 2502.19410v1 | null |
2025-02-26 | ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models | Danae Sánchez Villegas et.al. | 2502.19409v1 | null |
2025-02-25 | Allocating Variance to Maximize Expectation | Renato Purita Paes Leme et.al. | 2502.18463v1 | null |
2025-02-25 | Scalable Equilibrium Sampling with Sequential Boltzmann Generators | Charlie B. Tan et.al. | 2502.18462v1 | null |
2025-02-25 | K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs | Ziheng Ouyang et.al. | 2502.18461v1 | null |
2025-02-25 | DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers | Xueguang Ma et.al. | 2502.18460v1 | null |
2025-02-25 | Spectral modelling of Cygnus A between 110 and 250 MHz. Impact on the LOFAR 21-cm signal power spectrum | E. Ceccotti et.al. | 2502.18459v1 | null |
2025-02-25 | LLM-Based Design Pattern Detection | Christian Schindler et.al. | 2502.18458v1 | null |
2025-02-25 | Towards a Composite Framework for Simultaneous Exploration of New Physics in Background and Perturbed Universes | Shibendu Gupta Choudhury et.al. | 2502.18457v1 | null |
2025-02-25 | Assessing the Maturity of Cybersecurity Education in Virginia and the Impact of State Level Investment | Patrick Mero et.al. | 2502.18456v1 | null |
2025-02-25 | Evaluating the Effectiveness of Small Language Models in Detecting Refactoring Bugs | Rohit Gheyi et.al. | 2502.18454v1 | null |
2025-02-25 | Shift orbifolds, decompactification limits, and lattices | Dan Israel et.al. | 2502.18453v1 | null |
2025-02-24 | Fractal Generative Models | Tianhong Li et.al. | 2502.17437v1 | link |
2025-02-24 | Towards Hierarchical Rectified Flow | Yichi Zhang et.al. | 2502.17436v1 | link |
2025-02-24 | GCC: Generative Color Constancy via Diffusing a Color Checker | Chen-Wei Chang et.al. | 2502.17435v1 | null |
2025-02-24 | V-HOP: Visuo-Haptic 6D Object Pose Tracking | Hongyu Li et.al. | 2502.17434v1 | null |
2025-02-24 | Dynamical phases of short-term memory mechanisms in RNNs | Bariscan Kurtkaya et.al. | 2502.17433v1 | null |
2025-02-24 | FACTR: Force-Attending Curriculum Training for Contact-Rich Policy Learning | Jason Jingzhou Liu et.al. | 2502.17432v1 | null |
2025-02-24 | Joint Beamforming and 3D Location Optimization for Multi-User Holographic UAV Communications | Chandan Kumar Sheemar et.al. | 2502.17428v1 | null |
2025-02-24 | Mind the gap: addressing data gaps and assessing noise mismodeling in LISA | Ollie Burke et.al. | 2502.17426v1 | null |
2025-02-24 | Introducing Visual Perception Token into Multimodal Large Language Model | Runpeng Yu et.al. | 2502.17425v1 | link |
2025-02-24 | Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs | Jan Betley et.al. | 2502.17424v1 | link |
2025-02-21 | ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval | Guanqi Zhan et.al. | 2502.15682v1 | null |
2025-02-21 | One-step Diffusion Models with |
Yilun Xu et.al. | 2502.15681v1 | null |
2025-02-21 | Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training | Jaydeep Borkar et.al. | 2502.15680v1 | null |
2025-02-21 | BOSS: Benchmark for Observation Space Shift in Long-Horizon Task | Yue Yang et.al. | 2502.15679v1 | null |
2025-02-21 | Testing the limits of fine-tuning to improve reasoning in vision language models | Luca M. Schulze Buschoff et.al. | 2502.15678v1 | null |
2025-02-21 | FLEKE: Federated Locate-then-Edit Knowledge Editing | Zongkai Zhao et.al. | 2502.15677v1 | null |
2025-02-21 | AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind | Zhining Zhang et.al. | 2502.15676v1 | null |
2025-02-21 | On rational connectedness and parametrization of finite Galois extensions | Daniel Krashen et.al. | 2502.15674v1 | null |
2025-02-21 | Blow-up rate of solution to generalised Blasius equation | Guillaume Blanc et.al. | 2502.15673v1 | null |
2025-02-21 | VaViM and VaVAM: Autonomous Driving through Video Generative Modeling | Florent Bartoccioni et.al. | 2502.15672v1 | link |
2025-02-20 | Emergence of Fermi's Golden Rule in the Probing of a Quantum Many-Body System | Jianyi Chen et.al. | 2502.14867v1 | null |
2025-02-20 | LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention | Shang Yang et.al. | 2502.14866v1 | null |
2025-02-20 | Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts | Sara Ghaboura et.al. | 2502.14865v1 | null |
2025-02-20 | Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework | Yuming Yang et.al. | 2502.14864v1 | null |
2025-02-20 | The Fourier coefficients of the holomorphic multiplicative chaos in the limit of large frequency | Joseph Najnudel et.al. | 2502.14863v1 | null |
2025-02-20 | Interpretable Text Embeddings and Text Similarity Explanation: A Primer | Juri Opitz et.al. | 2502.14862v1 | null |
2025-02-20 | Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning | Shuyue Stella Li et.al. | 2502.14860v1 | null |
2025-02-20 | Taming Recoil Effect in Cavity-Assisted Quantum Interconnects | Seigo Kikura et.al. | 2502.14859v1 | null |
2025-02-20 | The |
Tongmu He et.al. | 2502.14858v1 | null |
2025-02-20 | FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling | Weilin Zhao et.al. | 2502.14856v1 | null |
2025-02-19 | FlexTok: Resampling Images into 1D Token Sequences of Flexible Length | Roman Bachmann et.al. | 2502.13967v1 | null |
2025-02-20 | Where's the Bug? Attention Probing for Scalable Fault Localization | Adam Stein et.al. | 2502.13966v2 | null |
2025-02-19 | Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Michael Luo et.al. | 2502.13965v1 | null |
2025-02-19 | A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects | Arjun Gupta et.al. | 2502.13964v1 | null |
2025-02-19 | MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads | Weihao Liu et.al. | 2502.13963v1 | null |
2025-02-19 | Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering | William Jurayj et.al. | 2502.13962v1 | null |
2025-02-19 | The Computational Advantage of Depth: Learning High-Dimensional Hierarchical Functions with Gradient Descent | Yatin Dandi et.al. | 2502.13961v1 | null |
2025-02-19 | Extended |
Pietro Borchia et.al. | 2502.13960v1 | null |
2025-02-19 | LIDDIA: Language-based Intelligent Drug Discovery Agent | Reza Averly et.al. | 2502.13959v1 | null |
2025-02-19 | Local and Non-local Entanglement Witnesses of Fermi Liquid | Yiming Wang et.al. | 2502.13958v1 | null |
2025-02-18 | Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization | Shuo Xing et.al. | 2502.13146v1 | null |
2025-02-18 | Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation | Bencheng Liao et.al. | 2502.13145v1 | null |
2025-02-18 | SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation | Zekun Qi et.al. | 2502.13143v1 | null |
2025-02-18 | Pre-training Auto-regressive Robotic Models with 4D Representations | Dantong Niu et.al. | 2502.13142v1 | null |
2025-02-18 | UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models | Huawei Lin et.al. | 2502.13141v1 | null |
2025-02-18 | Towards Quantum Tensor Decomposition in Biomedical Applications | Myson Burch et.al. | 2502.13140v1 | null |
2025-02-18 | AIDE: AI-Driven Exploration in the Space of Code | Zhengyao Jiang et.al. | 2502.13138v1 | null |
2025-02-18 | Theorem Prover as a Judge for Synthetic Data Generation | Joshua Ong Jun Leang et.al. | 2502.13137v1 | null |
2025-02-18 | Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions | Taedong Yun et.al. | 2502.13135v1 | null |
2025-02-18 | RHINO: Learning Real-Time Humanoid-Human-Object Interaction from Human Demonstrations | Jingxiao Chen et.al. | 2502.13134v1 | null |
2025-02-17 | Sampling the full hierarchical population posterior distribution in gravitational-wave astronomy | Michele Mancarella et.al. | 2502.12156v1 | null |
2025-02-17 | Observation of a zero-energy excitation mode in the open Dicke model | Anton Bolian et.al. | 2502.12155v1 | null |
2025-02-17 | Diffusion Models without Classifier-free Guidance | Zhicong Tang et.al. | 2502.12154v1 | link |
2025-02-17 | Gravitational waves from the Axiverse | Saurav Das et.al. | 2502.12153v1 | null |
2025-02-17 | Learning Getting-Up Policies for Real-World Humanoid Robots | Xialin He et.al. | 2502.12152v1 | null |
2025-02-17 | Idiosyncrasies in Large Language Models | Mingjie Sun et.al. | 2502.12150v1 | null |
2025-02-17 | HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation | Ling Yang et.al. | 2502.12148v1 | null |
2025-02-17 | Learning Smooth and Expressive Interatomic Potentials for Physical Property Prediction | Xiang Fu et.al. | 2502.12147v1 | null |
2025-02-17 | Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening | Ye Tian et.al. | 2502.12146v1 | null |
2025-02-17 | Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control | Jinyan Su et.al. | 2502.12145v1 | null |
2025-02-14 | MM-RLHF: The Next Step Forward in Multimodal LLM Alignment | Yi-Fan Zhang et.al. | 2502.10391v1 | null |
2025-02-14 | (How) Can Transformers Predict Pseudo-Random Numbers? | Tao Tao et.al. | 2502.10390v1 | null |
2025-02-14 | Region-Adaptive Sampling for Diffusion Transformers | Ziming Liu et.al. | 2502.10389v1 | null |
2025-02-14 | Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction | WonJin Yoon et.al. | 2502.10388v1 | null |
2025-02-14 | Unconventional Transport in a System with a Tower of Quantum Many-Body Scars | Gianluca Morettini et.al. | 2502.10387v1 | null |
2025-02-14 | Simplifying DINO via Coding Rate Regularization | Ziyang Wu et.al. | 2502.10385v1 | null |
2025-02-14 | Scaling limit and tail bounds for a random walk model of SOS level lines | Milind Hegde et.al. | 2502.10384v1 | null |
2025-02-14 | Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data | Corinna Cortes et.al. | 2502.10381v1 | null |
2025-02-14 | Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models | Jiexin Ding et.al. | 2502.10378v1 | null |
2025-02-14 | ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences | Liyuan Zhu et.al. | 2502.10377v1 | null |
2025-02-13 | Theoretical Benefit and Limitation of Diffusion Language Model | Guhao Feng et.al. | 2502.09622v1 | null |
2025-02-13 | MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency | Dongzhi Jiang et.al. | 2502.09621v1 | null |
2025-02-13 | Exploring the Potential of Encoder-free Architectures in 3D LMMs | Yiwen Tang et.al. | 2502.09620v1 | null |
2025-02-13 | Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights | Jonathan Kahana et.al. | 2502.09619v1 | null |
2025-02-13 | Variational Rectified Flow Matching | Pengsheng Guo et.al. | 2502.09616v1 | null |
2025-02-13 | RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets | Isabella Liu et.al. | 2502.09615v1 | null |
2025-02-13 | DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References | Xueyi Liu et.al. | 2502.09614v1 | null |
2025-02-13 | Latent Radiance Fields with 3D-aware 2D Representations | Chaoyi Zhou et.al. | 2502.09613v1 | null |
2025-02-13 | Superspin Renormalization and Slow Relaxation in Random Spin Systems | Yi J. Zhao et.al. | 2502.09612v1 | null |
2025-02-13 | Designing a Conditional Prior Distribution for Flow-Based Generative Models | Noam Issachar et.al. | 2502.09611v1 | null |
2025-02-12 | Poly-Autoregressive Prediction for Modeling Interactions | Neerja Thakkar et.al. | 2502.08646v1 | null |
2025-02-13 | Re$^3$Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation | Xiaoshen Han et.al. | 2502.08645v2 | null |
2025-02-13 | Rhythmic sharing: A bio-inspired paradigm for zero-shot adaptation and learning in neural networks | Hoony Kang et.al. | 2502.08644v2 | null |
2025-02-12 | A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards | Shivansh Patel et.al. | 2502.08643v1 | null |
2025-02-12 | SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation | Ellie Arar et.al. | 2502.08642v1 | null |
2025-02-12 | Constructing optimal Wannier functions via potential theory: isolated single band for matrix models | Hanwen Zhang et.al. | 2502.08641v1 | null |
2025-02-12 | Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs | Mantas Mazeika et.al. | 2502.08640v1 | null |
2025-02-12 | CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation | Qinghe Wang et.al. | 2502.08639v1 | null |
2025-02-12 | Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples | Andrianos Michail et.al. | 2502.08638v1 | null |
2025-02-13 | PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Xingrui Wang et.al. | 2502.08636v2 | null |
2025-02-11 | Pippo: High-Resolution Multi-View Humans from a Single Image | Yash Kant et.al. | 2502.07785v1 | null |
2025-02-11 | MatSwap: Light-aware material transfers in images | Ivan Lopes et.al. | 2502.07784v1 | null |
2025-02-11 | Curvature Tuning: Provable Training-free Model Steering From a Single Parameter | Leyang Hu et.al. | 2502.07783v1 | null |
2025-02-11 | A Flag Decomposition for Hierarchical Datasets | Nathan Mankovich et.al. | 2502.07782v1 | null |
2025-02-11 | Detectability of dark matter subhalo impacts in Milky Way stellar streams | Junyang Lu et.al. | 2502.07781v1 | null |
2025-02-11 | DarwinLM: Evolutionary Structured Pruning of Large Language Models | Shengkun Tang et.al. | 2502.07780v1 | null |
2025-02-11 | Quantum-driven Zero Trust Framework with Dynamic Anomaly Detection in 7G Technology: A Neural Network Approach | Shakil Ahmed et.al. | 2502.07779v1 | null |
2025-02-11 | Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection | Anirudh Sundara Rajan et.al. | 2502.07778v1 | null |
2025-02-11 | Feasibility study of multiplexing analog signals from SiPMs for a single layer monolithic PET detector design | Shiv K. Subedi et.al. | 2502.07777v1 | null |
2025-02-11 | Auditing Prompt Caching in Language Model APIs | Chenchen Gu et.al. | 2502.07776v1 | null |
2025-02-10 | EVEv2: Improved Baselines for Encoder-Free Vision-Language Models | Haiwen Diao et.al. | 2502.06788v1 | null |
2025-02-10 | Visual Agentic AI for Spatial Reasoning with a Dynamic API | Damiano Marsili et.al. | 2502.06787v1 | null |
2025-02-10 | Matryoshka Quantization | Pranav Nair et.al. | 2502.06786v1 | null |
2025-02-10 | DeepCrossAttention: Supercharging Transformer Residual Connections | Mike Heddes et.al. | 2502.06785v1 | null |
2025-02-10 | RelGNN: Composite Message Passing for Relational Deep Learning | Tianlang Chen et.al. | 2502.06784v1 | null |
2025-02-10 | Bunch-Davies initial conditions and non-perturbative inflationary dynamics in Numerical Relativity | Yoann L. Launay et.al. | 2502.06783v1 | null |
2025-02-10 | Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT | Dongyang Liu et.al. | 2502.06782v1 | null |
2025-02-10 | Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning | Chengqi Lyu et.al. | 2502.06781v1 | null |
2025-02-10 | KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification | Yue Zhu et.al. | 2502.06779v1 | null |
2025-02-10 | ALMACAL XIII. Evolution of the CO luminosity function and the molecular gas mass density out to |
Victoria Bollo et.al. | 2502.06778v1 | null |
2025-02-07 | FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation | Shilong Zhang et.al. | 2502.05179v1 | null |
2025-02-07 | QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation | Yue Zhao et.al. | 2502.05178v1 | null |
2025-02-07 | Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Yunhang Shen et.al. | 2502.05177v1 | null |
2025-02-07 | AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting | Chung-Ho Wu et.al. | 2502.05176v1 | null |
2025-02-07 | Fillerbuster: Multi-View Scene Completion for Casual Captures | Ethan Weber et.al. | 2502.05175v1 | null |
2025-02-07 | MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison | Kaijie Zhu et.al. | 2502.05174v1 | null |
2025-02-07 | Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient | Jan Ludziejewski et.al. | 2502.05172v1 | null |
2025-02-07 | Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach | Jonas Geiping et.al. | 2502.05171v1 | null |
2025-02-07 | Observation of a dynamic magneto-chiral instability in photoexcited tellurium | Yijing Huang et.al. | 2502.05170v1 | null |
2025-02-07 | NoLiMa: Long-Context Evaluation Beyond Literal Matching | Ali Modarressi et.al. | 2502.05167v1 | null |
2025-02-06 | Geometrical frustration, power law tunneling and non-local gauge fields from scattered light | Pavel P. Popov et.al. | 2502.04330v1 | null |
2025-02-06 | SMART: Advancing Scalable Map Priors for Driving Topology Reasoning | Junjie Ye et.al. | 2502.04329v1 | null |
2025-02-06 | Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment | Zuyan Liu et.al. | 2502.04328v1 | null |
2025-02-06 | WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs | Jack Hong et.al. | 2502.04326v1 | null |
2025-02-06 | Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness | Karolina Rudnicka et.al. | 2502.04324v1 | null |
2025-02-06 | The Uniformly Rotated Mondrian Kernel | Calvin Osborne et.al. | 2502.04323v1 | link |
2025-02-06 | Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions | Yik Siu Chan et.al. | 2502.04322v1 | null |
2025-02-06 | ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features | Alec Helbling et.al. | 2502.04320v1 | null |
2025-02-06 | On the origin of the |
Andrea Cattaneo et.al. | 2502.04319v1 | null |
2025-02-06 | sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views | Eyvaz Najafli et.al. | 2502.04318v1 | null |
2025-02-05 | On the origin of mid-infrared colors in |
Raniere de Menezes et.al. | 2502.03466v1 | null |
2025-02-05 | Seeing World Dynamics in a Nutshell | Qiuhong Shen et.al. | 2502.03465v1 | null |
2025-02-05 | Improving the trivial bound for |
Robert J. Lemke Oliver et.al. | 2502.03464v1 | null |
2025-02-05 | Cosmic Calipers: Precise and Accurate Neutron Star Radius Measurements with Next-Generation Gravitational Wave Detectors | Sanika Khadkikar et.al. | 2502.03463v1 | null |
2025-02-05 | Efficient Lindblad synthesis for noise model construction | Moein Malekakhlagh et.al. | 2502.03462v1 | null |
2025-02-05 | Do Large Language Model Benchmarks Test Reliability? | Joshua Vendrow et.al. | 2502.03461v1 | null |
2025-02-05 | Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Boyao Wang et.al. | 2502.03460v1 | null |
2025-02-05 | SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living | Arkaprava Sinha et.al. | 2502.03459v1 | null |
2025-02-06 | Clustering of the extreme: A theoretical description of weak lensing critical points power spectra in the mildly nonlinear regime | Zhengyangguang Gong et.al. | 2502.03457v2 | null |
2025-02-05 | DESI Strong Lens Foundry I: HST Observations and Modeling with GIGA-Lens | X. Huang et.al. | 2502.03455v1 | null |
2025-02-04 | Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling | Xiaowen Qiu et.al. | 2502.02590v1 | null |
2025-02-04 | COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation | Xueqing Deng et.al. | 2502.02589v1 | null |
2025-02-04 | Calibrated Multi-Preference Optimization for Aligning Diffusion Models | Kyungmin Lee et.al. | 2502.02588v1 | null |
2025-02-04 | New perspective on the multiple population phenomenon in Galactic globular clusters from a wide-field photometric survey | S. Jang et.al. | 2502.02585v1 | null |
2025-02-04 | QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search | Zongyu Lin et.al. | 2502.02584v1 | null |
2025-02-04 | DIISC -- VI (COS-DIISC): UV Metal Absorption Relative to the H I disk of Galaxies | Brad Koplitz et.al. | 2502.02583v1 | null |
2025-02-04 | Open Materials Generation with Stochastic Interpolants | Philipp Hoellmer et.al. | 2502.02582v1 | null |
2025-02-04 | Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism | Yuhao Qing et.al. | 2502.02581v1 | null |
2025-02-05 | Minimax-Optimal Dimension-Reduced Clustering for High-Dimensional Nonspherical Mixtures | Chengzhu Huang et.al. | 2502.02580v2 | null |
2025-02-04 | A new proof of superadditivity and of the density conjecture for Activated Random Walks on the line | Nicolas Forien et.al. | 2502.02579v1 | null |
2025-01-31 | Low-Rank Adapting Models for Sparse Autoencoders | Matthew Chen et.al. | 2501.19406v1 | link |
2025-01-31 | Spin oscillations of neutrinos scattered by the supermassive black hole in the galactic center | Mridupawan Deka et.al. | 2501.19404v1 | null |
2025-01-31 | Redefining Machine Unlearning: A Conformal Prediction-Motivated Approach | Yingdan Shi et.al. | 2501.19403v1 | null |
2025-01-31 | Vintix: Action Model via In-Context Reinforcement Learning | Andrey Polubarov et.al. | 2501.19400v1 | link |
2025-01-31 | Scalable-Softmax Is Superior for Attention | Ken M. Nakanishi et.al. | 2501.19399v1 | null |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398v1 | link |
2025-01-31 | Fixed-Population Causal Inference for Models of Equilibrium | Konrad Menzel et.al. | 2501.19394v1 | null |
2025-02-03 | s1: Simple test-time scaling | Niklas Muennighoff et.al. | 2501.19393v2 | link |
2025-01-31 | Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models | Alina Shutova et.al. | 2501.19392v1 | null |
2025-01-31 | Perceptive Mixed-Integer Footstep Control for Underactuated Bipedal Walking on Rough Terrain | Brian Acosta et.al. | 2501.19391v1 | null |
2025-01-30 | Boosting galaxy clustering analyses with non-perturbative modelling of redshift-space distortions | Alexander Eggemeier et.al. | 2501.18597v1 | null |
2025-01-30 | DeltaLLM: Compress LLMs with Low-Rank Deltas between Shared Weights | Liana Mikaelyan et.al. | 2501.18596v1 | null |
2025-01-30 | Foundational Models for 3D Point Clouds: A Survey and Outlook | Vishal Thengane et.al. | 2501.18594v1 | null |
2025-01-30 | Diffusion Autoencoders are Scalable Image Tokenizers | Yinbo Chen et.al. | 2501.18593v1 | null |
2025-01-30 | Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Hao Dong et.al. | 2501.18592v1 | link |
2025-01-30 | Non-Hermitian catalysis of density-wave orders on Euclidean and hyperbolic lattices | Christopher A. Leong et.al. | 2501.18591v1 | null |
2025-01-30 | DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models | Ruofan Liang et.al. | 2501.18590v1 | null |
2025-01-30 | Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching | David Chuan-En Lin et.al. | 2501.18588v1 | null |
2025-01-30 | Entropy functionals and equilibrium states in mixed quantum-classical dynamics | Cesare Tronci et.al. | 2501.18587v1 | null |
2025-01-30 | Towards more accurate |
Logan Roberts et.al. | 2501.18586v1 | null |
2025-01-29 | Direct Search signal of two-component Dark Matter | Subhaditya Bhattacharya et.al. | 2501.17862v1 | null |
2025-01-29 | Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations | Zijie Liu et.al. | 2501.17860v1 | null |
2025-01-29 | rEGGression: an Interactive and Agnostic Tool for the Exploration of Symbolic Regression Models | Fabricio Olivetti de Franca et.al. | 2501.17859v1 | null |
2025-01-29 | Improving Your Model Ranking on Chatbot Arena by Vote Rigging | Rui Min et.al. | 2501.17858v1 | link |
2025-01-29 | Planetesimal formation in a pressure bump induced by infall | Haichen Zhao et.al. | 2501.17857v1 | null |
2025-01-29 | GRACE: Generalizing Robot-Assisted Caregiving with User Functionality Embeddings | Ziang Liu et.al. | 2501.17855v1 | null |
2025-01-29 | Enriched Immersed Finite Element and Isogeometric Analysis -- Algorithms and Data Structures | Nils Wunsch et.al. | 2501.17853v1 | null |
2025-01-29 | Holographic Fluctuation-Dissipation Relations in Finite Density Systems | Shivam K. Sharma et.al. | 2501.17852v1 | null |
2025-01-29 | UGSim: Autonomous Buoyancy-Driven Underwater Glider Simulator with LQR Control Strategy and Recursive Guidance System | Zhizun Xu et.al. | 2501.17851v1 | null |
2025-01-29 | Twisted torus knots with Horadam parameters | Brandy Doleshal et.al. | 2501.17850v1 | null |
2025-01-28 | Nonlinear fitting of undersampled discrete datasets in astronomy | Igor Chilingarian et.al. | 2501.17163v1 | null |
2025-01-28 | CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation | Nikolai Kalischek et.al. | 2501.17162v1 | null |
2025-01-28 | SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training | Tianzhe Chu et.al. | 2501.17161v1 | null |
2025-01-28 | A Hybrid Deep Learning CNN Model for Enhanced COVID-19 Detection from Computed Tomography (CT) Scan Images | Suresh Babu Nettur et.al. | 2501.17160v1 | null |
2025-01-28 | IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait | Han Yang et.al. | 2501.17159v1 | null |
2025-01-28 | Keck and Gemini characterization of |
Bryce T. Bolin et.al. | 2501.17156v1 | null |
2025-01-28 | Rational points and rational moduli spaces | Shijie Fan et.al. | 2501.17155v1 | null |
2025-01-28 | Phase-field modeling of radiation-induced composition redistribution: An application to additively manufactured austenitic Fe-Cr-Ni | Sourabh Bhagwan Kadambi et.al. | 2501.17154v1 | null |
2025-01-28 | Line-of-sight effects on double source plane lenses | Daniel Johnson et.al. | 2501.17153v1 | null |
2025-01-28 | Three-Dimensional Diffusion-Weighted Multi-Slab MRI With Slice Profile Compensation Using Deep Energy Model | Reza Ghorbani et.al. | 2501.17152v1 | null |
2025-01-27 | RelightVid: Temporal-Consistent Diffusion Model for Video Relighting | Ye Fang et.al. | 2501.16330v1 | null |
2025-01-27 | sDREAMER: Self-distilled Mixture-of-Modality-Experts Transformer for Automatic Sleep Staging | Jingyuan Chen et.al. | 2501.16329v1 | null |
2025-01-27 | LUCY: Linguistic Understanding and Control Yielding Early Stage of Her | Heting Gao et.al. | 2501.16327v1 | link |
2025-01-27 | Tailored Forecasting from Short Time Series via Meta-learning | Declan A. Norton et.al. | 2501.16325v1 | null |
2025-01-27 | Efficient evaluation of real-time path integrals | Job Feldbrugge et.al. | 2501.16323v1 | null |
2025-01-27 | Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture | Yikun Hou et.al. | 2501.16322v1 | null |
2025-01-27 | The SEA algorithm for endomorphisms of supersingular elliptic curves | Travis Morrison et.al. | 2501.16321v1 | null |
2025-01-27 | Adaptive Iterative Compression for High-Resolution Files: an Approach Focused on Preserving Visual Quality in Cinematic Workflows | Leonardo Melo et.al. | 2501.16319v1 | null |
2025-01-27 | A MARVEL-ous study of how well galaxy shapes reflect Dark Matter halo shapes in Cold Dark Matter Simulations | Blake Keith et.al. | 2501.16317v1 | null |
2025-01-27 | The Fundamental Theorem of Weak Optimal Transport | Mathias Beiglböck et.al. | 2501.16316v1 | null |
2025-01-24 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729v1 | link |
2025-01-24 | Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection | Zehong Yan et.al. | 2501.14728v1 | null |
2025-01-24 | Estimation-theoretic analysis of lensless imaging | Leyla A. Kabuli et.al. | 2501.14727v1 | null |
2025-01-24 | Relightable Full-Body Gaussian Codec Avatars | Shaofei Wang et.al. | 2501.14726v1 | null |
2025-01-24 | CodeMonkeys: Scaling Test-Time Compute for Software Engineering | Ryan Ehrlich et.al. | 2501.14723v1 | null |
2025-01-24 | Dualities between 2+1d fusion surface models from braided fusion categories | Luisa Eck et.al. | 2501.14722v1 | null |
2025-01-24 | Communication-Based Distributed Control of Large-Scale District Heating Networks | Audrey Blizard et.al. | 2501.14720v1 | null |
2025-01-24 | Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? | Ipek Baris Schlicht et.al. | 2501.14719v1 | null |
2025-01-24 | Gland Segmentation Using SAM With Cancer Grade as a Prompt | Yijie Zhu et.al. | 2501.14718v1 | null |
2025-01-24 | Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models | Naihao Deng et.al. | 2501.14717v1 | null |
2025-01-23 | Exponentially slow thermalization in 1D fragmented dynamics | Cheng Wang et.al. | 2501.13930v1 | null |
2025-01-23 | Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass | Jianing Yang et.al. | 2501.13928v1 | null |
2025-01-23 | CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation | Guofeng Cui et.al. | 2501.13927v1 | null |
2025-01-23 | Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step | Ziyu Guo et.al. | 2501.13926v1 | link |
2025-01-23 | GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing | Akashah Shabbir et.al. | 2501.13925v1 | link |
2025-01-23 | Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware Optimization | Hao Dong et.al. | 2501.13924v1 | link |
2025-01-23 | Hamiltonian Simulation via Stochastic Zassenhaus Expansions | Joseph Peetz et.al. | 2501.13922v1 | null |
2025-01-23 | The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities | Chan-Jan Hsu et.al. | 2501.13921v1 | null |
2025-01-23 | IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jiayi Lei et.al. | 2501.13920v1 | null |
2025-01-23 | Temporal Preference Optimization for Long-Form Video Understanding | Rui Li et.al. | 2501.13919v1 | null |
2025-01-22 | Accelerate High-Quality Diffusion Models with Inner Loop Feedback | Matthew Gwilliam et.al. | 2501.13107v1 | null |
2025-01-22 | VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding | Boqiang Zhang et.al. | 2501.13106v1 | link |
2025-01-22 | Neural Radiance Fields for the Real World: A Survey | Wenhui Xiao et.al. | 2501.13104v1 | null |
2025-01-22 | Achievability of Covert Quantum Communication | Evan J. D. Anderson et.al. | 2501.13103v1 | null |
2025-01-22 | Simulating quantum circuits with arbitrary local noise using Pauli Propagation | Armando Angrisani et.al. | 2501.13101v1 | null |
2025-01-22 | Which Sensor to Observe? Timely Tracking of a Joint Markov Source with Model Predictive Control | Ismail Cosandal et.al. | 2501.13099v1 | null |
2025-01-22 | Sunny.jl: A Julia Package for Spin Dynamics | David Dahlbom et.al. | 2501.13095v1 | null |
2025-01-22 | Robust Representation Consistency Model via Contrastive Denoising | Jiachen Lei et.al. | 2501.13094v1 | link |
2025-01-22 | Volume preserving mean curvature flow of round surfaces in asymptotically flat spaces | Carlo Sinestrari et.al. | 2501.13091v1 | null |
2025-01-22 | Stability, periodic orbits and KAM tori in the dynamics of the three fixed centers problem | Edward A. Turner et.al. | 2501.13089v1 | null |
2025-01-21 | Towards Affordance-Aware Articulation Synthesis for Rigged Objects | Yu-Chu Yu et.al. | 2501.12393v1 | null |
2025-01-21 | Learning segmentation from point trajectories | Laurynas Karazija et.al. | 2501.12392v1 | null |
2025-01-21 | Physics of Skill Learning | Ziming Liu et.al. | 2501.12391v1 | link |
2025-01-22 | GPS as a Control Signal for Image Generation | Chao Feng et.al. | 2501.12390v2 | null |
2025-01-21 | Taming Teacher Forcing for Masked Autoregressive Video Generation | Deyu Zhou et.al. | 2501.12389v1 | null |
2025-01-21 | Continuous 3D Perception Model with Persistent State | Qianqian Wang et.al. | 2501.12387v1 | null |
2025-01-22 | InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling | Yi Wang et.al. | 2501.12386v2 | link |
2025-01-21 | Audio Texture Manipulation by Exemplar-Based Analogy | Kan Jen Cheng et.al. | 2501.12385v1 | null |
2025-01-21 | CCESAR: Coastline Classification-Extraction From SAR Images Using CNN-U-Net Combination | Vidhu Arora et.al. | 2501.12384v1 | null |
2025-01-21 | Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks | Greg Olmschenk et.al. | 2501.12383v1 | null |
2025-01-17 | FaceXBench: Evaluating Multimodal LLMs on Face Understanding | Kartik Narayan et.al. | 2501.10360v1 | link |
2025-01-17 | Parton distributions confront LHC Run II data: a quantitative appraisal | Amedeo Chiefa et.al. | 2501.10359v1 | null |
2025-01-17 | Zero-Shot Monocular Scene Flow Estimation in the Wild | Yiqing Liang et.al. | 2501.10357v1 | null |
2025-01-17 | Exploring the Standard Model and Beyond from the Evidence of CE$ν$NS with Reactor Antineutrinos in CONUS+ | M. Alpízar-Venegas et.al. | 2501.10355v1 | null |
2025-01-17 | I Zw 1 and H0557-385: The Dusty Tori of Two High Eddington AGNs Observed in the MATISSE LM-Bands | Farin Drewes et.al. | 2501.10352v1 | null |
2025-01-17 | Purcell-Enhanced, Directional Light-Matter Interaction in a Waveguide-Coupled Nanocavity | Nicholas J. Martin et.al. | 2501.10351v1 | null |
2025-01-17 | Resolving discrepancies in bang-time predictions for ICF experiments on the NIF: Insights from the Build-A-Hohlraum Campaign | G. F. Swadling et.al. | 2501.10350v1 | null |
2025-01-17 | Photonic chiral state transfer near the Liouvillian exceptional point | Huixia Gao et.al. | 2501.10349v1 | null |
2025-01-17 | Credit Risk Identification in Supply Chains Using Generative Adversarial Networks | Zizhou Zhang et.al. | 2501.10348v1 | null |
2025-01-17 | ColNet: Collaborative Optimization in Decentralized Federated Multi-task Learning Systems | Chao Feng et.al. | 2501.10347v1 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757v1 | null |
2025-01-16 | SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces | Sumit Chaturvedi et.al. | 2501.09756v1 | null |
2025-01-16 | Learnings from Scaling Visual Tokenizers for Reconstruction and Generation | Philippe Hansen-Estruch et.al. | 2501.09755v1 | null |
2025-01-16 | Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues | Youngjoon Jang et.al. | 2501.09754v1 | null |
2025-01-16 | SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification | Yuexi Du et.al. | 2501.09753v1 | link |
2025-01-16 | A vertical slice frontogenesis test case for compressible nonhydrostatic dynamical cores of atmospheric models | Hiroe Yamazaki et.al. | 2501.09752v1 | null |
2025-01-16 | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | Zekun Xi et.al. | 2501.09751v1 | null |
2025-01-16 | Enhancing Lexicon-Based Text Embeddings with Large Language Models | Yibin Lei et.al. | 2501.09749v1 | null |
2025-01-16 | PyPLUTO: a data analysis Python package for the PLUTO code | Giancarlo Mattia et.al. | 2501.09748v1 | null |
2025-01-16 | FAST: Efficient Action Tokenization for Vision-Language-Action Models | Karl Pertsch et.al. | 2501.09747v1 | null |
2025-01-15 | Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion | Jingyuan Chen et.al. | 2501.09019v1 | null |
2025-01-15 | Improving the stellar age determination through joint modeling of binarity and asteroseismology - Grid modeling of the seismic red-giant binary KIC 9163796 | D. H. Grossmann et.al. | 2501.09018v1 | null |
2025-01-15 | An Ensemble Information Filter: Retrieving Markov-information from the SPDE discretisation | Berent Ånund Strømnes Lunde et.al. | 2501.09016v1 | null |
2025-01-15 | Family-wise Error Rate Control with E-values | Will Hartog et.al. | 2501.09015v1 | null |
2025-01-15 | How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias | Tosin Fadahunsi et.al. | 2501.09014v1 | link |
2025-01-15 | Multimodal LLMs Can Reason about Aesthetics in Zero-Shot | Ruixiang Jiang et.al. | 2501.09012v1 | link |
2025-01-15 | Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians | Ishan Amin et.al. | 2501.09009v1 | null |
2025-01-15 | SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation | Aditya Bhat et.al. | 2501.09008v1 | null |
2025-01-15 | Improving Stability Estimates in Adversarial Explainable AI through Alternate Search Methods | Christopher Burger et.al. | 2501.09006v1 | null |
2025-01-15 | Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails | Shaona Ghosh et.al. | 2501.09004v1 | null |
2025-01-14 | DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models | Hyeonwoo Kim et.al. | 2501.08333v1 | null |
2025-01-14 | MangaNinja: Line Art Colorization with Precise Reference Following | Zhiheng Liu et.al. | 2501.08332v1 | null |
2025-01-14 | Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise | Ryan Burgert et.al. | 2501.08331v1 | link |
2025-01-14 | Gradient Equilibrium in Online Learning: Theory and Applications | Anastasios N. Angelopoulos et.al. | 2501.08330v1 | link |
2025-01-14 | Predicting 4D Hand Trajectory from Monocular Videos | Yufei Ye et.al. | 2501.08329v1 | null |
2025-01-14 | PokerBench: Training Large Language Models to become Professional Poker Players | Richard Zhuang et.al. | 2501.08328v1 | link |
2025-01-14 | Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks | Miran Heo et.al. | 2501.08326v1 | null |
2025-01-14 | GameFactory: Creating New Games with Generative Interactive Videos | Jiwen Yu et.al. | 2501.08325v1 | null |
2025-01-14 | ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations | Ziyuan Huang et.al. | 2501.08324v1 | null |
2025-01-14 | Exploring Robustness of Multilingual LLMs on Real-World Noisy Data | Amirhossein Aliakbarzadeh et.al. | 2501.08322v1 | link |
2025-01-13 | The Guitar's Magnetic Field Revealed by Starlight Polarization | Jack T. Dinsmore et.al. | 2501.07577v1 | null |
2025-01-13 | Dataset Distillation via Committee Voting | Jiacheng Cui et.al. | 2501.07575v1 | link |
2025-01-13 | UnCommon Objects in 3D | Xingchen Liu et.al. | 2501.07574v1 | link |
2025-01-13 | A generalized Lalanne--Kreweras involution for rectangular and staircase tableaux | Sergi Elizalde et.al. | 2501.07573v1 | null |
2025-01-13 | WebWalker: Benchmarking LLMs in Web Traversal | Jialong Wu et.al. | 2501.07572v1 | link |
2025-01-13 | Digital Twin for Smart Societies: A Catalyst for Inclusive and Accessible Healthcare | Joshit Mohanty et.al. | 2501.07570v1 | null |
2025-01-13 | A reference framework for extremely metal-poor OB star studies: calibrations for stellar parameters and intrinsic colours | Marta Lorenzo et.al. | 2501.07569v1 | null |
2025-01-13 | From Fiber to Fabric: Designing the Mechanics of Machine Knitting | Cosima du Pasquier et.al. | 2501.07567v1 | null |
2025-01-14 | E2ESlack: An End-to-End Graph-Based Framework for Pre-Routing Slack Prediction | Saurabh Bodhe et.al. | 2501.07564v2 | null |
2025-01-13 | Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss | Xinyu Zhang et.al. | 2501.07563v1 | null |
2025-01-10 | Multi-subject Open-set Personalization in Video Generation | Tsai-Shien Chen et.al. | 2501.06187v1 | null |
2025-01-10 | LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs | Omkar Thawakar et.al. | 2501.06186v1 | link |
2025-01-10 | QPEs as Lense-Thirring precession of super-Eddington flows | M. Middleton et.al. | 2501.06185v1 | null |
2025-01-10 | PEACE: Empowering Geologic Map Holistic Understanding with MLLMs | Yangyu Huang et.al. | 2501.06184v1 | null |
2025-01-10 | Theory of Irreversibility in Quantum Many-Body Systems | Takato Yoshimura et.al. | 2501.06183v1 | null |
2025-01-10 | Algebraic solutions for |
Alex E. Bernardini et.al. | 2501.06182v1 | null |
2025-01-10 | Probabilistic Forecasts of Load, Solar and Wind for Electricity Price Forecasting | Bartosz Uniejewski et.al. | 2501.06180v1 | null |
2025-01-10 | Precoding Design for Limited-Feedback MISO Systems via Character-Polynomial Codes | Siva Aditya Gooty et.al. | 2501.06178v1 | null |
2025-01-10 | Existence, uniqueness and asymptotic stability of invariant measures for the stochastic Allen-Cahn-Navier-Stokes system with singular potential | Andrea Di Primio et.al. | 2501.06174v1 | null |
2025-01-10 | VideoAuteur: Towards Long Narrative Video Generation | Junfei Xiao et.al. | 2501.06173v1 | null |
2025-01-09 | An Empirical Study of Autoregressive Pre-training from Videos | Jathushan Rajasegaran et.al. | 2501.05453v1 | null |
2025-01-09 | ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding | Xingyu Fu et.al. | 2501.05452v1 | null |
2025-01-09 | Interplay between altermagnetism and topological superconductivity in an unconventional superconducting platform | Pritam Chatterjee et.al. | 2501.05451v1 | null |
2025-01-09 | Decentralized Diffusion Models | David McAllister et.al. | 2501.05450v1 | null |
2025-01-09 | Explainable AI-Enhanced Deep Learning for Pumpkin Leaf Disease Detection: A Comparative Analysis of CNN Architectures | Md. Arafat Alam Khandaker et.al. | 2501.05449v1 | null |
2025-01-09 | Fortuity in the D1-D5 system | Chi-Ming Chang et.al. | 2501.05448v1 | null |
2025-01-09 | Relative Pose Estimation through Affine Corrections of Monocular Depth Priors | Yifan Yu et.al. | 2501.05446v1 | link |
2025-01-09 | Consistent Flow Distillation for Text-to-3D Generation | Runjie Yan et.al. | 2501.05445v1 | null |
2025-01-09 | Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark | Yunzhuo Hao et.al. | 2501.05444v1 | null |
2025-01-09 | A survey of textual cyber abuse detection using cutting-edge language models and large language models | Jose A. Diaz-Garcia et.al. | 2501.05443v1 | null |
2025-01-08 | Planarian Neural Networks: Evolutionary Patterns from Basic Bilateria Shaping Modern Artificial Neural Network Architectures | Ziyuan Huang et.al. | 2501.04700v1 | null |
2025-01-08 | EditAR: Unified Conditional Generation with Autoregressive Models | Jiteng Mu et.al. | 2501.04699v1 | null |
2025-01-08 | ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning | Yuzhou Huang et.al. | 2501.04698v1 | null |
2025-01-08 | Grokking at the Edge of Numerical Stability | Lucas Prieto et.al. | 2501.04697v1 | link |
2025-01-08 | Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation | Ulindu De Silva et.al. | 2501.04696v1 | link |
2025-01-08 | Re-ranking the Context for Multimodal Retrieval Augmented Generation | Matin Mortaheb et.al. | 2501.04695v1 | null |
2025-01-08 | EpiCoder: Encompassing Diversity and Complexity in Code Generation | Yaoxiang Wang et.al. | 2501.04694v1 | null |
2025-01-08 | Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding | Joshua Jones et.al. | 2501.04693v1 | null |
2025-01-08 | Non-Markovian dynamics of BIC generation via single-photon scattering | Giuseppe Magnifico et.al. | 2501.04691v1 | null |
2025-01-08 | Comparative Analysis of Quantum and Classical Support Vector Classifiers for Software Bug Prediction: An Exploratory Study | Md Nadim et.al. | 2501.04690v1 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005v1 | null |
2025-01-07 | LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes | Xiang Xu et.al. | 2501.04004v1 | link |
2025-01-07 | Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives | Shaoyuan Xie et.al. | 2501.04003v1 | null |
2025-01-07 | Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos | Haobo Yuan et.al. | 2501.04001v1 | null |
2025-01-07 | A Survey on Federated Learning in Human Sensing | Mohan Li et.al. | 2501.04000v1 | null |
2025-01-07 | WAPTS: A Weighted Allocation Probability Adjusted Thompson Sampling Algorithm for High-Dimensional and Sparse Experiment Settings | Haochen Song et.al. | 2501.03999v1 | null |
2025-01-07 | Tracing the Winds: A Uniform Interpretation of Helium Escape in Exoplanets from Archival Spectroscopic Observations | Patrick McCreery et.al. | 2501.03998v1 | null |
2025-01-07 | Two-fluid mobility model from coupled hydrodynamic equations for simulating laser-driven semiconductor switches | Qile Wu et.al. | 2501.03997v1 | null |
2025-01-07 | Modular Features of Superstring Scattering Amplitudes: Generalised Eisenstein Series and Theta Lifts | Daniele Dorigoni et.al. | 2501.03996v1 | null |
2025-01-07 | RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance | Matin Mortaheb et.al. | 2501.03995v1 | null |
2025-01-06 | Gaussian Masked Autoencoders | Jathushan Rajasegaran et.al. | 2501.03229v1 | null |
2025-01-07 | LightGNN: Simple Graph Neural Network for Recommendation | Guoxuan Chen et.al. | 2501.03228v2 | null |
2025-01-06 | When Should Selfish Miners Double-Spend? | Mustafa Doger et.al. | 2501.03227v1 | null |
2025-01-06 | BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning | Beichen Zhang et.al. | 2501.03226v1 | link |
2025-01-06 | Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Yuhui Zhang et.al. | 2501.03225v1 | link |
2025-01-06 | Testing Approximate Stationarity Concepts for Piecewise Affine Functions | Lai Tian et.al. | 2501.03224v1 | null |
2025-01-06 | Rate-My-LoRA: Efficient and Adaptive Federated Model Tuning for Cardiac MRI Segmentation | Xiaoxiao He et.al. | 2501.03223v1 | null |
2025-01-06 | RW-Net: Enhancing Few-Shot Point Cloud Classification with a Wavelet Transform Projection-based Network | Haosheng Zhang et.al. | 2501.03221v1 | null |
2025-01-06 | ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking | Tingyang Zhang et.al. | 2501.03220v1 | null |
2025-01-06 | Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction | Rui Qian et.al. | 2501.03218v1 | link |
2025-01-03 | VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction | Chaoyou Fu et.al. | 2501.01957v1 | link |
2025-01-03 | Metadata Conditioning Accelerates Language Model Pre-training | Tianyu Gao et.al. | 2501.01956v1 | null |
2025-01-03 | Positive determinacy of h-Shuhan matrices with |
Weicai Wu et.al. | 2501.01955v1 | null |
2025-01-03 | Grid-level impacts of renewable energy on thermal generation: efficiency, emissions and flexibility | Dhruv Suri et.al. | 2501.01954v1 | null |
2025-01-03 | Semigroups of holomorphic functions; rectifiability and Lipschitz properties of the orbits | Dimitrios Betsakos et.al. | 2501.01952v1 | null |
2025-01-03 | MADGEN -- Mass-Spec attends to De Novo Molecular generation | Yinkai Wang et.al. | 2501.01950v1 | null |
2025-01-03 | VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment | Wenyan Cong et.al. | 2501.01949v1 | null |
2025-01-03 | A New Approach to the Analysis of Cosmological Parameters in Multifield Cosmology | Katerina Bolshakova et.al. | 2501.01948v1 | null |
2025-01-03 | A uniform action of the dihedral group $ Z_2\times D_3$ on Littlewood--Richardson coefficients | Olga Azenhas et.al. | 2501.01947v1 | null |
2025-01-03 | Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap | Weizhi Zhang et.al. | 2501.01945v1 | null |
2025-01-03 | GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models | Zhangyang Qi et.al. | 2501.01428v2 | null |
2025-01-02 | VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control | Yuanpeng Tu et.al. | 2501.01427v1 | null |
2025-01-02 | Unifying Specialized Visual Encoders for Video Language Models | Jihoon Chung et.al. | 2501.01426v1 | null |
2025-01-03 | Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions | Xincheng Shuai et.al. | 2501.01425v2 | null |
2025-01-02 | Object-level Visual Prompts for Compositional Image Generation | Gaurav Parmar et.al. | 2501.01424v1 | null |
2025-01-02 | Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models | Jingfeng Yao et.al. | 2501.01423v1 | link |
2025-01-02 | Multi-Modal Video Feature Extraction for Popularity Prediction | Haixu Liu et.al. | 2501.01422v1 | null |
2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420v1 | null |
2025-01-02 | Riemann-Hilbert problems, Fredholm determinants, explicit combinatorial expansions, and connection formulas for the general |
Pavlo Gavrylenko et.al. | 2501.01419v1 | null |
2025-01-02 | The Bayesian Global Sky Model (B-GSM): Validation of a Data Driven Bayesian Simultaneous Component Separation and Calibration Algorithm for EoR Foreground Modelling | George Carter et.al. | 2501.01417v1 | null |
2024-12-30 | PERSE: Personalized 3D Generative Avatars from A Single Portrait | Hyunsoo Cha et.al. | 2412.21206v1 | null |
2024-12-30 | Action-Agnostic Point-Level Supervision for Temporal Action Detection | Shuhei M. Yoshida et.al. | 2412.21205v1 | link |
2024-12-30 | Branes Screening Quarks and Defect Operators | Andreas Karch et.al. | 2412.21204v1 | null |
2024-12-30 | SoS Certificates for Sparse Singular Values and Their Applications: Robust Statistics, Subspace Distortion, and More | Ilias Diakonikolas et.al. | 2412.21203v1 | null |
2024-12-30 | Two-component Dark Matter and low scale Thermal Leptogenesis | Subhaditya Bhattacharya et.al. | 2412.21202v1 | null |
2024-12-30 | Vector-like quark doublets, weak-basis invariants and CP violation | F. Albergaria et.al. | 2412.21201v1 | null |
2024-12-30 | Distributed Mixture-of-Agents for Edge Inference with Large Language Models | Purbesh Mitra et.al. | 2412.21200v1 | link |
2024-12-31 | HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation | Zhaojian Yu et.al. | 2412.21199v2 | link |
2024-12-30 | Topological Responses of the Standard Model Gauge Group | Zheyan Wan et.al. | 2412.21196v1 | null |
2024-12-30 | Rough differential equations for volatility | Ofelia Bonesini et.al. | 2412.21192v1 | null |
2024-12-27 | Computing Direct Sum Decompositions | Devlin Mallory et.al. | 2412.19799v1 | null |
2024-12-27 | Non-Scaling Topological Defects and Gravitational Waves in Higgs Portal | Wen Yin et.al. | 2412.19798v1 | null |
2024-12-27 | Streamlined Krylov construction and classification of ergodic Floquet systems | Nikita Kolganov et.al. | 2412.19797v1 | null |
2024-12-27 | Generalized Grade-of-Membership Estimation for High-dimensional Locally Dependent Data | Ling Chen et.al. | 2412.19796v1 | null |
2024-12-27 | g-factor theory of Si/SiGe quantum dots: spin-valley and giant renormalization effects | Benjamin D. Woods et.al. | 2412.19795v1 | null |
2024-12-27 | MVTamperBench: Evaluating Robustness of Vision-Language Models | Amit Agarwal et.al. | 2412.19794v1 | null |
2024-12-27 | InfAlign: Inference-aware language model alignment | Ananth Balashankar et.al. | 2412.19792v1 | null |
2024-12-27 | Bottom-up robust modeling for the foraging behavior of Physarum polycephalum | Damiano Reginato et.al. | 2412.19790v1 | null |
2024-12-27 | Data-driven analysis of anomalous transport and three-wave-coupling effects in E x B plasma discharges | Borja Bayón-Buján et.al. | 2412.19789v1 | null |
2024-12-27 | Transmon qutrit-based simulation of spin-1 AKLT systems | Keerthi Kumaran et.al. | 2412.19786v1 | null |
2024-12-24 | Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models | Jinhui Yi et.al. | 2412.18609v1 | link |
2024-12-24 | PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Minghao Chen et.al. | 2412.18608v1 | null |
2024-12-24 | DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers | Yuntao Chen et.al. | 2412.18607v1 | null |
2024-12-24 | Lattice T-duality from non-invertible symmetries in quantum spin chains | Salvatore D. Pace et.al. | 2412.18606v1 | null |
2024-12-24 | Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models | Zehan Wang et.al. | 2412.18605v1 | null |
2024-12-24 | Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models | Tahira Kazimi et.al. | 2412.18604v1 | null |
2024-12-24 | Long-Form Speech Generation with Spoken Language Models | Se Jin Park et.al. | 2412.18603v1 | link |
2024-12-24 | Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems | Fernando Jia et.al. | 2412.18601v1 | link |
2024-12-24 | ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Hongjie Li et.al. | 2412.18600v1 | null |
2024-12-24 | Double Spending Analysis of Nakamoto Consensus for Time-Varying Mining Rates with Ruin Theory | Mustafa Doger et.al. | 2412.18599v1 | null |
2024-12-23 | FaceLift: Single Image to 3D Head with View Generation and GS-LRM | Weijie Lyu et.al. | 2412.17812v1 | null |
2024-12-23 | ChatGarment: Garment Estimation, Generation and Editing via Large Language Models | Siyuan Bian et.al. | 2412.17811v1 | null |
2024-12-24 | Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders | Rui Chen et.al. | 2412.17808v2 | null |
2024-12-23 | Reconstructing People, Places, and Cameras | Lea Müller et.al. | 2412.17806v1 | null |
2024-12-23 | Large Motion Video Autoencoding with Cross-modal Video VAE | Yazhou Xing et.al. | 2412.17805v1 | null |
2024-12-23 | GauSim: Registering Elastic Objects into Digital World by Gaussian Simulator | Yidi Shao et.al. | 2412.17804v1 | null |
2024-12-23 | Examining Imbalance Effects on Performance and Demographic Fairness of Clinical Language Models | Precious Jones et.al. | 2412.17803v1 | null |
2024-12-23 | Feebly-Interacting Peccei-Quinn Model | Wen Yin et.al. | 2412.17802v1 | null |
2024-12-23 | Probing the magnetic origin of the pseudogap using a Fermi-Hubbard quantum simulator | Thomas Chalopin et.al. | 2412.17801v1 | null |
2024-12-23 | Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection | Yitong Chen et.al. | 2412.17800v1 | link |
2024-12-20 | HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding | Chenxin Tao et.al. | 2412.16158v1 | null |
2024-12-20 | Stochastic Analysis of Entanglement-assisted Quantum Communication Channels | Karim S. Elsayed et.al. | 2412.16157v1 | null |
2024-12-20 | Personalized Representation from Personalized Generation | Shobhita Sundaram et.al. | 2412.16156v1 | link |
2024-12-20 | Can Generative Video Models Help Pose Estimation? | Ruojin Cai et.al. | 2412.16155v1 | null |
2024-12-20 | MotiF: Making Text Count in Image Animation with Motion Focal Loss | Shijie Wang et.al. | 2412.16153v1 | null |
2024-12-20 | A vector logic for extensional formal semantics | Daniel Quigley et.al. | 2412.16152v1 | null |
2024-12-20 | Shape Shifters: Does Body Shape Change the Perception of Small-Scale Crowd Motions? | Bharat Vyas et.al. | 2412.16151v1 | null |
2024-12-20 | The Classical Super-Phaserotation Infrared Triangle | Sangmin Choi et.al. | 2412.16149v1 | null |
2024-12-20 | Frequency Is What You Need: Word-frequency Masking Benefits Vision-Language Model Pre-training | Mingliang Liang et.al. | 2412.16148v1 | null |
2024-12-20 | SeagrassFinder: Deep Learning for Eelgrass Detection and Coverage Estimation in the Wild | Jannik Elsäßer et.al. | 2412.16147v1 | null |
2024-12-19 | UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency | Enis Simsar et.al. | 2412.15216v1 | null |
2024-12-19 | EnvGS: Modeling View-Dependent Appearance with Environment Gaussian | Tao Xie et.al. | 2412.15215v1 | null |
2024-12-19 | LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Hanlin Wang et.al. | 2412.15214v1 | null |
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213v1 | null |
2024-12-19 | Scaling 4D Representations | João Carreira et.al. | 2412.15212v1 | null |
2024-12-19 | Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation | Hadi Alzayer et.al. | 2412.15211v1 | null |
2024-12-19 | PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation | Muntasir Wahed et.al. | 2412.15209v1 | null |
2024-12-19 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208v1 | link |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206v1 | link |
2024-12-19 | FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching | Sucheng Ren et.al. | 2412.15205v1 | link |
2024-12-18 | AniDoc: Animation Creation Made Easier | Yihao Meng et.al. | 2412.14173v1 | null |
2024-12-18 | Learning from Massive Human Videos for Universal Humanoid Pose Control | Jiageng Mao et.al. | 2412.14172v1 | null |
2024-12-18 | Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces | Jihan Yang et.al. | 2412.14171v1 | link |
2024-12-19 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170v2 | null |
2024-12-18 | Autoregressive Video Generation without Vector Quantization | Haoge Deng et.al. | 2412.14169v1 | link |
2024-12-18 | FashionComposer: Compositional Fashion Image Generation | Sihui Ji et.al. | 2412.14168v1 | null |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167v1 | null |
2024-12-18 | MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data | Hanwen Jiang et.al. | 2412.14166v1 | null |
2024-12-18 | On symmetry-resolved generalized entropies | Fei Yan et.al. | 2412.14165v1 | null |
2024-12-18 | MetaMorph: Multimodal Understanding and Generation via Instruction Tuning | Shengbang Tong et.al. | 2412.14164v1 | null |
2024-12-17 | ExBody2: Advanced Expressive Humanoid Whole-Body Control | Mazeyu Ji et.al. | 2412.13196v1 | null |
2024-12-17 | CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models | Gaoyang Zhang et.al. | 2412.13195v1 | link |
2024-12-17 | Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents | Yifei Zhou et.al. | 2412.13194v1 | null |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193v1 | link |
2024-12-17 | How to Falsify String Theory at a Collider | Matthew Baumgart et.al. | 2412.13192v1 | null |
2024-12-17 | MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Maham Tanveer et.al. | 2412.13190v1 | null |
2024-12-17 | Binary properties of the globular cluster 47 Tuc (NGC 104). A dearth of short-period binaries | Johanna Müller-Horn et.al. | 2412.13189v1 | null |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188v1 | null |
2024-12-17 | HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction | Chen Bao et.al. | 2412.13187v1 | null |
2024-12-17 | Efficiently measuring |
Daniel K. Mark et.al. | 2412.13186v1 | null |
2024-12-16 | MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization | Bhavya Sukhija et.al. | 2412.12098v1 | null |
2024-12-16 | There is more to the de Sitter horizon than just the area | Willy Fischler et.al. | 2412.12097v1 | null |
2024-12-16 | Causal Diffusion Transformers for Generative Modeling | Chaorui Deng et.al. | 2412.12095v1 | link |
2024-12-16 | SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator | Guoxuan Chen et.al. | 2412.12094v1 | null |
2024-12-16 | CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models | Felix Taubner et.al. | 2412.12093v1 | null |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091v1 | null |
2024-12-16 | Geometry of 3-dimensional del Pezzo fibrations in positive characteristic | Fabio Bernasconi et.al. | 2412.12090v1 | null |
2024-12-16 | Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation | Eliot Xing et.al. | 2412.12089v1 | null |
2024-12-16 | Nonlinear Reduced-Order Modeling of Compressible Flow Fields Using Deep Learning and Manifold Learning | Bilal Mufti et.al. | 2412.12088v1 | null |
2024-12-16 | Instruction-based Image Manipulation by Watching How Things Move | Mingdeng Cao et.al. | 2412.12087v1 | null |
2024-12-13 | GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction | Sicheng Zuo et.al. | 2412.10373v1 | link |
2024-12-13 | UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities | Muhammad Uzair Khattak et.al. | 2412.10372v1 | link |
2024-12-13 | Computational Explorations of Total Variation Distance | Arnab Bhattacharyya et.al. | 2412.10370v1 | null |
2024-12-13 | A Grounded Typology of Word Classes | Coleman Haley et.al. | 2412.10369v1 | null |
2024-12-13 | Black holes and gravitational waves from phase transitions in realistic models | Marek Lewicki et.al. | 2412.10366v1 | null |
2024-12-13 | Real-Time Simulation of Asymmetry Generation in Fermion-Bubble Collisions | Marcela Carena et.al. | 2412.10365v1 | null |
2024-12-13 | OP-LoRA: The Blessing of Dimensionality | Piotr Teterwak et.al. | 2412.10362v1 | null |
2024-12-13 | Apollo: An Exploration of Video Understanding in Large Multimodal Models | Orr Zohar et.al. | 2412.10360v1 | null |
2024-12-13 | Modeling |
Lyne Moser et.al. | 2412.10359v1 | null |
2024-12-13 | Critical Point Criteria and Dynamically Monogenic Polynomials | Joachim König et.al. | 2412.10358v1 | null |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627v1 | link |
2024-12-12 | FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion | Haonan Qiu et.al. | 2412.09626v1 | null |
2024-12-12 | Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors | Yue Feng et.al. | 2412.09625v1 | null |
2024-12-12 | GenEx: Generating an Explorable World | Taiming Lu et.al. | 2412.09624v1 | null |
2024-12-12 | OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation | Weiqi Li et.al. | 2412.09623v1 | null |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622v1 | null |
2024-12-12 | Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos | Linyi Jin et.al. | 2412.09621v1 | null |
2024-12-12 | Learning Camera Movement Control from Real-World Drone Videos | Yunzhong Hou et.al. | 2412.09620v1 | null |
2024-12-12 | SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training | Dongting Hu et.al. | 2412.09619v1 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618v1 | null |
2024-12-11 | SegFace: Face Segmentation of Long-Tail Classes | Kartik Narayan et.al. | 2412.08647v1 | link |
2024-12-11 | StreamChat: Chatting with Streaming Video | Jihao Liu et.al. | 2412.08646v1 | null |
2024-12-11 | ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation | Daniel Winter et.al. | 2412.08645v1 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643v1 | link |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642v1 | null |
2024-12-11 | 3D Mesh Editing using Masked LRMs | Will Gao et.al. | 2412.08641v1 | null |
2024-12-11 | Fast Prompt Alignment for Text-to-Image Generation | Khalil Mrini et.al. | 2412.08639v1 | link |
2024-12-11 | An Improved Precision Calculation of the |
Graham Van Goffrier et.al. | 2412.08638v1 | null |
2024-12-11 | DMin: Scalable Training Data Influence Estimation for Diffusion Models | Huawei Lin et.al. | 2412.08637v1 | link |
2024-12-11 | Deformation Openness of Big Fundamental Groups and Applications | Ya Deng et.al. | 2412.08636v1 | null |
2024-12-10 | Video Motion Transfer with Diffusion Transformers | Alexander Pondaven et.al. | 2412.07776v1 | link |
2024-12-10 | Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets | Zhen Liu et.al. | 2412.07775v1 | null |
2024-12-10 | UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics | Xi Chen et.al. | 2412.07774v1 | null |
2024-12-10 | From Slow Bidirectional to Fast Causal Video Generators | Tianwei Yin et.al. | 2412.07772v1 | null |
2024-12-10 | PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition | Kartik Narayan et.al. | 2412.07771v1 | null |
2024-12-10 | From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos | Matthew Wallingford et.al. | 2412.07770v1 | null |
2024-12-10 | BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities | Sahal Shaji Mullappilly et.al. | 2412.07769v1 | null |
2024-12-11 | Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting | Zetong Yang et.al. | 2412.07768v2 | null |
2024-12-10 | Learning Visual Generative Priors without Text | Shuailei Ma et.al. | 2412.07767v1 | null |
2024-12-10 | Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds | Xiaoyu Xiang et.al. | 2412.07766v1 | null |
2024-12-10 | [MASK] is All You Need | Vincent Tao Hu et.al. | 2412.06787v2 | link |
2024-12-09 | Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis | M. Hamza Mughal et.al. | 2412.06786v1 | null |
2024-12-09 | Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Ruihan Gao et.al. | 2412.06785v1 | link |
2024-12-09 | P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies | Mara Levy et.al. | 2412.06784v1 | null |
2024-12-09 | CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction | Zhefei Gong et.al. | 2412.06782v1 | null |
2024-12-09 | Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation | Nicolas Dufour et.al. | 2412.06781v1 | null |
2024-12-09 | Diverse Score Distillation | Yanbo Xu et.al. | 2412.06780v1 | null |
2024-12-09 | AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation | Guanxing Lu et.al. | 2412.06779v1 | null |
2024-12-09 | Dark Matter Freeze-In during Warm Inflation and the Seesaw Mechanism | Rayff de Souza et.al. | 2412.06778v1 | null |
2024-12-09 | Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models | Yi-Lun Lee et.al. | 2412.06775v1 | link |
2024-12-06 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280v1 | link |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279v1 | null |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278v1 | null |
2024-12-06 | Text to Blind Motion | Hee Jae Kim et.al. | 2412.05277v1 | null |
2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276v1 | link |
2024-12-06 | MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models | Tuna Han Salih Meral et.al. | 2412.05275v1 | null |
2024-12-06 | Real-space chirality from crystalline topological defects in the Kitaev spin liquid | Fay Borhani et.al. | 2412.05272v1 | null |
2024-12-06 | Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Zhe Chen et.al. | 2412.05271v1 | null |
2024-12-06 | APOLLO: SGD-like Memory, AdamW-level Performance | Hanqing Zhu et.al. | 2412.05270v1 | null |
2024-12-06 | Chimera: Accurate retrosynthesis prediction by ensembling models with diverse inductive biases | Krzysztof Maziarz et.al. | 2412.05269v1 | null |
2024-12-05 | Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail | Luca Bartolomei et.al. | 2412.04472v1 | link |
2024-12-05 | PaintScene4D: Consistent 4D Scene Generation from Text Prompts | Vinayak Gupta et.al. | 2412.04471v1 | null |
2024-12-05 | Turbo3D: Ultra-fast Text-to-3D Generation | Hanzhe Hu et.al. | 2412.04470v1 | null |
2024-12-05 | QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos | Sharath Girish et.al. | 2412.04469v1 | null |
2024-12-05 | NVILA: Efficient Frontier Visual Language Models | Zhijian Liu et.al. | 2412.04468v1 | null |
2024-12-05 | VisionZip: Longer is Better but Not Necessary in Vision Language Models | Senqiao Yang et.al. | 2412.04467v1 | link |
2024-12-05 | User-item fairness tradeoffs in recommendations | Sophie Greenwood et.al. | 2412.04466v1 | link |
2024-12-05 | UnZipLoRA: Separating Content and Style from a Single Image | Chang Liu et.al. | 2412.04465v1 | null |
2024-12-05 | DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction | Ben Kaye et.al. | 2412.04464v1 | null |
2024-12-05 | 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion | Chaoyang Wang et.al. | 2412.04462v1 | null |
2024-12-04 | Navigation World Models | Amir Bar et.al. | 2412.03572v1 | null |
2024-12-04 | Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation | Bingjie Song et.al. | 2412.03571v1 | null |
2024-12-04 | Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis | Qitao Zhao et.al. | 2412.03570v1 | null |
2024-12-04 | Critical behavior of the Schwinger model via gauge-invariant VUMPS | Hirotsugu Fujii et.al. | 2412.03569v1 | null |
2024-12-04 | The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control | Ruili Feng et.al. | 2412.03568v1 | null |
2024-12-04 | Streaming Detection of Queried Event Start | Cristobal Eyzaguirre et.al. | 2412.03567v1 | null |
2024-12-04 | FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes | Lue Fan et.al. | 2412.03566v1 | null |
2024-12-04 | Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning | Wujian Peng et.al. | 2412.03565v1 | null |
2024-12-04 | Improving Perturbation Theory with the Sum-of-Squares: Third Order | M. B. Hastings et.al. | 2412.03564v1 | null |
2024-12-04 | From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Xinyi Mou et.al. | 2412.03563v1 | null |
2024-12-03 | Motion Prompting: Controlling Video Generation with Motion Trajectories | Daniel Geng et.al. | 2412.02700v1 | null |
2024-12-03 | UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping | Wenbo Wang et.al. | 2412.02699v1 | null |
2024-12-03 | Scaling BERT Models for Turkish Automatic Punctuation and Capitalization Correction | Abdulkader Saoud et.al. | 2412.02698v1 | null |
2024-12-03 | Stage IV CMB forecasts for warm inflation | F. B. M. dos Santos et.al. | 2412.02696v1 | null |
2024-12-03 | An ADHD Diagnostic Interface Based on EEG Spectrograms and Deep Learning Techniques | Medha Pappula et.al. | 2412.02695v1 | null |
2024-12-03 | Increased Surface Temperatures of Habitable White Dwarf Worlds Relative to Main-Sequence Exoplanets | Aomawa L. Shields et.al. | 2412.02694v1 | null |
2024-12-03 | Diffusion-based Visual Anagram as Multi-task Learning | Zhiyuan Xu et.al. | 2412.02693v1 | link |
2024-12-03 | Taming Scalable Visual Tokenizer for Autoregressive Image Generation | Fengyuan Shi et.al. | 2412.02692v1 | null |
2024-12-04 | Chow-Lam Recovery | Elizabeth Pratt et.al. | 2412.02691v2 | null |
2024-12-03 | FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation | Kefan Chen et.al. | 2412.02690v1 | null |
2024-12-02 | T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs | Shukang Yin et.al. | 2411.19951v2 | link |
2024-11-29 | AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos | Yuze He et.al. | 2411.19950v1 | null |
2024-11-29 | Efficient short-wave infrared upconversion by self-sensitized holmium-doped nanoparticles | Rakesh Arul et.al. | 2411.19949v1 | null |
2024-11-29 | Operator Valued Flow Equation Approach to the Bosonic Lattice Polaron: Dispersion Renormalization Beyond the Fröhlich Paradigm | Jan-Philipp Christ et.al. | 2411.19947v1 | null |
2024-11-29 | DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation | Zhiqiang Shen et.al. | 2411.19946v1 | link |
2024-12-02 | Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability | Zicheng Lin et.al. | 2411.19943v2 | null |
2024-11-29 | Free-form Generation Enhances Challenging Clothed Human Modeling | Hang Ye et.al. | 2411.19942v1 | null |
2024-11-29 | Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark | Joseph Heyward et.al. | 2411.19941v1 | null |
2024-11-29 | Direct local parametrization of nuclear state densities using the back-shifted Bethe formula | C. Özen et.al. | 2411.19940v1 | null |
2024-11-29 | VLSBench: Unveiling Visual Leakage in Multimodal Safety | Xuhao Hu et.al. | 2411.19939v1 | null |
2024-11-27 | Textured Gaussians for Enhanced 3D Scene Appearance Modeling | Brian Chao et.al. | 2411.18625v1 | null |
2024-11-27 | GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data | Wentao Wang et.al. | 2411.18624v1 | null |
2024-11-27 | Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation | Yueru Jia et.al. | 2411.18623v1 | null |
2024-11-27 | Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data | Aoran Shen et.al. | 2411.18622v1 | null |
2024-11-27 | Yukawa-Lorentz Symmetry of Tilted Non-Hermitian Dirac Semimetals at Quantum Criticality | Sergio Pino-Alarcón et.al. | 2411.18621v1 | null |
2024-11-27 | Cross-modal Information Flow in Multimodal Large Language Models | Zhi Zhang et.al. | 2411.18620v1 | null |
2024-11-27 | Anatomy of the Real Higgs Triplet Model | Saiyad Ashanujjaman et.al. | 2411.18618v1 | null |
2024-11-27 | Online versus Offline Adversaries in Property Testing | Esty Kelman et.al. | 2411.18617v1 | null |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616v1 | null |
2024-11-27 | Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective | Zhi Zhang et.al. | 2411.18615v1 | null |
2024-11-26 | Mock modularity of Calabi-Yau threefolds | Sergey Alexandrov et.al. | 2411.17699v1 | null |
2024-11-26 | Video-Guided Foley Sound Generation with Multimodal Controls | Ziyang Chen et.al. | 2411.17698v1 | null |
2024-11-27 | StableAnimator: High-Quality Identity-Preserving Human Image Animation | Shuyuan Tu et.al. | 2411.17697v2 | link |
2024-11-26 | ScribbleLight: Single Image Indoor Relighting with Scribbles | Jun Myeong Choi et.al. | 2411.17696v1 | null |
2024-11-26 | Ensemble reliability and the signal-to-noise paradox in large-ensemble subseasonal forecasts | Christopher David Roberts et.al. | 2411.17694v1 | null |
2024-11-26 | Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats | Jiaxin Wen et.al. | 2411.17693v1 | null |
2024-11-27 | Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens | Xu Ouyang et.al. | 2411.17691v2 | null |
2024-11-26 | Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis | Akshita Gupta et.al. | 2411.17690v1 | null |
2024-11-26 | Cornering in the Water: An Investigation of Dolphin Swimming Performance | Mingkai Xia et.al. | 2411.17688v1 | null |
2024-11-26 | GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2411.17687v1 | null |
2024-11-25 | The impact of resistivity on the variability of black hole accretion flows | Antonios Nathanail et.al. | 2411.16684v1 | null |
2024-11-25 | Generative Omnimatte: Learning to Decompose Video into Layers | Yao-Chih Lee et.al. | 2411.16683v1 | null |
2024-11-25 | Contrasting and comparing the efficacy of non-pharmaceutical interventions on air-borne and vector-borne diseases | Bibandhan Poudyal et.al. | 2411.16682v1 | null |
2024-11-25 | Factorized Visual Tokenization and Generation | Zechen Bai et.al. | 2411.16681v1 | null |
2024-11-25 | Quark: Real-time, High-resolution, and General Neural View Synthesis | John Flynn et.al. | 2411.16680v1 | null |
2024-11-25 | Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts? | Sohee Yang et.al. | 2411.16679v1 | null |
2024-11-25 | Modified recombination and the Hubble tension | Seyed Hamidreza Mirpoorian et.al. | 2411.16678v1 | null |
2024-11-25 | A Sound Horizon-Free Measurement of |
E. A. Zaborowski et.al. | 2411.16677v1 | null |
2024-11-25 | Interpolation for degree 2 Veroneses of odd dimension | Ray Shang et.al. | 2411.16672v1 | null |
2024-11-25 | Winning opinion: Following Your Friends' Advice or That of Their Friends? | Francisco J. Muñoz et.al. | 2411.16671v1 | null |
2024-11-22 | Emulating Recombination with Neural Networks using Universal Differential Equations | Ben Pennell et.al. | 2411.15140v1 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139v1 | link |
2024-11-22 | Material Anything: Generating Materials for Any 3D Object via Diffusion | Xin Huang et.al. | 2411.15138v1 | null |
2024-11-22 | On Approximability of Satisfiable |
Amey Bhangale et.al. | 2411.15133v1 | null |
2024-11-22 | Efficient Eigenstate Preparation in an Integrable Model with Hilbert Space Fragmentation | Roberto Ruiz et.al. | 2411.15132v1 | null |
2024-11-22 | WildLMa: Long Horizon Loco-Manipulation in the Wild | Ri-Zhao Qiu et.al. | 2411.15131v1 | null |
2024-11-22 | Learning-based Trajectory Tracking for Bird-inspired Flapping-Wing Robots | Jiaze Cai et.al. | 2411.15130v1 | null |
2024-11-22 | Measuring Bullshit in the Language Games played by ChatGPT | Alessandro Trevisan et.al. | 2411.15129v1 | null |
2024-11-22 | Health AI Developer Foundations | Atilla P. Kiraly et.al. | 2411.15128v1 | null |
2024-11-22 | PRIMUS: Pretraining IMU Encoders with Multimodal Self-Supervision | Arnav M. Das et.al. | 2411.15127v1 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432v1 | link |
2024-11-21 | On Optimal Testing of Linearity | Vipul Arora et.al. | 2411.14431v1 | null |
2024-11-21 | Stable Flow: Vital Layers for Training-Free Image Editing | Omri Avrahami et.al. | 2411.14430v1 | null |
2024-11-21 | Transformer-based Heuristic for Advanced Air Mobility Planning | Jun Xiang et.al. | 2411.14427v1 | null |
2024-11-21 | Quantum States Imaging of Magnetic Field Contours based on Autler-Townes Effect in Yb Atoms | Tanaporn Na Narong et.al. | 2411.14426v1 | null |
2024-11-21 | Whack-a-Chip: The Futility of Hardware-Centric Export Controls | Ritwik Gupta et.al. | 2411.14425v1 | null |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423v1 | null |
2024-11-21 | Bootstrapping the Chiral-Gravitational Anomaly | Zi-Yu Dong et.al. | 2411.14422v1 | null |
2024-11-21 | From RNNs to Foundation Models: An Empirical Study on Commercial Building Energy Consumption | Shourya Bose et.al. | 2411.14421v1 | null |
2024-11-21 | Combining summary statistics with simulation-based inference for the 21 cm signal from the Epoch of Reionization | Benoit Semelin et.al. | 2411.14419v1 | null |
2024-11-20 | AI-generated Image Detection: Passive or Watermark? | Moyang Guo et.al. | 2411.13553v1 | null |
2024-11-20 | REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents | Rui Tian et.al. | 2411.13552v1 | link |
2024-11-20 | Find Any Part in 3D | Ziqi Ma et.al. | 2411.13550v1 | null |
2024-11-20 | Generating 3D-Consistent Videos from Unposed Internet Photos | Gene Chou et.al. | 2411.13549v1 | null |
2024-11-20 | SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs | Shirley Kokane et.al. | 2411.13547v1 | null |
2024-11-20 | Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm | Rushabh Solanki et.al. | 2411.13546v1 | null |
2024-11-20 | Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning | Andy Li et.al. | 2411.13545v1 | null |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543v1 | null |
2024-11-20 | The Rényi Outlier Test | Ryan Christ et.al. | 2411.13542v1 | null |
2024-11-20 | Living dangerously with decoupled first/second generation scalars: SUSY prospects at the LHC | Howard Baer et.al. | 2411.13541v1 | null |
2024-11-19 | RR Lyrae Stars in Intermediate-age Magellanic Clusters: Membership Probabilities and Delay Time Distribution | Bolivia Cuevas-Otahola et.al. | 2411.12741v1 | null |
2024-11-19 | Improving the solver for the Balitsky-Kovchegov evolution equation with Automatic Differentiation | Florian Cougoulic et.al. | 2411.12739v1 | null |
2024-11-19 | ACING: Actor-Critic for Instruction Learning in Black-Box Large Language Models | Salma Kharrat et.al. | 2411.12736v1 | link |
2024-11-19 | The More the Merrier: On Evolving Five-valued Spectra Boolean Functions | Claude Carlet et.al. | 2411.12735v1 | null |
2024-11-19 | Soft Robotic Dynamic In-Hand Pen Spinning | Yunchao Yao et.al. | 2411.12734v1 | null |
2024-11-19 | Benchmarking Positional Encodings for GNNs and Graph Transformers | Florian Grötschla et.al. | 2411.12732v1 | link |
2024-11-19 | Testing classical properties from quantum data | Matthias C. Caro et.al. | 2411.12730v1 | null |
2024-11-19 | Precise study of triply charmed baryons ( |
Navdeep Singh Dhindsa et.al. | 2411.12729v1 | null |
2024-11-19 | Information Theory of Meaningful Communication | Doron Sivan et.al. | 2411.12728v1 | null |
2024-11-19 | Reinforcement Learning, Collusion, and the Folk Theorem | Galit Askenazi-Golan et.al. | 2411.12725v1 | null |
2024-11-18 | High-precision black hole scattering with Calabi-Yau manifolds | Mathias Driesse et.al. | 2411.11846v1 | null |
2024-11-18 | UniHands: Unifying Various Wild-Collected Keypoints for Personalized Hand Reconstruction | Menghe Zhang et.al. | 2411.11845v1 | null |
2024-11-18 | Generative World Explorer | Taiming Lu et.al. | 2411.11844v1 | null |
2024-11-18 | Bi-Mamba: Towards Accurate 1-Bit State Space Models | Shengkun Tang et.al. | 2411.11843v1 | null |
2024-11-18 | On Halin's end-degree Conjecture and |
Gabriel Fernandes et.al. | 2411.11841v1 | null |
2024-11-18 | Mass Transfer in Eccentric Orbits with Self-consistent Stellar Evolution | Kyle Akira Rocha et.al. | 2411.11840v1 | null |
2024-11-18 | RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator | Xinhai Li et.al. | 2411.11839v1 | null |
2024-11-18 | Pairwise Markov Chains for Volatility Forecasting | Elie Azeraf et.al. | 2411.11838v1 | null |
2024-11-18 | The JWST EXCELS survey: tracing the chemical enrichment pathways of high-redshift star-forming galaxies with O, Ar and Ne abundances | T. M. Stanton et.al. | 2411.11837v1 | null |
2024-11-18 | Describe Now: User-Driven Audio Description for Blind and Low Vision Individuals | Maryam Cheema et.al. | 2411.11835v1 | null |
2024-11-15 | Ultrafast optical control of charge orders in kagome metals | Yu-Ping Lin et.al. | 2411.10447v1 | null |
2024-11-15 | VeriGraph: Scene Graphs for Execution Verifiable Robot Planning | Daniel Ekpo et.al. | 2411.10446v1 | null |
2024-11-15 | Inverse Melting of Polar Order in a Ferroelectric Oxide | Yang Zhang et.al. | 2411.10445v1 | null |
2024-11-15 | Balancing Passenger Transport and Power Distribution: A Distributed Dispatch Policy for Shared Autonomous Electric Vehicles | Jake Robbennolt et.al. | 2411.10444v1 | null |
2024-11-15 | Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization | Weiyun Wang et.al. | 2411.10442v1 | null |
2024-11-15 | Unlocking multiphoton emission from a single-photon source through mean-field engineering | Sang Kyu Kim et.al. | 2411.10441v1 | null |
2024-11-15 | LLaVA-o1: Let Vision Language Models Reason Step-by-Step | Guowei Xu et.al. | 2411.10440v1 | null |
2024-11-15 | MARS: Unleashing the Power of Variance Reduction for Training Large Models | Huizhuo Yuan et.al. | 2411.10438v1 | null |
2024-11-15 | Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization | Yuhan Fu et.al. | 2411.10436v1 | null |
2024-11-15 | The Spatial Complexity of Optical Computing and How to Reduce It | Yandong Li et.al. | 2411.10435v1 | null |
2024-11-14 | MagicQuill: An Intelligent Interactive Image Editing System | Zichen Liu et.al. | 2411.09703v1 | null |
2024-11-14 | On the Surprising Effectiveness of Attention Transfer for Vision Transformers | Alexander C. Li et.al. | 2411.09702v1 | null |
2024-11-14 | Post-Newtonian expansion of energy and angular momentum fluxes: inclined spherical orbits about a Kerr black hole | Jezreel C. Castillo et.al. | 2411.09700v1 | null |
2024-11-14 | Cubic Dirac Semimetals: General Theory and Application to Rare-Earth Magnets | Shouvik Sur et.al. | 2411.09699v1 | null |
2024-11-14 | A Universal Circuit Set Using the |
Liyuan Chen et.al. | 2411.09697v1 | null |
2024-11-14 | Petz-Rényi relative entropy in QFT from modular theory | Markus B. Fröb et.al. | 2411.09696v1 | null |
2024-11-14 | A physical basis for cosmological correlators from cuts | Shounak De et.al. | 2411.09695v1 | null |
2024-11-14 | A Bayesian Optimization Approach to Machine Translation Reranking | Julius Cheng et.al. | 2411.09694v1 | null |
2024-11-14 | CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants | Albert J. Zhai et.al. | 2411.09693v1 | null |
2024-11-14 | Reggeization in Color | Anjie Gao et.al. | 2411.09692v1 | null |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879v1 | null |
2024-11-13 | A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources | Yasin Abdulkadir et.al. | 2411.08876v1 | null |
2024-11-13 | A consistency relation for induced gravitational wave anisotropies | Julián Rey et.al. | 2411.08873v1 | null |
2024-11-13 | Large Wireless Model (LWM): A Foundation Model for Wireless Channels | Sadjad Alikhani et.al. | 2411.08872v1 | null |
2024-11-13 | The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models | Daniel P. Jeong et.al. | 2411.08870v1 | null |
2024-11-13 | Equivalence between the second order steady state for spin-Boson model and its quantum mean force Gibbs state | Prem Kumar et.al. | 2411.08869v1 | null |
2024-11-13 | CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Wissam Antoun et.al. | 2411.08868v1 | null |
2024-11-13 | Unsupervised Parameter-free Outlier Detection using HDBSCAN Outlier Profiles* | Kushankur Ghosh et.al. | 2411.08867v1 | null |
2024-11-13 | Local Operator Algebras of Charged States in Gauge Theory and Gravity | Pietro Antonio Grassi et.al. | 2411.08865v1 | null |
2024-11-13 | Isotropic Correlation Models for the Cross-Section of Equity Returns | Graham L. Giller et.al. | 2411.08864v1 | null |
2024-11-12 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034v1 | null |
2024-11-12 | GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation | Yushi Lan et.al. | 2411.08033v1 | null |
2024-11-12 | Antonio Racioppi et.al. | 2411.08031v1 | null | |
2024-11-12 | Integrable fishnet circuits and Brownian solitons | Žiga Krajnik et.al. | 2411.08030v1 | null |
2024-11-12 | The discovery and characterization of minimoon 2024 PT$_5$ | Bryce T. Bolin et.al. | 2411.08029v1 | null |
2024-11-12 | Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data | Juanhui Li et.al. | 2411.08028v1 | null |
2024-11-12 | LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models | Anoop Cherian et.al. | 2411.08027v1 | null |
2024-11-12 | Incentive Design with Spillovers | Krishna Dasaratha et.al. | 2411.08026v1 | null |
2024-11-12 | Leonardo vindicated: Pythagorean trees for minimal reconstruction of the natural branching structures | Dymitr Ruta et.al. | 2411.08024v1 | null |
2024-11-12 | How neutron star properties disfavor a nuclear chiral density wave | Orestis Papadopoulos et.al. | 2411.08023v1 | null |
2024-11-11 | A novel approach to understanding the link between supermassive black holes and host galaxies | Gabriel Sasseville et.al. | 2411.07242v1 | null |
2024-11-11 | A necessary and sufficient condition for |
Daniel McGinnis et.al. | 2411.07241v1 | null |
2024-11-11 | UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts | Bo Yang et.al. | 2411.07240v1 | null |
2024-11-11 | DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning | Zecheng Zhang et.al. | 2411.07239v1 | null |
2024-11-11 | OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model | Sumeth Yuenyong et.al. | 2411.07238v1 | null |
2024-11-11 | Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations | Chaitanya Malaviya et.al. | 2411.07237v1 | null |
2024-11-11 | Floquet Topological Dissipative Kerr Solitons and Incommensurate Frequency Combs | Seyed Danial Hashemi et.al. | 2411.07236v1 | null |
2024-11-11 | Score-based generative diffusion with "active" correlated noise sources | Alexandra Lamtyugina et.al. | 2411.07233v1 | null |
2024-11-12 | Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models | Yoad Tewel et.al. | 2411.07232v2 | null |
2024-11-11 | Watermark Anything with Localized Messages | Tom Sander et.al. | 2411.07231v1 | null |
2024-11-08 | Recycled Attention: Efficient inference for long-context language models | Fangyuan Xu et.al. | 2411.05787v1 | null |
2024-11-08 | Phases of decodability in the surface code with unitary errors | Yimu Bao et.al. | 2411.05785v1 | null |
2024-11-08 | Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles | Jonas Kiemel et.al. | 2411.05784v1 | null |
2024-11-08 | ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles | Kayo Yin et.al. | 2411.05783v1 | null |
2024-11-08 | Gender Inequalities in Content Collaborations: Asymmetric Creator Synergy and Symmetric Audience Biases | Mingyue Zha et.al. | 2411.05782v1 | null |
2024-11-08 | Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua et.al. | 2411.05781v1 | null |
2024-11-08 | GazeSearch: Radiology Findings Search Benchmark | Trong Thang Pham et.al. | 2411.05780v1 | null |
2024-11-08 | Curriculum Learning for Few-Shot Domain Adaptation in CT-based Airway Tree Segmentation | Maxime Jacovella et.al. | 2411.05779v1 | null |
2024-11-08 | LLMs as Method Actors: A Model for Prompt Engineering and Architecture | Colin Doyle et.al. | 2411.05778v1 | null |
2024-11-08 | Quantitative Assessment of Intersectional Empathetic Bias and Understanding | Vojtech Formanek et.al. | 2411.05777v1 | null |
2024-11-07 | SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007v1 | link |
2024-11-07 | ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Jun-Kun Chen et.al. | 2411.05006v1 | null |
2024-11-07 | Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models | Shuhong Zheng et.al. | 2411.05005v1 | null |
2024-11-07 | Long-range entanglement from spontaneous non-onsite symmetry breaking | Zhehao Zhang et.al. | 2411.05004v1 | null |
2024-11-07 | ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning | David Junhao Zhang et.al. | 2411.05003v1 | null |
2024-11-07 | Extracting Axion String Network Parameters from Simulated CMB Birefringence Maps using Convolutional Neural Networks | Ray Hagimoto et.al. | 2411.05002v1 | null |
2024-11-07 | Analyzing The Language of Visual Tokens | David M. Chan et.al. | 2411.05001v1 | null |
2024-11-07 | Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? | Jonathan Roberts et.al. | 2411.05000v1 | null |
2024-11-07 | DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation | Peiqi Liu et.al. | 2411.04999v1 | null |
2024-11-07 | LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation | Weiquan Huang et.al. | 2411.04997v1 | link |
2024-11-06 | Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | Jeongsoo Park et.al. | 2411.04125v1 | null |
2024-11-07 | The monoid representation of upho posets and total positivity | Ziyao Fu et.al. | 2411.04123v2 | null |
2024-11-06 | Second order cone relaxations for quantum Max Cut | Felix Huber et.al. | 2411.04120v1 | null |
2024-11-06 | Marcinkiewicz-Zygmund inequalities in quasi-Banach function spaces | Yurii Kolomoitsev et.al. | 2411.04119v1 | null |
2024-11-06 | Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P. Jeong et.al. | 2411.04118v1 | null |
2024-11-06 | Poisson genericity in numeration systems with exponentially mixing probabilities | Nicolás Álvarez et.al. | 2411.04116v1 | null |
2024-11-06 | Condensing Against Online Adversaries | Eshan Chattopadhyay et.al. | 2411.04115v1 | null |
2024-11-06 | Age of Gossip With Time-Varying Topologies | Arunabh Srivastava et.al. | 2411.04114v1 | null |
2024-11-06 | Fed-EC: Bandwidth-Efficient Clustering-Based Federated Learning For Autonomous Visual Robot Navigation | Shreya Gummadi et.al. | 2411.04112v1 | null |
2024-11-06 | Self-Consistency Preference Optimization | Archiki Prasad et.al. | 2411.04109v1 | null |
2024-11-05 | MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning | Ziliang Gan et.al. | 2411.03314v1 | null |
2024-11-06 | Classification Done Right for Vision-Language Pre-Training | Zilong Huang et.al. | 2411.03313v2 | link |
2024-11-05 | Inference Optimal VLMs Need Only One Visual Token but Larger Models | Kevin Y. Li et.al. | 2411.03312v1 | link |
2024-11-05 | Minkowski ideals and rings | Geir Agnarsson et.al. | 2411.03310v1 | null |
2024-11-05 | LLMs for Domain Generation Algorithm Detection | Reynier Leyva La O et.al. | 2411.03307v1 | null |
2024-11-05 | Spontaneous Flows and Quantum Analogies in Heterogeneous Active Nematic Films | Alexander J. H. Houston et.al. | 2411.03306v1 | null |
2024-11-05 | Quantum One-Time Protection of any Randomized Algorithm | Sam Gunn et.al. | 2411.03305v1 | null |
2024-11-05 | Bayesian Controlled FDR Variable Selection via Knockoffs | Lorenzo Focardi-Olmi et.al. | 2411.03304v1 | null |
2024-11-05 | Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor | Anish Bhattacharya et.al. | 2411.03303v1 | null |
2024-11-05 | Low-Overhead Entangling Gates from Generalised Dehn Twists | Ryan Tiew et.al. | 2411.03302v1 | null |
2024-11-04 | Adaptive Caching for Faster Video Generation with Diffusion Transformers | Kumara Kahatapitiya et.al. | 2411.02397v1 | null |
2024-11-04 | Fusion of Tree-induced Regressions for Clinico-genomic Data | Jeroen M. Goedhart et.al. | 2411.02396v1 | null |
2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395v1 | link |
2024-11-04 | AutoVFX: Physically Realistic Video Editing from Natural Language Instructions | Hao-Yu Hsu et.al. | 2411.02394v1 | null |
2024-11-04 | Adaptive Length Image Tokenization via Recurrent Allocation | Shivam Duggal et.al. | 2411.02393v1 | link |
2024-11-04 | Attacking Vision-Language Computer Agents via Pop-ups | Yanzhe Zhang et.al. | 2411.02391v1 | null |
2024-11-04 | Multidimensional coherent spectroscopy of correlated lattice systems | Jiyu Chen et.al. | 2411.02389v1 | null |
2024-11-04 | Reachability in One-Dimensional Pushdown Vector Addition Systems is Decidable | Clotilde Bizière et.al. | 2411.02386v1 | null |
2024-11-04 | How Far is Video Generation from World Model: A Physical Law Perspective | Bingyi Kang et.al. | 2411.02385v1 | null |
2024-11-04 | LDPC stabilizer codes as gapped quantum phases: stability under graph-local perturbations | Wojciech De Roeck et.al. | 2411.02384v1 | null |
2024-10-31 | Error Threshold of SYK Codes from Strong-to-Weak Parity Symmetry Breaking | Jaewon Kim et.al. | 2410.24225v1 | null |
2024-10-31 | What is the origin of the JWST SMBHs? | John Ellis et.al. | 2410.24224v1 | null |
2024-10-31 | URAvatar: Universal Relightable Gaussian Codec Avatars | Junxuan Li et.al. | 2410.24223v1 | null |
2024-10-31 | Robust Gaussian Processes via Relevance Pursuit | Sebastian Ament et.al. | 2410.24222v1 | null |
2024-10-31 | EgoMimic: Scaling Imitation Learning via Egocentric Video | Simar Kareer et.al. | 2410.24221v1 | link |
2024-10-31 | Bridging Geometric States via Geometric Diffusion Bridge | Shengjie Luo et.al. | 2410.24220v1 | null |
2024-10-31 | Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning | Penghui Ruan et.al. | 2410.24219v1 | link |
2024-10-31 | Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi et.al. | 2410.24218v1 | link |
2024-10-31 | Diagnosing electronic phases of matter using photonic correlation functions | Gautam Nambiar et.al. | 2410.24215v1 | null |
2024-10-31 | Learning Video Representations without Natural Videos | Xueyang Yu et.al. | 2410.24213v1 | null |
2024-10-30 | Computing the bridge length: the key ingredient in a continuous isometry classification of periodic point sets | Jonathan McManus et.al. | 2410.23288v1 | null |
2024-10-30 | ReferEverything: Towards Segmenting Everything We Can Speak of in Videos | Anurag Bagchi et.al. | 2410.23287v1 | null |
2024-10-30 | Proof of nonintegrability of the spin-$1$ bilinear-biquadratic chain model | HaRu K. Park et.al. | 2410.23286v1 | null |
2024-10-30 | Provable acceleration for diffusion models under minimal assumptions | Gen Li et.al. | 2410.23285v1 | null |
2024-10-30 | Exact overlaps for "all" integrable matrix product states of rational spin chains | Tamas Gombor et.al. | 2410.23282v1 | null |
2024-10-30 | Slow Relaxation in a Glassy Quantum Circuit | Richard D. Barney et.al. | 2410.23281v1 | null |
2024-10-30 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280v1 | null |
2024-10-30 | A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization | Bin Wu et.al. | 2410.23279v1 | null |
2024-10-30 | SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation | Yining Hong et.al. | 2410.23277v1 | null |
2024-10-30 | Conditional Forecasting of Margin Calls using Dynamic Graph Neural Networks | Matteo Citterio et.al. | 2410.23275v1 | null |
2024-10-29 | Absolute Dimensions of the Interferometric Binary HD 174881: A Test of Stellar Evolution Models for Evolved Stars | Guillermo Torres et.al. | 2410.22334v1 | null |
2024-10-29 | Hypothesis tests and model parameter estimation on data sets with missing correlation information | Lukas Koch et.al. | 2410.22333v1 | null |
2024-10-29 | Local Policies Enable Zero-shot Long-horizon Manipulation | Murtaza Dalal et.al. | 2410.22332v1 | null |
2024-10-29 | Task Vectors are Cross-Modal | Grace Luo et.al. | 2410.22330v1 | null |
2024-10-29 | Wavelength modulation laser spectroscopy of N$_2$O at 17 $μ$m | Y. Wang et.al. | 2410.22328v1 | null |
2024-10-29 | Observation of a Bilayer Superfluid with Interlayer Coherence | Erik Rydow et.al. | 2410.22326v1 | null |
2024-10-29 | Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models | Seetharam Killivalavan et.al. | 2410.22323v1 | null |
2024-10-29 | Superintegrability in the interaction of two particles with spin | O. Ogulcan Tuncer et.al. | 2410.22321v1 | null |
2024-10-30 | Nanoscale Connectomics Annotation Standards Framework | Nicole K. Guittari et.al. | 2410.22320v2 | null |
2024-10-29 | A wiggling filamentary jet at the origin of the blazar multi-wavelength behaviour | C. M. Raiteri et.al. | 2410.22319v1 | null |
2024-10-28 | Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context | Manuel Benavent-Lledo et.al. | 2410.21275v1 | link |
2024-10-28 | On Inductive Biases That Enable Generalization of Diffusion Transformers | Jie An et.al. | 2410.21273v1 | null |
2024-10-28 | Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics | Yaniv Nikankin et.al. | 2410.21272v1 | null |
2024-10-28 | EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation | Shih-Yang Liu et.al. | 2410.21271v1 | null |
2024-10-28 | Strategic Electric Distribution Network Sensing via Spectral Bandits | Samuel Talkington et.al. | 2410.21270v1 | null |
2024-10-28 | Pseudochaotic Many-Body Dynamics as a Pseudorandom State Generator | Wonjun Lee et.al. | 2410.21268v1 | null |
2024-10-28 | Modular Duality in Deep Learning | Jeremy Bernstein et.al. | 2410.21265v1 | null |
2024-10-28 | LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior | Hanyu Wang et.al. | 2410.21264v1 | null |
2024-10-28 | Adaptive Transfer Clustering: A Unified Framework | Yuqi Gu et.al. | 2410.21263v1 | null |
2024-10-28 | BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference | Changwoo Lee et.al. | 2410.21262v1 | link |
2024-10-25 | Model merging with SVD to tie the Knots | George Stoica et.al. | 2410.19735v1 | link |
2024-10-25 | The Influence of Lepton Portal on the WIMP-pFIMP framework | Jayita Lahiri et.al. | 2410.19734v1 | null |
2024-10-25 | The Potential and Value of AI Chatbot in Personalized Cognitive Training | Zilong Wang et.al. | 2410.19733v1 | null |
2024-10-25 | Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models | Yucheng Zhou et.al. | 2410.19732v1 | null |
2024-10-25 | Counting Ability of Large Language Models and Impact of Tokenization | Xiang Zhang et.al. | 2410.19730v1 | null |
2024-10-25 | cymyc -- Calabi-Yau Metrics, Yukawas, and Curvature | Per Berglund et.al. | 2410.19728v1 | null |
2024-10-25 | FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning | Nicole Cho et.al. | 2410.19727v1 | null |
2024-10-25 | Boundary choices and one-loop Complex Universe | Manishankar Ailiga et.al. | 2410.19724v1 | null |
2024-10-25 | Sparse Decomposition of Graph Neural Networks | Yaochen Hu et.al. | 2410.19723v1 | null |
2024-10-25 | Temporal Convolution-based Hybrid Model Approach with Representation Learning for Real-Time Acoustic Anomaly Detection | Sahan Dissanayaka et.al. | 2410.19722v1 | null |
2024-10-24 | Very massive stars and Nitrogen-emitting galaxies | Jorick S. Vink et.al. | 2410.18980v1 | null |
2024-10-24 | PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views | Xin Fei et.al. | 2410.18979v1 | link |
2024-10-24 | Framer: Interactive Frame Interpolation | Wen Wang et.al. | 2410.18978v1 | null |
2024-10-24 | MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms | Ling-Hao Chen et.al. | 2410.18977v1 | null |
2024-10-24 | CAMEL-Bench: A Comprehensive Arabic LMM Benchmark | Sara Ghaboura et.al. | 2410.18976v1 | link |
2024-10-24 | Unbounded: A Generative Infinite Game of Character Life Simulation | Jialu Li et.al. | 2410.18975v1 | null |
2024-10-24 | 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Hansheng Chen et.al. | 2410.18974v1 | link |
2024-10-24 | Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques | David Ortiz-Perez et.al. | 2410.18972v1 | null |
2024-10-24 | Detection of Undeclared EV Charging Events in a Green Energy Certification Scheme | Luca Domenico Loiacono et.al. | 2410.18971v1 | null |
2024-10-24 | ConceptDrift: Uncovering Biases through the Lens of Foundational Models | Cristian Daniel Păduraru et.al. | 2410.18970v1 | null |
2024-10-23 | DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes | Hengwei Bian et.al. | 2410.18084v1 | null |
2024-10-23 | FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution | Yang-Che Sun et.al. | 2410.18083v1 | null |
2024-10-23 | Prioritized Generative Replay | Renhao Wang et.al. | 2410.18082v1 | null |
2024-10-23 | Gauge-invariant perturbations of relativistic non-perfect fluids in spherical spacetime | David Díaz-Guerra et.al. | 2410.18081v1 | null |
2024-10-23 | The physical properties of Cluster Chains | Laura Posch et.al. | 2410.18080v1 | null |
2024-10-23 | FreeVS: Generative View Synthesis on Free Driving Trajectory | Qitai Wang et.al. | 2410.18079v1 | null |
2024-10-23 | ALTA: Compiler-Based Analysis of Transformers | Peter Shaw et.al. | 2410.18077v1 | link |
2024-10-23 | Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration | Max Wilcoxson et.al. | 2410.18076v1 | null |
2024-10-23 | ProFL: Performative Robust Optimal Federated Learning | Xue Zheng et.al. | 2410.18075v1 | null |
2024-10-23 | UnCLe: Unsupervised Continual Learning of Depth Completion | Suchisrit Gangopadhyay et.al. | 2410.18074v1 | null |
2024-10-22 | Cosmic Ray Mediated Thermal Fronts in the Warm-Hot Circumgalactic Medium | Hanjue Zhu et.al. | 2410.17252v1 | null |
2024-10-22 | Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu et.al. | 2410.17251v1 | null |
2024-10-22 | JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation | Shota Onohara et.al. | 2410.17250v1 | null |
2024-10-22 | SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes | Cheng-De Fan et.al. | 2410.17249v1 | null |
2024-10-22 | HyperspectralViTs: Fast and Accurate methane detection on-board satellites | Vít Růžička et.al. | 2410.17248v1 | null |
2024-10-22 | PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction | Long Xing et.al. | 2410.17247v1 | link |
2024-10-22 | Learning Precise, Contact-Rich Manipulation through Uncalibrated Tactile Skins | Venkatesh Pattabiraman et.al. | 2410.17246v1 | null |
2024-10-22 | Towards Reliable Evaluation of Behavior Steering Interventions in LLMs | Itamar Pres et.al. | 2410.17245v1 | null |
2024-10-22 | Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss | Zesen Cheng et.al. | 2410.17243v1 | link |
2024-10-22 | LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Haian Jin et.al. | 2410.17242v1 | null |
2024-10-21 | Multiparticle scalar dark matter with |
Subhaditya Bhattacharya et.al. | 2410.16275v1 | null |
2024-10-21 | Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations | Giorgio Mentasti et.al. | 2410.16274v1 | null |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272v1 | null |
2024-10-21 | Reflection-Bench: probing AI intelligence with reflection | Lingyu Li et.al. | 2410.16270v1 | link |
2024-10-21 | SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree | Shuangrui Ding et.al. | 2410.16268v1 | link |
2024-10-21 | xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs | Michael S. Ryoo et.al. | 2410.16267v1 | null |
2024-10-21 | 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Xi Liu et.al. | 2410.16266v1 | null |
2024-10-21 | Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation | Haomu Yuan et.al. | 2410.16265v1 | null |
2024-10-21 | Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution | Farid Thaalba et.al. | 2410.16264v1 | null |
2024-10-21 | Surface acoustic waves Brillouin photonics on a silicon nitride chip | Yvan Klaver et.al. | 2410.16263v1 | null |
2024-10-18 | Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts | German Gritsai et.al. | 2410.14677v1 | null |
2024-10-18 | SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment | Qin Liu et.al. | 2410.14676v1 | null |
2024-10-18 | Enhancing Large Language Models' Situated Faithfulness to External Contexts | Yukun Huang et.al. | 2410.14675v1 | link |
2024-10-18 | Effects of waveform systematics on inferences of neutron star population properties and the nuclear equation of state | Anjali B. Yelikar et.al. | 2410.14674v1 | null |
2024-10-18 | Self-supervised contrastive learning performs non-linear system identification | Rodrigo González Laiz et.al. | 2410.14673v1 | link |
2024-10-18 | BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities | Shaozhe Hao et.al. | 2410.14672v1 | link |
2024-10-18 | Rapid Dust Formation in the Early Universe | Danial Langeroodi et.al. | 2410.14671v1 | null |
2024-10-18 | Decomposing The Dark Matter of Sparse Autoencoders | Joshua Engels et.al. | 2410.14670v1 | link |
2024-10-18 | NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples | Baiqi Li et.al. | 2410.14669v1 | null |
2024-10-18 | MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps | Xiongtao Zhou et.al. | 2410.14668v1 | link |
2024-10-17 | UniDrive: Towards Universal Driving Perception Across Camera Configurations | Ye Li et.al. | 2410.13864v1 | link |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863v1 | null |
2024-10-17 | DepthSplat: Connecting Gaussian Splatting and Depth | Haofei Xu et.al. | 2410.13862v1 | link |
2024-10-17 | PUMA: Empowering Unified MLLM with Multi-granular Visual Generation | Rongyao Fang et.al. | 2410.13861v1 | link |
2024-10-17 | VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Runsen Xu et.al. | 2410.13860v1 | link |
2024-10-17 | $γ-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models | Yaxin Luo et.al. | 2410.13859v1 | null |
2024-10-17 | Monte Carlo Study of Critical Fermi Surface with Spatial Disorder Interactions | Tu Hong et.al. | 2410.13858v1 | null |
2024-10-17 | How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs | Guhao Feng et.al. | 2410.13857v1 | null |
2024-10-17 | A Fourier analysis framework for approximate classical simulations of quantum circuits | Cristina Cirstoiu et.al. | 2410.13856v1 | null |
2024-10-17 | Diffusing States and Matching Scores: A New Framework for Imitation Learning | Runzhe Wu et.al. | 2410.13855v1 | link |
2024-10-16 | Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media | Ross Deans Kristensen-McLachlan et.al. | 2410.12791v1 | null |
2024-10-16 | Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models | Ce Zhang et.al. | 2410.12790v1 | null |
2024-10-16 | Altermagnetic Instabilities from Quantum Geometry | Niclas Heinsdorf et.al. | 2410.12789v1 | null |
2024-10-16 | Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception | Jihao Zhao et.al. | 2410.12788v1 | null |
2024-10-16 | The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio | Sicong Leng et.al. | 2410.12787v1 | null |
2024-10-16 | Metal Price Spike Prediction via a Neurosymbolic Ensemble Approach | Nathaniel Lee et.al. | 2410.12785v1 | null |
2024-10-16 | JudgeBench: A Benchmark for Evaluating LLM-based Judges | Sijun Tan et.al. | 2410.12784v1 | null |
2024-10-16 | Context-Scaling versus Task-Scaling in In-Context Learning | Amirhesam Abedsoltan et.al. | 2410.12783v1 | null |
2024-10-16 | In-Context Learning Enables Robot Action Prediction in LLMs | Yida Yin et.al. | 2410.12782v1 | null |
2024-10-16 | Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats | Chen Ziwen et.al. | 2410.12781v1 | null |
2024-10-15 | MoH: Multi-Head Attention as Mixture-of-Head Attention | Peng Jin et.al. | 2410.11842v1 | link |
2024-10-15 | GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation | Fei Tang et.al. | 2410.11841v1 | null |
2024-10-15 | A Hitchhiker's Guide to Scaling Law Estimation | Leshem Choshen et.al. | 2410.11840v1 | null |
2024-10-15 | Molecular Quantum Control Algorithm Design by Reinforcement Learning | Anastasia Pipi et.al. | 2410.11839v1 | null |
2024-10-15 | High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion | Junhwa Hur et.al. | 2410.11838v1 | null |
2024-10-15 | Minimal models for minimal BCOV theories | Surya Raghavendran et.al. | 2410.11837v1 | null |
2024-10-15 | On the Effectiveness of Dataset Alignment for Fake Image Detection | Anirudh Sundara Rajan et.al. | 2410.11835v1 | null |
2024-10-15 | Contrastive Touch-to-Touch Pretraining | Samanta Rodriguez et.al. | 2410.11834v1 | null |
2024-10-15 | CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos | Nikita Karaev et.al. | 2410.11831v1 | null |
2024-10-15 | Compact object populations over cosmic time I. BOSSA: a Binary Object environment-Sensitive Sampling Algorithm | L. M. de Sá et.al. | 2410.11830v1 | null |
2024-10-14 | Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Jingzhi Bao et.al. | 2410.10821v1 | null |
2024-10-14 | High-resolution transmission spectroscopy of the hot-Saturn HD 149026b | Federico Biassoni et.al. | 2410.10820v1 | null |
2024-10-14 | DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads | Guangxuan Xiao et.al. | 2410.10819v1 | link |
2024-10-15 | TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models | Mu Cai et.al. | 2410.10818v2 | null |
2024-10-14 | When Does Perceptual Alignment Benefit Vision Representations? | Shobhita Sundaram et.al. | 2410.10817v1 | null |
2024-10-14 | LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Tianwei Xiong et.al. | 2410.10816v1 | link |
2024-10-14 | Depth Any Video with Scalable Synthetic Data | Honghui Yang et.al. | 2410.10815v1 | null |
2024-10-14 | Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free | Ziyue Li et.al. | 2410.10814v1 | null |
2024-10-14 | LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Di Wu et.al. | 2410.10813v1 | link |
2024-10-14 | HART: Efficient Visual Generation with Hybrid Autoregressive Transformer | Haotian Tang et.al. | 2410.10812v1 | link |
2024-10-11 | Horizon causality from holographic scattering in asymptotically dS$_3$ | Victor Franken et.al. | 2410.09050v1 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049v1 | link |
2024-10-11 | Towards Trustworthy LLMs for Code: A Data-Centric Synergistic Auditing Framework | Chong Wang et.al. | 2410.09048v1 | null |
2024-10-11 | Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models | Qin Liu et.al. | 2410.09047v1 | null |
2024-10-11 | Linear Convergence of Diffusion Models Under the Manifold Hypothesis | Peter Potaptchik et.al. | 2410.09046v1 | null |
2024-10-11 | MiRAGeNews: Multimodal Realistic AI-Generated News Detection | Runsheng Huang et.al. | 2410.09045v1 | null |
2024-10-11 | Systematic construction of stabilizer codes via gauging abelian boundary symmetries | Bram Vancraeynest-De Cuiper et.al. | 2410.09044v1 | null |
2024-10-11 | Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI | Muhammet Anil Yagiz et.al. | 2410.09043v1 | null |
2024-10-11 | Hidden under a warm blanket: If planets existed in protostellar disks, they would hardly produce observable substructures | P. Nazari et.al. | 2410.09042v1 | null |
2024-10-11 | Inferring birth versus death dynamics for ecological interactions in stochastic heterogeneous populations | Erin Beckman et.al. | 2410.09041v1 | null |
2024-10-10 | LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts | Anh-Quan Cao et.al. | 2410.08211v1 | null |
2024-10-10 | PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection | Botao Ren et.al. | 2410.08210v1 | null |
2024-10-10 | Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision | Shengcao Cao et.al. | 2410.08209v1 | null |
2024-10-11 | SPA: 3D Spatial-Awareness Enables Effective Embodied Representation | Haoyi Zhu et.al. | 2410.08208v2 | link |
2024-10-10 | DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Xiaoxiao He et.al. | 2410.08207v1 | null |
2024-10-10 | Interactive4D: Interactive 4D LiDAR Segmentation | Ilya Fradlin et.al. | 2410.08206v1 | null |
2024-10-10 | Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training | Gen Luo et.al. | 2410.08202v1 | null |
2024-10-10 | Efficient Dictionary Learning with Switch Sparse Autoencoders | Anish Mudide et.al. | 2410.08201v1 | link |
2024-10-10 | Adam Exploits |
Shuo Xie et.al. | 2410.08198v1 | null |
2024-10-10 | From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions | Changle Qu et.al. | 2410.08197v1 | link |
2024-10-09 | MM-Ego: Towards Building Egocentric Multimodal LLMs | Hanrong Ye et.al. | 2410.07177v1 | null |
2024-10-09 | Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models | Fei Wang et.al. | 2410.07176v1 | null |
2024-10-09 | Simulating realistic self-interacting dark matter models including small and large-angle scattering | Cenanda Arido et.al. | 2410.07175v1 | null |
2024-10-09 | Neural Circuit Architectural Priors for Quadruped Locomotion | Nikhil X. Bhattasali et.al. | 2410.07174v1 | null |
2024-10-09 | Do better language models have crisper vision? | Jona Ruthardt et.al. | 2410.07173v1 | null |
2024-10-09 | Glider: Global and Local Instruction-Driven Expert Router | Pingzhi Li et.al. | 2410.07172v1 | null |
2024-10-09 | IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation | Xinchen Zhang et.al. | 2410.07171v1 | link |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170v1 | null |
2024-10-09 | Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Cheol Jun Cho et.al. | 2410.07168v1 | null |
2024-10-09 | Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate | Qidong Huang et.al. | 2410.07167v1 | link |
2024-10-07 | Fine-Tuning CLIP's Last Visual Projector: A Few-Shot Cornucopia | Mohammad Fahes et.al. | 2410.05270v1 | link |
2024-10-07 | Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models | Fei Wang et.al. | 2410.05269v1 | null |
2024-10-07 | Anomalous continuous symmetries and quantum topology of Goldstone modes | Naren Manjunath et.al. | 2410.05268v1 | null |
2024-10-07 | Grounding Partially-Defined Events in Multimodal Data | Kate Sanders et.al. | 2410.05267v1 | null |
2024-10-07 | Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers | Andrew F. Luo et.al. | 2410.05266v1 | null |
2024-10-07 | PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs | Mengzhao Chen et.al. | 2410.05265v1 | link |
2024-10-07 | Generalization of Modular Spread Complexity for Non-Hermitian Density Matrices | Aneek Jana et.al. | 2410.05264v1 | null |
2024-10-07 | Regression Conformal Prediction under Bias | Matt Y. Cheung et.al. | 2410.05263v1 | link |
2024-10-07 | TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles | Qingchen Yu et.al. | 2410.05262v1 | link |
2024-10-07 | TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens | Ya-Qi Yu et.al. | 2410.05261v1 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665v1 | null |
2024-10-04 | Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models | Zhuochun Li et.al. | 2410.03663v1 | null |
2024-10-04 | System 2 reasoning capabilities are nigh | Scott C. Lowe et.al. | 2410.03662v1 | null |
2024-10-04 | Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models | Tinghui Zhu et.al. | 2410.03659v1 | null |
2024-10-04 | RAFT: Realistic Attacks to Fool Text Detectors | James Wang et.al. | 2410.03658v1 | null |
2024-10-04 | A low-dimensional model for adaptive networks of spiking neurons | Bastian Pietras et.al. | 2410.03657v1 | null |
2024-10-04 | Fault tolerance of metric basis can be expensive | Martin Knor et.al. | 2410.03656v1 | null |
2024-10-04 | Geometric Representation Condition Improves Equivariant Molecule Generation | Zian Li et.al. | 2410.03655v1 | null |
2024-10-04 | Learning Humanoid Locomotion over Challenging Terrain | Ilija Radosavovic et.al. | 2410.03654v1 | null |
2024-10-04 | On the distribution of the error terms in the divisor and circle problems | Youness Lamzouri et.al. | 2410.03652v1 | null |
2024-10-03 | Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos | Jianrui Zhang et.al. | 2410.02763v1 | null |
2024-10-03 | Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations | Nick Jiang et.al. | 2410.02762v1 | link |
2024-10-03 | FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models | Zhipei Xu et.al. | 2410.02761v1 | null |
2024-10-03 | Erasing Conceptual Knowledge from Language Models | Rohit Gandikota et.al. | 2410.02760v1 | link |
2024-10-03 | Forecasting Smog Clouds With Deep Learning | Valentijn Oldenburg et.al. | 2410.02759v1 | null |
2024-10-03 | Pseudoentanglement from tensor networks | Zihan Cheng et.al. | 2410.02758v1 | null |
2024-10-03 | Loong: Generating Minute-level Long Videos with Autoregressive Language Models | Yuqing Wang et.al. | 2410.02757v1 | null |
2024-10-03 | CorPipe at CRAC 2024: Predicting Zero Mentions from Raw Text | Milan Straka et.al. | 2410.02756v1 | null |
2024-10-03 | SIEVE: General Purpose Data Filtering System Matching GPT-4o Accuracy at 1% the Cost | Jifan Zhang et.al. | 2410.02755v1 | null |
2024-10-03 | Finite-element methods for noncollinear magnetism and spin-orbit coupling in real-space pseudopotential density functional theory | Nikhil Kodali et.al. | 2410.02754v1 | null |
2024-10-02 | A Catalog of Pulsar X-ray Filaments | Jack T. Dinsmore et.al. | 2410.01807v1 | null |
2024-10-02 | Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking | Mattia Segu et.al. | 2410.01806v1 | null |
2024-10-02 | Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads | Yuxiang Huang et.al. | 2410.01805v1 | link |
2024-10-02 | On the expressiveness and spectral bias of KANs | Yixuan Wang et.al. | 2410.01803v1 | null |
2024-10-02 | PROXI: Challenging the GNNs for Link Prediction | Astrit Tola et.al. | 2410.01802v1 | link |
2024-10-02 | FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images | Cheng Zhang et.al. | 2410.01801v1 | null |
2024-10-02 | The Newtonian limit of orthonormal frames in metric theories of gravity | Philip K. Schwartz et.al. | 2410.01800v1 | null |
2024-10-02 | Efficient |
Alex W. Neal Riasanovsky et.al. | 2410.01799v1 | null |
2024-10-02 | Statistical mechanics of the flexural Ising model in |
Abigail Plummer et.al. | 2410.01797v1 | null |
2024-10-02 | Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space | Yangming Li et.al. | 2410.01796v1 | null |
2024-09-30 | Continuously Improving Mobile Manipulation with Autonomous Real-World RL | Russell Mendonca et.al. | 2409.20568v1 | null |
2024-09-30 | Doping a fractional quantum anomalous Hall insulator | Zhengyan Darius Shi et.al. | 2409.20567v1 | null |
2024-09-30 | MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning | Haotian Zhang et.al. | 2409.20566v1 | null |
2024-09-30 | Ranking Over Scoring: Towards Reliable and Robust Automated Evaluation of LLM-Generated Medical Explanatory Arguments | Iker De la Iglesia et.al. | 2409.20565v1 | null |
2024-09-30 | DressRecon: Freeform 4D Human Reconstruction from Monocular Video | Jeff Tan et.al. | 2409.20563v1 | null |
2024-09-30 | SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes | Tianchang Shen et.al. | 2409.20562v1 | null |
2024-09-30 | Covariant Quantum Error-Correcting Codes with Metrological Entanglement Advantage | Cheng-Ju Lin et.al. | 2409.20561v1 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560v1 | null |
2024-09-30 | Supervised Multi-Modal Fission Learning | Lingchao Mao et.al. | 2409.20559v1 | null |
2024-09-30 | Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection | Yubin Wang et.al. | 2409.20558v1 | null |
2024-09-27 | PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Shaowei Liu et.al. | 2409.18964v1 | link |
2024-09-27 | Exploring Token Pruning in Vision State Space Models | Zheng Zhan et.al. | 2409.18962v1 | null |
2024-09-27 | ProMerge: Prompt and Merge for Unsupervised Instance Segmentation | Dylan Li et.al. | 2409.18961v1 | null |
2024-09-27 | Gen Li et.al. | 2409.18959v1 | null | |
2024-09-27 | Generalized HMC using Nambu mechanics for lattice QCD | Erik Lundstrum et.al. | 2409.18958v1 | null |
2024-09-27 | LML: Language Model Learning a Dataset for Data-Augmented Prediction | Praneeth Vadlapati et.al. | 2409.18957v1 | link |
2024-09-27 | Tree height and the asymptotic mean of the Colijn-Plazzotta rank of unlabeled binary rooted trees | Luc Devroye et.al. | 2409.18956v1 | null |
2024-09-27 | Radiative cooling induced coherent maser emission in relativistic plasmas | Pablo J. Bilbao et.al. | 2409.18955v1 | null |
2024-09-27 | RepairBench: Leaderboard of Frontier Models for Program Repair | André Silva et.al. | 2409.18952v1 | null |
2024-09-27 | Spectral Wavelet Dropout: Regularization in the Wavelet Domain | Rinor Cakaj et.al. | 2409.18951v1 | null |
2024-09-26 | Two-dopant origin of competing stripe and pair formation in Hubbard and |
Tizian Blatz et.al. | 2409.18131v1 | null |
2024-09-26 | Bridging 4D QFTs and 2D VOAs via 3D high-temperature EFTs | Arash Arabi Ardehali et.al. | 2409.18130v1 | null |
2024-09-26 | TOI-5005 b: A super-Neptune in the savanna near the ridge | A. Castro-González et.al. | 2409.18129v1 | null |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128v1 | link |
2024-09-26 | EgoLM: Multi-Modal Language Model of Egocentric Motions | Fangzhou Hong et.al. | 2409.18127v1 | null |
2024-09-26 | LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness | Chenming Zhu et.al. | 2409.18125v1 | null |
2024-09-26 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124v1 | null |
2024-09-26 | Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction | Justin Kerr et.al. | 2409.18121v1 | null |
2024-09-26 | EvMAPPER: High Altitude Orthomapping with Event Cameras | Fernando Cladera et.al. | 2409.18120v1 | null |
2024-09-26 | Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography | Yuexi Du et.al. | 2409.18119v1 | null |
2024-09-25 | Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models | Matt Deitke et.al. | 2409.17146v1 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145v1 | null |
2024-09-25 | Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization | Francisco Aguilera-Martínez et.al. | 2409.17144v1 | null |
2024-09-25 | Attention Prompting on Image for Large Vision-Language Models | Runpeng Yu et.al. | 2409.17143v1 | link |
2024-09-25 | Visualizing Dynamics of Charges and Strings in (2+1)D Lattice Gauge Theories | Tyler A. Cochran et.al. | 2409.17142v1 | null |
2024-09-25 | FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression | Fazal Mittu et.al. | 2409.17141v1 | link |
2024-09-25 | Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents | Junting Lu et.al. | 2409.17140v1 | null |
2024-09-25 | Learning with Dynamics: Autonomous Regulation of UAV Based Communication Networks with Dynamic UAV Crew | Ran Zhang et.al. | 2409.17139v1 | null |
2024-09-25 | Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action | Xin Chen et.al. | 2409.17138v1 | null |
2024-09-25 | PACE: marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization | Yao Ni et.al. | 2409.17137v1 | null |
2024-09-24 | Self-Supervised Any-Point Tracking by Contrastive Random Walks | Ayush Shrivastava et.al. | 2409.16288v1 | link |
2024-09-24 | Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking | Xi Wang et.al. | 2409.16287v1 | null |
2024-09-24 | Age of Gossip in Networks with Multiple Views of a Source | Kian J. Khojastepour et.al. | 2409.16285v1 | null |
2024-09-24 | Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation | Homanga Bharadhwaj et.al. | 2409.16283v1 | null |
2024-09-24 | An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement | Pin-Jui Ku et.al. | 2409.16282v1 | null |
2024-09-24 | Heavy $K^$ mesons with open charm from $KD^{()}D^*$ interactions | Xiu-Lei Ren et.al. | 2409.16281v1 | null |
2024-09-24 | MonoFormer: One Transformer for Both Diffusion and Autoregression | Chuyang Zhao et.al. | 2409.16280v1 | null |
2024-09-24 | Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation | Yong Xien Chng et.al. | 2409.16278v1 | null |
2024-09-24 | Bayesian Variable Selection and Sparse Estimation for High-Dimensional Graphical Models | Anwesha Chakravarti et.al. | 2409.16276v1 | null |
2024-09-24 | Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph | Utkarsh A. Mishra et.al. | 2409.16275v1 | null |
2024-09-20 | Gender Representation and Bias in Indian Civil Service Mock Interviews | Somonnoy Banerjee et.al. | 2409.12194v3 | null |
2024-09-18 | Vista3D: Unravel the 3D Darkside of a Single Image | Qiuhong Shen et.al. | 2409.12193v1 | link |
2024-09-18 | DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control | Zichen Jeff Cui et.al. | 2409.12192v1 | null |
2024-09-18 | Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution | Peng Wang et.al. | 2409.12191v1 | link |
2024-09-18 | Massively Multi-Person 3D Human Motion Forecasting with Scene Context | Felix B Mueller et.al. | 2409.12189v1 | null |
2024-09-18 | SPECTER: An Instrument Concept for CMB Spectral Distortion Measurements with Enhanced Sensitivity | Alina Sabyr et.al. | 2409.12188v1 | null |
2024-09-18 | Exoplanet accretion monitoring spectroscopic survey (ENTROPY) I. Evidence for magnetospheric accretion in the young isolated planetary-mass object 2MASS J11151597+1937266 | Gayathri Viswanath et.al. | 2409.12187v1 | null |
2024-09-18 | Qwen2.5-Coder Technical Report | Binyuan Hui et.al. | 2409.12186v1 | link |
2024-09-18 | To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning | Zayne Sprague et.al. | 2409.12183v1 | null |
2024-09-23 | A Controlled Study on Long Context Extension and Generalization in LLMs | Yi Lu et.al. | 2409.12181v2 | link |
2024-09-17 | Non-Universality from Conserved Superoperators in Unitary Circuits | Marco Lastres et.al. | 2409.11407v1 | null |
2024-09-17 | Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion | Zhenwei Wang et.al. | 2409.11406v1 | null |
2024-09-17 | Black-box Stealthy GPS Attacks on Unmanned Aerial Vehicles | Amir Khazraei et.al. | 2409.11405v1 | null |
2024-09-17 | AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs | Basel Mousi et.al. | 2409.11404v1 | null |
2024-09-17 | UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning | Kathakoli Sengupta et.al. | 2409.11403v1 | null |
2024-09-17 | NVLM: Open Frontier-Class Multimodal LLMs | Wenliang Dai et.al. | 2409.11402v1 | null |
2024-09-17 | Teaching dark matter simulations to speak the halo language | Shivam Pandey et.al. | 2409.11401v1 | null |
2024-09-17 | Systematic analysis of Parity-Violating modes | Hong-Ming Zhu et.al. | 2409.11400v1 | null |
2024-09-17 | Ya Deng et.al. | 2409.11399v1 | null | |
2024-09-17 | The dynamics of spherically symmetric black holes in scalar-Gauss-Bonnet gravity with a Ricci coupling | Farid Thaalba et.al. | 2409.11398v1 | null |
2024-09-16 | The VIRUS-dE Survey I: Stars in dwarf elliptical galaxies - 3D dynamics and radially resolved stellar initial mass functions | Mathias Lipka et.al. | 2409.10518v1 | null |
2024-09-16 | RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval | Di Liu et.al. | 2409.10516v1 | null |
2024-09-16 | An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems | Hitesh Tulsiani et.al. | 2409.10515v1 | null |
2024-09-16 | Constraints on axions from patchy screening of the cosmic microwave background | Samuel Goldstein et.al. | 2409.10514v1 | null |
2024-09-16 | KPZ equation from ASEP plus general speed-change drift | Kevin Yang et.al. | 2409.10513v1 | null |
2024-09-16 | Weak Superimposed Codes of Improved Asymptotic Rate and Their Randomized Construction | Yu Tsunoda et.al. | 2409.10511v1 | null |
2024-09-16 | General-relativistic resistive-magnetohydrodynamics simulations of self-consistent magnetized rotating neutron stars | Patrick Chi-Kit Cheong et.al. | 2409.10508v1 | null |
2024-09-16 | Beth-Uhlenbeck equation for the thermodynamics of fluctuations in a generalised 2+1D Gross-Neveu model | Biplab Mahato et.al. | 2409.10507v1 | link |
2024-09-16 | Context-aware Code Segmentation for C-to-Rust Translation using Large Language Models | Momoko Shiraishi et.al. | 2409.10506v1 | null |
2024-09-16 | Machine Learning Optimization of non-Kasha Behavior and of Transient Dynamics in Model Retinal Isomerization | Davinder Singh et.al. | 2409.10505v1 | null |
2024-09-13 | Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging | Jaime Parra Raad et.al. | 2409.09031v1 | null |
2024-09-13 | Agents in Software Engineering: Survey, Landscape, and Vision | Yanxian Huang et.al. | 2409.09030v1 | link |
2024-09-13 | Learning Theory Informed Priors for Bayesian Inference: A Case Study with Early Dark Energy | Michael W. Toomey et.al. | 2409.09029v1 | null |
2024-09-13 | Boson sampling with self-generation of squeezing via interaction of photons and atoms | Sergey V. Tarasov et.al. | 2409.09027v1 | null |
2024-09-13 | Towards Leveraging Contrastively Pretrained Neural Audio Embeddings for Recommender Tasks | Florian Grötschla et.al. | 2409.09026v1 | null |
2024-09-13 | Primordial Stochastic Gravitational Wave Backgrounds from a Sharp Feature in Three-field Inflation II: The Inflationary Era | Vikas Aragam et.al. | 2409.09023v1 | null |
2024-09-13 | New insights into the analytic structure of correlation functions via kinetic theory | Robbe Brants et.al. | 2409.09022v1 | null |
2024-09-13 | INN-PAR: Invertible Neural Network for PPG to ABP Reconstruction | Soumitra Kundu et.al. | 2409.09021v1 | null |
2024-09-13 | Nonequilibrium Phenomenology of Identified Particle Spectra in Heavy-Ion Collisions at LHC Energies | Oleksandr Vitiuk et.al. | 2409.09019v1 | null |
2024-09-13 | An Efficient and Streaming Audio Visual Active Speaker Detection System | Arnav Kundu et.al. | 2409.09018v1 | null |
2024-09-12 | Revisiting primordial neutrino asymmetries, spectral distortions and cosmological constraints with full neutrino transport | Yuan-Zhen Li et.al. | 2409.08280v1 | null |
2024-09-12 | The Role of the Curvaton Post-Planck | Gongjun Choi et.al. | 2409.08279v1 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278v1 | null |
2024-09-12 | AnySkin: Plug-and-play Skin Sensing for Robotic Touch | Raunaq Bhirangi et.al. | 2409.08276v1 | null |
2024-09-12 | Crown-Like Structures in Breast Adipose Tissue: Finding a 'Needle-in-a-Haystack' using Artificial Intelligence and Collaborative Active Learning on the Web | Praphulla MS Bhawsar et.al. | 2409.08275v1 | null |
2024-09-12 | Hand-Object Interaction Pretraining from Videos | Himanshu Gaurav Singh et.al. | 2409.08273v1 | null |
2024-09-12 | Click2Mask: Local Editing with Dynamic Mask Generation | Omer Regev et.al. | 2409.08272v1 | null |
2024-09-12 | DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer | Runjia Li et.al. | 2409.08271v1 | null |
2024-09-12 | Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation | Samanta Rodriguez et.al. | 2409.08269v1 | null |
2024-09-12 | Generalized Komar charges and Smarr formulas for black holes and boson stars | Romina Ballesteros et.al. | 2409.08268v1 | null |
2024-09-11 | Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs | Sadra Safadoust et.al. | 2409.07456v1 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454v1 | null |
2024-09-11 | "My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays | Shengxin Hong et.al. | 2409.07453v1 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452v1 | link |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451v1 | null |
2024-09-11 | VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos | Yan-Bo Lin et.al. | 2409.07450v1 | null |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447v1 | null |
2024-09-11 | Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning | Zhi-Hong Qi et.al. | 2409.07446v1 | link |
2024-09-11 | Echoes of Privacy: Uncovering the Profiling Practices of Voice Assistants | Tina Khezresmaeilzadeh et.al. | 2409.07444v1 | null |
2024-09-11 | Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering | Dafei Qin et.al. | 2409.07441v1 | null |
2024-09-10 | GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht et.al. | 2409.06704v1 | link |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703v1 | null |
2024-09-10 | Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Kairui Ding et.al. | 2409.06702v1 | null |
2024-09-10 | Intravalley spin-polarized superconductivity in rhombohedral tetralayer graphene | Yang-Zhi Chou et.al. | 2409.06701v1 | null |
2024-09-10 | A study on Deep Convolutional Neural Networks, Transfer Learning and Ensemble Model for Breast Cancer Detection | Md Taimur Ahad et.al. | 2409.06699v1 | null |
2024-09-10 | The Operational Meaning of Total Energy of Isolated Systems in General Relativity | Abhay Ashtekar et.al. | 2409.06698v1 | null |
2024-09-10 | Slow Rotation for the Super-Puff Planet Kepler-51d | Caleb Lammers et.al. | 2409.06697v1 | null |
2024-09-10 | Cooptimizing Safety and Performance with a Control-Constrained Formulation | Hao Wang et.al. | 2409.06696v1 | null |
2024-09-10 | DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images | Taslim Murad et.al. | 2409.06694v1 | null |
2024-09-10 | Geometric-Averaged Preference Optimization for Soft Preference Labels | Hiroki Furuta et.al. | 2409.06691v1 | null |
2024-09-09 | Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering | Benjamin Attal et.al. | 2409.05867v1 | null |
2024-09-09 | Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments | Haritheja Etukuru et.al. | 2409.05865v1 | null |
2024-09-09 | Neural MP: A Generalist Neural Motion Planner | Murtaza Dalal et.al. | 2409.05864v1 | null |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863v1 | null |
2024-09-10 | Evaluating Multiview Object Consistency in Humans and Image Models | Tyler Bonnen et.al. | 2409.05862v2 | link |
2024-09-09 | Nonlinear Gravitational Radiation Reaction: Failed Tail, Memories & Squares | Rafael A. Porto et.al. | 2409.05860v1 | null |
2024-09-09 | Asymptotically conformal CFL quark matter within a nonlocal chiral quark model | Oleksii Ivanytskyi et.al. | 2409.05859v1 | null |
2024-09-09 | Largest eigenvalue of positive mean Gaussian matrices | Arijit Chakrabarty et.al. | 2409.05858v1 | null |
2024-09-09 | Nonabelian Anyon Condenstion in 2+1d topological orders: A String-Net Model Realization | Yu Zhao et.al. | 2409.05852v1 | null |
2024-09-09 | QCD-sourced tachyonic phase transition in a supercooled Universe | Daniel Schmitt et.al. | 2409.05851v1 | null |
2024-09-06 | Ab initio quantum dynamics as a scalable solution to the exoplanet opacity challenge: A case study of CO$_2$ in hydrogen atmosphere | Laurent Wiesenfeld et.al. | 2409.04439v1 | null |
2024-09-06 | Memory burden effect mimics reheating signatures on SGWB from ultra-low mass PBH domination | Nilanjandev Bhaumik et.al. | 2409.04436v1 | null |
2024-09-06 | Holographic Air-quality Monitor (HAM) | Nicholas Bravo-Frank et.al. | 2409.04435v1 | null |
2024-09-06 | Accelerating Training with Neuron Interaction and Nowcasting Networks | Boris Knyazev et.al. | 2409.04434v1 | link |
2024-09-06 | Constrained local Hamiltonians: quantum generalizations of Vertex Cover | Ojas Parekh et.al. | 2409.04433v1 | null |
2024-09-06 | Theory, Analysis, and Best Practices for Sigmoid Self-Attention | Jason Ramapuram et.al. | 2409.04431v1 | null |
2024-09-06 | Highly efficient path-integral molecular dynamics simulations with GPUMD using neuroevolution potentials: Case studies on thermal properties of materials | Penghua Ying et.al. | 2409.04430v1 | null |
2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429v1 | null |
2024-09-06 | Hybrid Spiking Neural Networks for Low-Power Intra-Cortical Brain-Machine Interfaces | Alexandru Vasilache et.al. | 2409.04428v1 | null |
2024-09-06 | Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques | Davide Clode da Silva et.al. | 2409.04424v1 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757v1 | link |
2024-09-05 | Spectra of adjacency and Laplacian matrices of Erdős-Rényi hypergraphs | Soumendu Sundar Mukherjee et.al. | 2409.03756v1 | null |
2024-09-05 | DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation | Wenliang Zhao et.al. | 2409.03755v1 | link |
2024-09-05 | Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution | Marga Don et.al. | 2409.03754v1 | link |
2024-09-05 | Attention Heads of Large Language Models: A Survey | Zifan Zheng et.al. | 2409.03752v1 | link |
2024-09-05 | Pion electroproduction measurements in the nucleon resonance region | R. Li et.al. | 2409.03750v1 | null |
2024-09-05 | A neural processing approach to quantum state discrimination | Saeed A. Khan et.al. | 2409.03748v1 | null |
2024-09-05 | Hybrid Oscillator-Qubit Quantum Processors: Simulating Fermions, Bosons, and Gauge Fields | Eleanor Crane et.al. | 2409.03747v1 | null |
2024-09-05 | Orbital Support and Evolution of CX/OX Structures in Boxy/Peanut Bars | Behzad Tahmasebzadeh et.al. | 2409.03746v1 | null |
2024-09-05 | ArtiFade: Learning to Generate High-quality Subject from Blemished Images | Shuya Yang et.al. | 2409.03745v1 | null |
2024-09-04 | Learning Density Functionals from Noisy Quantum Data | Emiel Koridon et.al. | 2409.02921v1 | null |
2024-09-04 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) | Yao Mu et.al. | 2409.02920v1 | null |
2024-09-04 | HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Xinyu Liu et.al. | 2409.02919v1 | link |
2024-09-04 | SpecMon: Modular Black-Box Runtime Monitoring of Security Protocols | Kevin Morio et.al. | 2409.02918v1 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917v1 | link |
2024-09-04 | Latent Watermarking of Audio Generative Models | Robin San Roman et.al. | 2409.02915v1 | null |
2024-09-04 | Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving | Yuhang Lu et.al. | 2409.02914v1 | null |
2024-09-04 | On Baire property of spaces of compact-valued measurable functions | Alexander V. Osipov et.al. | 2409.02913v1 | null |
2024-09-04 | Design of a Standard-Compliant Real-Time Neural Receiver for 5G NR | Reinhard Wiesmayr et.al. | 2409.02912v1 | null |
2024-09-04 | Bulk Spectra of Truncated Sample Covariance Matrices | Subhroshekhar Ghosh et.al. | 2409.02911v1 | null |
2024-08-30 | Undulators are ALP Factories | Wen Yin et.al. | 2408.17451v1 | null |
2024-08-30 | Signatures of topology in generic transport measurements for Rarita-Schwinger-Weyl semimetals | Ipsita Mandal et.al. | 2408.17447v1 | null |
2024-08-30 | The picasso gas model: Painting intracluster gas on gravity-only simulations | F. Kéruzoré et.al. | 2408.17445v1 | null |
2024-08-30 | Bridging Episodes and Semantics: A Novel Framework for Long-Form Video Understanding | Gueter Josmy Faure et.al. | 2408.17443v1 | link |
2024-08-30 | From free idempotent monoids to free multiplicatively idempotent rigs | Morgan Rogers et.al. | 2408.17440v1 | null |
2024-08-30 | Magnetising galaxies with cold inflows | Nicolas Ledos et.al. | 2408.17438v1 | null |
2024-08-30 | SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists | Raoyuan Zhao et.al. | 2408.17437v1 | link |
2024-08-30 | Imprinting New Physics by using Angular profiles of the FCNC process |
Hira Waseem et.al. | 2408.17436v1 | null |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433v1 | null |
2024-08-30 | SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection | Ismail Rasim Ulgen et.al. | 2408.17432v1 | null |
2024-08-29 | 3D Whole-body Grasp Synthesis with Directional Controllability | Georgios Paschalidis et.al. | 2408.16770v1 | null |
2024-08-29 | PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning | Noor Hussein et.al. | 2408.16769v1 | link |
2024-08-29 | SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners | Ziyu Guo et.al. | 2408.16768v1 | link |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767v1 | null |
2024-08-29 | CSGO: Content-Style Composition in Text-to-Image Generation | Peng Xing et.al. | 2408.16766v1 | null |
2024-08-29 | A Score-Based Density Formula, with Applications in Diffusion Generative Models | Gen Li et.al. | 2408.16765v1 | null |
2024-08-29 | Finite Sample Valid Inference via Calibrated Bootstrap | Yiran Jiang et.al. | 2408.16763v1 | null |
2024-08-29 | UV-free Texture Generation with Denoising and Geodesic Heat Diffusions | Simone Foti et.al. | 2408.16762v1 | link |
2024-08-29 | Nonlocal Moments in the Chern Bands of Twisted Bilayer Graphene | Patrick J. Ledwith et.al. | 2408.16761v1 | null |
2024-08-29 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760v1 | null |
2024-08-28 | Q-MRS: A Deep Learning Framework for Quantitative Magnetic Resonance Spectra Analysis | Christopher J. Wu et.al. | 2408.15999v1 | null |
2024-08-28 | Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders | Min Shi et.al. | 2408.15998v1 | link |
2024-08-28 | Mamba or Transformer for Time Series Forecasting? Mixture of Universals (MoU) Is All You Need | Sijia Peng et.al. | 2408.15997v1 | link |
2024-08-29 | Spatio-Temporal Context Prompting for Zero-Shot Action Detection | Wei-Jhe Huang et.al. | 2408.15996v2 | null |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995v1 | null |
2024-08-28 | Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration | Xu Zhang et.al. | 2408.15994v1 | null |
2024-08-28 | ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution | Sungduk Yu et.al. | 2408.15993v1 | null |
2024-08-28 | CoGen: Learning from Feedback with Coupled Comprehension and Generation | Mustafa Omer Gul et.al. | 2408.15992v1 | link |
2024-08-28 | Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation | Shengyuan Zhang et.al. | 2408.15991v1 | link |
2024-08-28 | A Control Theoretic Approach to Simultaneously Estimate Average Value of Time and Determine Dynamic Price for High-occupancy Toll Lanes | Xuting Wang et.al. | 2408.15990v1 | null |
2024-08-27 | Photometric Redshifts Probability Density Estimation from Recurrent Neural Networks in the DECam Local Volume Exploration Survey Data Release 2 | G. Teixeira et.al. | 2408.15243v1 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242v1 | link |
2024-08-27 | GenRec: Unifying Video Generation and Recognition with Diffusion Models | Zejia Weng et.al. | 2408.15241v1 | null |
2024-08-27 | Generative Verifiers: Reward Modeling as Next-Token Prediction | Lunjun Zhang et.al. | 2408.15240v1 | null |
2024-08-27 | Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation | Xiaojuan Wang et.al. | 2408.15239v1 | null |
2024-08-27 | Weak mixing and sparse equidistribution | Max Auer et.al. | 2408.15238v1 | null |
2024-08-27 | The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Junxiong Wang et.al. | 2408.15237v1 | link |
2024-08-27 | MADNESS Deblender: Maximum A posteriori with Deep NEural networks for Source Separation | Biswajit Biswas et.al. | 2408.15236v1 | null |
2024-08-27 | Chebotarov continua, Jenkins-Strebel differentials and related problems: a numerical approach | Marco Bertola et.al. | 2408.15234v1 | null |
2024-08-27 | Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang et.al. | 2408.15232v1 | null |
2024-08-26 | Phases and phase transitions in a dimerized spin-$\mathbf{\frac{1}{2}}$ XXZ chain | Harsh Nigam et.al. | 2408.14474v1 | null |
2024-08-26 | Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning | Xinyang Gu et.al. | 2408.14472v1 | null |
2024-08-26 | A Practitioner's Guide to Continual Multimodal Pretraining | Karsten Roth et.al. | 2408.14471v1 | link |
2024-08-27 | Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models | Aradhye Agarwal et.al. | 2408.14470v2 | link |
2024-08-26 | Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos | Qirui Chen et.al. | 2408.14469v1 | null |
2024-08-26 | K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences | Zhikai Li et.al. | 2408.14468v1 | null |
2024-08-26 | Explicit Inductive Inference using Large Language Models | Tianyang Liu et.al. | 2408.14467v1 | null |
2024-08-26 | Bayesian functional data analysis in astronomy | Thomas Loredo et.al. | 2408.14466v1 | null |
2024-08-26 | On the Effects of Modeling on the Sim-to-Real Transfer Gap in Twinning the POWDER Platform | Maxwell McManus et.al. | 2408.14465v1 | null |
2024-08-26 | Eclipse mapping study of the eclipsing binary KIC~3858884 with hybrid |
A. Bókon et.al. | 2408.14464v1 | null |
2024-08-23 | MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? | Yi-Fan Zhang et.al. | 2408.13257v1 | null |
2024-08-23 | How Diffusion Models Learn to Factorize and Compose | Qiyao Liang et.al. | 2408.13256v1 | null |
2024-08-23 | Ensemble Modeling of Multiple Physical Indicators to Dynamically Phenotype Autism Spectrum Disorder | Marie Huynh et.al. | 2408.13255v1 | null |
2024-08-23 | Vertex correction to nuclear matrix elements of double-$β$ decays | Jun Terasaki et.al. | 2408.13254v1 | null |
2024-08-23 | Domain-specific long text classification from sparse relevant information | Célia D'Cruz et.al. | 2408.13253v1 | null |
2024-08-23 | LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation | Shuai Yang et.al. | 2408.13252v1 | null |
2024-08-23 | Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack | Vaibhav Sundharam et.al. | 2408.13251v1 | null |
2024-08-23 | Isolation and characterization of atomically thin mica phyllosilicates | Kristine L. Haley et.al. | 2408.13249v1 | null |
2024-08-23 | Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption | Sakhinana Sagar Srinivas et.al. | 2408.13248v1 | null |
2024-08-23 | Properties and applications of the Bicomplex Miller-Ross function | Snehasis Bera et.al. | 2408.13246v1 | null |
2024-08-22 | DreamCinema: Cinematic Transfer with Free Camera and 3D Character | Weiliang Chen et.al. | 2408.12601v1 | null |
2024-08-22 | Controllable Text Generation for Large Language Models: A Survey | Xun Liang et.al. | 2408.12599v1 | link |
2024-08-22 | ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction | Ziyu Tang et.al. | 2408.12598v1 | null |
2024-08-22 | Quantum Sabotage Complexity | Arjan Cornelissen et.al. | 2408.12595v1 | null |
2024-08-23 | Non-Homophilic Graph Pre-Training and Prompt Learning | Xingtong Yu et.al. | 2408.12594v2 | null |
2024-08-22 | Automating Deformable Gasket Assembly | Simeon Adebola et.al. | 2408.12593v1 | null |
2024-08-22 | xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations | Can Qin et.al. | 2408.12590v1 | null |
2024-08-22 | Real-Time Video Generation with Pyramid Attention Broadcast | Xuanlei Zhao et.al. | 2408.12588v1 | link |
2024-08-22 | Reconstructing the Inflaton Potential: Primordial Black Holes and Gravitational Waves in Slow Roll and Ultra Slow Roll Single Field Inflation | Gabriele Autieri et.al. | 2408.12587v1 | null |
2024-08-22 | Diagnosing the pattern effect in the atmosphere-ocean coupled system through linear response theory | Fabrizio Falasca et.al. | 2408.12585v1 | null |
2024-08-21 | Extended quantum anomalous Hall effect in moiré structures: phase transitions and transport | Adarsh S. Patri et.al. | 2408.11818v1 | null |
2024-08-21 | GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models | Jonathan Roberts et.al. | 2408.11817v1 | null |
2024-08-21 | Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction | Anthony GX-Chen et.al. | 2408.11816v1 | null |
2024-08-21 | Great Memory, Shallow Reasoning: Limits of $k$NN-LMs | Shangyi Geng et.al. | 2408.11815v1 | link |
2024-08-21 | SynPlay: Importing Real-world Diversity for a Synthetic Human Dataset | Jinsub Yim et.al. | 2408.11814v1 | null |
2024-08-21 | SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs | Yuanyang Yin et.al. | 2408.11813v1 | null |
2024-08-21 | Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation | Ria Doshi et.al. | 2408.11812v1 | null |
2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811v1 | null |
2024-08-21 | Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models | Chun-Yen Shih et.al. | 2408.11810v1 | null |
2024-08-21 | Distance Correlation in Multiple Biased Sampling Models | Yuwei Ke et.al. | 2408.11808v1 | null |
2024-08-20 | Proper splittings and projectivity for good moduli spaces | Dori Bejleri et.al. | 2408.11057v1 | null |
2024-08-20 | Detection of the large-scale tidal field with galaxy multiplet alignment in the DESI Y1 spectroscopic survey | Claire Lamman et.al. | 2408.11056v1 | null |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055v1 | link |
2024-08-20 | NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency | Valentinos Pariza et.al. | 2408.11054v1 | null |
2024-08-20 | Revisiting VerilogEval: Newer LLMs, In-Context Learning, and Specification-to-RTL Tasks | Nathaniel Pinckney et.al. | 2408.11053v1 | null |
2024-08-20 | FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Yunzhe Xu et.al. | 2408.11051v1 | link |
2024-08-21 | MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding | Jian Chen et.al. | 2408.11049v2 | null |
2024-08-20 | RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands | Yi Zhao et.al. | 2408.11048v1 | null |
2024-08-20 | Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders | Yuan Xin et.al. | 2408.11046v1 | null |
2024-08-20 | Representation Theory of Solitons | Clay Cordova et.al. | 2408.11045v1 | null |
2024-08-19 | The Resonant Remains of Broken Chains from Major and Minor Mergers | Rixin Li et.al. | 2408.10206v1 | null |
2024-08-19 | Criticality Leveraged Adversarial Training (CLAT) for Boosted Performance via Parameter Efficiency | Bhavna Gopal et.al. | 2408.10204v1 | null |
2024-08-19 | SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP | Yusuke Hirota et.al. | 2408.10202v1 | null |
2024-08-19 | MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model | Minghua Liu et.al. | 2408.10198v1 | null |
2024-08-19 | Demystifying the Communication Characteristics for Distributed Transformer Models | Quentin Anthony et.al. | 2408.10197v1 | null |
2024-08-19 | Some model theory of quadratic geometries | Charlotte Kestner et.al. | 2408.10196v1 | null |
2024-08-19 | SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu et.al. | 2408.10195v1 | null |
2024-08-19 | Krylov Complexity as a Probe for Chaos | Mohsen Alishahiha et.al. | 2408.10194v1 | null |
2024-08-19 | Area under the ROC Curve has the Most Consistent Evaluation for Binary Classification | Jing Li et.al. | 2408.10193v1 | null |
2024-08-19 | Superconductivity and spin canting in spin-orbit proximitized rhombohedral trilayer graphene | Caitlin L. Patterson et.al. | 2408.10190v1 | null |
2024-08-16 | Accelerating Giant Impact Simulations with Machine Learning | Caleb Lammers et.al. | 2408.08873v1 | link |
2024-08-16 | xGen-MM (BLIP-3): A Family of Open Large Multimodal Models | Le Xue et.al. | 2408.08872v1 | null |
2024-08-16 | SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation | Xinyu Xiong et.al. | 2408.08870v1 | link |
2024-08-19 | PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars | Sumanth Prabhu et.al. | 2408.08869v2 | null |
2024-08-16 | A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs | H. Brendan McMahan et.al. | 2408.08868v1 | null |
2024-08-16 | Quantum Annealing for Enhanced Feature Selection in Single-Cell RNA Sequencing Data Analysis | Selim Romero et.al. | 2408.08867v1 | null |
2024-08-16 | **High-Frequency Options Trading | With Portfolio Optimization** | Sid Bhatia et.al. | 2408.08866v1 |
2024-08-16 | Aerodynamic equilibria and flight stability of plates at intermediate Reynolds numbers | Olivia Pomerenk et.al. | 2408.08864v1 | null |
2024-08-16 | Instability and Evolution of Shocked Clouds Formed by Orthogonal Collisions between Magnetized Filamentary Molecular Clouds | Raiga Kashiwagi et.al. | 2408.08863v1 | null |
2024-08-16 | Visual Agents as Fast and Slow Thinkers | Guangyan Sun et.al. | 2408.08862v1 | null |
2024-08-15 | Can Large Language Models Understand Symbolic Graphics Programs? | Zeju Qiu et.al. | 2408.08313v1 | null |
2024-08-15 | HyperTaxel: Hyper-Resolution for Taxel-Based Tactile Signals Through Contrastive Learning | Hongyu Li et.al. | 2408.08312v1 | null |
2024-08-15 | ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li et.al. | 2408.08310v1 | null |
2024-08-15 | Lumos Extrema | Upamanyu Moitra et.al. | 2408.08308v1 | null |
2024-08-15 | Understanding the Local Geometry of Generative Model Manifolds | Ahmed Imtiaz Humayun et.al. | 2408.08307v1 | null |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306v1 | null |
2024-08-15 | Towards Flexible Visual Relationship Segmentation | Fangrui Zhu et.al. | 2408.08305v1 | null |
2024-08-15 | Simple Macroeconomic Forecast Distributions for the G7 Economies | Friederike Becker et.al. | 2408.08304v1 | null |
2024-08-15 | DIISC-IV: DIISCovery of Anomalously Low Metallicity H II Regions in NGC 99: Indirect Evidence of Gas Inflows | Alejandro J. Olvera et.al. | 2408.08303v1 | null |
2024-08-15 | Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors | Usman Syed et.al. | 2408.08302v1 | null |
2024-08-14 | Knowledge Distillation with Refined Logits | Wujie Sun et.al. | 2408.07703v1 | link |
2024-08-14 | The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models | Karime Maamari et.al. | 2408.07702v1 | null |
2024-08-14 | Profile Likelihoods in Cosmology: When, Why and How illustrated with $Λ$CDM, Massive Neutrinos and Dark Energy | Laura Herold et.al. | 2408.07700v1 | null |
2024-08-14 | Field-level Emulation of Cosmic Structure Formation with Cosmology and Redshift Dependence | Drew Jamieson et.al. | 2408.07699v1 | null |
2024-08-14 | Quantifying over Optimum Answer Sets | Giuseppe Mazzotta et.al. | 2408.07697v1 | null |
2024-08-14 | Model-Based Control of Water Treatment with Pumped Water Storage | Ryan Mauery et.al. | 2408.07696v1 | null |
2024-08-14 | Generalized Quandle Polynomials and Their Applications to Stuquandles, Stuck Links, and RNA Folding | Ekaterina Bondarenko et.al. | 2408.07695v1 | null |
2024-08-14 | End-to-end Semantic-centric Video-based Multimodal Affective Computing | Ronghao Lin et.al. | 2408.07694v1 | null |
2024-08-14 | Detecting Near-Duplicate Face Images | Sudipta Banerjee et.al. | 2408.07689v1 | link |
2024-08-14 | Finite Dimensional Projections of HJB Equations in the Wasserstein Space | Andrzej Święch et.al. | 2408.07688v1 | null |
2024-08-13 | Approaches for enhancing extrapolability in process-based and data-driven models in hydrology | Haiyang Shi et.al. | 2408.07071v1 | null |
2024-08-13 | Large-kernel Convolutional Neural Networks for Wide Parameter-Space Searches of Continuous Gravitational Waves | Prasanna Mohan Joshi et.al. | 2408.07070v1 | null |
2024-08-13 | Atomic fluorescence collection into planar photonic devices | Orion Smedley et.al. | 2408.07068v1 | null |
2024-08-13 | Conformal prediction after efficiency-oriented model selection | Ruiting Liang et.al. | 2408.07066v1 | null |
2024-08-13 | Fingerspelling within Sign Language Translation | Garrett Tanzer et.al. | 2408.07065v1 | null |
2024-08-13 | On Networks and their Applications: Stability of Gene Regulatory Networks and Gene Function Prediction using Autoencoders | Hamza Coban et.al. | 2408.07064v1 | null |
2024-08-13 | Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents | Kexun Zhang et.al. | 2408.07060v1 | null |
2024-08-13 | Model Counting in the Wild | Arijit Shaw et.al. | 2408.07059v1 | null |
2024-08-13 | Categorical Framework for Typed Extensional and Intensional Models in Formal Semantics | Daniel Quigley et.al. | 2408.07058v1 | null |
2024-08-13 | A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning | Prateek Yadav et.al. | 2408.07057v1 | null |
2024-08-12 | Measuring central charge on a universal quantum processor | Nazlı Uğur Köylüoğlu et.al. | 2408.06342v1 | null |
2024-08-12 | Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks | Lucas Félix et.al. | 2408.06341v1 | null |
2024-08-12 | Thermodynamical string fragmentation and QGP-like effects in jets | Robert Vertesi et.al. | 2408.06340v1 | null |
2024-08-12 | Non-Maxwellian Ion Distribution in the Equatorial and Auroral Electrojets | Rattanakorn Koontaweepunya et.al. | 2408.06339v1 | null |
2024-08-12 | LOLgorithm: Integrating Semantic,Syntactic and Contextual Elements for Humor Classification | Tanisha Khurana et.al. | 2408.06335v1 | null |
2024-08-12 | Entanglement and the density matrix renormalisation group in the generalised Landau paradigm | Laurens Lootens et.al. | 2408.06334v1 | null |
2024-08-12 | FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Yufei Huang et.al. | 2408.06333v1 | link |
2024-08-12 | Animate, or Inanimate, That is the Question for Large Language Models | Leonardo Ranaldi et.al. | 2408.06332v1 | null |
2024-08-12 | Time-dependent, spherically symmetric background in Kaluza-Klein compactified Horndeski theory and the speed of gravity waves | S. Mironov et.al. | 2408.06329v1 | null |
2024-08-12 | HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors | Hyungtae Lim et.al. | 2408.06328v1 | null |
2024-08-10 | Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions | Michele Miranda et.al. | 2408.05212v1 | null |
2024-08-09 | VITA: Towards Open-Source Interactive Omni Multimodal LLM | Chaoyou Fu et.al. | 2408.05211v1 | null |
2024-08-09 | Third-order corrections to the slow-roll expansion: calculation and constraints with Planck, ACT, SPT, and BICEP/Keck | Mario Ballardini et.al. | 2408.05210v1 | null |
2024-08-09 | What are the real implications for |
Dhruv Suri et.al. | 2408.05209v1 | null |
2024-08-09 | Holographic thermal correlators and quasinormal modes from semiclassical Virasoro blocks | Hewei Frederic Jia et.al. | 2408.05208v1 | null |
2024-08-09 | Multi-Garment Customized Model Generation | Yichen Liu et.al. | 2408.05206v1 | null |
2024-08-09 | Kalman-Inspired Feature Propagation for Video Face Super-Resolution | Ruicheng Feng et.al. | 2408.05205v1 | null |
2024-08-09 | Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners | Michael Vaccaro Jr et.al. | 2408.05204v1 | null |
2024-08-09 | Renata Kallosh et.al. | 2408.05203v1 | null | |
2024-08-09 | Twisted nanoporous graphene/graphene bilayers: electronic decoupling and chiral currents | Xabier Diaz de Cerio et.al. | 2408.05202v1 | null |
2024-08-08 | LiDAR-Event Stereo Fusion with Hallucinations | Luca Bartolomei et.al. | 2408.04633v1 | link |
2024-08-08 | Arctic-TILT. Business Document Understanding at Sub-Billion Scale | Łukasz Borchmann et.al. | 2408.04632v1 | null |
2024-08-08 | Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics | Ruining Li et.al. | 2408.04631v1 | null |
2024-08-08 | Higher derivative SVT theories from Kaluza-Klein reductions of Horndeski theory | S. Mironov et.al. | 2408.04626v1 | null |
2024-08-08 | Adaptive Sampling Bi-Fidelity Stochastic Trust Region Method for Derivative-Free Stochastic Optimization | Yunsoo Ha et.al. | 2408.04625v1 | null |
2024-08-08 | Axion production via trapped misalignment from Peccei-Quinn symmetry breaking | Luca Di Luzio et.al. | 2408.04623v1 | null |
2024-08-08 | On the interactions and equilibrium between Einstein-Maxwell-Dilaton black holes | Ulrich K. Beckering Vinckers et.al. | 2408.04621v1 | null |
2024-08-08 | Transformer Explainer: Interactive Learning of Text-Generative Models | Aeree Cho et.al. | 2408.04619v1 | null |
2024-08-08 | SSD Set System, Graph Decomposition and Hamiltonian Cycle | Kan Shota et.al. | 2408.04615v1 | null |
2024-08-08 | Better Alignment with Instruction Back-and-Forth Translation | Thao Nguyen et.al. | 2408.04614v1 | null |
2024-08-07 | How Well Can Vision Language Models See Image Details? | Chenhui Gou et.al. | 2408.03940v1 | null |
2024-08-07 | Zeros of |
Bryce Kerr et.al. | 2408.03938v1 | null |
2024-08-07 | SLIM-RAFT: A Novel Fine-Tuning Approach to Improve Cross-Linguistic Performance for Mercosur Common Nomenclature | Vinícius Di Oliveira et.al. | 2408.03936v1 | null |
2024-08-07 | Dynamical patterns in active-passive particle mixtures with non-reciprocal interactions: Exact hydrodynamic analysis | James Mason et.al. | 2408.03932v1 | null |
2024-08-07 | Robust Estimation of Regression Models with Potentially Endogenous Outliers via a Modern Optimization Lens | Zhan Gao et.al. | 2408.03930v1 | null |
2024-08-07 | Chapter 10: Quantitative Models of Discounting | Christopher T. Franck et.al. | 2408.03929v1 | null |
2024-08-07 | Vacuum Energy in Non-Supersymmetric Quasi-Realistic Heterotic-String Vacua with Fixed Moduli | Eman Basaad et.al. | 2408.03928v1 | null |
2024-08-07 | Enhanced Cooper Pairing via Random Matrix Phonons in Superconducting Grains | Andrey Grankin et.al. | 2408.03927v1 | null |
2024-08-07 | New fairness criteria for truncated ballots in multi-winner ranked-choice elections | Adam Graham-Squire et.al. | 2408.03926v1 | null |
2024-08-07 | Exact and universal quantum Monte Carlo estimators for energy susceptibility and fidelity susceptibility | Nic Ezzell et.al. | 2408.03924v1 | null |
2024-08-06 | LLaVA-OneVision: Easy Visual Task Transfer | Bo Li et.al. | 2408.03326v1 | null |
2024-08-06 | CoverBench: A Challenging Benchmark for Complex Claim Verification | Alon Jacovi et.al. | 2408.03325v1 | null |
2024-08-06 | ClassiFIM: An Unsupervised Method To Detect Phase Transitions | Victor Kasatkin et.al. | 2408.03323v1 | null |
2024-08-06 | Segment Anything in Medical Images and Videos: Benchmark and Deployment | Jun Ma et.al. | 2408.03322v1 | null |
2024-08-06 | Chasing cosmic inflation: constraints for inflationary models and reheating insights | Mario Ballardini et.al. | 2408.03321v1 | null |
2024-08-06 | Hedge Fund Portfolio Construction Using PolyModel Theory and iTransformer | Siqiao Zhao et.al. | 2408.03320v1 | null |
2024-08-06 | Training LLMs to Recognize Hedges in Spontaneous Narratives | Amie J. Paige et.al. | 2408.03319v1 | null |
2024-08-06 | Robustness of electron charge shuttling: Architectures, pulses, charge defects and noise thresholds | Minjun Jeon et.al. | 2408.03315v1 | null |
2024-08-06 | Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters | Charlie Snell et.al. | 2408.03314v1 | null |
2024-08-06 | MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation | Xiaofeng Mao et.al. | 2408.03312v1 | null |
2024-08-05 | Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics | Shishira R Maiya et.al. | 2408.02672v1 | null |
2024-08-05 | Systematic bias due to mismodelling precessing binary black hole ringdown | Cheng Foo et.al. | 2408.02671v1 | null |
2024-08-05 | Searching for dark matter with a 1000 km baseline interferometer | Daniel Gavilan-Martin et.al. | 2408.02668v1 | null |
2024-08-05 | Evaluating and Utilizing Surrogate Outcomes in Covariate-Adjusted Response-Adaptive Designs | Wenxin Zhang et.al. | 2408.02667v1 | null |
2024-08-05 | Self-Taught Evaluators | Tianlu Wang et.al. | 2408.02666v1 | null |
2024-08-05 | Structure-preserving approximations of the Serre-Green-Naghdi equations in standard and hyperbolic form | Hendrik Ranocha et.al. | 2408.02665v1 | null |
2024-08-05 | Noninvertible Gauge Symmetry in (2+1)d Topological Orders: A String-Net Model Realization | Yu Zhao et.al. | 2408.02664v1 | null |
2024-08-05 | Integrating Model-Based Footstep Planning with Model-Free Reinforcement Learning for Dynamic Legged Locomotion | Ho Jae Lee et.al. | 2408.02662v1 | null |
2024-08-05 | Context-aware Mamba-based Reinforcement Learning for social robot navigation | Syed Muhammad Mustafa et.al. | 2408.02661v1 | null |
2024-08-05 | Wavenumber Calibration for an Imaging Refractometer | A. Rososhek et.al. | 2408.02660v1 | null |
2024-08-02 | Generalised Circuit Partitioning for Distributed Quantum Computing | Felix Burt et.al. | 2408.01424v1 | null |
2024-08-02 | Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting | Xiangyu Zhao et.al. | 2408.01423v1 | null |
2024-08-02 | Topological phases of the interacting SSH model: an analytical study | E. Di Salvo et.al. | 2408.01421v1 | null |
2024-08-02 | Mission Impossible: A Statistical Perspective on Jailbreaking LLMs | Jingtong Su et.al. | 2408.01420v1 | null |
2024-08-02 | DebateQA: Evaluating Question Answering on Debatable Knowledge | Rongwu Xu et.al. | 2408.01419v1 | null |
2024-08-02 | Holographic duals of symmetry broken phases | Andrea Antinucci et.al. | 2408.01418v1 | null |
2024-08-02 | Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs | Yilun Hua et.al. | 2408.01417v1 | null |
2024-08-02 | Conditional LoRA Parameter Generation | Xiaolong Jin et.al. | 2408.01415v1 | null |
2024-08-02 | A Game Theoretic Analysis of High Occupancy Toll Lane Design | Zhanhao Zhang et.al. | 2408.01413v1 | null |
2024-08-02 | Spectroscopic survey of faint planetary-nebula nuclei VI. Seventeen hydrogen-rich central stars | Nicole Reindl et.al. | 2408.01411v1 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766v1 | null |
2024-08-01 | MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities | Weihao Yu et.al. | 2408.00765v1 | null |
2024-08-01 | AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Mengkang Hu et.al. | 2408.00764v1 | null |
2024-08-01 | Quantized electrical, thermal, and spin transports of non-Hermitian clean and dirty two-dimensional topological insulators and superconductors | Sanjib Kumar Das et.al. | 2408.00763v1 | null |
2024-08-01 | UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Xiangyu Fan et.al. | 2408.00762v1 | null |
2024-08-01 | Tamper-Resistant Safeguards for Open-Weight LLMs | Rishub Tamirisa et.al. | 2408.00761v1 | null |
2024-08-01 | Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention | Susung Hong et.al. | 2408.00760v1 | null |
2024-08-01 | Text-Guided Video Masked Autoencoder | David Fan et.al. | 2408.00759v1 | null |
2024-08-01 | Segment anything model 2: an application to 2D and 3D medical images | Haoyu Dong et.al. | 2408.00756v1 | null |
2024-08-01 | Thermal Conductivity Predictions with Foundation Atomistic Models | Balázs Póta et.al. | 2408.00755v1 | null |
2024-07-31 | Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey | Atsuyuki Miyai et.al. | 2407.21794v1 | null |
2024-07-31 | Non-equilibrium dynamics of symmetry-resolved entanglement and entanglement asymmetry: Exact asymptotics in Rule 54 | Katja Klobas et.al. | 2407.21793v1 | null |
2024-07-31 | Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress? | Richard Ren et.al. | 2407.21792v1 | null |
2024-07-31 | Deep Learning for Options Trading: An End-To-End Approach | Wee Ling Tan et.al. | 2407.21791v1 | null |
2024-07-31 | First measurement of the triaxiality of the inner dark matter halo of the Milky Way | Hanneke C. Woudenberg et.al. | 2407.21790v1 | null |
2024-07-31 | Vision-Language Model Based Handwriting Verification | Mihir Chauhan et.al. | 2407.21788v1 | null |
2024-07-31 | Large Language Monkeys: Scaling Inference Compute with Repeated Sampling | Bradley Brown et.al. | 2407.21787v1 | null |
2024-07-31 | Non-equilibrium dynamics of charged dual-unitary circuits | Alessandro Foligno et.al. | 2407.21786v1 | null |
2024-07-31 | The Llama 3 Herd of Models | Abhimanyu Dubey et.al. | 2407.21783v1 | null |
2024-07-31 | Tulip Agent -- Enabling LLM-Based Agents to Solve Tasks Using Large Tool Libraries | Felix Ocker et.al. | 2407.21778v1 | null |
2024-07-30 | A Multiwavelength Portrait of the 3C 220.3 Lensed System | Sóley Ó. Hyman et.al. | 2407.21020v1 | null |
2024-07-30 | Emergent dipole field theory in atomic ladders | Hernan B. Xavier et.al. | 2407.21019v1 | null |
2024-07-30 | ThinK: Thinner Key Cache by Query-Driven Pruning | Yuhui Xu et.al. | 2407.21018v1 | null |
2024-07-30 | Matting by Generation | Zhixiang Wang et.al. | 2407.21017v1 | null |
2024-07-30 | Add-SD: Rational Generation without Manual Reference | Lingfeng Yang et.al. | 2407.21016v1 | link |
2024-07-30 | Comparative Analyses of the Type D ASEP: Stochastic Fusion and Crystal Bases | Erik Brodsky et.al. | 2407.21015v1 | null |
2024-07-30 | Uplink Wave-Domain Combiner for Stacked Intelligent Metasurfaces Accounting for Hardware Limitations | Maryam Rezvani et.al. | 2407.21012v1 | null |
2024-07-30 | CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Yuexi Du et.al. | 2407.21011v1 | link |
2024-07-30 | Human-Data Interaction Framework: A Comprehensive Model for a Future Driven by Data and Humans | Ivan Durango et.al. | 2407.21010v1 | null |
2024-07-30 | AI-Assisted Generation of Difficult Math Questions | Vedant Shah et.al. | 2407.21009v1 | null |
2024-07-29 | Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Ekaterina Iakovleva et.al. | 2407.20232v1 | null |
2024-07-29 | Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue et.al. | 2407.20229v1 | null |
2024-07-29 | FlexAttention for Efficient High-Resolution Vision-Language Models | Junyan Li et.al. | 2407.20228v1 | null |
2024-07-29 | Additive martingales of the branching Brownian motion | Louis Chataignier et.al. | 2407.20227v1 | null |
2024-07-29 | Models of random spanning trees | Eric Babson et.al. | 2407.20226v1 | null |
2024-07-29 | Can Editing LLMs Inject Harm? | Canyu Chen et.al. | 2407.20224v1 | null |
2024-07-29 | Imprinting spin patterns by local strain control in a van der Waals antiferromagnet | Zhuoliang Ni et.al. | 2407.20222v1 | null |
2024-07-29 | Ionospheric contributions to the excess power in high-redshift 21-cm power-spectrum observations with LOFAR | S. A. Brackenhoff et.al. | 2407.20220v1 | null |
2024-07-29 | Global Structure-from-Motion Revisited | Linfei Pan et.al. | 2407.20219v1 | link |
2024-07-30 | cDVAE: Multimodal Generative Conditional Diffusion Guided by Variational Autoencoder Latent Embedding for Virtual 6D Phase Space Diagnostics | Alexander Scheinker et.al. | 2407.20218v2 | null |
2024-07-26 | Floating No More: Object-Ground Reconstruction from a Single Image | Yunze Man et.al. | 2407.18914v1 | null |
2024-07-26 | SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments | Shu Ishida et.al. | 2407.18913v1 | null |
2024-07-26 | Validating the clustering predictions of empirical models with the FLAMINGO simulations | Sergio Contreras et.al. | 2407.18912v1 | null |
2024-07-26 | HRP: Human Affordances for Robotic Pre-Training | Mohan Kumar Srirama et.al. | 2407.18911v1 | null |
2024-07-29 | Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation | Weizhi Zhang et.al. | 2407.18910v2 | link |
2024-07-26 | Hybrid summary statistics: neural weak lensing inference beyond the power spectrum | T. Lucas Makinen et.al. | 2407.18909v1 | null |
2024-07-26 | Wolf: Captioning Everything with a World Summarization Framework | Boyi Li et.al. | 2407.18908v1 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907v1 | null |
2024-07-26 | The nph2ph-transform: applications to the statistical analysis of completed clinical trials | Sean M. Devlin et.al. | 2407.18905v1 | null |
2024-07-26 | Birational geometry of Fano varieties of lines on cubic fourfolds containing pairs of cubic scrolls | Corey Brooke et.al. | 2407.18904v1 | null |
2024-07-25 | Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis | Cristian-Alexandru Botocan et.al. | 2407.18251v1 | link |
2024-07-25 | Yukawa-Lorentz symmetry of interacting non-Hermitian birefringent Dirac fermions | Sk Asrap Murshed et.al. | 2407.18250v1 | null |
2024-07-25 | Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang et.al. | 2407.18248v1 | link |
2024-07-25 | RegionDrag: Fast Region-Based Image Editing with Diffusion Models | Jingyi Lu et.al. | 2407.18247v1 | null |
2024-07-25 | Probing the early universe with future GW observatories | Suvashis Maity et.al. | 2407.18246v1 | null |
2024-07-25 | VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads | Orest Kupyn et.al. | 2407.18245v1 | null |
2024-07-25 | BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments | Yu-Yun Tseng et.al. | 2407.18243v1 | null |
2024-07-25 | LoRA-Pro: Are Low-Rank Adapters Properly Optimized? | Zhengbo Wang et.al. | 2407.18242v1 | link |
2024-07-25 | Numerical Literals in Link Prediction: A Critical Examination of Models and Datasets | Moritz Blum et.al. | 2407.18241v1 | link |
2024-07-25 | Dust and Power: Unravelling the merger -- AGN connection in the second half of the cosmic history | A. La Marca et.al. | 2407.18238v1 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470v1 | null |
2024-07-24 | I Could've Asked That: Reformulating Unanswerable Questions | Wenting Zhao et.al. | 2407.17469v1 | link |
2024-07-24 | WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries | Wenting Zhao et.al. | 2407.17468v1 | null |
2024-07-24 | CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu et.al. | 2407.17467v1 | null |
2024-07-24 | Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning | Shuang Qiu et.al. | 2407.17466v1 | null |
2024-07-24 | u-$μ$P: The Unit-Scaled Maximal Update Parametrization | Charlie Blake et.al. | 2407.17465v1 | null |
2024-07-24 | The HalfDome Multi-Survey Cosmological Simulations: N-body Simulations | Adrian E. Bayer et.al. | 2407.17462v1 | null |
2024-07-24 | SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning | Jianpeng Yao et.al. | 2407.17460v1 | null |
2024-07-24 | Hidden or Inferred: Fair Learning-To-Rank with Unknown Demographics | Oluseun Olulana et.al. | 2407.17459v1 | link |
2024-07-24 | Why Machines Can't Be Moral: Turing's Halting Problem and the Moral Limits of Artificial Intelligence | Massimo Passamonti et.al. | 2407.16890v1 | null |
2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698v1 | link |
2024-07-23 | AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking | Wenxuan Li et.al. | 2407.16697v1 | null |
2024-07-23 | PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects | Junyi Li et.al. | 2407.16696v1 | null |
2024-07-23 | Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack | Xiaoyue Xu et.al. | 2407.16695v1 | null |
2024-07-23 | Explanation Regularisation through the Lens of Attributions | Pedro Ferreira et.al. | 2407.16693v1 | null |
2024-07-23 | Lorentzian path-integral of Robin Universe | Manishankar Ailiga et.al. | 2407.16692v1 | null |
2024-07-23 | Automatic Equalization for Individual Instrument Tracks Using Convolutional Neural Networks | Florian Mockenhaupt et.al. | 2407.16691v1 | null |
2024-07-23 | DIISC-V: Variations in H$α$-to-FUV Star Formation Rate Ratios Across Star-forming Regions in Nearby Galaxies | Mansi Padave et.al. | 2407.16690v1 | null |
2024-07-23 | Robust Preference for Dynamical Dark Energy in DESI BAO and SN Measurements | William Giarè et.al. | 2407.16689v1 | null |
2024-07-23 | On the local cohomology of secant varieties | Sebastian Olano et.al. | 2407.16688v1 | null |
2024-07-22 | AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description | Junyu Xie et.al. | 2407.15850v1 | link |
2024-07-22 | LLMmap: Fingerprinting For Large Language Models | Dario Pasquini et.al. | 2407.15847v1 | null |
2024-07-22 | Reconstructing Training Data From Real World Models Trained with Transfer Learning | Yakir Oz et.al. | 2407.15845v1 | null |
2024-07-22 | CarFormer: Self-Driving with Learned Object-Centric Representations | Shadi Hamdan et.al. | 2407.15843v1 | null |
2024-07-22 | Artist: Aesthetically Controllable Text-Driven Stylization without Training | Ruixiang Jiang et.al. | 2407.15842v1 | null |
2024-07-22 | SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models | Mingze Xu et.al. | 2407.15841v1 | null |
2024-07-23 | QueST: Self-Supervised Skill Abstractions for Learning Continuous Control | Atharva Mete et.al. | 2407.15840v2 | null |
2024-07-22 | Importance Sampling-Guided Meta-Training for Intelligent Agents in Highly Interactive Environments | Mansur Arief et.al. | 2407.15839v1 | null |
2024-07-22 | MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity | Yangzhou Liu et.al. | 2407.15838v1 | null |
2024-07-22 | Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning | Yibing Wei et.al. | 2407.15837v1 | link |
2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509v1 | null |
2024-07-19 | Internal Consistency and Self-Feedback in Large Language Models: A Survey | Xun Liang et.al. | 2407.14507v1 | link |
2024-07-19 | On Pre-training of Multimodal Language Models Customized for Chart Understanding | Wan-Cyuan Fan et.al. | 2407.14506v1 | null |
2024-07-19 | T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation | Kaiyue Sun et.al. | 2407.14505v1 | null |
2024-07-19 | Nonlinear Schrödinger Network | Yiming Zhou et.al. | 2407.14504v1 | null |
2024-07-19 | Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification | Thomas Kwa et.al. | 2407.14503v1 | null |
2024-07-19 | M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Seunggeun Chi et.al. | 2407.14502v1 | null |
2024-07-19 | Indoor Air Quality Dataset with Activities of Daily Living in Low to Middle-income Communities | Prasenjit Karmakar et.al. | 2407.14501v1 | link |
2024-07-19 | Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery | Sukrut Rao et.al. | 2407.14499v1 | link |
2024-07-19 | Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation | Dongyang Wu et.al. | 2407.14498v1 | null |
2024-07-18 | GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model | Abdelrahman Shaker et.al. | 2407.13772v1 | null |
2024-07-18 | Training-Free Model Merging for Multi-target Domain Adaptation | Wenyi Li et.al. | 2407.13771v1 | null |
2024-07-18 | Moiré Fractional Chern Insulators IV: Fluctuation-Driven Collapse of FCIs in Multi-Band Exact Diagonalization Calculations on Rhombohedral Graphene | Jiabin Yu et.al. | 2407.13770v1 | null |
2024-07-18 | Topological insulators on fractal lattices: A general principle of construction | Daniel J. Salib et.al. | 2407.13767v1 | null |
2024-07-18 | Visual Haystacks: Answering Harder Questions About Sets of Images | Tsung-Han Wu et.al. | 2407.13766v1 | null |
2024-07-18 | Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data | Charles Jin et.al. | 2407.13765v1 | null |
2024-07-18 | Shape of Motion: 4D Reconstruction from a Single Video | Qianqian Wang et.al. | 2407.13764v1 | null |
2024-07-18 | SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He et.al. | 2407.13761v1 | null |
2024-07-18 | Neural Network Tire Force Modeling for Automated Drifting | Nicholas Drake Broadbent et.al. | 2407.13760v1 | null |
2024-07-18 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759v1 | null |
2024-07-17 | AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Zhaorun Chen et.al. | 2407.12784v1 | null |
2024-07-17 | SMooDi: Stylized Motion Diffusion Model | Lei Zhong et.al. | 2407.12783v1 | null |
2024-07-17 | Contrastive Adversarial Training for Unsupervised Domain Adaptation | Jiahong Chen et.al. | 2407.12782v1 | null |
2024-07-17 | VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control | Sherwin Bahmani et.al. | 2407.12781v1 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780v1 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777v1 | null |
2024-07-17 | Experimental Demonstration of a Quantum-Optimal Coronagraph Using Spatial Mode Sorters | Nico Deshler et.al. | 2407.12776v1 | null |
2024-07-17 | Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics | Kevin L. McKinney et.al. | 2407.12775v1 | null |
2024-07-17 | Market Definition: A Sensitivity Analysis | Paul S. Koh et.al. | 2407.12774v1 | null |
2024-07-17 | OMG-Net: A Deep Learning Framework Deploying Segment Anything to Detect Pan-Cancer Mitotic Figures from Haematoxylin and Eosin-Stained Slides | Zhuoyan Shen et.al. | 2407.12773v1 | null |
2024-07-16 | Quantizing Carrollian field theories | Jordan Cotler et.al. | 2407.11971v1 | null |
2024-07-16 | Does Refusal Training in LLMs Generalize to the Past Tense? | Maksym Andriushchenko et.al. | 2407.11969v1 | link |
2024-07-16 | mochi_class: Modelling Optimisation to Compute Horndeski In class | Matteo Cataneo et.al. | 2407.11968v1 | null |
2024-07-16 | Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale | Aymen Alsaadi et.al. | 2407.11967v1 | null |
2024-07-16 | Efficient Training with Denoised Neural Weights | Yifan Gong et.al. | 2407.11966v1 | null |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965v1 | null |
2024-07-16 | NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? | Mo Li et.al. | 2407.11963v1 | link |
2024-07-16 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962v1 | null |
2024-07-16 | Quantum and Classical Dynamics with Random Permutation Circuits | Bruno Bertini et.al. | 2407.11960v1 | null |
2024-07-16 | Faster Algorithms for Schatten-p Low Rank Approximation | Praneeth Kacham et.al. | 2407.11959v1 | null |
2024-07-15 | Age and metal gradients in massive quiescent galaxies at |
Chloe M. Cheng et.al. | 2407.10974v1 | null |
2024-07-15 | Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion | Yongyuan Liang et.al. | 2407.10973v1 | null |
2024-07-15 | VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation | Bocheng Zou et.al. | 2407.10972v1 | link |
2024-07-15 | An ALMA CO(1-0) survey of the 2Jy sample: large and massive molecular disks in radio AGN host galaxies | C. Tadhunter et.al. | 2407.10970v1 | null |
2024-07-15 | Q-Sparse: All Large Language Models can be Fully Sparsely-Activated | Hongyu Wang et.al. | 2407.10969v1 | null |
2024-07-15 | Optimal reconstruction of the Hellings and Downs correlation | Bruce Allen et.al. | 2407.10968v1 | null |
2024-07-15 | BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning | Haohong Lin et.al. | 2407.10967v1 | null |
2024-07-15 | Negative neutrino masses as a mirage of dark energy | Willem Elbers et.al. | 2407.10965v1 | null |
2024-07-15 | No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Walter Simoncini et.al. | 2407.10964v1 | link |
2024-07-15 | Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance | Ipsita Mandal et.al. | 2407.10963v1 | null |
2024-07-12 | Gauging The Diamond: Integrable Coset Models from Twistor Space | Lewis T. Cole et.al. | 2407.09479v1 | null |
2024-07-12 | Integer programs with nearly totally unimodular matrices: the cographic case | Manuel Aprile et.al. | 2407.09477v1 | null |
2024-07-12 | Intrinsically knotted graphs and connected domination | Gregory Li et.al. | 2407.09476v1 | null |
2024-07-12 | Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jinning Li et.al. | 2407.09475v1 | null |
2024-07-12 | A Primitive Model for Predicting Membrane Currents in Excitable Cells Based Only on Ion Diffusion Coefficients | Vivaan Patel et.al. | 2407.09474v1 | null |
2024-07-12 | Galaxy Mergers in the Epoch of Reionization I: A JWST Study of Pair Fractions, Merger Rates, and Stellar Mass Accretion Rates at |
Qiao Duan et.al. | 2407.09472v1 | null |
2024-07-12 | A new approach to principal-agent problems with volatility control | Alessandro Chiusolo et.al. | 2407.09471v1 | null |
2024-07-12 | Scalar tidal response of a rotating BTZ black hole | Rajendra Prasad Bhatt et.al. | 2407.09470v1 | null |
2024-07-12 | Learning Coordinated Maneuver in Adversarial Environments | Zechen Hu et.al. | 2407.09469v1 | null |
2024-07-12 | Beyond Euclid: An Illustrated Guide to Modern Machine Learning with Geometric, Topological, and Algebraic Structures | Sophia Sanborn et.al. | 2407.09468v1 | null |
2024-07-11 | MAVIS: Mathematical Visual Instruction Tuning | Renrui Zhang et.al. | 2407.08739v1 | link |
2024-07-11 | An Equation of State for Turbulence in the Gross-Pitaevskii model | Gevorg Martirosyan et.al. | 2407.08738v1 | null |
2024-07-11 | Video Diffusion Alignment via Reward Gradients | Mihir Prabhudesai et.al. | 2407.08737v1 | link |
2024-07-11 | Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Rohan Sinha et.al. | 2407.08735v1 | null |
2024-07-11 | Transformer Circuit Faithfulness Metrics are not Robust | Joseph Miller et.al. | 2407.08734v1 | link |
2024-07-11 | Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Zihao Zhou et.al. | 2407.08733v1 | null |
2024-07-11 | Elevated |
Doğa Veske et.al. | 2407.08732v1 | null |
2024-07-11 | Massive-ish Particles from Small-ish Scales: Non-Perturbative Techniques for Cosmological Collider Physics from Large-Scale Structure Surveys | Samuel Goldstein et.al. | 2407.08731v1 | null |
2024-07-11 | The Potential Impact of Noise Correlation in Next-generation Gravitational Wave Detectors | Isaac C. F. Wong et.al. | 2407.08728v1 | null |
2024-07-11 | Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data | Cherie Ho et.al. | 2407.08726v1 | null |
2024-07-10 | Pentagonal Photonic Crystal Mirrors: Scalable Lightsails with Enhanced Acceleration via Neural Topology Optimization | L. Norder et.al. | 2407.07896v1 | null |
2024-07-10 | LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models | Feng Li et.al. | 2407.07895v1 | link |
2024-07-10 | Quasinormal modes and gray-body factors of regular black holes in asymptotically safe gravity | Oleksandr Stashko et.al. | 2407.07892v1 | null |
2024-07-10 | A proposed crank for |
Samuel Wilson et.al. | 2407.07891v1 | null |
2024-07-10 | Training on the Test Task Confounds Evaluation and Emergence | Ricardo Dominguez-Olmedo et.al. | 2407.07890v1 | link |
2024-07-10 | AdaptiGraph: Material-Adaptive Graph-Based Neural Dynamics for Robotic Manipulation | Kaifeng Zhang et.al. | 2407.07889v1 | null |
2024-07-10 | Self-similar Markov trees and scaling limits | Jean Bertoin et.al. | 2407.07888v1 | null |
2024-07-10 | Controllability problems of a neutral integro-differential equation with memory | Sumit Arora et.al. | 2407.07886v1 | null |
2024-07-10 | Learning In-Hand Translation Using Tactile Skin With Shear and Normal Force Sensing | Jessica Yin et.al. | 2407.07885v1 | null |
2024-07-10 | Non-generic components of the Emerton-Gee stack for |
Kalyani Kansal et.al. | 2407.07883v1 | null |
2024-07-08 | Multi-Object Hallucination in Vision-Language Models | Xuweiyi Chen et.al. | 2407.06192v1 | null |
2024-07-08 | Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images | Zhangyang Qi et.al. | 2407.06191v1 | null |
2024-07-08 | 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu et.al. | 2407.06190v1 | link |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189v1 | null |
2024-07-08 | CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation | Xinying Guo et.al. | 2407.06188v1 | null |
2024-07-08 | JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng et.al. | 2407.06187v1 | null |
2024-07-08 | Stepping on the Edge: Curvature Aware Learning Rate Tuners | Vincent Roulet et.al. | 2407.06183v1 | null |
2024-07-08 | Left-Linear Rewriting in Adhesive Categories | Paolo Baldan et.al. | 2407.06181v1 | null |
2024-07-08 | Coherent Acoustic Phonons in Plasmonic Nanoparticles: Elastic Properties and Dissipation at Low Temperatures | Hilario D. Boggiano et.al. | 2407.06180v1 | null |
2024-07-08 | Non-uniqueness in the Leray-Hopf class for a dyadic Navier-Stokes model | Stan Palasek et.al. | 2407.06179v1 | null |
2024-07-05 | LaRa: Efficient Large-Baseline Radiance Fields | Anpei Chen et.al. | 2407.04699v1 | null |
2024-07-05 | VCoME: Verbal Video Composition with Multimodal Editing Effects | Weibo Gong et.al. | 2407.04697v1 | null |
2024-07-05 | Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Rudolf Laine et.al. | 2407.04694v1 | null |
2024-07-05 | ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Yuzhe Gu et.al. | 2407.04693v1 | null |
2024-07-05 | Eigen-decomposition of Covariance matrices: An application to the BAO Linear Point | Jaemyoung Jason Lee et.al. | 2407.04692v1 | null |
2024-07-05 | Exceptional Points and Braiding Topology in Non-Hermitian Systems with long-range coupling | S. M. Rafi-Ul-Islam et.al. | 2407.04691v1 | null |
2024-07-05 | Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks | Aaron Mueller et.al. | 2407.04690v1 | null |
2024-07-05 | Enhancing Vehicle Re-identification and Matching for Weaving Analysis | Mei Qiu et.al. | 2407.04688v1 | null |
2024-07-05 | Embracing Massive Medical Data | Yu-Cheng Chou et.al. | 2407.04687v1 | link |
2024-07-05 | Near-optimal hierarchical matrix approximation from matrix-vector products | Tyler Chen et.al. | 2407.04686v1 | null |
2024-07-03 | Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages | Max Zuo et.al. | 2407.03321v1 | link |
2024-07-03 | InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | Pan Zhang et.al. | 2407.03320v1 | link |
2024-07-03 | BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations | Zhantao Yang et.al. | 2407.03314v1 | null |
2024-07-03 | Synthesizing data products, mathematical models, and observational measurements for lake temperature forecasting | Maike F. Holthuijzen et.al. | 2407.03312v1 | null |
2024-07-03 | Universal Length Generalization with Turing Programs | Kaiying Hou et.al. | 2407.03310v1 | null |
2024-07-03 | Accelerated Proton Resonance Frequency-based Magnetic Resonance Thermometry by Optimized Deep Learning Method | Sijie Xu et.al. | 2407.03308v1 | link |
2024-07-03 | HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization | Yucheng Tang et.al. | 2407.03307v1 | null |
2024-07-03 | Macdonald polynomials for super-partitions | Dmitry Galakhov et.al. | 2407.03301v1 | null |
2024-07-03 | DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Yilun Xu et.al. | 2407.03300v1 | null |
2024-07-03 | Novel Pressure-Equilibrium and Kinetic-Energy Preserving fluxes for compressible flows based on the harmonic mean | Carlo De Michele et.al. | 2407.03299v1 | null |
2024-07-02 | MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention | Huiqiang Jiang et.al. | 2407.02490v1 | link |
2024-07-02 | Magic Insert: Style-Aware Drag-and-Drop | Nataniel Ruiz et.al. | 2407.02489v1 | null |
2024-07-02 | Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Ali Safaya et.al. | 2407.02486v1 | link |
2024-07-02 | RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Yue Yu et.al. | 2407.02485v1 | null |
2024-07-02 | Characterizing the Interpretability of Attention Maps in Digital Pathology | Tomé Albuquerque et.al. | 2407.02484v1 | null |
2024-07-02 | MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Binxu Li et.al. | 2407.02483v1 | null |
2024-07-02 | Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models | Fei Shen et.al. | 2407.02482v1 | null |
2024-07-02 | Analogs of the dual canonical bases for cluster algebras from Lie theory | Fan Qin et.al. | 2407.02480v1 | null |
2024-07-02 | Mixing effects on spectroscopy and partonic observables of mesons with logarithmic confining potential in a light-front quark model | Bhoomika Pandya et.al. | 2407.02479v1 | null |
2024-07-02 | Understanding Alignment in Multimodal LLMs: A Comprehensive Study | Elmira Amirloo et.al. | 2407.02477v1 | null |
2024-06-28 | Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Ankan Bhunia et.al. | 2406.20099v1 | link |
2024-06-28 | Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs | Sukmin Yun et.al. | 2406.20098v1 | link |
2024-06-28 | Predator-prey density-dependent branching processes | Cristina Gutiérrez et.al. | 2406.20097v1 | null |
2024-06-28 | LLaRA: Supercharging Robot Learning Data for Vision-Language Policy | Xiang Li et.al. | 2406.20095v1 | link |
2024-06-28 | Scaling Synthetic Data Creation with 1,000,000,000 Personas | Xin Chan et.al. | 2406.20094v1 | null |
2024-06-28 | Bowy M. La Rivière et.al. | 2406.20093v1 | null | |
2024-06-28 | LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression | Jieneng Chen et.al. | 2406.20092v1 | link |
2024-06-28 | Anomalous current fluctuations from Euler hydrodynamics | Takato Yoshimura et.al. | 2406.20091v1 | null |
2024-06-28 | Curbing PBHs with PTAs | A. J. Iovino et.al. | 2406.20089v1 | null |
2024-06-28 | Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints | Arnab Auddy et.al. | 2406.20088v1 | null |
2024-06-27 | SimLOB: Learning Representations of Limited Order Book for Financial Market Simulation | Yuanzhe Li et.al. | 2406.19396v1 | null |
2024-06-27 | Dataset Size Recovery from LoRA Weights | Mohammad Salama et.al. | 2406.19395v1 | null |
2024-06-27 | HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection | Liujuan Cao et.al. | 2406.19394v1 | link |
2024-06-27 | ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos | Jr-Jen Chen et.al. | 2406.19392v1 | link |
2024-06-27 | Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads | Ali Khaleghi Rahimian et.al. | 2406.19391v1 | link |
2024-06-27 | SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas | John Lambert et.al. | 2406.19390v1 | link |
2024-06-27 | OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding | Tao Zhang et.al. | 2406.19389v1 | null |
2024-06-27 | Taming Data and Transformers for Audio Generation | Moayed Haji-Ali et.al. | 2406.19388v1 | null |
2024-06-27 | Robust Hilbert space fragmentation in group-valued loop models | Alexey Khudorozhkov et.al. | 2406.19386v1 | null |
2024-06-27 | The Remarkable Robustness of LLMs: Stages of Inference? | Vedang Lad et.al. | 2406.19384v1 | link |
2024-06-26 | Towards Compositionality in Concept Learning | Adam Stein et.al. | 2406.18534v1 | link |
2024-06-26 | Symbolic Learning Enables Self-Evolving Agents | Wangchunshu Zhou et.al. | 2406.18532v1 | link |
2024-06-26 | A principled framework to assess information theoretical fitness of brain functional sub-circuits | Duy Duong-Tran et.al. | 2406.18531v1 | null |
2024-06-26 | MatchTime: Towards Automatic Soccer Game Commentary Generation | Jiayuan Rao et.al. | 2406.18530v1 | null |
2024-06-26 | Confident Natural Policy Gradient for Local Planning in |
Tian Tian et.al. | 2406.18529v1 | null |
2024-06-26 | PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter et.al. | 2406.18528v1 | null |
2024-06-26 | Compact embeddings of Sobolev, Besov, and Triebel-Lizorkin spaces | Ryan Alvarado et.al. | 2406.18527v1 | null |
2024-06-26 | Digging its own Site: Linear Coordination Stabilizes a Pt1/Fe2O3 Single-Atom Catalyst | Ali Rafsanjani-Abbasi et.al. | 2406.18525v1 | null |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524v1 | null |
2024-06-26 | Integrability and renormalizability for the fully anisotropic |
G. A. Kotousov et.al. | 2406.18523v1 | null |
2024-06-25 | Quantum hall transformer in a quantum point contact over the full range of transmission | Stuart N. Thomas et.al. | 2406.17778v1 | null |
2024-06-25 | Text-Animator: Controllable Visual Text Video Generation | Lin Liu et.al. | 2406.17777v1 | null |
2024-06-25 | Evidence of thermodynamics and magnetic monopole plasma formation by photon-magnon interaction in artificial spin ice | D. G. Duarte et.al. | 2406.17775v1 | null |
2024-06-25 | Spectrum and low-energy gap in triangular quantum spin liquid NaYbSe$_2$ | A. O. Scheie et.al. | 2406.17773v1 | null |
2024-06-25 | Violation of |
Hoang Ky Nguyen et.al. | 2406.17771v1 | null |
2024-06-25 | MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning | Xiangyu Zhao et.al. | 2406.17770v1 | link |
2024-06-25 | EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data | Jesse Zhang et.al. | 2406.17768v1 | null |
2024-06-25 | Splitting Guarantees for Prophet Inequalities via Nonlinear Systems | Johannes Brustle et.al. | 2406.17767v1 | null |
2024-06-25 | Generalized anomalous Hall crystals in twisted bilayer-trilayer graphene | Ruiheng Su et.al. | 2406.17766v1 | null |
2024-06-25 | A dimension formula of closed affine Deligne-Lusztig varieties of parahoric level | Arghya Sadhukhan et.al. | 2406.17765v1 | null |
2024-06-24 | Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models | Jierun Chen et.al. | 2406.16866v1 | link |
2024-06-24 | Variational Monte Carlo Study of the Doped |
Can Cui et.al. | 2406.16865v1 | null |
2024-06-24 | StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal | Chongjie Ye et.al. | 2406.16864v1 | null |
2024-06-24 | FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models | Haonan Qiu et.al. | 2406.16863v1 | link |
2024-06-24 | Dreamitate: Real-World Visuomotor Policy Learning via Video Generation | Junbang Liang et.al. | 2406.16862v1 | null |
2024-06-24 | Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs | Shengbang Tong et.al. | 2406.16860v1 | link |
2024-06-24 | EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li et.al. | 2406.16858v1 | null |
2024-06-24 | The Surface Signature and Rough Surfaces | Darrick Lee et.al. | 2406.16857v1 | null |
2024-06-24 | DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation | Yuang Peng et.al. | 2406.16855v1 | link |
2024-06-24 | Spectroscopy of Hubbard-Mott excitons and their ro-vibrational excitations | Annabelle Bohrdt et.al. | 2406.16854v1 | null |
2024-06-21 | A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick | Nishant Balepur et.al. | 2406.15352v1 | null |
2024-06-21 | Impact & Mitigation of Polarized Extragalactic Foregrounds on Bayesian Cosmic Microwave Background Lensing | Frank J. Qu et.al. | 2406.15351v1 | null |
2024-06-21 | Chiral Spin Liquid and Quantum Phase Transition in the Triangular Lattice Hofstadter-Hubbard Model | Stefan Divic et.al. | 2406.15348v1 | null |
2024-06-21 | A model-independent measurement of the expansion and growth rates from BOSS using the FreePower method | Adrian P. Schirra et.al. | 2406.15347v1 | null |
2024-06-21 | Privacy Preserved Blood Glucose Level Cross-Prediction: An Asynchronous Decentralized Federated Learning Approach | Chengzhe Piao et.al. | 2406.15346v1 | link |
2024-06-21 | Elucidating Galaxy Population Properties Using a Model-Free Analysis of Quadruply Imaged Quasar Lenses From Large Surveys | John Miller Jr et.al. | 2406.15344v1 | null |
2024-06-21 | Textured Exciton Insulators | Yves H. Kwan et.al. | 2406.15343v1 | null |
2024-06-21 | GenoTEX: A Benchmark for Evaluating LLM-Based Exploration of Gene Expression Data in Alignment with Bioinformaticians | Haoyang Liu et.al. | 2406.15341v1 | link |
2024-06-21 | Full-Scale Indexing and Semantic Annotation of CT Imaging: Boosting FAIRness | Hannes Ulrich et.al. | 2406.15340v1 | null |
2024-06-21 | Image Conductor: Precision Control for Interactive Video Synthesis | Yaowei Li et.al. | 2406.15339v1 | null |
2024-06-20 | Model Merging and Safety Alignment: One Bad Model Spoils the Bunch | Hasan Abed Al Kader Hammoud et.al. | 2406.14563v1 | null |
2024-06-20 | Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon et.al. | 2406.14562v1 | null |
2024-06-20 | How to Compute the Probability of a Word | Tiago Pimentel et.al. | 2406.14561v1 | null |
2024-06-20 | OH mid-infrared emission as a diagnostic of H$_2$O UV photodissociation. III. Application to planet-forming disks | Benoît Tabone et.al. | 2406.14560v1 | null |
2024-06-20 | Generalized upwind summation-by-parts operators and their application to nodal discontinuous Galerkin methods | Jan Glaubitz et.al. | 2406.14557v1 | null |
2024-06-21 | Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen et.al. | 2406.14556v2 | null |
2024-06-20 | A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models | Xincheng Shuai et.al. | 2406.14555v1 | link |
2024-06-20 | Neutrino mass bounds from DESI 2024 are relaxed by Planck PR4 and cosmological supernovae | Itamar J. Allali et.al. | 2406.14554v1 | null |
2024-06-20 | xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics | Daniil Larionov et.al. | 2406.14553v1 | null |
2024-06-20 | Exploring the no-hair theorem with LISA | Chantal Pitte et.al. | 2406.14552v1 | null |
2024-06-18 | Generalized entropy of photons in AdS | Sean Colin-Ellerin et.al. | 2406.12851v1 | null |
2024-06-18 | Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation | Ning-Hsu Wang et.al. | 2406.12849v1 | null |
2024-06-18 | ChangeViT: Unleashing Plain Vision Transformers for Change Detection | Duowang Zhu et.al. | 2406.12847v1 | link |
2024-06-18 | DrVideo: Document Retrieval Based Long Video Understanding | Ziyu Ma et.al. | 2406.12846v1 | null |
2024-06-18 | Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts | Haoxiang Wang et.al. | 2406.12845v1 | link |
2024-06-18 | Synergizing Foundation Models and Federated Learning: A Survey | Shenghui Li et.al. | 2406.12844v1 | null |
2024-06-18 | Demystifying Higher-Order Graph Neural Networks | Maciej Besta et.al. | 2406.12841v1 | null |
2024-06-18 | Towards an Automatic Framework for Solving Optimization Problems with Quantum Computers | Deborah Volpe et.al. | 2406.12840v1 | null |
2024-06-18 | Evaluating the design space of diffusion-based generative models | Yuqing Wang et.al. | 2406.12839v1 | null |
2024-06-18 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837v1 | link |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840v1 | null |
2024-06-17 | mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang et.al. | 2406.11839v1 | null |
2024-06-17 | Autoregressive Image Generation without Vector Quantization | Tianhong Li et.al. | 2406.11838v1 | null |
2024-06-17 | Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Lei Zhu et.al. | 2406.11837v1 | link |
2024-06-17 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836v1 | null |
2024-06-17 | MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs | Ziyu Liu et.al. | 2406.11833v1 | link |
2024-06-17 | Unveiling Encoder-Free Vision-Language Models | Haiwen Diao et.al. | 2406.11832v1 | link |
2024-06-17 | Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models | Bingqi Ma et.al. | 2406.11831v1 | null |
2024-06-17 | Language Modeling with Editable External Knowledge | Belinda Z. Li et.al. | 2406.11830v1 | link |
2024-06-17 | Learning sum of diverse features: computational hardness and efficient gradient-based training for ridge combinations | Kazusato Oko et.al. | 2406.11828v1 | null |
2024-06-14 | Quantifying Variance in Evaluation Benchmarks | Lovish Madaan et.al. | 2406.10229v1 | null |
2024-06-14 | VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models | Chenyu Zhou et.al. | 2406.10228v1 | null |
2024-06-14 | VideoGUI: A Benchmark for GUI Automation from Instructional Videos | Kevin Qinghong Lin et.al. | 2406.10227v1 | null |
2024-06-14 | SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models | Zhaoxu Luo et.al. | 2406.10225v1 | null |
2024-06-14 | EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models | Julian Straub et.al. | 2406.10224v1 | null |
2024-06-14 | Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation | Nameer Hirschkind et.al. | 2406.10223v1 | null |
2024-06-14 | Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding | Ridouane Ghermi et.al. | 2406.10221v1 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219v1 | null |
2024-06-14 | Semantic Membership Inference Attack against Large Language Models | Hamid Mozaffari et.al. | 2406.10218v1 | null |
2024-06-14 | MINDS. A multi-instrument investigation into the molecule-rich JWST-MIRI spectrum of the DF Tau binary system | Sierra L. Grant et.al. | 2406.10217v1 | null |
2024-06-13 | VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding | Muhammad Maaz et.al. | 2406.09418v1 | link |
2024-06-13 | Rethinking Score Distillation as a Bridge Between Image Distributions | David McAllister et.al. | 2406.09417v1 | null |
2024-06-13 | Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models | Qihao Liu et.al. | 2406.09416v1 | null |
2024-06-13 | An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels | Duy-Kien Nguyen et.al. | 2406.09415v1 | null |
2024-06-13 | Depth Anything V2 | Lihe Yang et.al. | 2406.09414v1 | null |
2024-06-13 | Interpreting the Weight Space of Customized Diffusion Models | Amil Dravid et.al. | 2406.09413v1 | link |
2024-06-13 | Explore the Limits of Omni-modal Pretraining at Scale | Yiyuan Zhang et.al. | 2406.09412v1 | link |
2024-06-13 | MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding | Fei Wang et.al. | 2406.09411v1 | null |
2024-06-13 | Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach | Yansheng Li et.al. | 2406.09410v1 | link |
2024-06-13 | Towards Evaluating the Robustness of Visual State Space Models | Hashmat Shadab Malik et.al. | 2406.09407v1 | link |
2024-06-12 | ICE-G: Image Conditional Editing of 3D Gaussian Splats | Vishnu Jaganathan et.al. | 2406.08488v1 | null |
2024-06-13 | Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models | Yi-Fan Zhang et.al. | 2406.08487v2 | link |
2024-06-12 | On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models | Hashmat Shadab Malik et.al. | 2406.08486v1 | link |
2024-06-12 | Celestial Topology, Symmetry Theories, and Evidence for a Non-SUSY D3-Brane CFT | Jonathan J. Heckman et.al. | 2406.08485v1 | null |
2024-06-12 | Exploiting the diversity of modeling methods to probe systematic biases in strong lensing analyses | A. Galan et.al. | 2406.08484v1 | null |
2024-06-12 | Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang et.al. | 2406.08482v1 | null |
2024-06-12 | Enhancing End-to-End Autonomous Driving with Latent World Model | Yingyan Li et.al. | 2406.08481v1 | link |
2024-06-12 | Linear equations with monomial constraints and decision problems in abelian-by-cyclic groups | Ruiwen Dong et.al. | 2406.08480v1 | null |
2024-06-12 | Real3D: Scaling Up Large Reconstruction Models with Real-World Images | Hanwen Jiang et.al. | 2406.08479v1 | null |
2024-06-12 | What If We Recaption Billions of Web Images with LLaMA-3? | Xianhang Li et.al. | 2406.08478v1 | null |
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550v1 | null |
2024-06-11 | A3VLM: Actionable Articulation-Aware Vision Language Model | Siyuan Huang et.al. | 2406.07549v1 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548v1 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547v1 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546v1 | null |
2024-06-11 | Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Aidar Myrzakhan et.al. | 2406.07545v1 | link |
2024-06-11 | Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man et.al. | 2406.07544v1 | null |
2024-06-11 | Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Chenyu Yang et.al. | 2406.07543v1 | link |
2024-06-11 | Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis | David Ortiz-Perez et.al. | 2406.07542v1 | link |
2024-06-11 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu et.al. | 2406.07541v1 | null |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527v1 | null |
2024-06-10 | GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Haozhe Xie et.al. | 2406.06526v1 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525v1 | link |
2024-06-10 | Gas Fees on the Ethereum Blockchain: From Foundations to Derivatives Valuations | Bernhard K Meister et.al. | 2406.06524v1 | null |
2024-06-10 | NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing | Ting-Hsuan Chen et.al. | 2406.06523v1 | null |
2024-06-10 | Multiple SLEs for |
Yu Feng et.al. | 2406.06522v1 | null |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521v1 | null |
2024-06-10 | Decentralized Personalized Federated Learning | Salma Kharrat et.al. | 2406.06520v1 | null |
2024-06-10 | UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor | Shivani Upadhyay et.al. | 2406.06519v1 | link |
2024-06-10 | Data Augmentation for Multivariate Time Series Classification: An Experimental Study | Romain Ilbert et.al. | 2406.06518v1 | null |
2024-06-07 | 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs | Jianing Yang et.al. | 2406.05132v1 | null |
2024-06-07 | DVOS: Self-Supervised Dense-Pattern Video Object Segmentation | Keyhan Najafian et.al. | 2406.05131v1 | null |
2024-06-07 | An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Xiongtao Zhou et.al. | 2406.05130v1 | null |
2024-06-07 | Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis | Chin-Yun Yu et.al. | 2406.05128v1 | link |
2024-06-07 | Towards Semantic Equivalence of Tokenization in Multimodal LLM | Shengqiong Wu et.al. | 2406.05127v1 | null |
2024-06-07 | GR-Athena++: magnetohydrodynamical evolution with dynamical space-time | Boris Daszuta et.al. | 2406.05126v1 | null |
2024-06-07 | Third-order intrinsic alignment of SDSS BOSS LOWZ galaxies | Laila Linke et.al. | 2406.05122v1 | null |
2024-06-07 | Energy Propagation in Scattering Convolution Networks Can Be Arbitrarily Slow | Hartmut Führ et.al. | 2406.05121v1 | null |
2024-06-07 | Contextual fusion enhances robustness to image blurring | Shruti Joshi et.al. | 2406.05120v1 | null |
2024-06-07 | Compositional Curvature Bounds for Deep Neural Networks | Taha Entesari et.al. | 2406.05119v1 | null |
2024-06-06 | Stereo-Depth Fusion through Virtual Pattern Projection | Luca Bartolomei et.al. | 2406.04345v1 | link |
2024-06-06 | Verbalized Machine Learning: Revisiting Machine Learning with Language Models | Tim Z. Xiao et.al. | 2406.04344v1 | null |
2024-06-06 | Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image | Stanislaw Szymanowicz et.al. | 2406.04343v1 | null |
2024-06-06 | Learning 1D Causal Visual Representation with De-focus Attention Networks | Chenxin Tao et.al. | 2406.04342v1 | link |
2024-06-06 | Interpreting the Second-Order Effects of Neurons in CLIP | Yossi Gandelsman et.al. | 2406.04341v1 | null |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340v1 | link |
2024-06-06 | RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation | Jiaming Liu et.al. | 2406.04339v1 | null |
2024-06-07 | Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Fangfu Liu et.al. | 2406.04338v2 | null |
2024-06-06 | Coherent Zero-Shot Visual Instruction Generation | Quynh Phung et.al. | 2406.04337v1 | null |
2024-06-06 | Particles and their fluids in |
P. P. Avelino et.al. | 2406.04335v1 | null |
2024-06-05 | Wings: Learning Multimodal LLMs without Text-only Forgetting | Yi-Kai Zhang et.al. | 2406.03496v1 | null |
2024-06-05 | Grokking Modular Polynomials | Darshil Doshi et.al. | 2406.03495v1 | null |
2024-06-05 | Stout smearing and Wilson flow in lattice perturbation theory | Maximilian Ammer et.al. | 2406.03493v1 | null |
2024-06-05 | The Logarithmic Memristor-Based Bayesian Machine | Clément Turck et.al. | 2406.03492v1 | null |
2024-06-05 | Detecting Phase Coherence of 2D Bose Gases via Noise Correlations | Shinichi Sunami et.al. | 2406.03491v1 | null |
2024-06-05 | Simultaneous retrieval of orbital phase resolved JWST/MIRI emission spectra of the hot Jupiter WASP-43b: evidence of water, ammonia and carbon monoxide | Jingxuan Yang et.al. | 2406.03490v1 | null |
2024-06-06 | Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training | Ao Sun et.al. | 2406.03488v2 | null |
2024-06-05 | Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends | Sanjana Ramprasad et.al. | 2406.03487v1 | null |
2024-06-05 | BIPED: Pedagogically Informed Tutoring System for ESL Education | Soonwoo Kwon et.al. | 2406.03486v1 | null |
2024-06-05 | Raman effects in Quantum Frequency Conversion using Bragg Scattering | Mathias Linde Holst Korsgaard et.al. | 2406.03484v1 | null |
2024-06-04 | Local control and mixed dimensions: Exploring high-temperature superconductivity in optical lattices | Henning Schlömer et.al. | 2406.02551v1 | null |
2024-06-04 | Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks | Tianyu He et.al. | 2406.02550v1 | link |
2024-06-04 | Dreamguider: Improved Training free Diffusion-based Conditional Generation | Nithin Gopalakrishnan Nair et.al. | 2406.02549v1 | null |
2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548v1 | link |
2024-06-04 | Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Alex Jinpeng Wang et.al. | 2406.02547v1 | link |
2024-06-05 | Dark photon limits from patchy dark screening of the cosmic microwave background | Fiona McCarthy et.al. | 2406.02546v2 | null |
2024-06-04 | Robust and highly scalable estimation of directional couplings from time-shifted signals | Luca Ambrogioni et.al. | 2406.02545v1 | null |
2024-06-04 | Asymmetry, Gap Opening and High Accretion Rate on DM Tau: A Hypothesis Based on Interaction of Magnetized Disk Wind with Planet | Yinhao Wu et.al. | 2406.02544v1 | null |
2024-06-04 | To Believe or Not to Believe Your LLM | Yasin Abbasi Yadkori et.al. | 2406.02543v1 | null |
2024-06-04 | Loki: Low-Rank Keys for Efficient Sparse Attention | Prajwal Singhania et.al. | 2406.02542v1 | null |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075v1 | null |
2024-05-31 | Toward Quantum Analogue Simulation of Many-Body Supersymmetry with Rydberg Atom Arrays | Hrushikesh Sable et.al. | 2405.21073v1 | null |
2024-05-31 | Fast inspirals and the treatment of orbital resonances | Philip Lynch et.al. | 2405.21072v1 | null |
2024-05-31 | A Multi-wavelength, Multi-epoch Monitoring Campaign of Accretion Variability in T Tauri Stars from the ODYSSEUS Survey. II. Photometric Light Curves | John Wendeborn et.al. | 2405.21071v1 | null |
2024-05-31 | Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights | Xin Wen et.al. | 2405.21070v1 | link |
2024-05-31 | Code Pretraining Improves Entity Tracking Abilities of Language Models | Najoung Kim et.al. | 2405.21068v1 | null |
2024-05-31 | Brightening and Fading in the Youngest Galactic Supernova Remnant G1.9+0.3: 13 years of monitoring with the Chandra X-ray Observatory | Kazimierz J. Borkowski et.al. | 2405.21067v1 | null |
2024-05-31 | Mixed Diffusion for 3D Indoor Scene Synthesis | Siyi Hu et.al. | 2405.21066v1 | null |
2024-05-31 | Recurrent neural networks: vanishing and exploding gradients are not the end of the story | Nicolas Zucchet et.al. | 2405.21064v1 | null |
2024-05-31 | Neural Network Verification with Branch-and-Bound for General Nonlinearities | Zhouxing Shi et.al. | 2405.21063v1 | null |
2024-05-30 | Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | Kailu Wu et.al. | 2405.20343v1 | null |
2024-05-30 | Evaluating Approximations of Count Distributions and Forecasts for Poisson-Lindley Integer Autoregressive Processes | Rachel D. Gidaro et.al. | 2405.20342v1 | null |
2024-05-30 | From Zero to Hero: Cold-Start Anomaly Detection | Tal Reiss et.al. | 2405.20341v1 | link |
2024-05-30 | MotionLLM: Understanding Human Behaviors from Human Motions and Videos | Ling-Hao Chen et.al. | 2405.20340v1 | null |
2024-05-30 | Visual Perception by Large Language Model's Weights | Feipeng Ma et.al. | 2405.20339v1 | null |
2024-05-30 | Mixed finite element methods for fourth order obstacle problems in linearised elasticity | Paolo Piersanti et.al. | 2405.20338v1 | null |
**2024-0 |