Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New submissions for Mon, 12 Jun 23 #372

Open
e-tornike opened this issue Jun 12, 2023 · 0 comments
Open

New submissions for Mon, 12 Jun 23 #372

e-tornike opened this issue Jun 12, 2023 · 0 comments

Comments

@e-tornike
Copy link
Owner

Keyword: contrastive

CLC: Cluster Assignment via Contrastive Representation Learning

Authors: Fei Ding, Dan Zhang, Yin Yang, Venkat Krovi, Feng Luo
Arxiv: https://arxiv.org/abs/2306.05439
TLDR: Clustering remains an important and challenging task of grouping samples into clusters without manual annotations. Recent works have achieved excellent results on small datasets by performing clustering on feature representations learned from self-supervised learning. However, for datasets with a large number of clusters, such as ImageNet, current methods still can not achieve high clustering performance. In this paper, we propose Contrastive Learning-based Clustering (CLC), which uses contrastive learning to directly learn cluster assignment
Repo: None

Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment

Authors: Zihui Xue, Kristen Grauman
Arxiv: https://arxiv.org/abs/2306.05526
TLDR: The egocentric and exocentric viewpoints of a human activity look dramatically different, yet invariant representations to link them are essential for many potential applications in robotics and augmented reality. Prior work is limited to learning view-invariant features from paired synchronized viewpoints. We relax that strong data assumption and propose to learn fine-grained action features that are invariant to the viewpoints by aligning egocar and exo videos in time, even when not captured simultaneously or in the
Repo: None

Assisting Language Learners: Automated Trans-Lingual Definition Generation via Contrastive Prompt Learning

Authors: Hengyuan Zhang, Dawei Li, Yanran Li, Chenming Shang, Chufan Shi, Yong Jiang
Arxiv: https://arxiv.org/abs/2306.06058
TLDR: The standard definition generation task requires to automatically produce mono-lingual definitions (e.g., English definitions for English words), but ignores that the generated definitions may also consist of unfamiliar words for language learners. In this work, we propose a novel task of Trans-Lingual Definition Generation (TLDG), which aims to generate definitions in another language, i.e., the native speaker's language. Initially, we explore the unsupervised manner of this task and build up
Repo: None

Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes

Authors: Chun Liu, Suqiang Ma, Zheng Li, Wei Yang, Zhigang Han
Arxiv: https://arxiv.org/abs/2306.06066
TLDR: Zero-shot classification of image scenes which can recognize the image scenes that are not seen in the training stage holds great promise of lowering the dependence on large numbers of labeled samples. To address the zero-shot image scene classification, the cross-modal feature alignment methods have been proposed in recent years. These methods mainly focus on matching the visual features of each image scene with their corresponding semantic descriptors in the latent space. Less attention has been paid to the contrastive relationships between different image
Repo: None

Keyword: data augmentation

LOST: A Mental Health Dataset of Low Self-esteem in Reddit Posts

Authors: Muskan Garg, Manas Gaur, Raxit Goswami, Sunghwan Sohn
Arxiv: https://arxiv.org/abs/2306.05596
TLDR: Low self-esteem and interpersonal needs (i.e., thwarted belongingness (TB) and perceived burdensomeness (PB)) have a major impact on depression and suicide attempts. Individuals seek social connectedness on social media to boost and alleviate their loneliness. Social media platforms allow people to express their thoughts, experiences, beliefs, and emotions. Prior studies on mental health from social media have focused on symptoms, causes, and disorders. Whereas an initial screening of social media content for interpersonal risk
Repo: None

Improving LiDAR 3D Object Detection via Range-based Point Cloud Density Optimization

Authors: Eduardo R. Corral-Soto, Alaap Grandhi, Yannis Y. He, Mrigank Rochan, Bingbing Liu
Arxiv: https://arxiv.org/abs/2306.05663
TLDR: In recent years, much progress has been made in LiDAR-based 3D object detection mainly due to advances in detector architecture designs and availability of large-scale LiDAS datasets. Existing 3D objects detectors tend to perform well on the point cloud regions closer to the LiDar sensor as opposed to on regions that are farther away. In this paper, we investigate this problem from the data perspective instead of detector architecture design. We observe that there is a learning bias in
Repo: None

Augmentation-aware Self-supervised Learning with Guided Projector

Authors: Marcin Przewięźlikowski, Mateusz Pyla, Bartosz Zieliński, Bartłomiej Twardowski, Jacek Tabor, Marek Śmieja
Arxiv: https://arxiv.org/abs/2306.06082
TLDR: Self-supervised learning (SSL) is a powerful technique for learning robust representations from unlabeled data. By learning to remain invariant to applied data augmentations, methods such as SimCLR and MoCo are able to reach quality on par with supervised approaches. However, this invariance may be harmful to solving some downstream tasks which depend on traits affected by augmentations used during pretraining, such as color. In this paper, we propose to foster sensitivity to such characteristics in
Repo: None

Keyword: knowledge graph

Bayesian Knowledge-driven Critiquing with Indirect Evidence

Authors: Armin Toroghi, Griffin Floto, Zhenwei Tang, Scott Sanner
Arxiv: https://arxiv.org/abs/2306.05636
TLDR: Conversational recommender systems (CRS) enhance the expressivity and personalization of recommendations through multiple turns of user-system interaction. Critiquing is a well-known paradigm for CRS that allows users to iteratively refine recommendations by providing feedback about attributes of recommended items. While existing critiquing methodologies utilize direct attributes of items to address user requests such as 'I prefer Western movies', the opportunity of incorporating richer contextual and side information about items stored in Knowledge Graphs
Repo: None

Keyword: legal

LexGPT 0.1: pre-trained GPT-J models with Pile of Law

Authors: Jieh-Sheng Lee
Arxiv: https://arxiv.org/abs/2306.05431
TLDR: This research aims to build generative language models specialized for the legal domain. The manuscript presents the development of LexGPT models based on GPT-J models and pre-trained with Pile of Law. The foundation model built in this manuscript is the initial step for the development, such as further training with reinforcement learning from human feedback. Another objective of this manuscript, is to assist legal professionals in utilizing language models through the ``No Code'' approach. By fine-tuning models
Repo: None

The Potential of Self-Regulation for Front-Running Prevention on DEXes

Authors: Lioba Heimbach, Eric Schertenleib, Roger Wattenhofer
Arxiv: https://arxiv.org/abs/2306.05756
TLDR: The transaction ordering dependency of the smart contracts building decentralized exchanges (DEXes) allow for predatory trading strategies. In particular, front-running attacks present a constant risk for traders on DEXes. Whereas legal regulation outlaws most front-forming practices in traditional finance, such measures are ineffective in preventing front-runs on DEEXes due to the absence of a central authority. While novel market designs hindering front-runners may emerge, it remains unclear whether the market's participants,
Repo: None

Efficient GNN Explanation via Learning Removal-based Attribution

Authors: Yao Rong, Guanchu Wang, Qizhang Feng, Ninghao Liu, Zirui Liu, Enkelejda Kasneci, Xia Hu
Arxiv: https://arxiv.org/abs/2306.05760
TLDR: As Graph Neural Networks (GNNs) have been widely used in real-world applications, model explanations are required not only by users but also by legal regulations. However, simultaneously achieving high fidelity and low computational costs in generating explanations has been a challenge for current methods. In this work, we propose a framework of GNN explanation named LeArn Removal-based Attribution (LARA) to address this problem. Specifically, we introduce removal-based attribution and demonstrate its substantiated link
Repo: None

Quantitative Ink Analysis: Estimating the Number of Inks in Documents through Hyperspectral Imaging

Authors: Aneeqa Abrar, Hamza Iqbal
Arxiv: https://arxiv.org/abs/2306.05784
TLDR: In the field of document forensics, ink analysis plays a crucial role in determining the authenticity of legal and historic documents and detecting forgery. Visual examination alone is insufficient for distinguishing visually similar inks, necessitating the use of advanced scientific techniques. This paper proposes an ink analysis technique based on hyperspectral imaging, which enables the examination of documents in hundreds of narrowly spaced spectral bands, revealing hidden details. The main objective of this study is to identify the number of distinct inks used
Repo: None

Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives

Authors: Rabee Qasem, Banan Tantour, Mohammed Maree
Arxiv: https://arxiv.org/abs/2306.05827
TLDR: With the ever-increasing utilization of natural language processing (NLP), we started to witness over the past few years a significant transformation in our interaction with legal texts. This technology has advanced the analysis and enhanced the understanding of complex legal terminology and contexts. The development of recent large language models (LLMs), particularly ChatGPT, has also introduced a revolutionary contribution to the way that legal texts can be processed and comprehended. In this paper, we present our work on a cooperative-
Repo: None

Learning Not to Spoof

Authors: David Byrd
Arxiv: https://arxiv.org/abs/2306.06087
TLDR: As intelligent trading agents based on reinforcement learning (RL) gain prevalence, it becomes more important to ensure that RL agents obey laws, regulations, and human behavioral expectations. There is substantial literature concerning the aversion of obvious catastrophes like crashing a helicopter or bankrupting a trading account, but little around the avoidance of subtle non-normative behavior for which there are examples, but no programmable definition. Such behavior may violate legal or regulatory, rather than physical or monetary, constraints. In
Repo: None

Keyword: legal text

Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives

Authors: Rabee Qasem, Banan Tantour, Mohammed Maree
Arxiv: https://arxiv.org/abs/2306.05827
TLDR: With the ever-increasing utilization of natural language processing (NLP), we started to witness over the past few years a significant transformation in our interaction with legal texts. This technology has advanced the analysis and enhanced the understanding of complex legal terminology and contexts. The development of recent large language models (LLMs), particularly ChatGPT, has also introduced a revolutionary contribution to the way that legal texts can be processed and comprehended. In this paper, we present our work on a cooperative-
Repo: None

Keyword: multi-task

PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance

Authors: Qianqian Xie, Weiguang Han, Xiao Zhang, Yanzhao Lai, Min Peng, Alejandro Lopez-Lira, Jimin Huang
Arxiv: https://arxiv.org/abs/2306.05443
TLDR: Although large language models (LLMs) has shown great performance on natural language processing (NLP) in the financial domain, there are no publicly available financial tailtored LLMs, instruction tuning datasets, and evaluation benchmarks, which is critical for continually pushing forward the open-source development of financial artificial intelligence (AI). This paper introduces PIXIU, a comprehensive framework including the first financial LLM based on fine-tuning LLaMA with instruction data, the first instruction data
Repo: None

Single-Stage Visual Relationship Learning using Conditional Queries

Authors: Alakh Desai, Tz-Ying Wu, Subarna Tripathi, Nuno Vasconcelos
Arxiv: https://arxiv.org/abs/2306.05689
TLDR: Research in scene graph generation (SGG) usually considers two-stage models, that is, detecting a set of entities, followed by combining them and labeling all possible relationships. While showing promising results, the pipeline structure induces large parameter and computation overhead, and typically hinders end-to-end optimizations. To address this, recent research attempts to train single-stage and multi-task models that are computationally efficient. With the advent of DETR, a set based detection model,
Repo: None

The Role of Diverse Replay for Generalisation in Reinforcement Learning

Authors: Max Weltevrede, Matthijs T.J. Spaan, Wendelin Böhmer
Arxiv: https://arxiv.org/abs/2306.05727
TLDR: In reinforcement learning (RL), key components of many algorithms are the exploration strategy and replay buffer. These strategies regulate what environment data is collected and trained on and have been extensively studied in the RL literature. In this paper, we investigate the impact of these components in the context of generalisation in multi-task RL. We investigate the hypothesis that collecting and training on more diverse data from the training environment will improve zero-shot generalisation to new environments/tasks. We motivate mathematically
Repo: None

Keyword: paraphrase

Towards End-to-end Speech-to-text Summarization

Authors: Raul Monteiro, Diogo Pernes
Arxiv: https://arxiv.org/abs/2306.05432
TLDR: Speech-to-text (S2T) summarization is a time-saving technique for filtering and keeping up with the broadcast news uploaded online on a daily basis. The rise of large language models from deep learning with impressive text generation capabilities has placed the research focus on summarization systems that produce paraphrased compact versions of the document content, also known as abstractive summaries. End-to the end (E2E) modelling of S2T abstractive summarization
Repo: None

FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering

Authors: Megha Chakraborty, Khusbu Pahwa, Anku Rani, Adarsh Mahor, Aditya Pakala, Arghya Sarkar, Harshit Dave, Ishan Paul, Janvita Reddy, Preethi Gurumurthy, Ritvik G, Samahriti Mukherjee, Shreyas Chatterjee, Kinjal Sensharma, Dwip Dalal, Suryavardan S, Shreyash Mishra, Parth Patwa, Aman Chadha, Amit Sheth, Amitava Das
Arxiv: https://arxiv.org/abs/2306.05523
TLDR: Combating disinformation is one of the burning societal crises -- about 67% of the American population believes that disinformation produces a lot of uncertainty, and 10% of them knowingly propagate disinformation. Evidence shows that disinformation can manipulate democratic processes and public opinion, causing disruption in the share market, panic and anxiety in society, and even death during crises. Therefore, disinformation should be identified promptly and, if possible, mitigated. With approximately 3.2 billion images and 720,000 hours of video shared
Repo: None

Emotion and Sentiment Guided Paraphrasing

Authors: Justin J. Xie, Ameeta Agrawal
Arxiv: https://arxiv.org/abs/2306.05556
TLDR: Paraphrase generation, a.k.a. paraphrasing, is a common and important task in natural language processing. Emotional paraphrased, which changes the emotion embodied in a piece of text while preserving its meaning, has many potential applications, including moderating online dialogues and preventing cyberbullying. We introduce a new task of fine-grained emotional paraphrase along emotion gradients, that is, altering the emotional intensities of the paraphrases in
Repo: None

Keyword: plagiarism

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text

Authors: Jinyan Su, Terry Yue Zhuo, Di Wang, Preslav Nakov
Arxiv: https://arxiv.org/abs/2306.05540
TLDR: With the rapid progress of large language models (LLMs) and the huge amount of text they generated, it becomes more and more impractical to manually distinguish whether a text is machine-generated. Given the growing use of LLMs in social media and education, it prompts us to develop methods to detect machine- generated text, preventing malicious usage such as plagiarism, misinformation, and propaganda. Previous work has studied several zero-shot methods, which require no training data. These methods achieve good
Repo: None

Keyword: robustness

Is Attentional Channel Processing Design Required? Comprehensive Analysis Of Robustness Between Vision Transformers And Fully Attentional Networks

Authors: Abhishri Ajit Medewar, Swanand Ashokrao Kavitkar
Arxiv: https://arxiv.org/abs/2306.05495
TLDR: The robustness testing has been performed for standard CNN models and Vision Transformers, however there is a lack of comprehensive study between the robustness of traditional Vision Transformers without an extra attentional channel design and the latest fully attentional network(FAN) models. So in this paper, we use the ImageNet dataset to compare the primitives of fully attentionable network(fAN) model with traditional Vision Optimus to understand the role of an attentional pattern processing design using white box attacks and
Repo: None

Robust Active and Passive Beamforming for RIS-Assisted Full-Duplex Systems under Imperfect CSI

Authors: Li-Hsiang Shen, Chia-Jou Ku, Kai-Ten Feng
Arxiv: https://arxiv.org/abs/2306.05693
TLDR: The sixth-generation (6G) wireless technology recognizes the potential of reconfigurable intelligent surfaces (RIS) as an effective technique for intelligently manipulating channel paths through reflection to serve desired users. Full-duplex (FD) systems, enabling simultaneous transmission and reception from a base station (BS), offer the theoretical advantage of doubled spectrum efficiency. However, the presence of strong self-interference (SI) in FD systems significantly degrades performance, which can be mitigated by
Repo: None

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

Authors: Jiaxin Zhang, Bangdong Chen, Hiuyi Cheng, Lianwen Jin, Fengjun Guo, Kai Ding
Arxiv: https://arxiv.org/abs/2306.05749
TLDR: Recently, there has been a growing interest in research concerning document image analysis and recognition in photographic scenarios. However, the lack of labeled datasets for this emerging challenge poses a significant obstacle, as manual annotation can be time-consuming and impractical. To tackle this issue, we present DocAligner, a novel method that streamlines the manual annotation process to a simple step of taking pictures. DocAligneder achieves this by establishing dense correspondence between photographic document images and their clean counterparts. It
Repo: None

Robust Probabilistic Temporal Logics

Authors: Martin Zimmermann
Arxiv: https://arxiv.org/abs/2306.05806
TLDR: We robustify PCTL and PCT L*, the most important specification languages for probabilistic systems, and show that robustness does not increase the complexity of the model-checking problems.
Repo: None

Extending Kernel PCA through Dualization: Sparsity, Robustness and Fast Algorithms

Authors: Francesco Tonin, Alex Lambert, Panagiotis Patrinos, Johan A. K. Suykens
Arxiv: https://arxiv.org/abs/2306.05815
TLDR: The goal of this paper is to revisit Kernel Principal Component Analysis (KPCA) through dualization of a difference of convex functions. This allows to naturally extend KPCA to multiple objective functions and leads to efficient gradient-based algorithms avoiding the expensive SVD of the Gram matrix. Particularly, we consider objective functions that can be written as Moreau envelopes, demonstrating how to promote robustness and sparsity within the same framework. The proposed method is evaluated on synthetic and real
Repo: None

Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?

Authors: Wissam Antoun, Virginie Mouilleron, Benoît Sagot, Djamé Seddah
Arxiv: https://arxiv.org/abs/2306.05871
TLDR: Recent advances in natural language processing (NLP) have led to the development of large language models (LLMs) such as ChatGPT. This paper proposes a methodology for developing and evaluating ChatgPT detectors for French text, with a focus on investigating their robustness on out-of-domain data and against common attack schemes. The proposed method involves translating an English dataset into French and training a classifier on the translated data. Results show that the detectors can effectively detect ChatG
Repo: None

Overcoming Adversarial Attacks for Human-in-the-Loop Applications

Authors: Ryan McCoppin, Marla Kennedy, Platon Lukyanenko, Sean Kennedy
Arxiv: https://arxiv.org/abs/2306.05952
TLDR: Including human analysis has the potential to positively affect the robustness of Deep Neural Networks and is relatively unexplored in the Adversarial Machine Learning literature. Neural network visual explanation maps have been shown to be prone to adversarial attacks. Further research is needed in order to select robust visualizations of explanations for the image analyst to evaluate a given model. These factors greatly impact Human-In-The-Loop (HITL) evaluation tools due to their reliance on adversarial images,
Repo: None

A Dynamical Graph Prior for Relational Inference

Authors: Liming Pan, Cheng Shi, Ivan Dokmanić
Arxiv: https://arxiv.org/abs/2306.06041
TLDR: Relational inference aims to identify interactions between parts of a dynamical system from the observed dynamics. Current state-of-the-art methods fit a graph neural network (GNN) on a learnable graph to the dynamics. They use one-step message-passing GNNs -- intuitively the right choice since non-locality of multi-step or spectral gNNs may confuse direct and indirect interactions. But the \textit{effective} interaction graph depends on
Repo: https://github.com/dadacheng/dygr

Gemtelligence: Accelerating Gemstone classification with Deep Learning

Authors: Tommaso Bendinelli, Luca Biggio, Daniel Nyfeler, Abhigyan Ghosh, Peter Tollan, Moritz Alexander Kirschmann, Olga Fink
Arxiv: https://arxiv.org/abs/2306.06069
TLDR: The value of luxury goods, particularly investment-grade gemstones, is greatly influenced by their origin and authenticity, sometimes resulting in differences worth millions of dollars. Traditionally, human experts have determined the origin and detected treatments on gemstones through visual inspections and a range of analytical methods. However, the interpretation of the data can be subjective and time-consuming, resulting in inconsistencies. In this study, we propose Gemtelligence, a novel approach based on deep learning that enables accurate and
Repo: None

Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering

Authors: Irina-Elena Veliche, Pascale Fung
Arxiv: https://arxiv.org/abs/2306.06083
TLDR: The challenge of fairness arises when Automatic Speech Recognition (ASR) systems do not perform equally well for all sub-groups of the population. In the past few years there have been many improvements in overall speech recognition quality, but without any particular focus on advancing Equality and Equity for all users of ASR systems. ASR fairness is therefore also a robustness issue. Meanwhile, data privacy also takes priority in production systems. In this paper, we present a privacy preserving approach to improve
Repo: None

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

Authors: Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee
Arxiv: https://arxiv.org/abs/2306.06094
TLDR: Recently, large language models (LLMs) have made significant advancements in natural language understanding and generation. However, their potential in computer vision remains largely unexplored. In this paper, we introduce a new, exploratory approach that enables LLMs to process images using the Scalable Vector Graphics (SVG) format. By leveraging the XML-based textual descriptions of SVG representations instead of raster images, we aim to bridge the gap between the visual and textual modalities, allowing LLMs
Repo: None

Keyword: summarization

Towards End-to-end Speech-to-text Summarization

Authors: Raul Monteiro, Diogo Pernes
Arxiv: https://arxiv.org/abs/2306.05432
TLDR: Speech-to-text (S2T) summarization is a time-saving technique for filtering and keeping up with the broadcast news uploaded online on a daily basis. The rise of large language models from deep learning with impressive text generation capabilities has placed the research focus on summarization systems that produce paraphrased compact versions of the document content, also known as abstractive summaries. End-to the end (E2E) modelling of S2T abstractive summarization
Repo: None

AaKOS: Aspect-adaptive Knowledge-based Opinion Summarization

Authors: Guan Wang, Weihua Li, Edmund M-K. Lai, Quan Bai
Arxiv: https://arxiv.org/abs/2306.05537
TLDR: The rapid growth of information on the Internet has led to an overwhelming amount of opinions and comments on various activities, products, and services. This makes it difficult and time-consuming for users to process all the available information when making decisions. Text summarization, a Natural Language Processing (NLP) task, has been widely explored to help users quickly retrieve relevant information by generating short and salient content from long or multiple documents. Recent advances in pre-trained language models, such as ChatG
Repo: None

Privacy- and Utility-Preserving NLP with Anonymized Data: A case study of Pseudonymization

Authors: Oleksandr Yermilov, Vipul Raheja, Artem Chernodub
Arxiv: https://arxiv.org/abs/2306.05561
TLDR: This work investigates the effectiveness of different pseudonymization techniques, ranging from rule-based substitutions to using pre-trained Large Language Models (LLMs), on a variety of datasets and models used for two widely used NLP tasks: text classification and summarization. Our work provides crucial insights into the gaps between original and anonymized data (focusing on the pseudonymization technique) and model quality and fosters future research into higher-quality anonymization techniques to better balance the trade-offs
Repo: None

Embodied Executable Policy Learning with Language-based Scene Summarization

Authors: Jielin Qiu, Mengdi Xu, William Han, Seungwhan Moon, Ding Zhao
Arxiv: https://arxiv.org/abs/2306.05696
TLDR: Large Language models (LLMs) have shown remarkable success in assisting robot learning tasks, i.e., complex household planning. However, the performance of pretrained LLMs heavily relies on domain-specific templated text data, which may be infeasible in real-world robot learning task with image-based observations. Moreover, existing LLMs with text inputs lack the capability to evolve with non-expert interactions with environments. In this work, we introduce a novel learning paradigm
Repo: None

Keyword: text generation

Towards End-to-end Speech-to-text Summarization

Authors: Raul Monteiro, Diogo Pernes
Arxiv: https://arxiv.org/abs/2306.05432
TLDR: Speech-to-text (S2T) summarization is a time-saving technique for filtering and keeping up with the broadcast news uploaded online on a daily basis. The rise of large language models from deep learning with impressive text generation capabilities has placed the research focus on summarization systems that produce paraphrased compact versions of the document content, also known as abstractive summaries. End-to the end (E2E) modelling of S2T abstractive summarization
Repo: None

AaKOS: Aspect-adaptive Knowledge-based Opinion Summarization

Authors: Guan Wang, Weihua Li, Edmund M-K. Lai, Quan Bai
Arxiv: https://arxiv.org/abs/2306.05537
TLDR: The rapid growth of information on the Internet has led to an overwhelming amount of opinions and comments on various activities, products, and services. This makes it difficult and time-consuming for users to process all the available information when making decisions. Text summarization, a Natural Language Processing (NLP) task, has been widely explored to help users quickly retrieve relevant information by generating short and salient content from long or multiple documents. Recent advances in pre-trained language models, such as ChatG
Repo: None

Emotion and Sentiment Guided Paraphrasing

Authors: Justin J. Xie, Ameeta Agrawal
Arxiv: https://arxiv.org/abs/2306.05556
TLDR: Paraphrase generation, a.k.a. paraphrasing, is a common and important task in natural language processing. Emotional paraphrased, which changes the emotion embodied in a piece of text while preserving its meaning, has many potential applications, including moderating online dialogues and preventing cyberbullying. We introduce a new task of fine-grained emotional paraphrase along emotion gradients, that is, altering the emotional intensities of the paraphrases in
Repo: None
@e-tornike e-tornike self-assigned this Jun 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment