Skip to content

A pairwise GUI dataset between Android phone and tablet. Accepted by NeurIPS2023.

Notifications You must be signed in to change notification settings

huhanGitHub/papt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

News

Our manuscript regarding the methodology for collecting pairwise GUIs has been formally accepted into the NeurIPS 2023 track on datasets and benchmarks. Paper

Papt

The current dataset is available at Papt. Papt is a pairwise GUI dataset between Android phones and tablets. A detailed introduction about this dataset is in the dataset directory. This dataset includes pairwise phone-tablet GUIs.

The 'dataset' directory contains pair examples and more introductions.

Data Source

We first crawl 6,456 tablet apps from Google Play. Then we match their corresponding phone apps by their app names and app developers. Finally, we collect 5,593 valid phone-tablet app pairs from 22 app categories.

Data Collection

First, we dynamically adjust the resolution of the device to match GUI pairs. image

Second, we compare the similarities of two GUIs from phones and tablets and match two GUIs with high similarity. image

The tools directory contains data collection tools. You may see the introduction in this directory. We open source two tools: Adjust resolution collector and GUI Explorer and similarity comparer.

Pair format

Currently, we have about 10,000 phone-tablet GUI pairs now.

image

This figure shows an example of pairwise GUI pages of the app 'Spotify' in our dataset. All GUI pairs in one phone-tablet app pair are placed in the same directory. Each pair consists of four elements:

(1) a screenshot of the GUI running on the phone ('phone_1676189565_MainActivity.png');

(2) the metadata data corresponding to the GUI screenshot on the phone ('phone_1676189565_MainActivity.xml');

(3) a screenshot of the GUI running on the tablet ('tablet_1676189565_MainActivity.png');

(4)the metadata data corresponding to the GUI screenshot on the tablet ('tablet_1676189565_MainActivity.xml').

The naming format for all files in the dataset is 'Device_Timestamp_Activity' Name. As shown in Figure, The filename tablet_1676189565_MainActivity.xml indicates that this file was obtained by the tablet and was collected with the timestamp '1676189565', this GUI belongs to 'MainActivity' and this file is a metadata file in XML format. We use timestamps and activity names to distinguish phone-tablet GUI pairs.

Source Paper

We describe all the details in our source paper. You can refer to this paper to get more details. If this dataset benefits your research, we hope you cite our paper.

@misc{hu2023pairwise,

  title={Pairwise GUI Dataset Construction Between Android Phones and Tablets},  
  
  author={Han Hu and Haolan Zhan and Yujin Huang and Di Liu},     
  
  journal={Advances in Neural Information Processing Systems},
  
  volume={36},
  
  year={2024},

}

About

A pairwise GUI dataset between Android phone and tablet. Accepted by NeurIPS2023.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published