Skip to content
This repository has been archived by the owner on Jul 11, 2023. It is now read-only.

Python package to read, transform, enrich and load news data. Patterns can be found in the Dow Jones Developer Portal.

License

Notifications You must be signed in to change notification settings

dowjones/factiva-pipelines-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dow Jones Factiva Pipelines Library

Tools for processing and analysing data from Snapshots and Streams.

Installation

To install this library, run the following commands.

$ pip install --upgrade factiva-pipelines

Using Library services

Create a new snapshot and download to a local repository just require a few lines of code.

from factiva.pipelines import snapshot_files as sf
from factiva.pipelines import metadata as fm

all_articles = sf.read_folder('./nag6oqitd2', only_stats=True)
all_articles = fm.expand_country_codes(covid)
all_articles = fm.expand_industry_codes(covid)

In the previous code a folder from a Snapshot download is read fully into a Pandas Dataframe. Then, some metadata codes are expanded into new columns with their human-readable texts.

About

Python package to read, transform, enrich and load news data. Patterns can be found in the Dow Jones Developer Portal.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages