Skip to content

Latest commit

 

History

History
24 lines (19 loc) · 739 Bytes

README.md

File metadata and controls

24 lines (19 loc) · 739 Bytes

Build Status

table-extractor

Extract normalized tables from CSVs, Excel Spreadsheets, Word Docs, and Web Pages

A table is basically a list of rows. And a row is basically a list of values.

Installation

pip install table-extractor

Use

from table_extractor import extract_tables
tables = extract_tables("/tmp/top_5_movies.docx")
# [[["Name", "Rating"], ["The Shawshank Redemption", 9.2], ["The Godfather", 9.2], ["The Godfather: Part II", 9.2], ["The Dark Knight", 8.9], ["12 Angry Men", 8.9]]]

Testing

To test the package run

python3 -m unittest table_extractor.tests.test