any2rss: Create RSS feeds from broken websites

This programs scrapes HTML and outputs RSS. This is made to work on very, very broken websites. Unlike projects such as html2rss which you should use instead if that works for you.

Unlike other approaches that limit you by using a simplified DSL to describe how to extract the data, this package gives you the full freedom of Python. Any2rss manages downloading, caching, and parsing, handing of to your code for the extraction. Then any2rss handles sanitising the HTML and generation of the actual RSS document.

Documentation

This project is in very early stages, there is almost no documentation. But here is how to run the example:

$ # It is assumed you have set up venv and installed this package using pip.
$ any2rss -m any2rss.examples.the_whiteboard
<?xml version='1.0' encoding='UTF-8'?>
<rss version="2.0">
[...]
$

The idea is that you would use a cronjob to run any2rss to generate feeds on your self-hosted web server, that you can then poll using your RSS reader of choice.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
src/any2rss		src/any2rss
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

any2rss: Create RSS feeds from broken websites

Documentation

About

Releases

Contributors 2

Languages

License

VorpalBlade/any2rss

Folders and files

Latest commit

History

Repository files navigation

any2rss: Create RSS feeds from broken websites

Documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Contributors 2

Languages