Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Welsh Written Questiosn #186

Open
ajparsons opened this issue Sep 25, 2024 · 0 comments
Open

Add Welsh Written Questiosn #186

ajparsons opened this issue Sep 25, 2024 · 0 comments
Assignees

Comments

@ajparsons
Copy link
Contributor

We had a quick look at the Senedd as part of the initial project - but it’s not available through the same API as the debate transcripts. Instead, we’d need a new scraper.

There is a search page that can be specified to a date range, and to the Written Question type. However, it actually fetches the boxes with the information through javascript - the same arguments can be fed to the seeMore endpoint that will return a json with some html boxes with the information - using this is needed to page through multiple content automatically.

https://record.senedd.wales/Search/seeMore?type=7

The search page does not allow limiting to answered questions, and does not indicate if this has happened (the ‘for answer on xx/xx/xxxx’ text can’t be trusted). Either questions need to be rechecked until answered, or unanswered questions stashed to be checked again later. This is similar to what’s needed for the London Assembly (see PR).

Complication here of having to do both language versions - have not explored if there are examples where a question has been answered but not yet translated into both languages (my guess is there won’t be, as it’s not live, translation is probably part of the publication process0. . In practice, fetching the Welsh version after successfully retrieving a complete question and answer in English is probably good enough. They’re the same page with different text, so the same scraper would hopefully work aimed at the other page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants