Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IFB events scraper not working #61

Open
njall opened this issue Oct 18, 2018 · 3 comments
Open

IFB events scraper not working #61

njall opened this issue Oct 18, 2018 · 3 comments
Assignees

Comments

@njall
Copy link
Contributor

njall commented Oct 18, 2018

No description provided.

@knirirr
Copy link
Contributor

knirirr commented Oct 29, 2018

I had a quick look at this. It's not working as it's not been finished - it seems the page at https://www.france-bioinformatique.fr/en/evenements_upcoming/ is malformed, for if I'm reading this correctly then the event divs aren't closed properly. Therefore, parsing it is quite difficult.

@njall
Copy link
Contributor Author

njall commented Dec 18, 2018

Could you take another look at this? They previously had had an SSL issue which was forcing a redirect preventing our RDF parser from working but I've received an e-mail saying they've fixed it.
If it will not parse due to malformed HTML, could you give me a few more details to send to them (e.g. where the bad brackets are)

knirirr added a commit that referenced this issue Dec 18, 2018
knirirr added a commit that referenced this issue Dec 18, 2018
@knirirr
Copy link
Contributor

knirirr commented Dec 18, 2018

Trying to parse the RDF results in the output below.
Any chance of them supplying their list as JSON, YAML or something like that?

ERROR https://www.france-bioinformatique.fr: 201:36: FATAL: Double hyphen within comment: <!--<div class="sous-menu sous-menu
307:36: FATAL: Double hyphen within comment: <!--<div class="sous-menu sous-menu
562:112: FATAL: AttValue: " or ' expected
562:112: FATAL: attributes construct error
562:112: FATAL: Couldn't find end of Start Tag a line 562
563:104: FATAL: AttValue: " or ' expected
563:104: FATAL: attributes construct error
563:104: FATAL: Couldn't find end of Start Tag div line 563
564:20: FATAL: AttValue: " or ' expected
564:20: FATAL: attributes construct error
564:20: FATAL: Couldn't find end of Start Tag img line 564
564:182: FATAL: Opening and ending tag mismatch: div line 562 and img
567:36: FATAL: AttValue: " or ' expected
567:36: FATAL: attributes construct error
567:36: FATAL: Couldn't find end of Start Tag img line 567
567:217: FATAL: Opening and ending tag mismatch: BR line 562 and img
568:18: FATAL: AttValue: " or ' expected
568:18: FATAL: attributes construct error
568:18: FATAL: Couldn't find end of Start Tag p line 568
569:39: FATAL: AttValue: " or ' expected
569:39: FATAL: attributes construct error
569:39: FATAL: Couldn't find end of Start Tag img line 569
569:246: FATAL: Opening and ending tag mismatch: div line 561 and img
575:20: FATAL: Opening and ending tag mismatch: BR line 573 and span
576:16: FATAL: Opening and ending tag mismatch: BR line 572 and p
577:17: FATAL: Opening and ending tag mismatch: BR line 571 and div
577:22: FATAL: Opening and ending tag mismatch: span line 570 and a
577:30: FATAL: AttValue: " or ' expected
577:30: FATAL: attributes construct error
577:30: FATAL: Couldn't find end of Start Tag a line 577
578:104: FATAL: AttValue: " or ' expected
578:104: FATAL: attributes construct error
578:104: FATAL: Couldn't find end of Start Tag div line 578
579:20: FATAL: AttValue: " or ' expected
579:20: FATAL: attributes construct error
579:20: FATAL: Couldn't find end of Start Tag img line 579
579:181: FATAL: Opening and ending tag mismatch: div line 558 and img
582:36: FATAL: AttValue: " or ' expected
582:36: FATAL: attributes construct error
582:36: FATAL: Couldn't find end of Start Tag img line 582
582:217: FATAL: Opening and ending tag mismatch: div line 557 and img
583:18: FATAL: AttValue: " or ' expected
583:18: FATAL: attributes construct error
583:18: FATAL: Couldn't find end of Start Tag p line 583
584:39: FATAL: AttValue: " or ' expected
584:39: FATAL: attributes construct error
584:39: FATAL: Couldn't find end of Start Tag img line 584
584:234: FATAL: Opening and ending tag mismatch: div line 551 and img
590:20: FATAL: Opening and ending tag mismatch: BR line 588 and span
591:16: FATAL: Opening and ending tag mismatch: BR line 587 and p
592:17: FATAL: Opening and ending tag mismatch: BR line 586 and div
592:22: FATAL: Opening and ending tag mismatch: span line 585 and a
592:84: FATAL: AttValue: " or ' expected
592:84: FATAL: attributes construct error
592:84: FATAL: Couldn't find end of Start Tag a line 592
593:104: FATAL: AttValue: " or ' expected
593:104: FATAL: attributes construct error
593:104: FATAL: Couldn't find end of Start Tag div line 593
594:20: FATAL: AttValue: " or ' expected
594:20: FATAL: attributes construct error
594:20: FATAL: Couldn't find end of Start Tag img line 594
594:182: FATAL: Opening and ending tag mismatch: div line 592 and img
597:36: FATAL: AttValue: " or ' expected
597:36: FATAL: attributes construct error
597:36: FATAL: Couldn't find end of Start Tag img line 597
597:217: FATAL: Opening and ending tag mismatch: div line 549 and img
598:18: FATAL: AttValue: " or ' expected
598:18: FATAL: attributes construct error
598:18: FATAL: Couldn't find end of Start Tag p line 598
599:39: FATAL: AttValue: " or ' expected
599:39: FATAL: attributes construct error
599:39: FATAL: Couldn't find end of Start Tag img line 599
599:240: FATAL: Opening and ending tag mismatch: div line 534 and img
605:20: FATAL: Opening and ending tag mismatch: BR line 603 and span
606:16: FATAL: Opening and ending tag mismatch: BR line 602 and p
607:17: FATAL: Opening and ending tag mismatch: BR line 601 and div
607:22: FATAL: Opening and ending tag mismatch: span line 600 and a
607:30: FATAL: AttValue: " or ' expected
607:30: FATAL: attributes construct error
607:30: FATAL: Couldn't find end of Start Tag a line 607
608:104: FATAL: AttValue: " or ' expected
608:104: FATAL: attributes construct error
608:104: FATAL: Couldn't find end of Start Tag div line 608
609:20: FATAL: AttValue: " or ' expected
609:20: FATAL: attributes construct error
609:20: FATAL: Couldn't find end of Start Tag img line 609
609:182: FATAL: Opening and ending tag mismatch: section line 533 and img
612:36: FATAL: AttValue: " or ' expected
612:36: FATAL: attributes construct error
612:36: FATAL: Couldn't find end of Start Tag img line 612
612:217: FATAL: Opening and ending tag mismatch: main line 528 and img
613:18: FATAL: AttValue: " or ' expected
613:18: FATAL: attributes construct error
613:18: FATAL: Couldn't find end of Start Tag p line 613
614:39: FATAL: AttValue: " or ' expected
614:39: FATAL: attributes construct error
614:39: FATAL: Couldn't find end of Start Tag img line 614
614:263: FATAL: Opening and ending tag mismatch: div line 526 and img
620:20: FATAL: Opening and ending tag mismatch: BR line 618 and span
621:16: FATAL: Opening and ending tag mismatch: BR line 617 and p
622:17: FATAL: Opening and ending tag mismatch: BR line 616 and div
622:22: FATAL: Opening and ending tag mismatch: span line 615 and a
622:30: FATAL: AttValue: " or ' expected
622:30: FATAL: attributes construct error
622:30: FATAL: Couldn't find end of Start Tag a line 622
623:104: FATAL: AttValue: " or ' expected
623:104: FATAL: attributes construct error
623:104: FATAL: Couldn't find end of Start Tag div line 623
624:20: FATAL: AttValue: " or ' expected
624:20: FATAL: attributes construct error
624:20: FATAL: Couldn't find end of Start Tag img line 624
624:182: FATAL: Opening and ending tag mismatch: body line 97 and img
627:36: FATAL: AttValue: " or ' expected
627:36: FATAL: attributes construct error
627:36: FATAL: Couldn't find end of Start Tag img line 627
627:217: FATAL: Opening and ending tag mismatch: html line 3 and img
628:12: FATAL: Extra content at the end of the document

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants