Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

☂️ Handle remaining unimplemented content parsing features #921

Closed
14 tasks
PIG208 opened this issue Aug 30, 2024 · 4 comments
Closed
14 tasks

☂️ Handle remaining unimplemented content parsing features #921

PIG208 opened this issue Aug 30, 2024 · 4 comments
Labels
a-content Parsing and rendering Zulip HTML content, notably message contents
Milestone

Comments

@PIG208
Copy link
Member

PIG208 commented Aug 30, 2024

This is a follow-up to #190 and #917, to serve as an umbrella issue for unimplemented features to be handled.

Current features

Legacy features/behaviors in old messages

Short-lived experiments

In the public chat.zulip.org message history, each of these appears for only a brief period. It's not clear any of them went to Zulip Cloud or into a published Zulip Server release.


Full Output

Found unimplemented features in 33742 out of 1295577 public messages:

  • <span class="topic-mention">
    Oldest message: 1609467; newest message: 1925998 (15/33742)

  • <table>
    Oldest message: 33947; newest message: 1931762 (348/33742)

  • <div class="codehilite">
    Oldest message: 3444; newest message: 1845456 (10175/33742)

  • <img>
    Oldest message: 1792633; newest message: 1792683 (3/33742)

  • <span class="tex-error">
    Oldest message: 176408; newest message: 1768950 (25/33742)

  • <span class="topic-mention silent">
    Oldest message: 1609468; newest message: 1704065 (2/33742)

  • <div class="inline-preview-twitter">
    Oldest message: 29260; newest message: 1574645 (22176/33742)

  • <span class="katex-display">
    Oldest message: 202662; newest message: 1355972 (19/33742)

  • <span class="timestamp-error">
    Oldest message: 925908; newest message: 1267388 (10/33742)

  • <div class="message_inline_ref">
    Oldest message: 61290; newest message: 945000 (26/33742)

  • <img class="message_body_gravatar">
    Oldest message: 15312; newest message: 927237 (55/33742)

  • <span class="timestamp">
    Oldest message: 882554; newest message: 908075 (38/33742)

  • <p>
    Oldest message: 176412; newest message: 908053 (65/33742)

  • <div class="message_inline_image">
    Oldest message: 4324; newest message: 751747 (709/33742)

  • <span class="katex">
    Oldest message: 308073; newest message: 426840 (3/33742)

  • <div class="message_embed">
    Oldest message: 192764; newest message: 193181 (28/33742)

  • <span class="inline-subscribe">
    Oldest message: 4297; newest message: 97826 (45/33742)


These features are categorized to help us determine the priorities.

  • The current features should be supported before launch. There can also be potential bugs in the content parser affecting current features.
  • Most of the legacy features (such as inline-subscribe) are irrelevant enough that we probably just need to acknowledge them and render them as plain text.
  • The one-off experiments can likely be addressed by just removing the messages from CZO, without us handling them.

We might add more unimplemented features here as we find more of them later. We tested on all public messages from CZO (1295577 messages).

Also related:

@PIG208 PIG208 added the a-content Parsing and rendering Zulip HTML content, notably message contents label Aug 30, 2024
@chrisbobbe

This comment was marked as resolved.

@PIG208

This comment was marked as resolved.

@gnprice
Copy link
Member

gnprice commented Sep 26, 2024

Thanks @PIG208 for doing this survey, and for compiling all this data!

I've gone and filed issues for the remaining items that didn't already have them:

and added the link for one that did:

All of those are post-launch issues, as is #922.

It's encouraging to see that almost all the features this found were fairly obscure. In the list above, the issues that I think we'll want to handle before launching this app to replace the legacy app are three features we knew about already:

and one issue we didn't:

Now that I've filed those other sub-issues (in addition to #922 and #933 which you filed), I'll close this issue in favor of the specific sub-issues. This doesn't belong in the "Launch" milestone since we won't be completing it before launch… but if it's in "Post-launch" then we won't be seeing it on our dashboards, so we'll be relying on the sub-issues for tracking anyway.

@gnprice gnprice closed this as not planned Won't fix, can't repro, duplicate, stale Sep 26, 2024
@gnprice
Copy link
Member

gnprice commented Oct 18, 2024

After looking through the #917 script's output myself, I realized that three classes that we had attributed to the known issue that we don't yet support TeX:

are really a pair of other, low-priority issues which we hadn't known about (and I just filed):

I've edited the description to reflect that.

In fact #46 itself doesn't relate to this search for unimplemented content, because TeX nodes — apart from the cases covered by #1003 and #1004 — are already implemented by our parser, as part of #359. Our display of them isn't as rich as we'd like it to be (that's #46), but this survey doesn't have a way to know about that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
a-content Parsing and rendering Zulip HTML content, notably message contents
Projects
Status: Done
Development

No branches or pull requests

3 participants