Skip to content
This repository has been archived by the owner on Sep 6, 2023. It is now read-only.

Replace pandoc_ast with scraper #67

Closed
shonfeder opened this issue Mar 11, 2021 · 0 comments · Fixed by #68
Closed

Replace pandoc_ast with scraper #67

shonfeder opened this issue Mar 11, 2021 · 0 comments · Fixed by #68
Milestone

Comments

@shonfeder
Copy link
Contributor

As documented on #65 and #33, pandoc_ast is not actively maintained to stay
compatible with the current pandoc API. In the midterm, we'll still want to
address #33 and drop the pandoc dependency, but for the short term we will:

  • Use pandoc to dump the markdown to HTML.
  • Use scraper to extract the logical
    units from the definition list.
  • Use html2md to render the logical
    unit content back to markdown for entry to the database.
@shonfeder shonfeder added this to the Release v0.1 milestone Mar 11, 2021
shonfeder added a commit that referenced this issue Mar 11, 2021
Closes #67

Instead of using pandoc_ast to ingest the JSON representation of
pandoc's AST, we are now using pandoc to convert the md to html, then
using scraper to extract the parts the we need, and html2md to return
render anything back to md if needed.

This is intended as a stopgap.
shonfeder added a commit that referenced this issue Mar 11, 2021
Closes #67

Instead of using pandoc_ast to ingest the JSON representation of
pandoc's AST, we are now using pandoc to convert the md to html, then
using scraper to extract the parts the we need, and html2md to return
render anything back to md if needed.

This is intended as a stopgap.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant