-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature request] submit pandoc commandline options by metadata and/or separate yaml file #2069
Comments
+++ Bernhard Weichel [Apr 11 15 08:54 ]:
That's a bit difficult, since we need to know the reader extensions in order to know whether (and also how) to parse the YAML metadata in the first place. |
isn't So, we could have a hierarchy (decreasing significance). The settings could be merged. In that sense it would not be possible to turn off
|
+++ Bernhard Weichel [Apr 11 15 23:02 ]:
I suppose they could be changed mid-parse. But then we'd face something like the Liar Paradox if you specified in the YAML metadata that |
Indeed, this could be a problem. Maybe we could resolve the paradox by an extra rule as you propose. We have options like:
As I think of it, the approach raises even more quesions:
|
Frankly, I don't think that mixing document parsing strategy and metadata is a good idea. That said, it may be a nice feature to be able to define exact syntax used in document itself. I suppose we could devise a simple syntax extension allowing to specify exact flavor used for given Markdown document, something akin to vim modeline. E.g, have first or last non-empty line in a document consist of comment like this, for example: <!-- pandoc-markdown: +compact_definition_lists -->
or
<!-- pandoc-markdown: markdown_gfm --> This would be fully compatible and would answer concerns about "Liar Paradox". Exact syntax is obviously debatable, that's just something I thought of from the top of my head. I don't think specifying writer options/flavor in a document is very useful, unless one is often performing Markdown-to-Markdown transformation. That is not a very common use-case, I think. One case (that I can think of) where md2md transformation would be immediately helpful is combining contributions from different authors, and even then output format is not per-document, but rather should be common for all documents. Being able to specify default Markdown flavor for both reader and writer (possibly separately) from a settings file could be a useful feature though. It would be a good idea to be able to both specify said settings file location on command line, and to have a default location for it (very much like templates are handled now). Exact format requires some discussion, as that would be nice to be able to set default reader and writer options, not just Markdown flavors in it, I think. Thoughts? |
Well, we have the choice of mixing parsing strategy into metadata or to have an entirely different syntax. For me markdown to markdown conversion is a regular usecase:
therefore I do this all all the time! |
@bwl21, in any case, I feel like specifying document flavor in a flavor-dependent block is just plain bad design. F.ex., gfm does not support yaml metadata blocks, so you can't specify that it's gfm and keep compatibility with gfm. Same goes for other flavors. It does not make sense to me. I can't think of a flavor that does not understand HTML comments though, so that's a pro for my idea IMO. Con is it being yet another syntax extension, when we have more than enough already. But at least this one's not disruptive (i.e. will be silently ignored by parsers not supporting it) |
Oh, and I assume that in your use-case of md2md transformations, output format is not per-document, but rather per-project at least, so using an external config should be more convenient anyway, unless I utterly misunderstand something. |
@liedakil raises some good points here. Maybe it would be worth implementing the modeline-like syntax, but I'm not sure. It adds further complexity. If the line is at the beginning of the document, then it's incompatible with pandoc title blocks. So, it would probably have to be at the end. And then it might interact badly with things like references (at least with the current setup, where pandoc-citeproc looks for an empty references header at the end of the document -- this will probably be changed soon). I have a simple solution to all these problems: Makefiles! Whenever I'm doing anything moderately complex, I just create a simple Makefile, like:
Then, typing |
Makefiles do nicely as a substitution for local config file, that much is true. However, ability to specify flavor in-document does add an extra feature: portability. For the sake of argument, I could send Markdown text to a colleague and not worry about him having trouble converting that. Right now, I'd have to also include part of my Makefile to be sure that everything goes smoothly. Another point is global config. F.ex., I try to avoid setex headings and simple tables at all times -- it would be handy to be able to disable those globally for writer. At the moment I use a shell alias, but that's cognitive overhead I certainly could live without. So, this proposal is very rational at its core. Details need some working out though. |
+++ Nikolay Yakimov [Apr 13 15 09:21 ]:
Yes, I see the point. Though, there are still many things that can go wrong: e.g. your colleague might use the wrong writer options. Attaching a Makefile is more failsafe.
This part could be cured with a global Makefile that you just include in all the others.
I agree, it's still worth thinking about. |
2015-04-13 19:36 GMT+03:00 John MacFarlane [email protected]:
Not for one-shot conversions though, and bigger projects usually convert |
I plan to solve the issue in my environment (https://github.com/bwl21/wortsammler) such that Wortsammler first reads the metadata of the input and adjust the pandoc command line. This corresponds to the makefile proposal. In addition to this I plan to implement the config hierarchy mentioned before. My proposal was indeed to handle the cases where plain pandoc is used "one shot conversions", respectively out of the box editor integrations (e.g. sublime) which act on one single file. We all agree that the most important part is the configuration of the reader. In my Opinion, metadata is information about data. And the markdown reader configuration is meta information about the current markdown file. So I still think that adding it to the metadata block is still a valid approach. This is, how I plan to represent this in Wortsammler's config file: :pandoc:
:system_command: ! 'pandoc_1.13.1 '
:markdown_intput_switches:
- -backtick_code_blocks
- +fenced_code_blocks
- +compact_definition_lists
:markdown_output_switches:
- -backtick_code_blocks
- +fenced_code_blocks
- -compact_definition_lists If by whatever reason another syntax shall be applied, then I feel that xml processing instruction would be the adequate solution, not xml comments. <?pandoc mardown-reader="-backtick_code_blocks+fenced_code_blocks+compact_definition_lists" ?> But yaml metadata appears much more adequate to me. |
Oh, by the way, pandoc ignores fields in YAML that end with an pandoc_opts_:
and write a small wrapper script that reads the YAML at the head of the |
@bwl21, all your proposals are only valid in context of always using parser that supports your proposed extensions. That is not always the case, esp. in the context of multiple authors. XML processing instructions will be parsed by most parsers verbatim, which is certainly not something I would want. HTML comments are mostly ignored, hence I suggest using them. I feel like that would be the least-disruptive option. |
I hereby close this issue as I fix it in my surrounding environment as described above. |
Btw. the |
I open this ticket according to the following post in the google group https://groups.google.com/forum/#!searchin/pandoc-discuss/definition$20list/pandoc-discuss/ZKytjti_h2Q/pcagNlG9SdsJ
Summing up:
--metadatafile <file>
In particular the change of definition list is a drastic one. The only way to deal with legacy documents is to enable compact_definition_list at any time. In a given environment there is not even a compatibility mode allowing to support both syntaxes on input and use pandoc as an implicit converter. One must chose the definition list format explicitly. The requested feature would make it easier to maintain legacy documents. For example I have documents which are adapted every other year and got the surprise ...
The metadata could be for example
The text was updated successfully, but these errors were encountered: