The goal of RmdConcord is to provide support for concordances in R Markdown files.
This is based on a suggestion and initial code from Heather Turner. Thanks!
Rmarkdown files (type .Rmd
) allow presentation articles to be generated from simple text documents. However, if there are errors, these can be difficult to track back to the originating markdown text because there are often intermediate stages: the R Markdown file is converted to plain Markdown, the plain Markdown is converted to HTML or LaTeX, the LaTeX is converted to PDF. A concordance that allows back-tracking of errors from the later steps is needed.
The main use for this in HTML output is to help deciphering HTML Tidy error messages. If you replace the original driver with RmdConcord::html_documentC
, R CMD check
should report locations in the original .Rmd
file.
With PDF output using patchDVI::pdf_documentC
(from patchDVI
version 1.11.0 or newer), Synctex output will be enabled, and it will be patched to refer to the .Rmd
file. This is helpful in previewers like the one in TeXworks that can link source to a preview. This package contains RmdConcord::pdf_documentC0
, which does part of the work (preparing the concordance records) but doesn’t do the patching to make previewers work with it.
Limitations
The Pandoc Commonmark driver is still in development, so some features supported in the rmarkdown
drivers will not be fully supported using the drivers from this package. At present I am using Pandoc 2.19.2 and I know of the following missing features:
Citations, e.g.
[@doe99]
-
Raw LaTeX will need to be “fenced”, e.g. entered as
```{=latex} LaTeX ```
(The
\pagebreak
and\newpage
macros receive special treatment, so they don’t need fencing as long as they are separated by blank lines from text above and below.)
If you notice others, please let me know, e.g. by posting an issue to the Github site.
These will not cause errors (Markdown doesn’t ever give errors!), but they won’t be handled properly. We suggest using the RmdConcord
and patchDVI
drivers during early development or to track down obscure bugs, but using the rmarkdown
drivers for regular production.
Installation
This version of RmdConcord
makes use of some functions that will be released in R 4.3.0. Those functions have been copied into RmdConcord
, so the package should be compatible with older versions of R.
You can install the development version of RmdConcord
from GitHub with:
# install.packages("devtools")
devtools::install_github("dmurdoch/RmdConcord")
Example
To embed concordances in an R Markdown HTML document, change the output YAML to patchDVI::html_documentC
. For a PDF document, use patchDVI::pdf_documentC
.
output: patchDVI::html_documentC
This is used in the Sample.Rmd
vignette, which contains an error on line 23.
library(RmdConcord)
example(processConcordance)
#>
#> prcssC> # This example works on the file inst/sample/Sample.Rmd,
#> prcssC> # which should be a copy of the vignette Sample.Rmd. This
#> prcssC> # is convenient because RStudio doesn't install vignettes by default.
#> prcssC>
#> prcssC> # First, see the results without concordances:
#> prcssC>
#> prcssC> library(RmdConcord)
#>
#> prcssC> dir <- tempdir()
#>
#> prcssC> intermediates <- tempfile()
#>
#> prcssC> infile <- system.file("sample/Sample.Rmd", package = "RmdConcord")
#>
#> prcssC> outfile1 <- file.path(dir, "html_vignette.html")
#>
#> prcssC> rmarkdown::render(infile,
#> prcssC+ intermediates_dir = intermediates,
#> prcssC+ output_file = outfile1,
#> prcssC+ quiet = TRUE)
#>
#> prcssC> tidy_validate(outfile1)
#> line col msg txt
#> [1,] "359" "4" "Error: <foobar> is not recognized!" "<p><foobar></p>"
#> [2,] "359" "4" "Warning: discarding unexpected <foobar>" "<p><foobar></p>"
#>
#> prcssC> # Next, see them with concordances by setting
#> prcssC> # the output format to use RmdConcord::html_documentC
#> prcssC> # which post-processes the document with processConcordance.
#> prcssC>
#> prcssC> dir <- tempdir()
#>
#> prcssC> outfile2 <- file.path(dir, "commonmark.html")
#>
#> prcssC> rmarkdown::render(infile,
#> prcssC+ intermediates_dir = intermediates,
#> prcssC+ output_file = outfile2,
#> prcssC+ output_format = html_documentC(),
#> prcssC+ quiet = TRUE)
#>
#> prcssC> tidy_validate(outfile2)
#> line col msg txt
#> [1,] "319" "1" "Error: <foobar> is not recognized!" "<foobar>"
#> [2,] "319" "1" "Warning: discarding unexpected <foobar>" "<foobar>"
#> srcFile srcLine
#> [1,] "Sample.Rmd" "23"
#> [2,] "Sample.Rmd" "23"
#>
#> prcssC> unlink(c(intermediates, outfile1, outfile2), recursive = TRUE)