-
-
Notifications
You must be signed in to change notification settings - Fork 520
/
03-documents.Rmd
1189 lines (839 loc) · 51.2 KB
/
03-documents.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
# (PART) Output Formats {-}
# Documents
The very original version of Markdown was invented mainly to write HTML content more easily. For example, you can write a bullet with `- text` instead of the verbose HTML code `<ul><li>text</li></ul>`, or a quote with `> text` instead of `<blockquote>text</blockquote>`.
The syntax of Markdown has been greatly extended by Pandoc. What is more, Pandoc makes it possible to convert a Markdown document to a large variety of output formats. In this chapter, we will introduce the features of various document output formats. In the next two chapters, we will document the presentation formats and other R Markdown extensions, respectively.
## HTML document
As we just mentioned before, Markdown was originally designed for HTML output, so it may not be surprising that the HTML format\index{HTML format} has the richest features among all output formats. We recommend that you read this full section before you learn other output formats, because other formats have several features in common with the HTML document format, and we will not repeat these features in the corresponding sections.
To create an HTML document from R Markdown, you specify the `html_document` output format in the YAML metadata of your document:
```yaml
---
title: Habits
author: John Doe
date: March 22, 2005
output: html_document
---
```
### Table of contents
You can add a table of contents (TOC) using the `toc` option and specify the depth of headers that it applies to using the `toc_depth` option. For example:
```yaml
---
title: "Habits"
output:
html_document:
toc: true
toc_depth: 2
---
```
If the table of contents depth is not explicitly specified, it defaults to 3 (meaning that all level 1, 2, and 3 headers will be included in the table of contents).
#### Floating TOC
You can specify the `toc_float` option to float the table of contents to the left of the main document content. The floating table of contents will always be visible even when the document is scrolled. For example:
```yaml
---
title: "Habits"
output:
html_document:
toc: true
toc_float: true
---
```
You may optionally specify a list of options for the `toc_float` parameter which control its behavior. These options include:
- `collapsed` (defaults to `TRUE`) controls whether the TOC appears with only the top-level (e.g., H2) headers. If collapsed initially, the TOC is automatically expanded inline when necessary.
- `smooth_scroll` (defaults to `TRUE`) controls whether page scrolls are animated when TOC items are navigated to via mouse clicks.
For example:
```yaml
---
title: "Habits"
output:
html_document:
toc: true
toc_float:
collapsed: false
smooth_scroll: false
---
```
### Section numbering
You can add section numbering to headers using the `number_sections` option:
```yaml
---
title: "Habits"
output:
html_document:
toc: true
number_sections: true
---
```
Note that if you do choose to use the `number_sections` option, you will likely also want to use `#` (H1) headers in your document as `##` (H2) headers will include a decimal point, because without H1 headers, you H2 headers will be numbered with `0.1`, `0.2`, and so on.
### Tabbed sections
You can organize content using tabs by applying the `.tabset` class attribute to headers within a document. This will cause all sub-headers of the header with the `.tabset` attribute to appear within tabs rather than as standalone sections. For example:
```markdown
## Quarterly Results {.tabset}
### By Product
(tab content)
### By Region
(tab content)
```
You can also specify two additional attributes to control the appearance and behavior of the tabs. The `.tabset-fade` attribute causes the tabs to fade in and out when switching between tabs. The `.tabset-pills` attribute causes the visual appearance of the tabs to be "pill" (see Figure \@ref(fig:tabset)) rather than traditional tabs. For example:
```markdown
## Quarterly Results {.tabset .tabset-fade .tabset-pills}
```
```{r tabset, echo=FALSE, fig.cap='Traditional tabs and pill tabs on an HTML page.', out.width='50%', fig.align='center'}
knitr::include_graphics('images/tabset.png', dpi = NA)
```
### Appearance and style
There are several options that control the appearance of HTML documents:
- `theme` specifies the Bootstrap theme to use for the page (themes are drawn from the [Bootswatch](https://bootswatch.com/3/) theme library). Valid themes include `r knitr::combine_words(rmarkdown:::themes())`. Pass `null` for no theme (in this case you can use the `css` parameter to add your own styles).
- `highlight` specifies the syntax highlighting style. Supported styles include `r knitr::combine_words(rmarkdown:::html_highlighters(), before='\x60')`. Pass `null` to prevent syntax highlighting.
- `smart` indicates whether to produce typographically correct output, converting straight quotes to curly quotes, `---` to em-dashes, `--` to en-dashes, and `...` to ellipses. Note that `smart` is enabled by default.
For example:
```yaml
---
title: "Habits"
output:
html_document:
theme: united
highlight: tango
---
```
#### Custom CSS
You can add your own CSS\index{CSS} to an HTML document using the `css` option:
```yaml
---
title: "Habits"
output:
html_document:
css: styles.css
---
```
If you want to provide all of the styles for the document from your own CSS you set the `theme` (and potentially `highlight`) to `null`:
```yaml
---
title: "Habits"
output:
html_document:
theme: null
highlight: null
css: styles.css
---
```
You can also target specific sections of documents with custom CSS by adding ids or classes to section headers within your document. For example the following section header:
```markdown
## Next Steps {#nextsteps .emphasized}
```
Would enable you to apply CSS to all of its content using either of the following CSS selectors:
```css
#nextsteps {
color: blue;
}
.emphasized {
font-size: 1.2em;
}
```
### Figure options
There are a number of options that affect the output of figures within HTML documents:
- `fig_width` and `fig_height` can be used to control the default figure width and height (7x5 is used by default).
- `fig_retina` specifies the scaling to perform for retina displays (defaults to 2, which currently works for all widely used retina displays). Set to `null` to prevent retina scaling.
- `fig_caption` controls whether figures are rendered with captions.
- `dev` controls the graphics device used to render figures (defaults to `png`).
For example:
```yaml
---
title: "Habits"
output:
html_document:
fig_width: 7
fig_height: 6
fig_caption: true
---
```
### Data frame printing
You can enhance the default display of data frames via the `df_print` option. Valid values are shown in Table \@ref(tab:df-print).
Table: (\#tab:df-print) The possible values of the `df_print` option for the `html_document` format.
| Option | Description |
|------------|-------------------------------------------|
| default | Call the `print.data.frame` generic method |
| kable | Use the `knitr::kable` function |
| tibble | Use the `tibble::print.tbl_df` function |
| paged | Use `rmarkdown::paged_table` to create a pageable table |
| A custom function | Use the function to create the table |
#### Paged printing
When the `df_print` option is set to `paged`, tables are printed as HTML tables with support for pagination over rows and columns. For instance (see Figure \@ref(fig:paged)):
````markdown
---
title: "Motor Trend Car Road Tests"
output:
html_document:
df_print: paged
---
`r ''````{r}
mtcars
```
````
```{r paged, echo=FALSE, fig.cap='A paged table in the HTML output document.', out.width='100%'}
knitr::include_graphics('images/paged.png', dpi = NA)
```
Table \@ref(tab:paged) shows the available options for paged tables.
Table: (\#tab:paged) The options for paged HTML tables.
| Option | Description |
|-----------------|-------------------------------------------------------|
| max.print | The number of rows to print. |
| rows.print | The number of rows to display. |
| cols.print | The number of columns to display. |
| cols.min.print | The minimum number of columns to display. |
| pages.print | The number of pages to display under page navigation. |
| paged.print | When set to `FALSE` turns off paged tables. |
| rownames.print | When set to `FALSE` turns off row names. |
These options are specified in each chunk like below:
````markdown
`r ''````{r cols.print=3, rows.print=3}
mtcars
```
````
#### Custom function {#custom-fun-dfprint}
The `df_print` option can also take an arbitrary function to create the table in the output document. This function must output in the correct format according to the output used.
For example,
```r
rmarkdown::html_document(df_print = knitr::kable)
```
is the equivalent to using the method `"kable"`
```r
rmarkdown::html_document(df_print = "kable")
```
To use a custom function in `df_print` within the YAML header, the tag `!expr` must be used so the R expression after it will be evaluated. See the `eval.expr` argument on the help page `?yaml::yaml.load` for details.
````markdown
---
title: "Motor Trend Car Road Tests"
output:
html_document:
df_print: !expr pander::pander
---
`r ''````{r}
mtcars
```
````
### Code folding
When the **knitr** chunk option `echo = TRUE` is specified (the default behavior), the R source code within chunks is included within the rendered document. In some cases, it may be appropriate to exclude code entirely (`echo = FALSE`) but in other cases you might want the code to be available but not visible by default.
The `code_folding: hide` option enables you to include R code but have it hidden by default. Users can then choose to show hidden R code chunks either individually or document wide. For example:
````markdown
---
title: "Habits"
output:
html_document:
code_folding: hide
---
````
You can specify `code_folding: show` to still show all R code by default but then allow users to hide the code if they wish.
### MathJax equations
By default, [MathJax](https://www.mathjax.org)\index{MathJax} scripts are included in HTML documents for rendering LaTeX and MathML equations. You can use the `mathjax` option to control how MathJax is included:
- Specify `"default"` to use an HTTPS URL from a CDN host (currently provided by RStudio).
- Specify `"local"` to use a local version of MathJax (which is copied into the output directory). Note that when using `"local"` you also need to set the `self_contained` option to `false`.
- Specify an alternate URL to load MathJax from another location.
- Specify `null` to exclude MathJax entirely.
For example, to use a local copy of MathJax:
```yaml
---
title: "Habits"
output:
html_document:
mathjax: local
self_contained: false
---
```
To use a self-hosted copy of MathJax:
```yaml
---
title: "Habits"
output:
html_document:
mathjax: "http://example.com/MathJax.js"
---
```
To exclude MathJax entirely:
```yaml
---
title: "Habits"
output:
html_document:
mathjax: null
---
```
### Document dependencies
By default, R Markdown produces standalone HTML files with no external dependencies, using `data:` URIs to incorporate the contents of linked scripts, stylesheets, images, and videos. This means you can share or publish the file just like you share Office documents or PDFs. If you would rather keep dependencies in external files, you can specify `self_contained: false`. For example:
```yaml
---
title: "Habits"
output:
html_document:
self_contained: false
---
```
Note that even for self-contained documents, MathJax is still loaded externally (this is necessary because of its big size). If you want to serve MathJax locally, you should specify `mathjax: local` and `self_contained: false`.
One common reason to keep dependencies external is for serving R Markdown documents from a website (external dependencies can be cached separately by browsers, leading to faster page load times). In the case of serving multiple R Markdown documents you may also want to consolidate dependent library files (e.g. Bootstrap, and MathJax, etc.) into a single directory shared by multiple documents. You can use the `lib_dir` option to do this. For example:
```yaml
---
title: "Habits"
output:
html_document:
self_contained: false
lib_dir: libs
---
```
### Advanced customization
#### Keeping Markdown
When **knitr** processes an R Markdown input file, it creates a Markdown (`*.md`) file that is subsequently transformed into HTML by Pandoc. If you want to keep a copy of the Markdown file after rendering, you can do so using the `keep_md` option:
```yaml
---
title: "Habits"
output:
html_document:
keep_md: true
---
```
#### Includes
You can do more advanced customization of output by including additional HTML content or by replacing the core Pandoc template entirely. To include content in the document header or before/after the document body, you use the `includes` option as follows:
```markdown
---
title: "Habits"
output:
html_document:
includes:
in_header: header.html
before_body: doc_prefix.html
after_body: doc_suffix.html
---
```
#### Custom templates
You can also replace the underlying Pandoc template\index{Pandoc template} using the `template` option:
```yaml
---
title: "Habits"
output:
html_document:
template: quarterly_report.html
---
```
Consult the documentation on [Pandoc templates](http://pandoc.org/MANUAL.html#templates) for additional details on templates. You can also study the [default HTML template `default.html5`](https://github.com/jgm/pandoc-templates/) as an example.
#### Markdown extensions {#markdown-extensions}
By default, R Markdown is defined as all Pandoc Markdown extensions with the following tweaks for backward compatibility with the old **markdown** package [@R-markdown]:
```
+autolink_bare_uris
+tex_math_single_backslash
```
You can enable or disable Markdown extensions using the `md_extensions` option (you preface an option with `-` to disable and `+` to enable it). For example:
```yaml
---
title: "Habits"
output:
html_document:
md_extensions: -autolink_bare_uris+hard_line_breaks
---
```
The above would disable the `autolink_bare_uris` extension, and enable the `hard_line_breaks` extension.
For more on available markdown extensions see the [Pandoc Markdown specification](http://pandoc.org/MANUAL.html#pandocs-markdown).
#### Pandoc arguments {#pandoc-arguments}
If there are Pandoc features that you want to use but lack equivalents in the YAML options described above, you can still use them by passing custom `pandoc_args`. For example:
```yaml
---
title: "Habits"
output:
html_document:
pandoc_args: [
"--title-prefix", "Foo",
"--id-prefix", "Bar"
]
---
```
Documentation on all available pandoc arguments can be found in the [Pandoc User Guide](http://pandoc.org/MANUAL.html#options).
### Shared options {#shared-options}
If you want to specify a set of default options to be shared by multiple documents within a directory, you can include a file named `_output.yml` within the directory. Note that no YAML delimiters (`---`) or the enclosing `output` field are used in this file. For example:
```yaml
html_document:
self_contained: false
theme: united
highlight: textmate
```
It should not be written as:
```yaml
---
output:
html_document:
self_contained: false
theme: united
highlight: textmate
---
```
All documents located in the same directory as `_output.yml` will inherit its options. Options defined explicitly within documents will override those specified in the shared options file.
### HTML fragments
If you want to create an HTML fragment rather than a full HTML document, you can use the `html_fragment` format. For example:
```yaml
---
output: html_fragment
---
```
Note that HTML fragments are not complete HTML documents. They do not contain the standard header content that HTML documents do (they only contain content in the `<body>` tags of normal HTML documents). They are intended for inclusion within other web pages or content management systems (like blogs). As such, they do not support features like themes or code highlighting (it is expected that the environment they are ultimately published within handles these things).
## Notebook
An R Notebook is an R Markdown document with chunks that can be executed independently and interactively, with output visible immediately beneath the input. See Figure \@ref(fig:notebook-demo) for an example.
```{r notebook-demo, echo=FALSE, fig.cap='An R Notebook example.', out.width='100%'}
knitr::include_graphics('images/notebook-demo.png', dpi = NA)
```
R Notebooks are an implementation of [Literate Programming](https://en.wikipedia.org/wiki/Literate_programming) that allows for direct interaction with R while producing a reproducible document with publication-quality output.
Any R Markdown document can be used as a notebook, and all R Notebooks can be rendered to other R Markdown document types. A notebook can therefore be thought of as a special execution mode for R Markdown documents. The immediacy of notebook mode makes it a good choice while authoring the R Markdown document and iterating on code. When you are ready to publish the document, you can share the notebook directly, or render it to a publication format with the `Knit` button.
### Using Notebooks
#### Creating a Notebook
You can create a new notebook in RStudio with the menu command `File -> New File -> R Notebook`, or by using the `html_notebook` output type in your document's YAML metadata.
```yaml
---
title: "My Notebook"
output: html_notebook
---
```
By default, RStudio enables inline output (Notebook mode) on all R Markdown documents, so you can interact with any R Markdown document as though it were a notebook. If you have a document with which you prefer to use the traditional console method of interaction, you can disable notebook mode by clicking the gear button in the editor toolbar, and choosing `Chunk Output in Console` (Figure \@ref(fig:notebook-console)).
```{r notebook-console, echo=FALSE, fig.cap='Send the R code chunk output to the console.', out.width='80%', fig.align='center'}
knitr::include_graphics('images/notebook-console.png', dpi = NA)
```
If you prefer to use the console by default for *all* your R Markdown documents (restoring the behavior in previous versions of RStudio), you can make `Chunk Output in Console` the default: `Tools -> Options -> R Markdown -> Show output inline for all R Markdown documents`.
#### Inserting chunks
Notebook chunks can be inserted quickly using the keyboard shortcut `Ctrl + Alt + I` (macOS: `Cmd + Option + I`), or via the `Insert` menu in the editor toolbar.
Because all of a chunk's output appears beneath the chunk (not alongside the statement which emitted the output, as it does in the rendered R Markdown output), it is often helpful to split chunks that produce multiple outputs into two or more chunks which each produce only one output. To do this, select the code to split into a new chunk (Figure \@ref(fig:notebook-split-before)), and use the same keyboard shortcut for inserting a new code chunk (Figure \@ref(fig:notebook-split-after)).
```{r notebook-split-before, echo=FALSE, fig.cap='Select the code to split into a new chunk.', out.width='90%', fig.align='center'}
knitr::include_graphics('images/notebook-split-before.png', dpi = NA)
```
```{r notebook-split-after, echo=FALSE, fig.cap='Insert a new chunk from the code selected before.', out.width='90%', fig.align='center'}
knitr::include_graphics('images/notebook-split-after.png', dpi = NA)
```
#### Executing code
Code in the notebook is executed with the same gestures you would use to execute code in an R Markdown document:
1. Use the green triangle button on the toolbar of a code chunk that has the tooltip "Run Current Chunk", or `Ctrl + Shift + Enter` (macOS: `Cmd + Shift + Enter`) to run the current chunk.
1. Press `Ctrl + Enter` (macOS: `Cmd + Enter`) to run just the current statement. Running a single statement is much like running an entire chunk consisting only of that statement.
1. There are other ways to run a batch of chunks if you click the menu `Run` on the editor toolbar, such as `Run All`, `Run All Chunks Above`, and `Run All Chunks Below`.
The primary difference is that when executing chunks in an R Markdown document, all the code is sent to the console at once, but in a notebook, only one line at a time is sent. This allows execution to stop if a line raises an error.
When you execute code in a notebook, an indicator will appear in the gutter to show you execution progress (Figure \@ref(fig:notebook-longrunning)). Lines of code that have been sent to R are marked with dark green; lines that have not yet been sent to R are marked with light green. If at least one chunk is waiting to be executed, you will see a progress meter appear in the editor's status bar, indicating the number of chunks remaining to be executed. You can click on this meter at any time to jump to the currently executing chunk. When a chunk is waiting to execute, the `Run` button in its toolbar will change to a "queued" icon. If you do not want the chunk to run, you can click on the icon to remove it from the execution queue.
```{r notebook-longrunning, echo=FALSE, fig.cap='The indicator in the gutter to show the execution progress of a code chunk in the notebook.', out.width='100%'}
knitr::include_graphics('images/notebook-longrunning.png', dpi = NA)
```
In general, when you execute code in a notebook chunk, it will do exactly the same thing as it would if that same code were typed into the console. There are however a few differences:
- **Output**: The most obvious difference is that most forms of output produced from a notebook chunk are shown in the chunk output rather than, for example, the RStudio Viewer or the Plots pane. Console output (including warnings and messages) appears both at the console *and* in the chunk output.
- **Working directory**: The current working directory inside a notebook chunk is always the directory containing the notebook `.Rmd` file. This makes it easier to use relative paths inside notebook chunks, and also matches the behavior when knitting, making it easier to write code that works identically both interactively and in a standalone render.
You'll get a warning if you try to change the working directory inside a notebook chunk, and the directory will revert back to the notebook's directory once the chunk is finished executing. You can suppress this warning by using the `warnings = FALSE` chunk option.
If it is necessary to execute notebook chunks in a different directory, you can change the working directory for **all** your chunks by using the **knitr** `root.dir` option. For instance, to execute all notebook chunks in the grandparent folder of the notebook:
```r
knitr::opts_knit$set(root.dir = normalizePath(".."))
```
This option is only effective when used inside the setup chunk. Also note that, as in **knitr**, the `root.dir` chunk option applies only to chunks; relative paths in Markdown are still relative to the notebook's parent folder.
- **Warnings**: Inside a notebook chunk, warnings are always displayed immediately rather than being held until the end, as in `options(warn = 1)`.
- **Plots**: Plots emitted from a chunk are rendered to match the width of the editor at the time the chunk was executed. The height of the plot is determined by the [golden ratio](https://en.wikipedia.org/wiki/Golden_ratio). The plot's display list is saved, too, and the plot is re-rendered to match the editor's width when the editor is resized.
You can use the `fig.width`, `fig.height`, and `fig.asp` chunk options to manually specify the size of rendered plots in the notebook; you can also use `knitr::opts_chunk$set(fig.width = ..., fig.height = ...)` in the setup chunk to to set a default rendered size. Note, however, specifying a chunk size manually suppresses the generation of the display list, so plots with manually specified sizes will be resized using simple image scaling when the notebook editor is resized.
To execute an inline R expression in the notebook, put your cursor inside the chunk and press `Ctrl + Enter` (macOS: `Cmd + Enter`). As in the execution of ordinary chunks, the content of the expression will be sent to the R console for evaluation. The results will appear in a small pop-up window next to the code (Figure \@ref(fig:notebook-inline-output)).
```{r notebook-inline-output, echo=FALSE, fig.cap='Output from an inline R expression in the notebook.', out.width='40%', fig.align='center'}
knitr::include_graphics('images/notebook-inline-output.png', dpi = NA)
```
In notebooks, inline R expressions can only produce text (not figures or other kinds of output). It is also important that inline R expressions executes quickly and do not have side-effects, as they are executed whenever you save the notebook.
Notebooks are typically self-contained. However, in some situations, it is preferable to re-use code from an R script as a notebook chunk, as in **knitr**'s [code externalization](https://yihui.name/knitr/demo/externalization/). This can be done by using `knitr::read_chunk()` in your notebook's setup chunk, along with a special `## ---- chunkname` annotation in the R file from which you intend to read code. Here is a minimal example with two files:
**example.Rmd**
````markdown
`r ''````{r setup}
knitr::read_chunk("example.R")
```
```{r chunk}
```
````
**example.R**
```r
## ---- chunk
1 + 1
```
When you execute the empty chunk in the notebook `example.Rmd`, code from the external file `example.R` will be inserted, and the results displayed inline, as though the chunk contained that code (Figure \@ref(fig:notebook-external-code)).
```{r notebook-external-code, echo=FALSE, fig.cap='Execute a code chunk read from an external R script.', out.width='70%', fig.align='center'}
knitr::include_graphics('images/notebook-external-code.png', dpi = NA)
```
#### Chunk output
When code is executed in the notebook, its output appears beneath the code chunk that produced it. You can clear an individual chunk's output by clicking the `X` button in the upper right corner of the output, or collapse it by clicking the chevron.
It is also possible to clear or collapse all of the output in the document at once using the `Collapse All Output` and `Clear All Output` menu items available on the gear menu in the editor toolbar (Figure \@ref(fig:notebook-console)).
If you want to fully reset the state of the notebook, the item `Restart R and Clear Output` on the `Run` menu on the editor toolbar will do the job.
Ordinary R Markdown documents are "knitted", but notebooks are "previewed". While the notebook preview looks similar to a rendered R Markdown document, the notebook preview _does not execute any of your R code chunks_. It simply shows you a rendered copy of the Markdown output of your document along with the most recent chunk output. This preview is generated automatically whenever you save the notebook (whether you are viewing it in RStudio or not); see the section beneath on the `*.nb.html` file for details.
When `html_notebook` is the topmost (default) format in your YAML metadata, you will see a `Preview` button in the editor toolbar. Clicking it will show you the notebook preview (Figure \@ref(fig:notebook-preview)).
```{r notebook-preview, echo=FALSE, fig.cap='Preview a notebook.', out.width='100%'}
knitr::include_graphics('images/notebook-preview.png', dpi = NA)
```
If you have configured R Markdown previewing to use the Viewer pane (as illustrated in Figure \@ref(fig:notebook-preview)), the preview will be automatically updated whenever you save your notebook.
When an error occurs while a notebook chunk is executing (Figure \@ref(fig:notebook-error)):
```{r notebook-error, echo=FALSE, fig.cap='Errors in a notebook.', out.width='90%', fig.align='center'}
knitr::include_graphics('images/notebook-error.png', dpi = NA)
```
1. Execution will stop; the remaining lines of that chunk (and any chunks that have not yet been run) will not be executed.
1. The editor will scroll to the error.
1. The line of code that caused the error will have a red indicator in the editor's gutter.
If you want your notebook to keep running after an error, you can suppress the first two behaviors by specifying `error = TRUE` in the chunk options.
In most cases, it should not be necessary to have the console open while using the notebook, as you can see all of the console output in the notebook itself. To preserve vertical space, the console will be automatically collapsed when you open a notebook or run a chunk in the notebook.
If you prefer not to have the console hidden when chunks are executed, uncheck the option from the menu `Tools -> Global Options -> R Markdown -> Hide console automatically when executing notebook chunks`.
### Saving and sharing
#### Notebook file
When a notebook `*.Rmd` file is saved, a `*.nb.html` file is created alongside it. This file is a self-contained HTML file which contains both a rendered copy of the notebook with all current chunk outputs (suitable for display on a website) and a copy of the `*.Rmd` file itself.
You can view the `*.nb.html` file in any ordinary web browser. It can also be opened in RStudio; when you open there (e.g., using `File -> Open File`), RStudio will do the following:
1. Extract the bundled `*.Rmd` file, and place it alongside the `*.nb.html` file.
2. Open the `*.Rmd` file in a new RStudio editor tab.
3. Extract the chunk outputs from the `*.nb.html` file, and place them appropriately in the editor.
Note that the `*.nb.html` file is only created for R Markdown documents that are notebooks (i.e., at least one of their output formats is `html_notebook`). It is possible to have an R Markdown document that includes inline chunk output beneath code chunks, but does not produce an `*.nb.html` file, when `html_notebook` is not specified as an output format for the R Markdown document.
#### Output storage
The document's chunk outputs are also stored in an internal RStudio folder beneath the project's `.Rproj.user` folder. If you work with a notebook but do not have a project open, the outputs are stored in the RStudio state folder in your home directory (the location of this folder varies between the [desktop](https://support.rstudio.com/hc/en-us/articles/200534577) and the [server](https://support.rstudio.com/hc/en-us/articles/218730228)).
#### Version control
One of the major advantages of R Notebooks compared to other notebook systems is that they are plain-text files and therefore work well with version control. We recommend checking in both the `*.Rmd` and `*.nb.html` files into version control, so that both your source code and output are available to collaborators. However, you can choose to include only the `*.Rmd` file (with a `.gitignore` that excludes `*.nb.html`) if you want each collaborator to work with their own private copies of the output.
### Notebook format
While RStudio provides a set of integrated tools for authoring R Notebooks, the notebook file format itself is decoupled from RStudio. The **rmarkdown** package provides several functions that can be used to read and write R Notebooks outside of RStudio.
In this section, we describe the internals of the notebook format. It is primarily intended for front-end applications using or embedding R, or other users who are interested in reading and writing documents using the R Notebook format. We recommend that beginners skip this section when reading this book or using notebooks for the first time.
R Notebooks are HTML documents with data written and encoded in such a way that:
1. The source Rmd document can be recovered, and
2. Chunk outputs can be recovered.
To generate an R Notebook, you can use `rmarkdown::render()` and specify the `html_notebook` output format in your document's YAML metadata. Documents rendered in this form will be generated with the `.nb.html` file extension, to indicate that they are HTML notebooks.
To ensure chunk outputs can be recovered, the elements of the R Markdown document are enclosed with HTML comments, providing more information on the output. For example, chunk output might be serialized in the form:
```html
<!-- rnb-chunk-begin -->
<!-- rnb-output-begin -->
<pre><code>Hello, World!</code></pre>
<!-- rnb-output-end -->
<!-- rnb-chunk-end -->
```
Because R Notebooks are just HTML documents, they can be opened and viewed in any web browser; in addition, hosting environments can be configured to recover and open the source Rmd document, and also recover and display chunk outputs as appropriate.
#### Generating R Notebooks with custom output
It is possible to render an HTML notebook with custom chunk outputs inserted in lieu of the result that would be generated by evaluating the associated R code. This can be useful for front-end editors that show the output of chunk execution inline, or for conversion programs from other notebook formats where output is already available from the source format. To facilitate this, one can provide a custom "output source" to `rmarkdown::render()`. Let's investigate with a simple example:
```{r comment='', class.output='markdown'}
rmd_stub = "examples/r-notebook-stub.Rmd"
cat(readLines(rmd_stub), sep = "\n")
```
Let's try to render this document with a custom output source, so that we can inject custom output for the single chunk within the document. The output source function will accept:
- `code`: The code within the current chunk.
- `context`: An environment containing active chunk options and other chunk information.
- `...`: Optional arguments reserved for future expansion.
In particular, the `context` elements `label` and `chunk.index` can be used to help identify which chunk is currently being rendered.
```{r tidy=FALSE}
output_source = function(code, context, ...) {
logo = file.path(R.home("doc"), "html", "logo.jpg")
if (context$label == "chunk-one") list(
rmarkdown::html_notebook_output_code("# R Code"),
paste("Custom output for chunk:", context$chunk.index),
rmarkdown::html_notebook_output_code("# R Logo"),
rmarkdown::html_notebook_output_img(logo)
)
}
```
We can pass our `output_source` along as part of the `output_options` list to `rmarkdown::render()`.
```{r tidy=FALSE}
output_file = rmarkdown::render(
rmd_stub,
output_options = list(output_source = output_source),
quiet = TRUE
)
```
We have now generated an R Notebook. Open [this document](https://rmarkdown.rstudio.com/notebook/r-notebook-stub.nb.html) in a web browser, and it will show that the `output_source` function has effectively side-stepped evaluation of code within that chunk, and instead returned the injected result.
#### Implementing output sources
In general, you can provide regular R output in your output source function, but **rmarkdown** also provides a number of endpoints for insertion of custom HTML content. These are documented within `?html_notebook_output`.
Using these functions ensures that you produce an R Notebook that can be opened in R frontends (e.g., RStudio).
#### Parsing R Notebooks
The `rmarkdown::parse_html_notebook()` function provides an interface for recovering and parsing an HTML notebook.
```{r comment=''}
parsed = rmarkdown::parse_html_notebook(output_file)
str(parsed, width = 60, strict.width = 'wrap')
```
This interface can be used to recover the original Rmd source, and also (with some more effort from the front-end) the ability to recover chunk outputs from the document itself.
```{r, include=FALSE}
rm(parsed)
```
## PDF document
To create a PDF document from R Markdown, you specify the `pdf_document` output format in the YAML metadata:
```yaml
---
title: "Habits"
author: John Doe
date: March 22, 2005
output: pdf_document
---
```
Within R Markdown documents that generate PDF output, you can use raw LaTeX\index{LaTeX}, and even define LaTeX macros. See Pandoc's documentation on the [raw_tex extension](https://pandoc.org/MANUAL.html#extension-raw_tex) for details.
Note that PDF output (including Beamer slides) requires an installation of LaTeX (see Chapter \@ref(installation)).
### Table of contents
You can add a table of contents using the `toc` option and specify the depth of headers that it applies to using the `toc_depth` option. For example:
```yaml
---
title: "Habits"
output:
pdf_document:
toc: true
toc_depth: 2
---
```
If the TOC depth is not explicitly specified, it defaults to 2 (meaning that all level 1 and 2 headers will be included in the TOC), while it defaults to 3 in `html_document`.
You can add section numbering to headers using the `number_sections` option:
```yaml
---
title: "Habits"
output:
pdf_document:
toc: true
number_sections: true
---
```
If you are familiar with LaTeX, `number_sections: true` means `\section{}`, and `number_sections: false` means `\section*{}` for sections in LaTeX (it also applies to other levels of "sections" such as `\chapter{}`, and `\subsection{}`).
### Figure options
There are a number of options that affect the output of figures within PDF documents:
- `fig_width` and `fig_height` can be used to control the default figure width and height (6.5x4.5 is used by default).
- `fig_crop` controls whether the `pdfcrop` utility, if available in your system, is automatically applied to PDF figures (this is `true` by default).
- If you are using [TinyTeX](https://yihui.name/tinytex/) as your LaTeX distribution, we recommend that you run `tinytex::tlmgr_install("pdfcrop")` to install the LaTeX package `pdfcrop`. You also have to make sure the system package [`ghostscript`](https://www.ghostscript.com) is available in your system for `pdfcrop` to work. For macOS users who have installed Homebrew, `ghostscript` can be installed via `brew install ghostscript`.
- If your graphics device is `postscript`, we recommend that you disable this feature (see more info in the **knitr** issue [#1365](https://github.com/yihui/knitr/issues/1365)).
- `fig_caption` controls whether figures are rendered with captions (this is `true` by default).
- `dev` controls the graphics device used to render figures (defaults to `pdf`).
For example:
```yaml
---
title: "Habits"
output:
pdf_document:
fig_width: 7
fig_height: 6
fig_caption: true
---
```
### Data frame printing
You can enhance the default display of data frames via the `df_print` option. Valid values are presented in Table \@ref(tab:df-print-pdf).
Table: (\#tab:df-print-pdf) The possible values of the `df_print` option for the `pdf_document` format.
| Option | Description |
|------------|-------------------------------------------|
| default | Call the `print.data.frame` generic method |
| kable | Use the `knitr::kable()` function |
| tibble | Use the `tibble::print.tbl_df()` function |
| A custom function | Use the function to create the table. See \@ref(custom-fun-dfprint) |
For example:
```yaml
---
title: "Habits"
output:
pdf_document:
df_print: kable
---
```
### Syntax highlighting
The `highlight` option specifies the syntax highlighting style. Its usage in `pdf_document` is the same as `html_document` (Section \@ref(appearance-and-style)). For example:
```yaml
---
title: "Habits"
output:
pdf_document:
highlight: tango
---
```
### LaTeX options
Many aspects of the LaTeX template used to create PDF documents can be customized using _top-level_ YAML metadata (note that these options do not appear underneath the `output` section, but rather appear at the top level along with `title`, `author`, and so on). For example:
```yaml
---
title: "Crop Analysis Q3 2013"
output: pdf_document
fontsize: 11pt
geometry: margin=1in
---
```
A few available metadata variables are displayed in Table \@ref(tab:latex-vars) (consult the Pandoc manual for [the full list](https://pandoc.org/MANUAL.html#variables-for-latex)):
Table: (\#tab:latex-vars) Available top-level YAML metadata variables for LaTeX output.
Variable|Description
------------------------|--------------------------------------------------------------------
lang|Document language code
fontsize|Font size (e.g., `10pt`, `11pt`, or `12pt`)
documentclass|LaTeX document class (e.g., `article`)
classoption|Options for documentclass (e.g., `oneside`)
geometry|Options for geometry class (e.g., `margin=1in`)
mainfont, sansfont, monofont, mathfont|Document fonts (works only with `xelatex` and `lualatex`)
linkcolor, urlcolor, citecolor|Color for internal, external, and citation links
### LaTeX packages for citations
By default, citations\index{citation} are processed through `pandoc-citeproc`, which works for all output formats. For PDF output, sometimes it is better to use LaTeX packages to process citations, such as `natbib` or `biblatex`. To use one of these packages, just set the option `citation_package` to be `natbib` or `biblatex`, e.g.
```yaml
---
output:
pdf_document:
citation_package: natbib
---
```
### Advanced customization
#### LaTeX engine
By default, PDF documents are rendered using `pdflatex`. You can specify an alternate engine using the `latex_engine` option. Available engines are `pdflatex`, `xelatex`, and `lualatex`. For example:
```yaml
---
title: "Habits"
output:
pdf_document:
latex_engine: xelatex
---
```
The main reasons you may want to use `xelatex` or `lualatex` are: (1) They support Unicode better; (2) It is easier to make use of system fonts. See some posts on Stack Overflow for more detailed explanations, e.g., https://tex.stackexchange.com/q/3393/9128 and https://tex.stackexchange.com/q/36/9128.
#### Keeping intermediate TeX
R Markdown documents are converted to PDF by first converting to a TeX file and then calling the LaTeX engine to convert to PDF. By default, this TeX file is removed, however if you want to keep it (e.g., for an article submission), you can specify the `keep_tex` option. For example:
```yaml
---
title: "Habits"
output:
pdf_document:
keep_tex: true
---
```
#### Includes
You can do more advanced customization of PDF output by including additional LaTeX directives and/or content or by replacing the core Pandoc template entirely. To include content in the document header or before/after the document body, you use the `includes` option as follows:
```yaml
---
title: "Habits"
output:
pdf_document:
includes:
in_header: preamble.tex
before_body: doc-prefix.tex
after_body: doc-suffix.tex
---
```
#### Custom templates
You can also replace the underlying Pandoc template using the `template` option:
```yaml
---
title: "Habits"
output:
pdf_document:
template: quarterly-report.tex
---
```
Consult the documentation on [Pandoc templates](https://pandoc.org/README.html#templates) for additional details on templates. You can also study the [default LaTeX template](https://github.com/jgm/pandoc-templates/blob/master/default.latex) as an example.
### Other features
Similar to HTML documents, you can enable or disable certain Markdown extensions for generating PDF documents. See Section \@ref(markdown-extensions) for details. You can also pass more custom Pandoc arguments through the `pandoc_args` option (Section \@ref(pandoc-arguments)), and define shared options in `_output.yml` (Section \@ref(shared-options)).
## Word document
To create a Word\index{Word} document from R Markdown, you specify the `word_document` output format in the YAML metadata of your document: