diff --git a/README.md b/README.md new file mode 100644 index 0000000..3acd556 --- /dev/null +++ b/README.md @@ -0,0 +1,43 @@ +# README + +John Little 2021-06-17 + + + +## A workshop case study on webscraping + + + + + +[![DOI](https://img.shields.io/badge/DOI-10.5281%2Fzenodo.4908874%20(Latest%20Version%20Release)-blue "DOI")](https://doi.org/10.5281/zenodo.4908874) + +[![ORCID](https://img.shields.io/badge/ORCID-0000--0002--3600--0972-A6CE39?logo=ORCID&logoColor=A6CE39 "ORCID")](https://orcid.org/0000-0002-3600-0972) + +[![Creative Commons CC BY-NC](https://img.shields.io/badge/Creative%20Commons-BY--NC-EF9421?logo=creative%20commons&logoColor=EF9421 "CC BY-NC")](https://creativecommons.org/licenses/by-nc-nd/4.0/) + +[![Launch Rstudio Binder](http://mybinder.org/badge_logo.svg "Launch RStudio Binder")](https://mybinder.org/v2/gh/libjohn/workshop_webscraping/HEAD?urlpath=rstudio) + +[![YouTube Playlist](https://img.shields.io/badge/YouTube-Workshop%20recording-f00?logo=youtube "Workshop Recording")](https://www.youtube.com/embed/8ISc8V9GDAg?si=iErWIj6F82aFWIV1) + + + +Using the `rvest` library to learn about web crawling and HTML parsing in R. + +- Introduce just enough HTML/CSS +- Introduce the `library(rvest)` package for harvesting websites/HTML +- Tidyverse iteration with `purrr::map` + +**Workshop Video**: + +**See Also**: [*What to know about law & ethics when archiving & mining data*](slides/What%20to%20know%20about%20law%20&%20ethics_Archiving%20&%20Mining_15%20Min_with%20notes.pdf) by Rachael Samberg, J.D., MLIS Timothy Vollmer, MIS & the UC Berkeley Office of Scholarly Communication Services youtube [playlists on navigating intellectual property, copyright, fair-use](https://www.youtube.com/channel/UCNUMwTyK0raTNNZVjhgB7KA/playlists). **Please note**, the Samberg/Vollmer slides are found in this github repo’s slides folder and are redistributed with permission from the slide authors. + +------------------------------------------------------------------------ + +## License + +John Little + +### CC BY-NC + +Creative Commons Attribution-NonCommercial