Skip to content

Commit 4bb149d

Browse files
committed
expand explanations
1 parent c5bf3ae commit 4bb149d

File tree

5 files changed

+37
-18
lines changed

5 files changed

+37
-18
lines changed

docs/index.html

+1-1
Original file line numberDiff line numberDiff line change
@@ -267,7 +267,7 @@ <h1>Course Materials</h1>
267267
<a href="./homework/06.html" class="title listing-title">String Processing and Data Wrangling in R and Python</a>
268268
</td>
269269
</tr>
270-
<tr data-index="1" data-categories="Slides" data-listing-date-sort="1727758800000" data-listing-file-modified-sort="1727646040362" data-listing-date-modified-sort="NaN" data-listing-reading-time-sort="1" data-listing-word-count-sort="113" data-listing-title-sort="Wrangling Data" data-listing-filename-sort="06-data-wrangling.qmd">
270+
<tr data-index="1" data-categories="Slides" data-listing-date-sort="1727758800000" data-listing-file-modified-sort="1727796896509" data-listing-date-modified-sort="NaN" data-listing-reading-time-sort="1" data-listing-word-count-sort="119" data-listing-title-sort="Wrangling Data" data-listing-filename-sort="06-data-wrangling.qmd">
271271
<td>
272272
<span class="listing-date">Oct 1, 2024</span>
273273
</td>

docs/search.json

+9-2
Original file line numberDiff line numberDiff line change
@@ -312,14 +312,14 @@
312312
"href": "slides/06-data-wrangling.html#organization",
313313
"title": "Wrangling Data",
314314
"section": "Organization",
315-
"text": "Organization\n\nHomework #5: make sure to put the link to your repo on Github into Canvas\nHomework #6:\nplease check that you can access the github classroom repo"
315+
"text": "Organization\nHomework #5:\n\nmake sure to put the link to your repo on Github into Canvas\nauthorization: usethis::create_github_token()\nwhich files to push? any original files!"
316316
},
317317
{
318318
"objectID": "slides/06-data-wrangling.html#todays-goals",
319319
"href": "slides/06-data-wrangling.html#todays-goals",
320320
"title": "Wrangling Data",
321321
"section": "Today’s Goals:",
322-
"text": "Today’s Goals:\n\nClone the repo for Homework 6\nMake sure that you are able to render index.qmd\nStart wrangling these poems!\n\nTry to get past checkpoints 1 and 2 for both python and R, so you can focus on the exploration on Thursday\n\nhelpful functions in R: separate_longer_delim, str_count (make sure to check ?boundary()),\nhelpful functions in python: str.split, str.count, explode"
322+
"text": "Today’s Goals:\n\nIn Canvas, follow the Github Classroom link to Homework 6\nClone the repo for Homework 6 to your local machine\nMake sure that you are able to render index.qmd\nStart wrangling these poems!\n\nTry to get past checkpoints 1 and 2 for both python and R, so you can focus on the exploration on Thursday"
323323
},
324324
{
325325
"objectID": "slides/06-data-wrangling.html#resources",
@@ -425,5 +425,12 @@
425425
"title": "String Processing and Data Wrangling in R and Python",
426426
"section": "Poetry Analysis",
427427
"text": "Poetry Analysis\nChoose some characteristic(s) of the poet’s style to explore graphically. If necessary, create a subset of the data with poems relevant to your question before you generate numerical summaries. You may want to clean up the data and remove lines which contain quotes (e.g. start with a &gt; character) or horizontal lines in reddit markdown (e.g. only have ----).\nSome ideas to get you started thinking:\n\nNumber of syllables per line may be used to infer rhyming scheme and/or poetry style. You can use the syllables python package. You may need to use a for-loop or program a custom function to use the syllables function on every entry in your data frame column.\nUse of non-alphabetic characters. Schnoodle often uses emoji and other text annotations to convey emotions and excitement - how often do these types of annotations appear in their poems?\nCommon characters and phrases. Sprog writes poems frequently about a character named Timmy (who often meets a horrible end). Do these poems have a common format/style/rhyme scheme?\nSprog often writes longer poems with multiple stanzas. Identify which poems have multiple stanzas (Hint, look for blank lines in a systematic pattern) and show the distribution of stanza length, stanza variation within poems, and number of stanzas in each poem.\nSchnoodle often misspells words intentionally (using e.g. fren instead of friend) to convey that they are writing using an animal’s voice. What proportion of words are misspelled in each poem? How much does this proportion vary? Based on the distribution, how likely is it, in your opinion, that Schnoodle misspells words more often when speaking as one type of animal than another? You can use one of the python packages discussed in this post to detect whether words are likely misspelled."
428+
},
429+
{
430+
"objectID": "slides/06-data-wrangling.html#helpful-functions",
431+
"href": "slides/06-data-wrangling.html#helpful-functions",
432+
"title": "Wrangling Data",
433+
"section": "Helpful functions",
434+
"text": "Helpful functions\n\nin R: separate_longer_delim, str_count (make sure to check ?boundary()),\nin python: str.split, str.count, explode"
428435
}
429436
]

docs/slides/06-data-wrangling.html

+11-6
Original file line numberDiff line numberDiff line change
@@ -348,23 +348,28 @@ <h2>This week’s big picture goal:</h2>
348348
</section>
349349
<section id="organization" class="slide level2">
350350
<h2>Organization</h2>
351+
<p>Homework #5:</p>
351352
<ul>
352-
<li><p>Homework #5: make sure to put the link to your repo on Github into Canvas</p></li>
353-
<li><p>Homework #6:</p>
354-
<p>please check that you can access the github classroom repo</p></li>
353+
<li><p>make sure to put the link to your repo on Github into Canvas</p></li>
354+
<li><p>authorization: <code>usethis::create_github_token()</code></p></li>
355+
<li><p>which files to push? any original files!</p></li>
355356
</ul>
356357
</section>
357358
<section id="todays-goals" class="slide level2">
358359
<h2>Today’s Goals:</h2>
359360
<ul>
360-
<li><p>Clone the repo for Homework 6</p></li>
361+
<li><p>In Canvas, follow the Github Classroom link to Homework 6</p></li>
362+
<li><p>Clone the repo for Homework 6 to your local machine</p></li>
361363
<li><p>Make sure that you are able to render <code>index.qmd</code></p></li>
362364
<li><p>Start wrangling these poems!</p></li>
363365
</ul>
364366
<p>Try to get past checkpoints 1 and 2 for both python and R, so you can focus on the exploration on Thursday</p>
367+
</section>
368+
<section id="helpful-functions" class="slide level2">
369+
<h2>Helpful functions</h2>
365370
<ul>
366-
<li><p>helpful functions in R: <code>separate_longer_delim</code>, <code>str_count</code> (make sure to check <code>?boundary()</code>),</p></li>
367-
<li><p>helpful functions in python: <code>str.split</code>, <code>str.count</code>, <code>explode</code></p></li>
371+
<li><p>in R: <code>separate_longer_delim</code>, <code>str_count</code> (make sure to check <code>?boundary()</code>),</p></li>
372+
<li><p>in python: <code>str.split</code>, <code>str.count</code>, <code>explode</code></p></li>
368373
</ul>
369374
</section>
370375
<section id="resources" class="slide level2">

docs/syllabus.pdf

0 Bytes
Binary file not shown.

slides/06-data-wrangling.qmd

+16-9
Original file line numberDiff line numberDiff line change
@@ -16,29 +16,36 @@ Working with text and practicing exploration skills
1616

1717
## Organization
1818

19-
- Homework #5:
20-
make sure to put the link to your repo on Github into Canvas
19+
Homework #5:
20+
21+
- make sure to put the link to your repo on Github into Canvas
22+
23+
- authorization: `usethis::create_github_token()`
24+
25+
- which files to push? any original files!
2126

22-
- Homework #6:
23-
24-
please check that you can access the github classroom repo
25-
2627

2728
---
2829

2930
## Today's Goals:
3031

31-
- Clone the repo for Homework 6
32+
- In Canvas, follow the Github Classroom link to Homework 6
33+
34+
- Clone the repo for Homework 6 to your local machine
3235

3336
- Make sure that you are able to render `index.qmd`
3437

3538
- Start wrangling these poems!
3639

3740
Try to get past checkpoints 1 and 2 for both python and R, so you can focus on the exploration on Thursday
3841

39-
- helpful functions in R: `separate_longer_delim`, `str_count` (make sure to check `?boundary()`),
42+
---
43+
44+
## Helpful functions
45+
46+
- in R: `separate_longer_delim`, `str_count` (make sure to check `?boundary()`),
4047

41-
- helpful functions in python: `str.split`, `str.count`, `explode`
48+
- in python: `str.split`, `str.count`, `explode`
4249

4350
----
4451

0 commit comments

Comments
 (0)