This script outines approaches to extract textual features of interest from the Congressional Record.
Full texts and metadata scraped using this script: https://judgelord.github.io/cr/scraper.html
Speeches parsed using this script: https://judgelord.github.io/cr/speakers.html
Testing with a sample of 1000 speeches for now
list.files(here::here("data", "txt"), recursive = T)
cr <-
tibble(file = str_c("data/txt/", cr),
d <-date = str_extract(cr, "[0-9]{4}-[0-9]{2}-[0-9]{2}") %>%
as.Date,
year = str_sub(cr, 1, 4),
icpsr = str_remove_all(cr, ".*-|.txt"))
kablebox(d)
file | date | year | icpsr |
---|---|---|---|
data/txt/2018/14871/CREC-2018-10-03-pt1-PgS6492-000004-14871.txt | 2018-10-03 | 2018 | 14871 |
data/txt/2018/14871/CREC-2018-10-11-pt1-PgS6809-2-000017-14871.txt | 2018-10-11 | 2018 | 14871 |
data/txt/2018/14921/CREC-2018-10-05-pt2-PgS6698-2-000009-14921.txt | 2018-10-05 | 2018 | 14921 |
data/txt/2018/14921/CREC-2018-10-05-pt2-PgS6698-2-000010-14921.txt | 2018-10-05 | 2018 | 14921 |
data/txt/2018/14921/CREC-2018-10-11-pt1-PgS6880-3-000018-14921.txt | 2018-10-11 | 2018 | 14921 |
data/txt/2018/14921/CREC-2018-10-11-pt1-PgS6880-3-000019-14921.txt | 2018-10-11 | 2018 | 14921 |
data/txt/2018/14921/CREC-2018-10-11-pt1-PgS6880-3-000020-14921.txt | 2018-10-11 | 2018 | 14921 |
data/txt/2018/15029/CREC-2018-11-13-pt1-PgH9508-4-000043-15029.txt | 2018-11-13 | 2018 | 15029 |
data/txt/2018/15029/CREC-2018-11-13-pt1-PgH9508-4-000044-15029.txt | 2018-11-13 | 2018 | 15029 |
data/txt/2018/15603/CREC-2018-11-09-pt1-PgE1501-2-000014-15603.txt | 2018-11-09 | 2018 | 15603 |
data/txt/2018/15603/CREC-2018-11-09-pt1-PgE1501-2-000015-15603.txt | 2018-11-09 | 2018 | 15603 |
data/txt/2018/20124/CREC-2018-10-23-pt1-PgE1441-3-000028-20124.txt | 2018-10-23 | 2018 | 20124 |
data/txt/2018/20124/CREC-2018-10-23-pt1-PgE1441-3-000029-20124.txt | 2018-10-23 | 2018 | 20124 |
data/txt/2018/20352/CREC-2018-11-14-pt1-PgH9525-000064-20352.txt | 2018-11-14 | 2018 | 20352 |
data/txt/2018/20352/CREC-2018-11-14-pt1-PgH9525-000065-20352.txt | 2018-11-14 | 2018 | 20352 |
data/txt/2018/20352/CREC-2018-11-14-pt1-PgH9525-000066-20352.txt | 2018-11-14 | 2018 | 20352 |
data/txt/2018/20352/CREC-2018-11-14-pt1-PgH9525-000067-20352.txt | 2018-11-14 | 2018 | 20352 |
data/txt/2018/20501/CREC-2018-10-16-pt1-PgE1417-000024-20501.txt | 2018-10-16 | 2018 | 20501 |
data/txt/2018/20501/CREC-2018-10-16-pt1-PgE1417-000025-20501.txt | 2018-10-16 | 2018 | 20501 |
data/txt/2018/20501/CREC-2018-11-09-pt1-PgE1506-2-000016-20501.txt | 2018-11-09 | 2018 | 20501 |
data/txt/2018/20501/CREC-2018-11-09-pt1-PgE1506-2-000017-20501.txt | 2018-11-09 | 2018 | 20501 |
data/txt/2018/20518/CREC-2018-11-02-pt1-PgE1485-2-000001-20518.txt | 2018-11-02 | 2018 | 20518 |
data/txt/2018/20518/CREC-2018-11-02-pt1-PgE1485-2-000002-20518.txt | 2018-11-02 | 2018 | 20518 |
data/txt/2018/20521/CREC-2018-11-13-pt1-PgH9474-000028-20521.txt | 2018-11-13 | 2018 | 20521 |
data/txt/2018/20521/CREC-2018-11-13-pt1-PgH9474-000029-20521.txt | 2018-11-13 | 2018 | 20521 |
data/txt/2018/20521/CREC-2018-11-13-pt1-PgH9507-000046-20521.txt | 2018-11-13 | 2018 | 20521 |
data/txt/2018/20528/CREC-2018-11-02-pt1-PgE1491-7-000005-20528.txt | 2018-11-02 | 2018 | 20528 |
data/txt/2018/20528/CREC-2018-11-02-pt1-PgE1491-7-000006-20528.txt | 2018-11-02 | 2018 | 20528 |
data/txt/2018/20705/CREC-2018-10-30-pt1-PgE1481-4-000034-20705.txt | 2018-10-30 | 2018 | 20705 |
data/txt/2018/20705/CREC-2018-10-30-pt1-PgE1481-4-000035-20705.txt | 2018-10-30 | 2018 | 20705 |
data/txt/2018/20717/CREC-2018-11-13-pt1-PgS6910-2-000054-20717.txt | 2018-11-13 | 2018 | 20717 |
data/txt/2018/20750/CREC-2018-10-26-pt1-PgH9449-13-000033-20750.txt | 2018-10-26 | 2018 | 20750 |
data/txt/2018/20955/CREC-2018-10-26-pt1-PgE1462-2-000031-20955.txt | 2018-10-26 | 2018 | 20955 |
data/txt/2018/20955/CREC-2018-10-26-pt1-PgE1462-2-000032-20955.txt | 2018-10-26 | 2018 | 20955 |
data/txt/2018/20958/CREC-2018-11-02-pt1-PgE1487-3-000036-20958.txt | 2018-11-02 | 2018 | 20958 |
data/txt/2018/20958/CREC-2018-11-02-pt1-PgE1487-3-000037-20958.txt | 2018-11-02 | 2018 | 20958 |
data/txt/2018/20959/CREC-2018-10-12-pt1-PgE1401-4-000021-20959.txt | 2018-10-12 | 2018 | 20959 |
data/txt/2018/20959/CREC-2018-10-12-pt1-PgE1401-4-000022-20959.txt | 2018-10-12 | 2018 | 20959 |
data/txt/2018/21109/CREC-2018-11-09-pt1-PgE1502-2-000041-21109.txt | 2018-11-09 | 2018 | 21109 |
data/txt/2018/21109/CREC-2018-11-09-pt1-PgE1502-2-000042-21109.txt | 2018-11-09 | 2018 | 21109 |
data/txt/2018/21111/CREC-2018-11-13-pt1-PgE1513-3-000021-21111.txt | 2018-11-13 | 2018 | 21111 |
data/txt/2018/21111/CREC-2018-11-13-pt1-PgE1513-3-000022-21111.txt | 2018-11-13 | 2018 | 21111 |
data/txt/2018/21111/CREC-2018-11-13-pt1-PgE1513-3-000023-21111.txt | 2018-11-13 | 2018 | 21111 |
data/txt/2018/21303/CREC-2018-11-13-pt1-PgE1509-000019-21303.txt | 2018-11-13 | 2018 | 21303 |
data/txt/2018/21303/CREC-2018-11-13-pt1-PgE1509-000020-21303.txt | 2018-11-13 | 2018 | 21303 |
data/txt/2018/21316/CREC-2018-11-06-pt1-PgE1495-2-000039-21316.txt | 2018-11-06 | 2018 | 21316 |
data/txt/2018/21316/CREC-2018-11-06-pt1-PgE1495-2-000040-21316.txt | 2018-11-06 | 2018 | 21316 |
data/txt/2018/21324/CREC-2018-11-14-pt1-PgH9514-000062-21324.txt | 2018-11-14 | 2018 | 21324 |
data/txt/2018/21324/CREC-2018-11-14-pt1-PgH9514-000063-21324.txt | 2018-11-14 | 2018 | 21324 |
data/txt/2018/21332/CREC-2018-11-06-pt1-PgE1495-3-000008-21332.txt | 2018-11-06 | 2018 | 21332 |
data/txt/2018/21332/CREC-2018-11-06-pt1-PgE1495-3-000009-21332.txt | 2018-11-06 | 2018 | 21332 |
data/txt/2018/21332/CREC-2018-11-06-pt1-PgE1498-000010-21332.txt | 2018-11-06 | 2018 | 21332 |
data/txt/2018/21332/CREC-2018-11-06-pt1-PgE1498-000011-21332.txt | 2018-11-06 | 2018 | 21332 |
data/txt/2018/21332/CREC-2018-11-14-pt1-PgE1520-000057-21332.txt | 2018-11-14 | 2018 | 21332 |
data/txt/2018/21332/CREC-2018-11-14-pt1-PgE1520-000058-21332.txt | 2018-11-14 | 2018 | 21332 |
data/txt/2018/21358/CREC-2018-11-13-pt1-PgE1511-2-000043-21358.txt | 2018-11-13 | 2018 | 21358 |
data/txt/2018/21358/CREC-2018-11-13-pt1-PgE1511-2-000044-21358.txt | 2018-11-13 | 2018 | 21358 |
data/txt/2018/21368/CREC-2018-10-05-pt1-PgH9420-11-000008-21368.txt | 2018-10-05 | 2018 | 21368 |
data/txt/2018/21521/CREC-2018-11-02-pt1-PgE1489-2-000003-21521.txt | 2018-11-02 | 2018 | 21521 |
data/txt/2018/21521/CREC-2018-11-02-pt1-PgE1489-2-000004-21521.txt | 2018-11-02 | 2018 | 21521 |
data/txt/2018/21525/CREC-2018-11-14-pt1-PgH9526-8-000068-21525.txt | 2018-11-14 | 2018 | 21525 |
data/txt/2018/21537/CREC-2018-10-09-pt1-PgH9425-9-000011-21537.txt | 2018-10-09 | 2018 | 21537 |
data/txt/2018/21704/CREC-2018-11-13-pt1-PgE1516-000024-21704.txt | 2018-11-13 | 2018 | 21704 |
data/txt/2018/21704/CREC-2018-11-13-pt1-PgE1516-000025-21704.txt | 2018-11-13 | 2018 | 21704 |
data/txt/2018/21708/CREC-2018-10-23-pt1-PgH9443-5-000030-21708.txt | 2018-10-23 | 2018 | 21708 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000045-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000046-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000047-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000048-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000049-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000050-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000051-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000052-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000053-21714.txt | 2018-11-13 | 2018 | 21714 |
data/txt/2018/21740/CREC-2018-10-16-pt1-PgH9436-9-000026-21740.txt | 2018-10-16 | 2018 | 21740 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000030-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000031-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000032-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000033-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000034-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000035-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000036-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29321/CREC-2018-11-13-pt1-PgH9495-000037-29321.txt | 2018-11-13 | 2018 | 29321 |
data/txt/2018/29323/CREC-2018-11-14-pt1-PgE1523-000059-29323.txt | 2018-11-14 | 2018 | 29323 |
data/txt/2018/29323/CREC-2018-11-14-pt1-PgE1523-000060-29323.txt | 2018-11-14 | 2018 | 29323 |
data/txt/2018/29373/CREC-2018-10-10-pt1-PgS6767-2-000012-29373.txt | 2018-10-10 | 2018 | 29373 |
data/txt/2018/29378/CREC-2018-10-12-pt1-PgH9431-7-000023-29378.txt | 2018-10-12 | 2018 | 29378 |
data/txt/2018/29561/CREC-2018-11-13-pt1-PgH9500-2-000038-29561.txt | 2018-11-13 | 2018 | 29561 |
data/txt/2018/29561/CREC-2018-11-13-pt1-PgH9500-2-000039-29561.txt | 2018-11-13 | 2018 | 29561 |
data/txt/2018/29561/CREC-2018-11-13-pt1-PgH9500-2-000040-29561.txt | 2018-11-13 | 2018 | 29561 |
data/txt/2018/29561/CREC-2018-11-13-pt1-PgH9500-2-000041-29561.txt | 2018-11-13 | 2018 | 29561 |
data/txt/2018/29561/CREC-2018-11-13-pt1-PgH9500-2-000042-29561.txt | 2018-11-13 | 2018 | 29561 |
data/txt/2018/29768/CREC-2018-10-05-pt1-PgE1372-4-000006-29768.txt | 2018-10-05 | 2018 | 29768 |
data/txt/2018/29768/CREC-2018-10-05-pt1-PgE1372-4-000007-29768.txt | 2018-10-05 | 2018 | 29768 |
data/txt/2018/41101/CREC-2018-10-01-pt1-PgS6414-2-000001-41101.txt | 2018-10-01 | 2018 | 41101 |
data/txt/2018/41703/CREC-2018-10-11-pt1-PgS6793-2-000013-41703.txt | 2018-10-11 | 2018 | 41703 |
data/txt/2018/41703/CREC-2018-10-11-pt1-PgS6793-2-000014-41703.txt | 2018-10-11 | 2018 | 41703 |
data/txt/2018/41703/CREC-2018-10-11-pt1-PgS6793-2-000015-41703.txt | 2018-10-11 | 2018 | 41703 |
data/txt/2018/41703/CREC-2018-10-11-pt1-PgS6793-2-000016-41703.txt | 2018-10-11 | 2018 | 41703 |
data/txt/2018/NA/CREC-2018-10-02-pt1-PgH9411-3-000002-NA.txt | 2018-10-02 | 2018 | NA |
%<>% filter(icpsr != "NA")
d
# a function to grab sentences with keywords
function(file, word){
keyword_sentence <- read_lines(here::here(file)) %>%
text <- str_c(collapse = " ") %>%
str_squish()
if( str_detect(text, regex(word, ignore_case = T) ) ){
%<>%
text enframe(name = NULL, value = "text") %>%
unnest_tokens(sentence, text, token = "sentences") %>%
filter(str_detect(sentence, word)) %>%
.$sentence %>%
str_c(collapse = "...")
else {
} NA
text <-
}
return(text)
}
## test
# keyword_sentence(d$file[1], "i am")
%<>% mutate(district_sentences = purrr::map_chr(d$file, keyword_sentence, word = "district"))
d
%>% filter(!is.na(district_sentences)) %>% kablebox() d
file | date | year | icpsr | district_sentences |
---|---|---|---|---|
data/txt/2018/15603/CREC-2018-11-09-pt1-PgE1501-2-000015-15603.txt | 2018-11-09 | 2018 | 15603 | speaker, saint joseph’s medical center located in my district will host its annual ball on november 3 and has chosen an outstanding slate of honorees for the event. |
data/txt/2018/20501/CREC-2018-11-09-pt1-PgE1506-2-000017-20501.txt | 2018-11-09 | 2018 | 20501 | for over 30 years, gary has devoted himself to the fresno agricultural community by serving as an advocate and leader in his position as general manager of the fresno irrigation district….gary joined the fresno irrigation district in 1986, where he held the positions of watermaster and assistant manager of operations….as a result of gary’s hard work and significant contributions, he was appointed the general manager of the fresno irrigation district in 2000….as general manager, gary has skillfully navigated the district through some of the most trying times for california agriculture….it is both fitting and appropriate that he is named the 2018 agriculturist of the year, as he retires from his position as general manager of the fresno irrigation district at the end of this year. |
data/txt/2018/20958/CREC-2018-11-02-pt1-PgE1487-3-000037-20958.txt | 2018-11-02 | 2018 | 20958 | as president of the willows unified school district board she works with other board members to ensure the school district provides a safe and engaging learning environment where each student has the opportunity to realize their full potential, develop respect and tolerance for others, and ultimately become a productive member of the community….taylor is an integral part of realizing the school district’s mission of preparing today’s students for tomorrow’s challenges….taylor was also a founding member of the painted ladies, a group that worked to fill the gaps in school budgets by cleaning and painting classrooms throughout the district. |
data/txt/2018/21332/CREC-2018-11-06-pt1-PgE1495-3-000009-21332.txt | 2018-11-06 | 2018 | 21332 | speaker, i rise today, on behalf of the entire 6th congressional district of indiana, to recognize state senator jean leising for her contribution to our state….as the chair of the senate committee on agriculture, jean has been an advocate for farmers in her district, leading on issues that impact family farms across indiana. |
data/txt/2018/21332/CREC-2018-11-06-pt1-PgE1498-000011-21332.txt | 2018-11-06 | 2018 | 21332 | speaker, i rise today, on behalf of the entire 6th congressional district of indiana, to recognize mayor chuck fewell for his contribution to our state and the city of greenfield. |
data/txt/2018/21332/CREC-2018-11-14-pt1-PgE1520-000058-21332.txt | 2018-11-14 | 2018 | 21332 | speaker, i rise today, on behalf of the entire 6th congressional district of indiana, to recognize john meredith for his contribution to our state and wayne county. |
data/txt/2018/21704/CREC-2018-11-13-pt1-PgE1516-000025-21704.txt | 2018-11-13 | 2018 | 21704 | as a proud veteran of the second world war and an upstanding member of his community, john is an indispensable part of michigan’s first district….veterans day weekend gave those in michigan’s first district an opportunity to show their thanks for the sacrifices demanded of john and all of our servicemen and women….i ask that you join with me and the people of michigan’s first district in thanking him for his unwavering commitment to our nation and its people. ____________________ </pre></body></html> |
data/txt/2018/21714/CREC-2018-11-13-pt1-PgH9510-000047-21714.txt | 2018-11-13 | 2018 | 21714 | a bill to amend title 40, united states code, to prohibit the commission of fine arts from exercising authority over non-federal property in the district of columbia, and for other purposes; to the committee on oversight and government reform. |
data/txt/2018/29323/CREC-2018-11-14-pt1-PgE1523-000060-29323.txt | 2018-11-14 | 2018 | 29323 | numerous county offices of education and school districts throughout california operate similar programs. |
data/txt/2018/29561/CREC-2018-11-13-pt1-PgH9500-2-000042-29561.txt | 2018-11-13 | 2018 | 29561 | speaker, i chose tonight to be sworn in on our constitution, a document that begins with our uniquely american creed, ``we the people,’’ a charge and a challenge to faithfully represent the people of my district and the entire country, a charge i promise to honor every day with all of my might. |
data/txt/2018/29768/CREC-2018-10-05-pt1-PgE1372-4-000007-29768.txt | 2018-10-05 | 2018 | 29768 | speaker, it is with great honor that i recognize ichs for their essential work in the 9th district and the surrounding area, and i wish them continued success in their mission. ____________________ </pre></body></html> |
data/txt/2020/14854/CREC-2020-02-06-pt1-PgE143-000050-14854.txt | 2020-02-06 | 2020 | 14854 | madam speaker, i rise today to pay tribute to mark davis in appreciation of his dedicated service to the people of kentucky’s fifth congressional district, specifically through his tireless work with operation unite and the eastern kentucky pride organization over the last two decades. |
data/txt/2020/14873/CREC-2020-02-06-pt1-PgE144-2-000052-14873.txt | 2020-02-06 | 2020 | 14873 | we are almost in the city center, in a district called bayerisches viertel, the bavarian quarter….this was a district inhabited by german intelligentsia of jewish origin. |
data/txt/2020/14921/CREC-2020-07-02-pt1-PgS4211-2-000030-14921.txt | 2020-07-02 | 2020 | 14921 | the senior assistant legislative clerk read the nominations of owen mccurdy cypher, of michigan, to be united states marshal for the eastern district of michigan for the term of four years; thomas l….foster, of virginia, to be united states marshal for the western district of virginia for the term of four years; and tyreece l….miller, of tennessee, to be united states marshal for the western district of tennessee for the term of four years. |
data/txt/2020/14921/CREC-2020-07-02-pt1-PgS4211-2-000031-14921.txt | 2020-07-02 | 2020 | 14921 | the senior assistant legislative clerk read the nominations of owen mccurdy cypher, of michigan, to be united states marshal for the eastern district of michigan for the term of four years; thomas l….foster, of virginia, to be united states marshal for the western district of virginia for the term of four years; and tyreece l….miller, of tennessee, to be united states marshal for the western district of tennessee for the term of four years. |
data/txt/2020/15431/CREC-2020-07-01-pt1-PgE608-5-000004-15431.txt | 2020-07-01 | 2020 | 15431 | in my district, many communities predate the interstate system….our amendment overturns a federal rule on sales taxes that uniquely affects clayton county in my district. |
data/txt/2020/20138/CREC-2020-06-04-pt1-PgE512-3-000011-20138.txt | 2020-06-04 | 2020 | 20138 | madam speaker, as i travel throughout the second congressional district, i have been inspired by the outpouring of support and generosity for those in need defeating the wuhan virus. |
data/txt/2020/20340/CREC-2020-02-06-pt1-PgE140-000019-20340.txt | 2020-02-06 | 2020 | 20340 | on behalf of the united states house of representatives and the people of the first district of north carolina, i express appreciation to mrs. |
data/txt/2020/20907/CREC-2020-03-09-pt1-PgE275-3-000135-20907.txt | 2020-03-09 | 2020 | 20907 | staples high school joins the ranks of several state champions stemming from the fourth congressional district….i join the westport community and fourth congressional district in recognizing these students, along with their teacher suzanne kammerman, on this accomplishment. ____________________ </pre></body></html> |
data/txt/2020/20946/CREC-2020-03-02-pt1-PgE241-4-000004-20946.txt | 2020-03-02 | 2020 | 20946 | she attended school and graduated from palm beach county school district….she was initially hired in the school district as a high school english teacher….after five years, she became the jefferson county upper elementary principal and during her tenure it led her to the appointment as the district’s school improvement officer and curriculum director….faye brown for her hard work and dedication in the jefferson county school district. ____________________ </pre></body></html> |
data/txt/2020/20948/CREC-2020-04-22-pt1-PgE380-3-000032-20948.txt | 2020-04-22 | 2020 | 20948 | on behalf of the twenty-second congressional district of texas, we extend our deep gratitude to forward science for their commendable actions filling the gap and helping our healthcare professionals at this critical time. |
data/txt/2020/21149/CREC-2020-02-05-pt1-PgH772-2-000018-21149.txt | 2020-02-05 | 2020 | 21149 | i rise today to pay tribute to a resident of missouri’s fourth district who was recently honored as one of the milken family foundation’s outstanding educators. |
data/txt/2020/21149/CREC-2020-02-05-pt1-PgH772-2-000019-21149.txt | 2020-02-05 | 2020 | 21149 | i rise today to pay tribute to a resident of missouri’s fourth district who was recently honored as one of the milken family foundation’s outstanding educators. |
data/txt/2020/21149/CREC-2020-02-05-pt1-PgH772-2-21149.txt | 2020-02-05 | 2020 | 21149 | i rise today to pay tribute to a resident of missouri’s fourth district who was recently honored as one of the milken family foundation’s outstanding educators. |
data/txt/2020/21150/CREC-2020-02-06-pt1-PgE140-5-000017-21150.txt | 2020-02-06 | 2020 | 21150 | i am honored to recognize their achievements, and i thank each team for making missouri’s 7th district proud. ____________________ </pre></body></html> |
data/txt/2020/21329/CREC-2020-05-01-pt1-PgE413-000004-21329.txt | 2020-05-01 | 2020 | 21329 | it is because of student leaders such as bryce faworski that i am especially proud to serve illinois’ 17th congressional district. |
data/txt/2020/21329/CREC-2020-05-05-pt1-PgE423-4-000014-21329.txt | 2020-05-05 | 2020 | 21329 | it is because of student leaders such as dallas krueger that i am especially proud to serve illinois’ 17th congressional district. |
data/txt/2020/21329/CREC-2020-07-01-pt1-PgH2993-5-000009-21329.txt | 2020-07-01 | 2020 | 21329 | i represent a district where 85 percent of the towns are 5,000 people or fewer, and 60 percent of the towns are 1,000 people or fewer….earlier this year, i reached out to every mayor and city administrator and county administrator and village president, 151 leaders, representing towns and counties in the 7,000 square miles in the congressional district that i serve….today, for the district i represent, and for all rural communities, i will be proud to cast my vote in favor of this important legislation. ____________________ </pre></body></html> |
data/txt/2020/21545/CREC-2020-02-06-pt1-PgH851-4-000041-21545.txt | 2020-02-06 | 2020 | 21545 | last month, i had the opportunity to meet with jules oringel of charlotte when she led the pledge of allegiance at my state of the district address….the towns of the 12th congressional district lost people to gun violence as well. |
data/txt/2020/21545/CREC-2020-02-06-pt1-PgH851-4-21545.txt | 2020-02-06 | 2020 | 21545 | last month, i had the opportunity to meet with jules oringel of charlotte when she led the pledge of allegiance at my state of the district address….the towns of the 12th congressional district lost people to gun violence as well. |
data/txt/2020/21723/CREC-2020-02-07-pt1-PgH940-000055-21723.txt | 2020-02-07 | 2020 | 21723 | as representative of the fifth congressional district, i am very proud to stand beside yele’s many friends and neighbors throughout north jersey whose hope is that he will soon be able to be reunited with his family. |
data/txt/2020/21723/CREC-2020-02-07-pt1-PgH940-21723.txt | 2020-02-07 | 2020 | 21723 | as representative of the fifth congressional district, i am very proud to stand beside yele’s many friends and neighbors throughout north jersey whose hope is that he will soon be able to be reunited with his family. |
data/txt/2020/21730/CREC-2020-02-06-pt1-PgE139-2-000002-21730.txt | 2020-02-06 | 2020 | 21730 | the 8th congressional district of illinois is home to many sikh-americans and sikh faithful….madam speaker, on this 550th anniversary of his birth i want to recognize the founding contribution of guru nanak to sikhism, one of the world’s great religions, and also the many gurdwaras in the greater chicagoland area and the sikh faithful in the 8th congressional district, for their service to their communities. ____________________ </pre></body></html> |
data/txt/2020/21986/CREC-2020-03-05-pt1-PgE267-3-000044-21986.txt | 2020-03-05 | 2020 | 21986 | madam speaker, on february 7, 2020, i was unable to vote because i was with constituents from my district to participate in an official event with the president of the united states in north carolina. |
data/txt/2020/29323/CREC-2020-02-06-pt1-PgE139-4-000006-29323.txt | 2020-02-06 | 2020 | 29323 | dean has contributed immensely to the betterment of our region and i am proud to call him a friend, a fellow community member, american and a constituent of the 42nd congressional district of california. |
data/txt/2020/29368/CREC-2020-03-02-pt1-PgE241-4-000003-29368.txt | 2020-03-02 | 2020 | 29368 | she attended school and graduated from palm beach county school district….she was initially hired in the school district as a high school english teacher….after five years, she became the jefferson county upper elementary principal and during her tenure it led her to the appointment as the district’s school improvement officer and curriculum director….faye brown for her hard work and dedication in the jefferson county school district. ____________________ </pre></body></html> |
data/txt/2020/29901/CREC-2020-02-06-pt1-PgE139-5-000008-29901.txt | 2020-02-06 | 2020 | 29901 | madam speaker, i rise today to honor greater vallejo recreation district (gvrd) on the 75th anniversary of its founding….greater vallejo recreation district was established in june of 1944 and has provided exceptional services to better the quality of life in our community ever since….this independent special service district offers recreational activities and leisure services to the people of vallejo….greater vallejo recreation district manages mostly city-owned recreational properties and an additional 1000 acres of public land….the district organizes basketball and baseball trainings for children and teens along with cardio and weight lifting classes for adults….madam speaker, the greater vallejo recreation district has been instrumental in fulfilling the educational and amusement needs of our community….it is therefore fitting and proper that we honor the greater vallejo recreation district on the 75th anniversary of its founding. |
data/txt/2020/29901/CREC-2020-02-06-pt1-PgE142-2-000036-29901.txt | 2020-02-06 | 2020 | 29901 | jolly has served our district and our country for decades. |
%>% mutate(district_preface = str_extract_all(district_sentences, "\\w+ district")) %>%
d unnest(district_preface) %>% count(district_preface, sort = T) %>% drop_na(district_preface) %>% kablebox()
district_preface | n |
---|---|
congressional district | 18 |
school district | 10 |
the district | 9 |
my district | 5 |
recreation district | 5 |
first district | 4 |
irrigation district | 4 |
western district | 4 |
a district | 3 |
fourth district | 3 |
eastern district | 2 |
7th district | 1 |
9th district | 1 |
her district | 1 |
our district | 1 |
service district | 1 |
%>% add_count(icpsr, name = "Speeches") %>%
d drop_na(district_sentences, icpsr) %>%
add_count(icpsr, name = "Speeches_with_district") %>%
mutate(district_sentences = str_split(district_sentences, "\\.\\.\\.")) %>%
unnest(district_sentences) %>%
count(icpsr, Speeches, Speeches_with_district, name = "Total mentions of distirct") %>%
kablebox()
icpsr | Speeches | Speeches_with_district | Total mentions of distirct |
---|---|---|---|
14854 | 2 | 1 | 1 |
14873 | 15 | 1 | 2 |
14921 | 24 | 2 | 6 |
15431 | 3 | 1 | 2 |
15603 | 2 | 1 | 1 |
20138 | 4 | 1 | 1 |
20340 | 4 | 1 | 1 |
20501 | 4 | 1 | 5 |
20907 | 2 | 1 | 2 |
20946 | 6 | 1 | 4 |
20948 | 2 | 1 | 1 |
20958 | 2 | 1 | 3 |
21149 | 3 | 3 | 3 |
21150 | 2 | 1 | 1 |
21329 | 6 | 3 | 5 |
21332 | 6 | 3 | 4 |
21545 | 2 | 2 | 4 |
21704 | 2 | 1 | 3 |
21714 | 49 | 1 | 1 |
21723 | 3 | 2 | 2 |
21730 | 2 | 1 | 2 |
21986 | 2 | 1 | 1 |
29323 | 6 | 2 | 2 |
29368 | 6 | 1 | 4 |
29561 | 5 | 1 | 1 |
29768 | 2 | 1 | 1 |
29901 | 14 | 2 | 8 |