Scribed by Sage P.
“Room A Section 1: Topic- Intro to API’s
DPLA- Digital Public Library
API-Application programming interface
Behind the scenes, to get data
Why?
Scrape data from webite
Data in a structured way
need
Chrome
Postman in chrome (Add app)
(Don’t have to sign up)
hit “send”
It will email a key
API request
Protocol-Domain-version-action-parameter
Switch to GET. Enter domain: api/dpla/v2/items
Params (parameter)
Key & Value
Api_key… # from email
Key: q (query)
Value: (search)
Click send
0 is first not 1
Count -# of matched
Start Limit Docs
Different way to search, defined on Dpla website
Specific info back
(ID & titles) put in fields Pagination Saving it is by send & download
More way to access
Web console
Web application- developed gwu library
Commandline- twarc (twitter app)
Code- can always write code
(depend on API and what you want)
Read documentation (to understand)
You may not understand data if you don’t
Fairly stable
Tweeting slides”
]]>Rebecca Benefiel presented and talked about using tools for data analysis that would be useful for businesses, social media, and email communications. Rebecca said that when you analyze text, which approaches do you use depends on what is the focus of your research. Specifically, for social research your often confronted with a lot of language material. In essence, when you do the analysis with one of the tools you focus on detailed analysis of text. Some of the uses of text analysis tools that were mentioned and briefly explained are:
Also, Rebecca mentioned Gephi; another tool platform, which is used for data analysts and scientists keen to explore and understand graphs. A tool in which “the user interacts with the representation, manipulate the structures, shapes and colors to reveal hidden patterns”.
Some of the tools for text analysis mentioned are:
Dirt Directory (dirtdirectory.org)
Voyant (voyant-tools.org) for text analysis
Open Refine (openrefine.org)
AND Notes from Neil on the same workshop Tool Sharing w/ Rebecca (Text Analysis
THAT Camp Conference Summary
Transcribed by: Manuel Fiallos Garcia
The conference I was in talked about Wikipedia. It focused in the fact that there’s a lot of information in Wikipedia that you can come to believe it’s fake. That is because people could edit information easily and there was no problem with it, even if it was fake. People started doubting about using Wikipedia as a credible source and developed a tendency to disapprove it as a primary source of information. In the conference, they explained that there are actually back up proves of who the author is and what he added to the information, bringing more credibility to the source. As well, now you need to become an approved user to be able to edit in Wikipedia. They told us that also if you get to add too much fake information, people can flag you and you can get to a point were you are blocked from editing the content in Wikipedia. There was also someone who had edited a page in Wikipedia and showed us where her user appeared and what she had edited. A lady was saying that there might be content that even though it is backed up by an author’s biography website or some sort of website containing the content he added up; there might a probability that what he added is fake and could have make up that information in the website just to add content to Wikipedia. We saw a video containing how to edit Wikipedia and how people really use this tool to spread the information they know and contribute to the digital information world. This is the audio file that contains the conference, it is not that great the audio but it can provide a further insight of what they talked in the session. Thanks.
]]>
Hosted by: David M- PHD at George Mason
Excel
Pros:
-Helpful for social media analysis
Cons:
-Information overload
Google Sheets, Google Docs, etc.
Pros:
-Easy to share with a broader community
-Able to receive feedback in real-time
-Can access data from anywhere with a wifi connection
-Free, no costly fees of database subscriptions, software expenses etc.
-User friendly
Cons:
-Runs risk of glitches
-Less shortcuts than excel
-Not as advanced as excel when it comes to visual analyzation ie: pivot tables, etc.
Alternative methods:
-Database(s)
-Open Refined Program (refined.org)
-Sequel
Conclusion:
-All comes down to personal preference
-It is possible to use various programs simultaniously for different needs
-Tailor your user experience to match project needs
-Research alternative methods to expedite process
-Share tools + tips with the digital community for enhanced user experience
What are people using?
Session 1 – WordPress – Ruth & Stephen B.
Free & Open.
WordPress runs about 27% websites on the internet
Started with blogs
Difference btw wordpress.com (hosted solution) and wordpress.org
WordPress is open built software
Matt Mullenwag created wordpress – founded company called automatic in mid 2000s
Unlike Facebook, can get the wordpress software (opensource)
WordPress.com vs. WordPress.org
The way WordPress is built is to be “extensible”
Totally true that if you get the wordpress software, you can do whatever you want with it on your own website. Part of the reason why it took off so much. People like to customize.
Extensible nature comes into play with themes and plugins.
Themes control how site works — can customize themes and plugins on your own site
Plug-ins are simple idea, instead of customizing the look and feel of your website, they allow you to customize what your website can do. There are plugins for everything. Lists of plugins can be found on wordpress.org
You can embed everything from videos on youtube to maps on google and even documents.
Software is basically the same on either
People will develop for own site and then give it away for other people to use
Others use it as a portfolio for job searches
Is there a difference between plug in (function) and embedding (content)
WordPress.com is a great tool to teach people about WordPress.org, and a great way to learn web design, code, etc.
Iframe could be helpful for people who are looking to embed/plug in things that are more complex
W3schools.com
Can control pixel width and height — control size of images — in iframe code
The blog part is also valuable, albeit “old fashioned.”
Can be used in a classroom setting to make the discussion more interactive
How to get a high profile?
-don’t use a hosted site
Reclaim Hosting — place to buy web hosting
What are you best uses for WordPress as undergrad students/people preparing to go into career after graduation?
The kinds of things you can put up on your own website that you can’t do on LinkedIn. Can post coursework, papers that you’ve done, actually documenting what you did in courses.
Really good for a portfolio
Benefits of having your own websites
It’s very customizable
Can add your twitter feed, LibraryThing (what you’re currently reading)
There are plugins that allow you to integrate your social media for “sharing” purposes
Medium.com also can be used for long form writing
How to gain/improve readership
Be lucky (lol)
There are so many systems of engagement, work to connect them all.
Orchid — researcher identifier
Be authentic and water your presence like a garden
Content and FREQUENT content are very important.
Keep things up to date
How to approach DH?
-Text analysis
-Social network analysis
-Geo-spatial mapping
-Distance reading / content analysis
-Visual/sound analysis
-Visualization
Resources
Dirt Directory (dirtdirectory.org)
-comprehensive website/registry listing resources to help you conduct research
-can be categorized by your approach (text analysis, numeric data, etc.)
Tags (for twitter date collection)
-allows you to collect any tweet you want by the minute
-only need twitter and gmail account
-using twitter’s API including location, vast amounts of data
Voyant (voyant-tools.org) for text analysis
-load your own dataset
-enables you to quantify the humanities into datasets just as scientists and social scientists do
-shows (from left to right) a word cloud, an automatic summary (including words per sentence, frequent words, distinctive words, vocabulary density, etc.), the top five words, and words preceding and following specific words
-tool to exclude phrases you do not want to count as words
Programming Historian (programminghistorian.org)
-valuable especially for isolated regions where resources may be more limited
-always looking for contributors
-tutorials are well-written
-using regular expression to clean OCR text
Open Refine (openrefine.org)
Text grid labs – downloadable application for text analysis
-upload photos of manuscript
-can embed links, etc.
Gephi (gephi.org) for visualization
Palladio (hdlab.standford.edu/palladio) for visualizing historical data
-perfect for exploring and catered to be user-friendly
-partially funded by NEH
Google nGram
Social network analysis
-lots of statistics
-all you need is two columns of two related persons
-difference from Palladio – shows nodes (persons beyond the first degree of separation)
-analysis includes:
-maximum geodesic distance – diameter (“hops” of degrees of separation from one side of the chart to the other side)
-centrality (how many times people have go through you to get to another relation)
-exemplifies “power law curve”
-Eigenvector unit – “proximity to power” (how close you are to people with high scores of centrality)
Oxygen
Omeka
-omeka.org and omeka.net
-free, easy, nice to use
-really good at presenting all the metadata, making it very accessible
-comprehensive source for manuscript, images, audio, video
Zotero
-good for articles, books, embedding
-create things in zotero and you can embed on Omeka using a connecting tool
Wikis and Wikithons
Transcribed by: Hope Gillespie
Transcribed by: Rachel Cousins