a11y: abbreviation for computer accessibility for all people regardless of disability. See http://a11yproject.com/.
algorithm: “A rigid, logical argument made in regularized terms” (Lisa Rhody).
API (application program interface): provides the link between two systems, allowing them to communicate. On the internet, an API allows you to access a web service with another program or software. For instance, a program you write on your computer might ask a museum database for results that match a certain criteria.
API key: when using an API, you need a unique key for access. Usually provided by the API creator when you sign up for the service.
backchannel: a secondary conversation, often taking place on Twitter using a hashtag, where people share relevant links and clarify terms.
backend: administrative side where you can make technical and content changes that is not public-facing, aka “control panel” or “dashboard.”
borked: broken (for the moment).
CamelCase: writing a word without spaces but with the first letter of each word capitalized (e.g., CamelCase, MarySue, PowerPoint, VistaVision, HyperCard).
CMS (content management system): a computer program (e.g., Drupal Gardens) that allows publishing, editing, and modifying content as well as maintenance from a central interface. Such systems of content management provide procedures to manage workflow in a collaborative environment. CMSs have been available since the late 1990s. CMSs are often used to run websites containing blogs, news, and shopping. CMSs typically aim to avoid the need for hand coding but may support it for specific elements or entire pages (from https://en.wikipedia.org/wiki/Content_management_system).
corpus: a set of texts used for analysis. The corpus has usually been normalized and compiled into a single file format.
CSS (Cascading Style Sheets): a markup language (code) to describe the “look and formatting” of a document or webpage (from http://en.wikipedia.org/wiki/Cascading_Style_Sheets). See also http://www.w3schools.com/css/.
CSV (comma separated values): a.k.a. character separated values. A file with a series of records made up of fields, where each field is separated by a comma or other specific character (; | / ). Easily created via a spreadsheet program like Excel, GoogleDocs, or Numbers. A good way to move information between databases/platforms. See http://en.wikipedia.org/wiki/Comma-separated_values.
DAMS (digital asset management systems): computer software and hardware for “downloading, renaming, backing up, rating, grouping, archiving, optimizing, maintaining, thinning, and exporting files” (from http://en.wikipedia.org/wiki/Digital_asset_management).
Distant Reading: from Franco Moretti, looking for trends over large corpora of works.
doi (digital object identifier): a managed, persistent, trackable link to an online publication. See www.doi.org.
Dublin Core: an internationally recognized metadata standard for describing any conceivable resource, comprised of fifteen elements, including “title,” “description,” “date,” and “format” (definition adapted from http://omeka.org/codex/Creating_an_Element_Set).
field: “Any one of a number of places where a user is expected to enter a single item of a particular type of data; an item of such data; esp. one in a database record” (OED definition 19).
FTP (File Transfer Protocol) Client: a program that lets a user transfer computer files from one host, such as your local computer, to a web-based server so that it can be available or viewed on the Web. See also SFTP: Secure File Transfer Protocol.
GIS (geographic information systems): a computer system (or web-based system) designed to “capture, store, manipulate, analyze, manage, and present” information about geographic data (from https://en.wikipedia.org/wiki/Geographic_information_system). Although GIS can be used to create maps, they are also capable of creating different forms of representation.
Github: a place for sharing open source code and any other kinds of files that someone else can grab.
GLAM: acronym for “galleries, libraries, archives, and museums.”
HTML (HyperText Markup Language): “the standard markup language used to create webpages” (from http://en.wikipedia.org/wiki/HTML). Markup in this case means formatting things like links, emphasis (bold, italics), and header. See also http://www.w3schools.com/html/.
KML (Keyhole Markup Language)/KMZ file: XML-based file format used to display geographic data. Google KML documentation: https://developers.google.com/kml/.
LAMP (Linux, Apache, MySQL, PHP/Python): a software combination for servers where Linux is the operating system, Apache is the web server, MySQL is the database, and PHP/Python is the scripting language. See http://en.wikipedia.org/wiki/LAMP_(software_bundle).
LMS (learning management system): a program that facilitates course management, content, and administration (e.g., Blackboard).
machine readable: data formatted in a way that allows it to be immediately read and processed by a computer. See http://opendatahandbook.org/glossary/en/terms/machine-readable/.
metadata: data about data, or descriptive information about a thing. Metadata is what you read in library catalog records or museum collections management systems. Wikipedia has a list of available metadata systems. Getty provides a glossary for metadata.
NLP (natural language processing): enables computers to parse information from “human language” (prose). See http://en.wikipedia.org/wiki/Natural_language_processing.
OAI-PMH (Open Archives Initiative Protocal for Metadata Harvesting): “a low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP” (from http://www.openarchives.org/pmh/).
OCR (optical character recognition): conversion of images (photographs, scans) to machine/computer-readable text. See http://en.wikipedia.org/wiki/Optical_character_recognition.
Omeka: open-source content management system (see above) which uses an item (object/image/document) as the primary piece (as opposed to WordPress, which uses the post). See www.omeka.org.
open source: a software development model that encourages community collaboration by making the underlying code available to all for free, to be enhanced, remixed, and repurposed. See https://en.wikipedia.org/wiki/Open-source_model.
programming languages: used to write the programs, functions, and algorithms that provide the background functionality of websites and software (e.g., Python, R, Ruby, C++, and many, many more).
public history: “public history describes the many and diverse ways in which history is put to work in the world. In this sense, it is history that is applied to real-world issues. In fact, applied history was a term used synonymously and interchangeably with public history for a number of years. Although public history has gained ascendance in recent years as the preferred nomenclature especially in the academic world, applied history probably remains the more intuitive and self-defining term” (from http://ncph.org/cms/what-is-public-history/).
RDF (Resource Description Framework): originally built as a metadata model, RDF is machine-readable and often used with web resources.
Responsive: “a web design approach aimed at crafting sites to provide an optimal viewing experience—easy reading and navigation with a minimum of resizing, panning, and scrolling—across a wide range of devices (from mobile phones to desktop computer monitors)” (from http://en.wikipedia.org/wiki/Responsive_web_design).
SFTP (Secure File Transfer Protocol): See above, FTP Client.
slug: (in Omeka) the last part of the URL for a page, such as an exhibit page, simple page, or blog post (e.g., in http://mallhistory.org/explorations/show/operasinger, the slug is operasinger).
SQL (Structured Query Language): most widely used programming language for relational databases. For instance, when you create a WordPress post, the content is stored in a database, which is created and accessed using SQL. See http://en.wikipedia.org/wiki/SQL.
stopwords: words that are filtered from a textual data set before it is processed. Stopword lists generally include common function words (e.g., “the,” “and,” “a”).
structured data: data that follows a system of organization that makes it easier for the computer to manipulate it (e.g., XML files, databases).
SVG (Scalable Vector Graphic): XML-based vector image. These can be edited in some image-editing programs, like Adobe Illustrator, and then exported for use on the web.
TMS (The Museum System): a collection management system for creating and managing metadata offered by GallerySystems.
Unstructured Data: free-form files with information that needs to be discovered and organized to be usable (e.g., PDFs, webpages, .doc files).
XML (EXtensible Markup Language): a file format to describe, transport, and store data/information. See W3schools on the difference between XML and HTML: http://www.w3schools.com/xml/xml_whatis.asp.
vaporware: hardware or software which is proposed, announced, and never actually exists.
web hosting service: there are numerous ways to publish content to the internet. Most of the websites you visit or create will use one the following types:
- Free web hosting service: offered by different companies with limited services, sometimes supported by advertisements, and often limited when compared to paid hosting. For example, WordPress offers free blogs with limited capabilities.
- Managed hosting service: the user gets his or her own Web server but is not allowed full control over it; however, they are allowed to manage their data via FTP or other remote management tools. For example, bluehost offers server space where users can install their own management systems and publish content.
The difference is important: free WordPress blogs are limited, but easy to use. Access to your own server space is flexible and capable, but requires payment and more skill to manage.
WYSIWYG: “What You See Is What You Get” editors provide a toolbar at the top of the text box that allows you to change the formatting of the content. They provide an alternative to tag- and code-based formatting.