|
|
Digitising History
  |
|
|
7.1 Glossary A widely used relational database system part of the Microsoft Office suite. Arts and Humanities Data Service. American Standard Code for Information Interchange. An international standard allowing computers to exchange and display character-based data. Most relational database systems can import and export ASCII data in a number of delimited ASCII data formats. Data that do not adhere to a character coding scheme such as ASCII or EBCDIC. Binary format data can only be read with specific software packages usually on specific computer platforms. BitMaP. A widely used graphic image format common in Windows applications. See record. Terminology usually associated with statistical analysis. Central Computer and Telecommunications Agency http://www.ccta.gov.uk/cctahome.htm Census Enumerators Books A popular method of associating textual values with numeric codes. Most important for classifying complex data in order to facilitate meaningful analysis. The most common coding framework in historical data (and contemporary data) applies to the classification and categorisation of occupations. Also known as CSV or Comma Separated Values. An ASCII data format where each field in a record is separated by a comma character. The longest 'distance' or time to complete a project. If any activity on the critical path takes longer than planned, the end date of the project slips accordingly. Refers to the way in which the contents of a data file are organised. Many data formats are specific to one software application. Comma separated and tab delimited ASCII are common formats which can be read by many analysis packages and into standard text editors. Most desktop database systems are able to read data from a number of formats. The theoretical model by which data are structured. Common data models include relational, network, hierarchical and object-oriented. Data modelling is a methodology for structuring data for use in a database system. See also ERM and RDA. A database system term that is used to define the characteristic of a particular field. The system will then expect all values in this field to conform to given rules on this type, i.e. numeric types cannot contain word characters. A generic term commonly used to describe a structured collection of data. Databases can take many forms including unstructured full text, images, maps, statistics or a mixture of data sources. For this guide, the term database refers to essentially tabular data containing text and numbers. The software application that the user employs to define, create, manage and analyse a database. This guide is concerned with relational database systems primarily, although these are by no means the only type of database system software available. A widely used relational database system. An ASCII data format where each field in a record is separated by a specified character e.g. comma or tab. An item which the project has to create as part of the requirements. It may be part of the final outcome or an intermediate element on which one or more subsequent deliverables are dependent. According to the type of project, another name for a deliverable is 'product'. Entity Relationship Modelling. A methodology for designing a database. Used to specify what entities are of interest and in what ways these entities are linked. A finalised entity will form a table in the database itself and its attributes will form fields. A widely used spreadsheet package part of the Microsoft Office suite. A facility provided by most database systems to convert or export data into a different data format. A precise element of information within a table taking the form of a column. Sometimes referred to as a variable in statistical analysis. An ASCII data format where each field in a record begins at a set column position and has a set width. Describes data that are structured (modelled) into one single table. A common on-screen interface to databases allowing users to enter data, maintain data and query data within the database system. Often used to avoid complex SQL commands and to automate some database processes. A widely used relational database system. File Transfer Protocol Graphics Interchange Format. A widely used graphic image format owned by CompuServe. Geographical Information System. A database system that links digital maps with attribute data, usually tabular in structure. Provides powerful and complex tools to visualise and analyse data in a spatial context. History Data Service HyperText Markup Language. A widely used document format. HTML uses tags as part of a general framework for describing a document structure primarily for the World Wide Web. A facility provided by most database systems to convert or import data into the database system’s standard data format. A widely used spreadsheet package part of the Lotus SmartSuite. The storage medium used to hold data. Common examples include CD-ROM, DAT, DVD-ROM, floppy disk, hard disk and magnetic tape. A methodology used by historians in order to fit their sources into a structured data model, usually resulting in some loss or restructuring of information. Commonly associated with the application of the relational data model. A term used mainly within database systems, SQL and programming languages, to indicate that a field has no value. This is subtly, but importantly different, from assigning a value which means 'blank', 'empty' or 'no data'. A widely used relational database system. Portable Document Format. A widely used document format promoted by Adobe. A term that defines both the operating system of the computer and its hardware base, usually referring to the central processing unit. For example, the most common computing platform today is Windows/Intel Pentium. Portable Network Graphics. Pronounced 'ping'. A relatively new graphic image format. PRojects IN Controlled Environments. A method which supports some selected aspects of project management. A widely used spreadsheet package. Relational Data Analysis. A methodology for designing a database from the bottom-up. Used to isolate each discrete item of information which are then normalised and restructured to form entities. Designed to remove data duplication and data redundancy. Relational Database Management System. See database system. A complete item of information within a table taking the form of a row. Contains one or more descriptive fields and is usually a list of subjects, e.g. people, places, goods, items, etc. Refers to the computer-based procedure of linking records from (often) different sources. The most common example is nominal record-linkage which is a methodology for linking people across time from different sources, based on their names and other attributes. Popular with prosopographers. Describes a particular type of data model which structures data into individual tables, each made up of fields which are linked together (related) through a system of key fields. The chance of exposure to the adverse consequences of future events. Rich Text Format. A widely used document format. A widely used statistical package. Export format for SAS. Standard Generalised Mark-up Language. A widely used document format. SGML uses tags as part of a general framework for describing a document structure. A widely used statistical package. Export format for SIR. A series of SIR commands which define a database's structure. In relation to project activities, slack or slack time means that there is more time available to complete an activity than needed. Activities with slack are not on the critical path. A methodology used by historians to create databases that are able to represent every single aspect of the original source material without loss. Often associated with the concept of the critical edition advocated by Manfred Thaller. The database system KLEIO is commonly understood to be source-oriented. Still Picture Interchange File Format. A relatively new graphic image format. A widely used statistical package. Export format for SPSS which is the best way to transfer SPSS data files from one type of machine to another. Variable names, variable labels, value labels, formats, and missing-value specifications are saved along with the data. A series of SPSS commands which define a database's structure. Structured Query Language. A computer language developed for use with relational database systems. Has an extensive set of commands that allow the user to define, manage and analyse/query the database. Most desktop applications now include an SQL component. A series of SQL commands which define a database's structure. A division of a project for management purposes. According to PRINCE, a 'Project Board' approves the project to proceed one stage at a time. A widely used statistical package. Export format for Stata. An ASCII data format where each field in a record is separated (delimited) by a tab character. A complete table of information within a database. Tagged Interchange File Format. TIF (PC) or TIFF (Macintosh). A widely used graphic image format. See field. A widely used word-processing package part of the Microsoft Office suite. A widely used word-processing package. |
|