Content - 3733a9b7160ccef2f9fa32e2dc9701b9aa25501f - 9f70618/TS.html

visit type:
Tip revision: 347c64fac3fa1a64ade0d8d1842813d4b8f7acec authored by Hugh Cayless on 12 May 2017, 16:57:59 UTC
Updates.
Tip revision: 347c64f
TS.html

<!DOCTYPE html
  SYSTEM "about:legacy-compat">
<html xmlns="http://www.w3.org/1999/xhtml" lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8" /><!--THIS FILE IS GENERATED FROM AN XML MASTER. DO NOT EDIT (4)--><title>8 Transcriptions of Speech - The TEI Guidelines</title><meta property="Language" content="en" /><meta property="DC.Title" content="8 Transcriptions of Speech - The TEI Guidelines" /><meta property="DC.Language" content="SCHEME=iso639 en" /><meta property="DC.Creator.Address" content="tei@oucs.ox.ac.uk" /><meta charset="utf-8" /><link href="guidelines.css" rel="stylesheet" type="text/css" /><link href="odd.css" rel="stylesheet" type="text/css" /><link rel="stylesheet" media="print" type="text/css" href="guidelines-print.css" /><script type="text/javascript" src="jquery-1.2.6.min.js"></script><script type="text/javascript" src="columnlist.js"></script><script type="text/javascript" src="popupFootnotes.js"></script><script type="text/javascript">
        $(function() {
         $('ul.attrefs-class').columnizeList({cols:3,width:30,unit:'%'});
         $('ul.attrefs-element').columnizeList({cols:3,width:30,unit:'%'});
         $(".displayRelaxButton").click(function() {
           $(this).parent().find('.RNG_XML').toggle();
           $(this).parent().find('.RNG_Compact').toggle();
         });
         $(".tocTree .showhide").click(function() {
          $(this).find(".tocShow,.tocHide").toggle();
          $(this).parent().find("ul.continuedtoc").toggle();
	  });
        })
    </script><script type="text/javascript"><!--
var displayXML=0;
states=new Array()
states[0]="element-a"
states[1]="element-b"
states[2]="element-c"
states[3]="element-d"
states[4]="element-e"
states[5]="element-f"
states[6]="element-g"
states[7]="element-h"
states[8]="element-i"
states[9]="element-j"
states[10]="element-k"
states[11]="element-l"
states[12]="element-m"
states[13]="element-n"
states[14]="element-o"
states[15]="element-p"
states[16]="element-q"
states[17]="element-r"
states[18]="element-s"
states[19]="element-t"
states[20]="element-u"
states[21]="element-v"
states[22]="element-w"
states[23]="element-x"
states[24]="element-y"
states[25]="element-z"

function startUp() {

}

function hideallExcept(elm) {
for (var i = 0; i < states.length; i++) {
 var layer;
 if (layer = document.getElementById(states[i]) ) {
  if (states[i] != elm) {
    layer.style.display = "none";
  }
  else {
   layer.style.display = "block";
      }
  }
 }
 var mod;
 if ( mod = document.getElementById('byMod') ) {
     mod.style.display = "none";
 }
}

function showall() {
 for (var i = 0; i < states.length; i++) {
   var layer;
   if (layer = document.getElementById(states[i]) ) {
      layer.style.display = "block";
      }
  }
}

function showByMod() {
  hideallExcept('');
  var mod;
  if (mod = document.getElementById('byMod') ) {
     mod.style.display = "block";
     }
}

	--></script></head><body><div id="container"><div id="banner"><img src="Images/banner.jpg" alt="Text Encoding Initiative logo and banner" /></div></div><div class="mainhead"><h1>P5: 
    Guidelines for Electronic Text Encoding and Interchange</h1><p>Version 3.1.1a. Last updated on
	10th May 2017, revision bd8dda3</p></div><div id="onecol" class="main-content"><h2><span class="headingNumber">8 </span>Transcriptions of Speech</h2><div class="div1" id="TS"><div class="miniTOC miniTOC_left"><p><span class="subtochead">Table of contents</span></p><div class="subtoc"><ul class="subtoc"><li class="subtoc"><a class="subtoc" href="TS.html#TSOV" title="General Considerations and Overview">8.1 General Considerations and Overview</a></li><li class="subtoc"><a class="subtoc" href="TS.html#HD32" title="Documenting the Source of Transcribed Speech">8.2 Documenting the Source of Transcribed Speech</a></li><li class="subtoc"><a class="subtoc" href="TS.html#TSBA" title="Elements Unique to Spoken Texts">8.3 Elements Unique to Spoken Texts</a></li><li class="subtoc"><a class="subtoc" href="TS.html#TSSA" title="Elements Defined Elsewhere">8.4 Elements Defined Elsewhere</a></li><li class="subtoc"><a class="subtoc" href="TS.html#index-body.1_div.8_div.5">8.5 Module for Transcribed Speech</a></li></ul></div><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="DR.html"><span class="headingNumber">7 </span>Performance Texts</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="DI.html"><span class="headingNumber">9 </span>Dictionaries</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><p>The module described in this chapter is intended for use with a wide variety of transcribed spoken material. It should be stressed, however, that the present proposals are not intended to support unmodified every variety of research undertaken upon spoken material now or in the future; some discourse analysts, some phonologists, and doubtless others may wish to extend the scheme presented here to express more precisely the set of distinctions they wish to draw in their transcriptions. Speech regarded as a purely acoustic phenomenon may well require different methods from those outlined here, as may speech regarded solely as a process of social interaction. </p><p>This chapter begins with a discussion of some of the problems commonly encountered in transcribing spoken language (section <a class="link_ptr" href="TS.html#TSOV" title="General Considerations and Overview"><span class="headingNumber">8.1 </span>General Considerations and Overview</a>). Section <a class="link_ptr" href="TS.html#HD32" title="Documenting the Source of Transcribed Speech"><span class="headingNumber">8.2 </span>Documenting the Source of Transcribed Speech</a> documents some additional TEI header elements which may be used to document the recording or other source from which transcribed text is taken. Section <a class="link_ptr" href="TS.html#TSBA" title="Elements Unique to Spoken Texts"><span class="headingNumber">8.3 </span>Elements Unique to Spoken Texts</a> describes the basic structural elements provided by this module. Finally, section <a class="link_ptr" href="TS.html#TSSA" title="Elements Defined Elsewhere"><span class="headingNumber">8.4 </span>Elements Defined Elsewhere</a> of this chapter reviews further problems specific to the encoding of spoken language, demonstrating how mechanisms and elements discussed elsewhere in these Guidelines may be applied to them.</p><div class="div2" id="TSOV"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#HD32"><span class="headingNumber">8.2 </span>Documenting the Source of Transcribed Speech</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h3><span class="bookmarklink"><a class="bookmarklink" href="#TSOV" title="link to this section "><span class="invisible">TEI: General Considerations and Overview</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.1 </span><span class="head">General Considerations and Overview</span></h3><p>There is great variation in the ways different researchers have chosen to represent speech using the written medium.<span id="Note69_return"><a class="notelink" title="For a discussion of several of these see ; ; and ." href="#Note69"><sup>32</sup></a></span> This reflects the special difficulties which apply to the encoding or <span class="noindex">transcription</span> of speech. Speech varies according to a large number of dimensions, many of which have no counterpart in writing (for example, tempo, loudness, pitch, etc.). The audibility of speech recorded in natural communication situations is often less than perfect, affecting the accuracy of the transcription. Spoken material may be transcribed in the course of linguistic, acoustic, anthropological, psychological, ethnographic, journalistic, or many other types of research. Even in the same field, the interests and theoretical perspectives of different transcribers may lead them to prefer different levels of detail in the transcript and different styles of visual display. The production and comprehension of speech are intimately bound up with the situation in which speech occurs, far more so than is the case for written texts. A speech transcript must therefore include some contextual features; determining which are relevant is not always simple. Moreover, the ethical problems in recording and making public what was produced in a private setting and intended for a limited audience are more frequently encountered in dealing with spoken texts than with written ones.</p><p>Speech also poses difficult structural problems. Unlike a written text, a speech event takes place in time. Its beginning and end may be hard to determine and its internal composition difficult to define. Most researchers agree that the utterances or <span class="term">turns</span> of individual speakers form an important structural component in most kinds of speech, but these are rarely as well-behaved (in the structural sense) as paragraphs or other analogous units in written texts: speakers frequently interrupt each other, use gestures as well as words, leave remarks unfinished and so on. Speech itself, though it may be represented as words, frequently contains items such as vocalized pauses which, although only semi-lexical, have immense importance in the analysis of spoken text. Even non-vocal elements such as gestures may be regarded as forming a component of spoken text for some analytic purposes. Below the level of the individual utterance, speech may be segmented into units defined by phonological, prosodic, or syntactic phenomena; no clear agreement exists, however, even as to appropriate names for such segments.</p><p>Spoken texts transcribed according to the guidelines presented here are organized as follows. The overall structure of a TEI spoken text is identical to that of any other TEI text: the <a class="gi" title="(TEI document) contains a single TEI-conformant document, combining a single TEI header with one or more members of the model.resourceLike class. Multiple &lt;TEI&gt; elements may be combined to form a &lt;teiCorpus&gt; element." href="ref-TEI.html">TEI</a> element for a spoken text contains a <a class="gi" title="(TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources." href="ref-teiHeader.html">teiHeader</a> element, followed by a <a class="gi" title="contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample." href="ref-text.html">text</a> element. Even texts primarily composed of transcribed speech may also include conventional front and back matter, and may even be organized into divisions like printed texts.</p><p>We may say, therefore, that these Guidelines regard transcribed speech as being composed of arbitrary high-level units called <span class="noindex">texts</span>. A spoken <a class="gi" title="contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample." href="ref-text.html">text</a> might typically be a conversation between a small number of people, a lecture, a broadcast TV item, or a similar event. Each such unit has associated with it a <a class="gi" title="(TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources." href="ref-teiHeader.html">teiHeader</a> providing detailed contextual information such as the source of the transcript, the identity of the participants, whether the speech is scripted or spontaneous, the physical and social setting in which the discourse takes place and a range of other aspects. Details of the header in general are provided in chapter <a class="link_ptr" href="HD.html" title="5"><span class="headingNumber">2 </span>The TEI Header</a>; the particular elements it provides for use with spoken texts are described below (<a class="link_ptr" href="TS.html#HD32" title="Documenting the Source of Transcribed Speech"><span class="headingNumber">8.2 </span>Documenting the Source of Transcribed Speech</a>). Details concerning additional elements which may be used for the documentation of participant and contextual information are given in <a class="link_ptr" href="CC.html#CCAH" title="Contextual Information"><span class="headingNumber">15.2 </span>Contextual Information</a>.</p><p>Defining the bounds of a spoken text is frequently a matter of arbitrary convention or convenience. In public or semi-public contexts, a text may be regarded as synonymous with, for example, a <span class="noindex">lecture</span>, a <span class="noindex">broadcast item</span>, a <span class="noindex">meeting</span>, etc. In informal or private contexts, a text may be simply a conversation involving a specific group of participants. Alternatively, researchers may elect to define spoken texts solely in terms of their duration in time or length in words. By default, these Guidelines assume of a text only that: </p><ul class="bulleted"><li class="item">it is internally cohesive,</li><li class="item">it is describable by a single header, and</li><li class="item">it represents a single stretch of time with no significant discontinuities.</li></ul><p> Deviation from these assumptions may be specified (for example, the <span class="att">org</span> attribute on the <a class="gi" title="contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample." href="ref-text.html">text</a> element may take the value <span class="val">compos</span> to specify that the components of the text are discrete) but is not recommended.</p><p>Within a <a class="gi" title="contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample." href="ref-text.html">text</a> it may be necessary to identify subdivisions of various kinds, if only for convenience of handling. The neutral <a class="gi" title="(text division) contains a subdivision of the front, body, or back of a text." href="ref-div.html">div</a> element discussed in section <a class="link_ptr" href="DS.html#DSDIV" title="Divisions of the Body"><span class="headingNumber">4.1 </span>Divisions of the Body</a> is recommended for this purpose. It may be found useful also for representing subdivisions relating to discourse structure, speech act theory, transactional analysis, etc., provided only that these divisions are hierarchically well-behaved. Where they are not, as is often the case, the mechanisms discussed in chapters <a class="link_ptr" href="SA.html" title="14"><span class="headingNumber">16 </span>Linking, Segmentation, and Alignment</a> and <a class="link_ptr" href="NH.html" title="31"><span class="headingNumber">20 </span>Non-hierarchical Structures</a> may be used.</p><p>A spoken text may contain any of the following components: </p><ul class="bulleted"><li class="item">utterances</li><li class="item">pauses</li><li class="item">vocalized but non-lexical phenomena such as coughs</li><li class="item">kinesic (non-verbal, non-lexical) phenomena such as gestures</li><li class="item">entirely non-linguistic incidents occurring during and possibly influencing the course of speech</li><li class="item">writing, regarded as a special class of incident in that it can be transcribed, for example captions or overheads displayed during a lecture</li><li class="item">shifts or changes in vocal quality</li></ul><p>Elements to represent all of these features of spoken language are discussed in section <a class="link_ptr" href="TS.html#TSBA" title="Elements Unique to Spoken Texts"><span class="headingNumber">8.3 </span>Elements Unique to Spoken Texts</a> below.</p><p>An utterance (tagged <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a>) may contain lexical items interspersed with pauses and non-lexical vocal sounds; during an utterance, non-linguistic incidents may occur and written materials may be presented. The <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element can thus contain any of the other elements listed, interspersed with a transcription of the lexical items of the utterance; the other elements may all appear between utterances or next to each other, but except for <a class="gi" title="contains a passage of written text revealed to participants in the course of a spoken text." href="ref-writing.html">writing</a> they do not contain any other elements nor any data.</p><p>A spoken text itself may be without substructure, that is, it may consist simply of units such as utterances or pauses, not grouped together in any way, or it may be subdivided. If the notion of what constitutes a ‘text’ in spoken discourse is inevitably rather an arbitrary one, the notion of formal subdivisions within such a ‘text’ may appear even more debatable. Nevertheless, such divisions may be useful for such types of discourse as debates, broadcasts, etc., where structural subdivisions can easily be identified, or more generally wherever it is desired to aggregate utterances or other parts of a transcript into units smaller than a complete ‘text’. Examples might include <span class="q">‘conversations’</span> or <span class="q">‘discourse fragments’</span>, or more narrowly, <span class="q">‘that part of the conversation where topic x was discussed’</span>, provided only that the set of all such divisions is coextensive with the text.</p><div class="p">Each such division of a spoken text should be represented by the numbered or unnumbered <a class="gi" title="(text division) contains a subdivision of the front, body, or back of a text." href="ref-div.html">div</a> elements defined in chapter <a class="link_ptr" href="DS.html" title="7"><span class="headingNumber">4 </span>Default Text Structure</a>. For some detailed kinds of analysis a hierarchy of such divisions may be found useful; nested <a class="gi" title="(text division) contains a subdivision of the front, body, or back of a text." href="ref-div.html">div</a> elements may be used for this purpose, as in the following example showing how a collection made up of transcribed ‘sound bites’ taken from speeches given by a politician on different occasions might be encoded. Each extract is regarded as a distinct <a class="gi" title="(text division) contains a subdivision of the front, body, or back of a text." href="ref-div.html">div</a>, nested within a single composite <a class="gi" title="(text division) contains a subdivision of the front, body, or back of a text." href="ref-div.html">div</a> as follows: <div id="index-egXML-d52e67990" class="pre egXML_valid"><span class="element">&lt;div <span class="attribute">type</span>="<span class="attributevalue">soundbites</span>"<br /> <span class="attribute">subtype</span>="<span class="attributevalue">conservative</span>" <span class="attribute">org</span>="<span class="attributevalue">composite</span>"&gt;</span><br /> <span class="element">&lt;div <span class="attribute">sample</span>="<span class="attributevalue">medial</span>"/&gt;</span><br /> <span class="element">&lt;div <span class="attribute">sample</span>="<span class="attributevalue">medial</span>"/&gt;</span><br /> <span class="element">&lt;div <span class="attribute">sample</span>="<span class="attributevalue">initial</span>"/&gt;</span><br /><span class="element">&lt;/div&gt;</span></div></div><p>As a member of the class <a class="link_odd" title="provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element." href="ref-att.declaring.html">att.declaring</a>, the <a class="gi" title="(text division) contains a subdivision of the front, body, or back of a text." href="ref-div.html">div</a> element may also carry a <span class="att">decls</span> attribute, for use where the divisions of a text do not all share the same set of the contextual declarations specified in the TEI header. (See further section <a class="link_ptr" href="CC.html#CCAS" title="Associating Contextual Information with a Text"><span class="headingNumber">15.3 </span>Associating Contextual Information with a Text</a>).</p></div><div class="div3" id="HD32"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSOV"><span class="headingNumber">8.1 </span>General Considerations and Overview</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSBA"><span class="headingNumber">8.3 </span>Elements Unique to Spoken Texts</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h3><span class="bookmarklink"><a class="bookmarklink" href="#HD32" title="link to this section "><span class="invisible">TEI: Documenting the Source of Transcribed Speech</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.2 </span><span class="head">Documenting the Source of Transcribed Speech</span></h3><p>Where a computer file is derived from a spoken text rather than a written one, it will usually be desirable to record additional information about the recording or broadcast which constitutes its source. Several additional elements are provided for this purpose within the source description component of the TEI header: </p><ul class="specList"><li><span class="specList-elementSpec"><a href="ref-scriptStmt.html">scriptStmt</a></span> (script statement) contains a citation giving details of the script used for a spoken text.</li><li><span class="specList-elementSpec"><a href="ref-recordingStmt.html">recordingStmt</a></span> (recording statement) describes a set of recordings used as the basis for transcription of a spoken text.</li><li><span class="specList-elementSpec"><a href="ref-recording.html">recording</a></span> (recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast.<table class="specDesc"><tr><td class="Attribute"><span class="att">type</span></td><td>the kind of recording.</td></tr></table></li><li><span class="specList-elementSpec"><a href="ref-transcriptionDesc.html">transcriptionDesc</a></span> describes the set of transcription conventions used, particularly for spoken material.<table class="specDesc"><tr><td class="Attribute"><span class="att">ident</span></td><td>supplies an identifier for the encoding convention, independent of any version number.</td></tr><tr><td class="Attribute"><span class="att">version</span></td><td>supplies a version number for the encoding conventions used, if any.</td></tr></table></li></ul><p> As a member of the <a class="link_odd" title="provides attributes for normalization of elements that contain datable events." href="ref-att.duration.html">att.duration</a> class, the <a class="gi" title="(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast." href="ref-recording.html">recording</a> element inherits the following attribute: </p><ul class="specList"><li><span class="specList-classSpec"><a href="ref-att.duration.w3c.html">att.duration.w3c</a></span> provides attributes for recording normalized temporal durations.<table class="specDesc"><tr><td class="Attribute"><span class="att">dur</span></td><td>(duration) indicates the length of this element in time.</td></tr></table></li></ul><p>Note that detailed information about the participants or setting of an interview or other transcript of spoken language should be recorded in the appropriate division of the profile description, discussed in chapter <a class="link_ptr" href="CC.html" title="23"><span class="headingNumber">15 </span>Language Corpora</a>, rather than as part of the source description. The source description is used to hold information only about the source from which the transcribed speech was taken, for example, any script being read and any technical details of how the recording was produced. If the source was a previously-created transcript, it should be treated in the same way as any other source text.</p><div class="p">The <a class="gi" title="(script statement) contains a citation giving details of the script used for a spoken text." href="ref-scriptStmt.html">scriptStmt</a> element should be used where it is known that one or more of the participants in a spoken text is speaking from a previously prepared script. The script itself should be documented in the same way as any other written text, using one of the three citation tags mentioned above. Utterances or groups of utterances may be linked to the script concerned by means of the <span class="att">decls</span> attribute, described in section <a class="link_ptr" href="CC.html#CCAS" title="Associating Contextual Information with a Text"><span class="headingNumber">15.3 </span>Associating Contextual Information with a Text</a>. <div id="index-egXML-d52e68043" class="pre egXML_valid"><span class="element">&lt;sourceDesc&gt;</span><br /> <span class="element">&lt;scriptStmt <span class="attribute">xml:id</span>="<span class="attributevalue">CNN12</span>"&gt;</span><br />  <span class="element">&lt;bibl&gt;</span><br />   <span class="element">&lt;author&gt;</span>CNN Network News<span class="element">&lt;/author&gt;</span><br />   <span class="element">&lt;title&gt;</span>News headlines<span class="element">&lt;/title&gt;</span><br />   <span class="element">&lt;date <span class="attribute">when</span>="<span class="attributevalue">1991-06-12</span>"&gt;</span>12 Jun 91<span class="element">&lt;/date&gt;</span><br />  <span class="element">&lt;/bibl&gt;</span><br /> <span class="element">&lt;/scriptStmt&gt;</span><br /><span class="element">&lt;/sourceDesc&gt;</span></div></div><p>The <a class="gi" title="(recording statement) describes a set of recordings used as the basis for transcription of a spoken text." href="ref-recordingStmt.html">recordingStmt</a> is used to group together information relating to the recordings from which the spoken text was transcribed. The element may contain either a prose description or, more helpfully, one or more <a class="gi" title="(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast." href="ref-recording.html">recording</a> elements, each corresponding with a particular recording. The linkage between utterances or groups of utterances and the relevant recording statement is made by means of the <span class="att">decls</span> attribute, described in section <a class="link_ptr" href="CC.html#CCAS" title="Associating Contextual Information with a Text"><span class="headingNumber">15.3 </span>Associating Contextual Information with a Text</a>.</p><p>The <a class="gi" title="(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast." href="ref-recording.html">recording</a> element should be used to provide a description of how and by whom a recording was made. This information may be provided in the form of a prose description, within which such items as statements of responsibility, names, places, and dates may be identified using the appropriate phrase-level tags. Alternatively, a selection of elements from the <a class="link_odd" title="groups elements used to describe details of an audio or video recording." href="ref-model.recordingPart.html">model.recordingPart</a> class may be provided. This element class makes available the following elements: </p><ul class="specList"><li><span class="specList-elementSpec"><a href="ref-date.html">date</a></span> contains a date in any format.</li><li><span class="specList-elementSpec"><a href="ref-time.html">time</a></span> contains a phrase defining a time of day in any format.</li><li><span class="specList-elementSpec"><a href="ref-respStmt.html">respStmt</a></span> (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work.</li><li><span class="specList-elementSpec"><a href="ref-equipment.html">equipment</a></span> provides technical details of the equipment and media used for an audio or video recording used as the source for a spoken text.</li><li><span class="specList-elementSpec"><a href="ref-broadcast.html">broadcast</a></span> describes a broadcast used as the source of a spoken text.</li></ul><div class="p">Specialized collections may wish to add further sub-elements to these major components. These elements should be used only for information relating to the recording process itself; information about the setting or participants (for example) is recorded elsewhere: see sections <a class="link_ptr" href="CC.html#CCAHSE" title="The Setting Description"><span class="headingNumber">15.2.3 </span>The Setting Description</a> and <a class="link_ptr" href="CC.html#CCAHPA" title="The Participant Description"><span class="headingNumber">15.2.2 </span>The Participant Description</a>. <div id="index-egXML-d52e68089" class="pre egXML_valid"><span class="element">&lt;recordingStmt&gt;</span><br /> <span class="element">&lt;recording <span class="attribute">type</span>="<span class="attributevalue">video</span>"&gt;</span><br />  <span class="element">&lt;p&gt;</span>U-matic recording made by college audio-visual department staff,<br />       available as PAL-standard VHS transfer or sound-only cassette<span class="element">&lt;/p&gt;</span><br /> <span class="element">&lt;/recording&gt;</span><br /><span class="element">&lt;/recordingStmt&gt;</span></div> <div id="index-egXML-d52e68095" class="pre egXML_valid"><span class="element">&lt;recordingStmt&gt;</span><br /> <span class="element">&lt;recording <span class="attribute">type</span>="<span class="attributevalue">audio</span>" <span class="attribute">dur</span>="<span class="attributevalue">P30M</span>"&gt;</span><br />  <span class="element">&lt;respStmt&gt;</span><br />   <span class="element">&lt;resp&gt;</span>Location recording by<span class="element">&lt;/resp&gt;</span><br />   <span class="element">&lt;name&gt;</span>Sound Services Ltd.<span class="element">&lt;/name&gt;</span><br />  <span class="element">&lt;/respStmt&gt;</span><br />  <span class="element">&lt;equipment&gt;</span><br />   <span class="element">&lt;p&gt;</span>Multiple close microphones mixed down to stereo Digital<br />         Audio Tape, standard play, 44.1 KHz sampling frequency<span class="element">&lt;/p&gt;</span><br />  <span class="element">&lt;/equipment&gt;</span><br />  <span class="element">&lt;date&gt;</span>12 Jan 1987<span class="element">&lt;/date&gt;</span><br /> <span class="element">&lt;/recording&gt;</span><br /><span class="element">&lt;/recordingStmt&gt;</span></div> <div id="index-egXML-d52e68109" class="pre egXML_valid"><span class="element">&lt;recordingStmt&gt;</span><br /> <span class="element">&lt;recording <span class="attribute">type</span>="<span class="attributevalue">audio</span>" <span class="attribute">dur</span>="<span class="attributevalue">P15M</span>"<br />  <span class="attribute">xml:id</span>="<span class="attributevalue">rec-3001</span>"&gt;</span><br />  <span class="element">&lt;date&gt;</span>14 Feb 2001<span class="element">&lt;/date&gt;</span><br /> <span class="element">&lt;/recording&gt;</span><br /> <span class="element">&lt;recording <span class="attribute">type</span>="<span class="attributevalue">audio</span>" <span class="attribute">dur</span>="<span class="attributevalue">P15M</span>"<br />  <span class="attribute">xml:id</span>="<span class="attributevalue">rec-3002</span>"&gt;</span><br />  <span class="element">&lt;date&gt;</span>17 Feb 2001<span class="element">&lt;/date&gt;</span><br /> <span class="element">&lt;/recording&gt;</span><br /> <span class="element">&lt;recording <span class="attribute">type</span>="<span class="attributevalue">audio</span>" <span class="attribute">dur</span>="<span class="attributevalue">P15M</span>"<br />  <span class="attribute">xml:id</span>="<span class="attributevalue">rec-3003</span>"&gt;</span><br />  <span class="element">&lt;date&gt;</span>22 Feb 2001<span class="element">&lt;/date&gt;</span><br /> <span class="element">&lt;/recording&gt;</span><br /><span class="element">&lt;/recordingStmt&gt;</span></div></div><div class="p">When a recording has been made from a public broadcast, details of the broadcast itself should be supplied within the <a class="gi" title="(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast." href="ref-recording.html">recording</a> element, as a nested <a class="gi" title="describes a broadcast used as the source of a spoken text." href="ref-broadcast.html">broadcast</a> element. A broadcast is closely analogous to a publication and the <a class="gi" title="describes a broadcast used as the source of a spoken text." href="ref-broadcast.html">broadcast</a> element should therefore contain one or the other of the bibliographic citation elements <a class="gi" title="(bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged." href="ref-bibl.html">bibl</a>, <a class="gi" title="(structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements appear and in a specified order." href="ref-biblStruct.html">biblStruct</a>, or <a class="gi" title="(fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file description are present." href="ref-biblFull.html">biblFull</a>. The broadcasting agency responsible for a broadcast is regarded as its author, while other participants (for example interviewers, interviewees, script writers, directors, producers, etc.) should be specified using the <a class="gi" title="(statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work." href="ref-respStmt.html">respStmt</a> or <a class="gi" title="contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc." href="ref-editor.html">editor</a> element with an appropriate <a class="gi" title="(responsibility) contains a phrase describing the nature of a person's intellectual responsibility, or an organization's role in the production or distribution of a work." href="ref-resp.html">resp</a> (see further section <a class="link_ptr" href="CO.html#COBI" title="Bibliographic Citations and References"><span class="headingNumber">3.11 </span>Bibliographic Citations and References</a>). <div id="index-egXML-d52e68152" class="pre egXML_valid"><span class="element">&lt;recording <span class="attribute">type</span>="<span class="attributevalue">audio</span>" <span class="attribute">dur</span>="<span class="attributevalue">P10M</span>"&gt;</span><br /> <span class="element">&lt;equipment&gt;</span><br />  <span class="element">&lt;p&gt;</span>Recorded from FM Radio to digital tape<span class="element">&lt;/p&gt;</span><br /> <span class="element">&lt;/equipment&gt;</span><br /> <span class="element">&lt;broadcast&gt;</span><br />  <span class="element">&lt;bibl&gt;</span><br />   <span class="element">&lt;title&gt;</span>Interview on foreign policy<span class="element">&lt;/title&gt;</span><br />   <span class="element">&lt;author&gt;</span>BBC Radio 5<span class="element">&lt;/author&gt;</span><br />   <span class="element">&lt;respStmt&gt;</span><br />    <span class="element">&lt;resp&gt;</span>interviewer<span class="element">&lt;/resp&gt;</span><br />    <span class="element">&lt;name&gt;</span>Robin Day<span class="element">&lt;/name&gt;</span><br />   <span class="element">&lt;/respStmt&gt;</span><br />   <span class="element">&lt;respStmt&gt;</span><br />    <span class="element">&lt;resp&gt;</span>interviewee<span class="element">&lt;/resp&gt;</span><br />    <span class="element">&lt;name&gt;</span>Margaret Thatcher<span class="element">&lt;/name&gt;</span><br />   <span class="element">&lt;/respStmt&gt;</span><br />   <span class="element">&lt;series&gt;</span><br />    <span class="element">&lt;title&gt;</span>The World Tonight<span class="element">&lt;/title&gt;</span><br />   <span class="element">&lt;/series&gt;</span><br />   <span class="element">&lt;note&gt;</span>First broadcast on <span class="element">&lt;date <span class="attribute">when</span>="<span class="attributevalue">1989-11-27</span>"&gt;</span>27 Nov 1989<span class="element">&lt;/date&gt;</span><span class="element">&lt;/note&gt;</span><br />  <span class="element">&lt;/bibl&gt;</span><br /> <span class="element">&lt;/broadcast&gt;</span><br /><span class="element">&lt;/recording&gt;</span></div></div><div class="p">When a broadcast contains several distinct recordings (for example a compilation), additional <a class="gi" title="(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast." href="ref-recording.html">recording</a> elements may be further nested within the <a class="gi" title="describes a broadcast used as the source of a spoken text." href="ref-broadcast.html">broadcast</a> element. <div id="index-egXML-d52e68188" class="pre egXML_valid"><span class="element">&lt;recording <span class="attribute">dur</span>="<span class="attributevalue">P100M</span>"&gt;</span><br /> <span class="element">&lt;broadcast&gt;</span><br />  <span class="element">&lt;recording/&gt;</span><br /> <span class="element">&lt;/broadcast&gt;</span><br /><span class="element">&lt;/recording&gt;</span></div>  </div><div class="p">The <a class="gi" title="describes the set of transcription conventions used, particularly for spoken material." href="ref-transcriptionDesc.html">transcriptionDesc</a> element can be used to document the particular transcription conventions (use of space, punctuation, special characters etc.) used in making the transcription. A number of sets of such conventions have been defined within particular research communities, or by users of particular transcription tools. The attributes <span class="att">ident</span> and <span class="att">version</span> may be used to refer to such conventions in a machine tractable way, where this is appropriate. <div id="index-egXML-d52e68208" class="pre egXML_valid"><span class="element">&lt;transcriptionDesc <span class="attribute">ident</span>="<span class="attributevalue">HIAT</span>"<br /> <span class="attribute">version</span>="<span class="attributevalue">2004</span>"/&gt;</span></div></div></div><div class="div2" id="TSBA"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#HD32"><span class="headingNumber">8.2 </span>Documenting the Source of Transcribed Speech</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSSA"><span class="headingNumber">8.4 </span>Elements Defined Elsewhere</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h3><span class="bookmarklink"><a class="bookmarklink" href="#TSBA" title="link to this section "><span class="invisible">TEI: Elements Unique to Spoken Texts</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3 </span><span class="head">Elements Unique to Spoken Texts</span></h3><p>The following elements characterize spoken texts, transcribed according to these Guidelines: </p><ul class="specList"><li><span class="specList-elementSpec"><a href="ref-u.html">u</a></span> (utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker.</li><li><span class="specList-elementSpec"><a href="ref-pause.html">pause</a></span> marks a pause either between or within utterances.</li><li><span class="specList-elementSpec"><a href="ref-vocal.html">vocal</a></span> marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc.</li><li><span class="specList-elementSpec"><a href="ref-kinesic.html">kinesic</a></span> marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc.</li><li><span class="specList-elementSpec"><a href="ref-incident.html">incident</a></span> marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication.</li><li><span class="specList-elementSpec"><a href="ref-writing.html">writing</a></span> contains a passage of written text revealed to participants in the course of a spoken text.</li><li><span class="specList-elementSpec"><a href="ref-shift.html">shift</a></span> marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes.</li></ul><p>The <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element may appear directly within a spoken text, and may contain any of the others; the others may also appear directly (for example, a <a class="gi" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a> may appear between two utterances) but cannot contain a <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element. In terms of the basic TEI model, therefore, we regard the <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element as analogous to a paragraph, and the others as analogous to ‘phrase’ elements, but with the important difference that they can exist either as siblings or as children of utterances. The class <a class="link_odd" title="groups elements structurally analogous to paragraphs within spoken texts." href="ref-model.divPart.spoken.html">model.divPart.spoken</a> provides the <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element; the class <a class="link_odd" title="groups elements which may appear globally within spoken texts." href="ref-model.global.spoken.html">model.global.spoken</a> provides the six other elements listed above.</p><p>As members of the <a class="link_odd" title="provides attributes for elements representing speech or action that can be ascribed to a specific individual." href="ref-att.ascribed.html">att.ascribed</a> class, all of these elements share the following attribute: </p><ul class="specList"><li><span class="specList-classSpec"><a href="ref-att.ascribed.html">att.ascribed</a></span> provides attributes for elements representing speech or action that can be ascribed to a specific individual.<table class="specDesc"><tr><td class="Attribute"><span class="att">who</span></td><td>indicates the person, or group of people, to whom the element content is ascribed.</td></tr></table></li></ul><p> As members of the <a class="link_odd" title="provides attributes which can be used to classify or subclassify elements in any way." href="ref-att.typed.html">att.typed</a>, <a class="link_odd" title="provides attributes common to those elements which have a duration in time, expressed either absolutely or by reference to an alignment map." href="ref-att.timed.html">att.timed</a> and <a class="link_odd" title="provides attributes for normalization of elements that contain datable events." href="ref-att.duration.html">att.duration</a> classes, all of these elements except <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> share the following attribute: </p><ul class="specList"><li><span class="specList-classSpec"><a href="ref-att.typed.html">att.typed</a></span> provides attributes which can be used to classify or subclassify elements in any way.<table class="specDesc"><tr><td class="Attribute"><span class="att">type</span></td><td>characterizes the element in some sense, using any convenient classification scheme or typology.</td></tr><tr><td class="Attribute"><span class="att">subtype</span></td><td>provides a sub-categorization of the element, if needed</td></tr></table></li><li><span class="specList-classSpec"><a href="ref-att.timed.html">att.timed</a></span> provides attributes common to those elements which have a duration in time, expressed either absolutely or by reference to an alignment map.<table class="specDesc"><tr><td class="Attribute"><span class="att">start</span></td><td>indicates the location within a temporal alignment at which this element begins.</td></tr><tr><td class="Attribute"><span class="att">end</span></td><td>indicates the location within a temporal alignment at which this element ends.</td></tr></table></li><li><span class="specList-classSpec"><a href="ref-att.duration.w3c.html">att.duration.w3c</a></span> provides attributes for recording normalized temporal durations.<table class="specDesc"><tr><td class="Attribute"><span class="att">dur</span></td><td>(duration) indicates the length of this element in time.</td></tr></table></li></ul><p>Each of these elements is further discussed and specified in sections <a class="link_ptr" href="TS.html#TSBAUT" title="Utterances"><span class="headingNumber">8.3.1 </span>Utterances</a> to <a class="link_ptr" href="TS.html#TSBAWR" title="Writing"><span class="headingNumber">8.3.4 </span>Writing</a>.</p><p>We can show the relationship between four of these constituents of speech using the features <span class="mentioned">eventive</span>, <span class="mentioned">communicative</span>, <span class="mentioned">anthropophonic</span> (for sounds produced by the human vocal apparatus), and <span class="mentioned">lexical</span>: </p><div class="table"><table><tr class="label"><td> </td><td>eventive</td><td>communicative</td><td>anthropophonic</td><td>lexical</td></tr><tr><td>incident</td><td>+</td><td>-</td><td>-</td><td>-</td></tr><tr><td>kinesic</td><td>+</td><td>+</td><td>-</td><td>-</td></tr><tr><td>vocal</td><td>+</td><td>+</td><td>+</td><td>-</td></tr><tr><td>utterance</td><td>+</td><td>+</td><td>+</td><td>+</td></tr></table></div><p> The differences are not always clear-cut. Among <span class="noindex">incidents</span> might be included actions like slamming the door, which can certainly be communicative. <span class="noindex">Vocals</span> include coughing and sneezing, which are usually involuntary noises. Equally, the distinction between utterances and vocals is not always clear, although for many analytic purposes it will be convenient to regard them as distinct. Individual scholars may differ in the way borderlines are drawn and should declare their definitions in the <a class="gi" title="(editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text." href="ref-editorialDecl.html">editorialDecl</a> element of the header (see <a class="link_ptr" href="HD.html#HD53" title="The Editorial Practices Declaration"><span class="headingNumber">2.3.3 </span>The Editorial Practices Declaration</a>).</p><div class="p">The following short extract exemplifies several of these elements. It is recoded from a text originally transcribed in the CHILDES format.<span id="Note70_return"><a class="notelink" title="The original is a conversation between two children and their parents, recorded in 1987, and discussed in" href="#Note70"><sup>33</sup></a></span> Each utterance is encoded using a <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element (see section <a class="link_ptr" href="TS.html#TSBAUT" title="Utterances"><span class="headingNumber">8.3.1 </span>Utterances</a>). The speakers are defined using the <a class="gi" title="(list of persons) contains a list of descriptions, each of which provides information about an identifiable person or a group of people, for example the participants in a language interaction, or the people referred to in a historical source." href="ref-listPerson.html">listPerson</a> element discussed in <a class="link_ptr" href="ND.html#NDPERSE" title="The Person Element"><span class="headingNumber">13.3.2 </span>The Person Element</a> and each is given a unique identifier also used to identify their speech. Pauses marked by the transcriber are indicated using the <a class="gi" title="marks a pause either between or within utterances." href="ref-pause.html">pause</a> element (see section <a class="link_ptr" href="TS.html#TSBAPA" title="Pausing"><span class="headingNumber">8.3.2 </span>Pausing</a>). Non-verbal vocal effects such as the child's meowing are indicated either with orthographic transcriptions or with the <a class="gi" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a> element, and entirely non-linguistic but significant incidents such as the sound of the toy cat are represented by the <a class="gi" title="marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication." href="ref-incident.html">incident</a> elements (see section <a class="link_ptr" href="TS.html#TSBAVO" title="Vocal Kinesic Incident"><span class="headingNumber">8.3.3 </span>Vocal, Kinesic, Incident</a>). <div id="index-egXML-d52e68971" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#mar</span>"&gt;</span>you<br />   never <span class="element">&lt;pause/&gt;</span> take this cat for show and tell<br /> <span class="element">&lt;pause/&gt;</span> meow meow<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#ros</span>"&gt;</span>yeah well I dont want to<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;incident&gt;</span><br /> <span class="element">&lt;desc&gt;</span>toy cat has bell in tail which continues to make a tinkling sound<span class="element">&lt;/desc&gt;</span><br /><span class="element">&lt;/incident&gt;</span><br /><span class="element">&lt;vocal <span class="attribute">who</span>="<span class="attributevalue">#mar</span>"&gt;</span><br /> <span class="element">&lt;desc&gt;</span>meows<span class="element">&lt;/desc&gt;</span><br /><span class="element">&lt;/vocal&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#ros</span>"&gt;</span>because it is so old<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#mar</span>"&gt;</span>how <span class="element">&lt;choice&gt;</span><br />  <span class="element">&lt;orig&gt;</span>bout<span class="element">&lt;/orig&gt;</span><br />  <span class="element">&lt;reg&gt;</span>about<span class="element">&lt;/reg&gt;</span><br /> <span class="element">&lt;/choice&gt;</span><br /> <span class="element">&lt;emph&gt;</span>your<span class="element">&lt;/emph&gt;</span> cat <span class="element">&lt;pause/&gt;</span>yours is <span class="element">&lt;emph&gt;</span>new<span class="element">&lt;/emph&gt;</span><br /> <span class="element">&lt;kinesic&gt;</span><br />  <span class="element">&lt;desc&gt;</span>shows Father the cat<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/kinesic&gt;</span><span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">trans</span>="<span class="attributevalue">pause</span>" <span class="attribute">who</span>="<span class="attributevalue">#fat</span>"&gt;</span>thats <span class="element">&lt;pause/&gt;</span> darling<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#mar</span>"&gt;</span><br /> <span class="element">&lt;seg&gt;</span>no <span class="element">&lt;emph&gt;</span>mine<span class="element">&lt;/emph&gt;</span> isnt old<span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg&gt;</span>mine is just um a little dirty<span class="element">&lt;/seg&gt;</span><br /><span class="element">&lt;/u&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /><span class="element">&lt;listPerson&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">mar</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">ros</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">fat</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /><span class="element">&lt;/listPerson&gt;</span><div style="float: right;"><a href="BIB.html#TSBA-eg-19">bibliography</a> </div></div> </div><p>This example also uses some elements common to all TEI texts, notably the <a class="gi" title="(regularization) contains a reading which has been regularized or normalized in some sense." href="ref-reg.html">reg</a> tag for editorial regularization. Unusually stressed syllables have been encoded with the <a class="gi" title="(emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect." href="ref-emph.html">emph</a> element. The <a class="gi" title="(arbitrary segment) represents any segmentation of text below the ‘chunk’ level." href="ref-seg.html">seg</a> element has also been used to segment the last utterance. Further discussion of all of such options is provided in section <a class="link_ptr" href="TS.html#TSSA" title="Elements Defined Elsewhere"><span class="headingNumber">8.4 </span>Elements Defined Elsewhere</a>.</p><p>Contextual information is of particular importance in spoken texts, and should be provided by the TEI header of a text. In general, all of the information in a header is understood to be relevant to the whole of the associated text. The element <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> as a member of the <a class="link_odd" title="provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element." href="ref-att.declaring.html">att.declaring</a> class, may however specify a different context by means of the <span class="att">decls</span> attribute (see further section <a class="link_ptr" href="CC.html#CCAS" title="Associating Contextual Information with a Text"><span class="headingNumber">15.3 </span>Associating Contextual Information with a Text</a>).</p><div class="div3" id="TSBAUT"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSBAPA"><span class="headingNumber">8.3.2 </span>Pausing</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSBAUT" title="link to this section "><span class="invisible">TEI: Utterances</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3.1 </span><span class="head">Utterances</span></h4><p>Each distinct <span class="term">utterance</span> in a spoken text is represented by a <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element, described as follows: </p><ul class="specList"><li><span class="specList-elementSpec"><a href="ref-u.html">u</a></span> (utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker.<table class="specDesc"><tr><td class="Attribute"><span class="att">trans</span></td><td>(transition) indicates the nature of the transition between this utterance and the previous one.</td></tr></table></li></ul><p>Use of the <span class="att">who</span> attribute to associate the utterance with a particular speaker is recommended but not required. Its use implies as a further requirement that all speakers be identified by a <a class="gi" title="provides information about an identifiable individual, for example a participant in a language interaction, or a person referred to in a historical source." href="ref-person.html">person</a> or <a class="gi" title="(personal group) describes a group of individuals treated as a single person for analytic purposes." href="ref-personGrp.html">personGrp</a> element in the TEI header (see section <a class="link_ptr" href="CC.html#CCAHPA" title="The Participant Description"><span class="headingNumber">15.2.2 </span>The Participant Description</a>), but it may also point to another external source of information about the speaker. Where utterances or other parts of the transcription cannot be attributed with confidence to any particular participant or group of participants, the encoder may choose to create <a class="gi" title="(personal group) describes a group of individuals treated as a single person for analytic purposes." href="ref-personGrp.html">personGrp</a> elements with <span class="att">xml:id</span> attributes such as <span class="val">various</span> or <span class="val">unknown</span>, and perhaps give the root <a class="gi" title="(list of persons) contains a list of descriptions, each of which provides information about an identifiable person or a group of people, for example the participants in a language interaction, or the people referred to in a historical source." href="ref-listPerson.html">listPerson</a> element an <span class="att">xml:id</span> value of <span class="val">all</span>, then point to those as appropriate using <span class="att">who</span>.</p><div class="p">The <span class="att">trans</span> attribute is provided as a means of characterizing the transition from one utterance to the next at a simpler level of detail than that provided by the temporal alignment mechanism discussed in section <a class="link_ptr" href="SA.html#SASY" title="Synchronization"><span class="headingNumber">16.4 </span>Synchronization</a>. The value specified applies to the transition from the preceding utterance into the utterance bearing the attribute. For example:<span id="Note71_return"><a class="notelink" title="For the most part, the examples in this chapter use no sentence punctuation except to mark the rising intonation often found in interrogative statemen…" href="#Note71"><sup>34</sup></a></span> <div id="index-egXML-d52e69118" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">ts_a1</span>" <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>Have you heard the<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">ts_b1</span>" <span class="attribute">trans</span>="<span class="attributevalue">latching</span>" <span class="attribute">who</span>="<span class="attributevalue">#b</span>"&gt;</span>the election results? yes<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">ts_a2</span>" <span class="attribute">trans</span>="<span class="attributevalue">pause</span>" <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>it's a disaster<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">ts_b2</span>" <span class="attribute">trans</span>="<span class="attributevalue">overlap</span>" <span class="attribute">who</span>="<span class="attributevalue">#b</span>"&gt;</span>it's a miracle<span class="element">&lt;/u&gt;</span></div> In this example, utterance <span class="val">ts_b1</span> latches on to utterance <span class="val">ts_a1</span>, while there is a marked pause between <span class="val">ts_b1</span> and <span class="val">ts_a2</span>. <span class="val">ts_b2</span> and <span class="val">ts_a2</span> overlap, but by an unspecified amount. For ways of providing a more precise indication of the degree of overlap, see section <a class="link_ptr" href="TS.html#TSSAPA" title="Synchronization and Overlap"><span class="headingNumber">8.4.2 </span>Synchronization and Overlap</a>.</div><p>An utterance may contain either running text, or text within which other basic structural elements are nested. Where such nesting occurs, the <span class="att">who</span> attribute is considered to be inherited for the elements <a class="gi" title="marks a pause either between or within utterances." href="ref-pause.html">pause</a>, <a class="gi" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a>, <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> and <a class="gi" title="marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc." href="ref-kinesic.html">kinesic</a>; that is, a pause or shift (etc.) within an utterance is regarded as being produced by that speaker only, while a pause between utterances applies to all speakers.</p><div class="p">Occasionally, an utterance may seem to contain other utterances, for example where one speaker interrupts himself, or when another speaker produces a ‘back-channel’ while they are still speaking. The present version of these Guidelines does not support nesting of one <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element within another. The transcriber must therefore decide whether such interruptions constitute a change of utterance, or whether other elements may be used. In the case of self-interruption, the <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> element may be used to show that the speaker has changed the quality of their speech: <div id="index-egXML-d52e69177" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>Listen to this <span class="element">&lt;shift <span class="attribute">new</span>="<span class="attributevalue">reading</span>"/&gt;</span>The government is<br />   confident, he said, that the current economic problems will be<br />   completely overcome by June<span class="element">&lt;shift <span class="attribute">new</span>="<span class="attributevalue">normal</span>"/&gt;</span> what nonsense<span class="element">&lt;/u&gt;</span></div> Alternatively the <a class="gi" title="marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication." href="ref-incident.html">incident</a> element described in section <a class="link_ptr" href="TS.html#TSBAVO" title="Vocal Kinesic Incident"><span class="headingNumber">8.3.3 </span>Vocal, Kinesic, Incident</a> might be used, without transcribing the read material: <div id="index-egXML-d52e69191" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>Listen to this<br /> <span class="element">&lt;incident&gt;</span><br />  <span class="element">&lt;desc&gt;</span>reads aloud from newspaper<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/incident&gt;</span> what<br />   nonsense<span class="element">&lt;/u&gt;</span></div></div><div class="p">Often, back-channelling is only semi-lexicalized and may therefore be represented using the <a class="gi" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a> element: <div id="index-egXML-d52e69204" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>So what could I have done <span class="element">&lt;vocal <span class="attribute">who</span>="<span class="attributevalue">#b</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>tut-tutting<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/vocal&gt;</span> about it anyway?<span class="element">&lt;/u&gt;</span></div> Where this is not possible, it is simplest to regard the back-channel as a distinct utterance.</div></div><div class="div3" id="TSBAPA"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSBAUT"><span class="headingNumber">8.3.1 </span>Utterances</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSBAVO"><span class="headingNumber">8.3.3 </span>Vocal, Kinesic, Incident</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSBAPA" title="link to this section "><span class="invisible">TEI: Pausing</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3.2 </span><span class="head">Pausing</span></h4><div class="p">Speakers differ very much in their rhythm and in particular in the amount of time they leave between words. The following element is provided to mark occasions where the transcriber judges that speech has been paused, irrespective of the actual amount of silence: <ul class="specList"><li><span class="specList-elementSpec"><a href="ref-pause.html">pause</a></span> marks a pause either between or within utterances.</li></ul> A pause contained by an utterance applies to the speaker of that utterance. A pause between utterances applies to all speakers. The <span class="att">type</span> attribute may be used to categorize the pause, for example as short, medium, or long; alternatively the attribute <span class="att">dur</span> may be used to indicate its length more exactly, as in the following example: <div id="index-egXML-d52e69226" class="pre egXML_valid"><span class="element">&lt;u&gt;</span>Okay <span class="element">&lt;pause <span class="attribute">dur</span>="<span class="attributevalue">PT2M</span>"/&gt;</span>U-m<span class="element">&lt;pause <span class="attribute">dur</span>="<span class="attributevalue">PT75S</span>"/&gt;</span>the scene opens up<br /> <span class="element">&lt;pause <span class="attribute">dur</span>="<span class="attributevalue">PT50S</span>"/&gt;</span> with <span class="element">&lt;pause <span class="attribute">dur</span>="<span class="attributevalue">PT20S</span>"/&gt;</span> um <span class="element">&lt;pause <span class="attribute">dur</span>="<span class="attributevalue">PT145S</span>"/&gt;</span> you see<br />   a tree okay?<span class="element">&lt;/u&gt;</span><div style="float: right;"><a href="BIB.html#TSBAPA-eg-24">bibliography</a> </div></div>  If detailed synchronization of pausing with other vocal phenomena is required, the alignment mechanism defined at section <a class="link_ptr" href="SA.html#SASY" title="Synchronization"><span class="headingNumber">16.4 </span>Synchronization</a> and discussed informally below should be used. Note that the <span class="att">trans</span> attribute mentioned in the previous section may also be used to characterize the degree of pausing between (but not within) utterances.</div></div><div class="div3" id="TSBAVO"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSBAPA"><span class="headingNumber">8.3.2 </span>Pausing</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSBAWR"><span class="headingNumber">8.3.4 </span>Writing</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSBAVO" title="link to this section "><span class="invisible">TEI: Vocal, Kinesic, Incident</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3.3 </span><span class="head">Vocal, Kinesic, Incident</span></h4><p>The presence of non-transcribed semi-lexical or non-lexical phenomena either between or within utterances may be indicated with the following three elements. </p><ul class="specList"><li><span class="specList-elementSpec"><a href="ref-vocal.html">vocal</a></span> marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc.</li><li><span class="specList-elementSpec"><a href="ref-kinesic.html">kinesic</a></span> marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc.</li><li><span class="specList-elementSpec"><a href="ref-incident.html">incident</a></span> marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication.</li></ul><p>The <span class="att">who</span> attribute should be used to specify the person or group responsible for a <a class="gi" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a>, <a class="gi" title="marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc." href="ref-kinesic.html">kinesic</a>, or <a class="gi" title="marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication." href="ref-incident.html">incident</a> which is contained within an utterance, if this differs from that of the enclosing utterance. The attribute must be supplied for a <a class="gi" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a>, <a class="gi" title="marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc." href="ref-kinesic.html">kinesic</a>, or <a class="gi" title="marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication." href="ref-incident.html">incident</a> which is not contained within an utterance.</p><p>The <span class="att">iterated</span> attribute may be used to indicate that the vocal, kinesic, or incident is repeated, for example <span class="val">laughter</span> as opposed to <span class="val">laugh</span>. These should both be distinguished from <span class="val">laughing</span>, where what is being encoded is a shift in voice quality. For this last case, the <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> element discussed in section <a class="link_ptr" href="TS.html#TSSASH" title="Shifts"><span class="headingNumber">8.3.6 </span>Shifts</a> should be used.</p><p>A child <a class="gi" title="(description) contains a brief description of the object documented by its parent element, typically a documentation element or an entity." href="ref-desc.html">desc</a> element may be used to supply a conventional representation for the phenomenon, for example: </p><dl><dt><span>non-lexical </span></dt><dd>burp, click, cough, exhale, giggle, gulp, inhale, laugh, sneeze, sniff, snort, sob, swallow, throat, yawn </dd><dt><span>semi-lexical </span></dt><dd>ah, aha, aw, eh, ehm, er, erm, hmm, huh, mm, mmhm, oh, ooh, oops, phew, tsk, uh, uh-huh, uh-uh, um, urgh, yup</dd></dl><p> Researchers may prefer to regard some semi-lexical phenomena as ‘words’ within the bounds of the <a class="gi" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> element. See further the discussion at section <a class="link_ptr" href="TS.html#TSREG" title="Regularization of Word Forms"><span class="headingNumber">8.4.3 </span>Regularization of Word Forms</a> below. As for all basic categories, the definition should be made clear in the <a class="gi" title="(encoding description) documents the relationship between an electronic text and the source or sources from which it was derived." href="ref-encodingDesc.html">encodingDesc</a> element of the TEI header.</p><div class="p">Some typical examples follow: <div id="index-egXML-d52e69332" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#jan</span>"&gt;</span>This is just delicious<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;incident&gt;</span><br /> <span class="element">&lt;desc&gt;</span>telephone rings<span class="element">&lt;/desc&gt;</span><br /><span class="element">&lt;/incident&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#ann</span>"&gt;</span>I'll get it<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#tom</span>"&gt;</span>I used to <span class="element">&lt;vocal&gt;</span><br />  <span class="element">&lt;desc&gt;</span>cough<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/vocal&gt;</span> smoke a lot<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span><br /> <span class="element">&lt;vocal&gt;</span><br />  <span class="element">&lt;desc&gt;</span>sniffs<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/vocal&gt;</span>He thinks he's tough<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;vocal <span class="attribute">who</span>="<span class="attributevalue">#ann</span>"&gt;</span><br /> <span class="element">&lt;desc&gt;</span>snorts<span class="element">&lt;/desc&gt;</span><br /><span class="element">&lt;/vocal&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /><span class="element">&lt;listPerson&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">ann</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">bob</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">jan</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">kim</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">tom</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /><span class="element">&lt;/listPerson&gt;</span><div style="float: right;"><a href="BIB.html#TSSASE-eg-20">bibliography</a> </div></div> Note that Ann's snorting could equally well be encoded as follows: <div id="index-egXML-d52e69367" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#ann</span>"&gt;</span><br /> <span class="element">&lt;vocal&gt;</span><br />  <span class="element">&lt;desc&gt;</span>snorts<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/vocal&gt;</span><br /><span class="element">&lt;/u&gt;</span></div></div><p>The extent to which encoding of incidents or kinesics is included in a transcription will depend entirely on the purpose for which the transcription was made. As elsewhere, this will depend on the particular research agenda and the extent to which their presence is felt to be significant for the interpretation of spoken interactions. </p></div><div class="div3" id="TSBAWR"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSBAVO"><span class="headingNumber">8.3.3 </span>Vocal, Kinesic, Incident</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSBATI"><span class="headingNumber">8.3.5 </span>Temporal Information</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSBAWR" title="link to this section "><span class="invisible">TEI: Writing</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3.4 </span><span class="head">Writing</span></h4><div class="p">Written text may also be encountered when speech is transcribed, for example in a television broadcast or cinema performance, or where one participant shows written text to another. The <a class="gi" title="contains a passage of written text revealed to participants in the course of a spoken text." href="ref-writing.html">writing</a> element may be used to distinguish such written elements from the spoken text in which they are embedded. <ul class="specList"><li><span class="specList-elementSpec"><a href="ref-writing.html">writing</a></span> contains a passage of written text revealed to participants in the course of a spoken text.<table class="specDesc"><tr><td class="Attribute"><span class="att">gradual</span></td><td>indicates whether the writing is revealed all at once or gradually.</td></tr></table></li><li><span class="specList-classSpec"><a href="ref-att.global.source.html">att.global.source</a></span> provides an attribute used by elements to point to an external source.</li></ul>  For example, if speaker A in the breakfast table conversation in section <a class="link_ptr" href="TS.html#TSBAUT" title="Utterances"><span class="headingNumber">8.3.1 </span>Utterances</a> above had simply shown the newspaper passage to her interlocutor instead of reading it, the interaction might have been encoded as follows: <div id="index-egXML-d52e69391" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>look at this<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;writing <span class="attribute">who</span>="<span class="attributevalue">#a</span>" <span class="attribute">type</span>="<span class="attributevalue">newspaper</span>"<br /> <span class="attribute">gradual</span>="<span class="attributevalue">false</span>"&gt;</span>Government claims economic problems<br /> <span class="element">&lt;soCalled&gt;</span>over by June<span class="element">&lt;/soCalled&gt;</span><span class="element">&lt;/writing&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#a</span>"&gt;</span>what nonsense!<span class="element">&lt;/u&gt;</span></div></div><div class="p">If the source of the writing being displayed is known, bibliographic information about it may be stored in a <a class="gi" title="(citation list) contains a list of bibliographic citations of any kind." href="ref-listBibl.html">listBibl</a> within the <a class="gi" title="(source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as &#34;born digital&#34; for a text which has no previous existence." href="ref-sourceDesc.html">sourceDesc</a> element of the TEI header, and then pointed to using the <span class="att">source</span> attribute. For example, in the following imaginary example, a lecturer displays two different versions of the same passage of text: <div id="index-egXML-d52e69412" class="pre egXML_valid"><span class="element">&lt;sourceDesc&gt;</span><br /><span class="comment">&lt;!-- ...--&gt;</span><br /> <span class="element">&lt;bibl <span class="attribute">xml:id</span>="<span class="attributevalue">FOL1</span>"&gt;</span>Shakespeare First Folio text<span class="element">&lt;/bibl&gt;</span><br /> <span class="element">&lt;bibl <span class="attribute">xml:id</span>="<span class="attributevalue">FOL2</span>"&gt;</span>Shakespeare Second Folio text<span class="element">&lt;/bibl&gt;</span><br /><span class="comment">&lt;!-- ...--&gt;</span><br /><span class="element">&lt;/sourceDesc&gt;</span><br /><span class="comment">&lt;!-- ...--&gt;</span><br /><span class="element">&lt;u&gt;</span>[...] now compare the punctuation of lines 12 and 14 in these two<br />   versions of page 42...<br /> <span class="element">&lt;writing <span class="attribute">source</span>="<span class="attributevalue">#FOL1</span>"&gt;</span>[...]<span class="element">&lt;/writing&gt;</span><br /> <span class="element">&lt;writing <span class="attribute">source</span>="<span class="attributevalue">#FOL2</span>"&gt;</span>[...]<span class="element">&lt;/writing&gt;</span><span class="element">&lt;/u&gt;</span></div></div></div><div class="div3" id="TSBATI"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSBAWR"><span class="headingNumber">8.3.4 </span>Writing</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSSASH"><span class="headingNumber">8.3.6 </span>Shifts</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSBATI" title="link to this section "><span class="invisible">TEI: Temporal Information</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3.5 </span><span class="head">Temporal Information</span></h4><p>As noted above, utterances, vocals, pauses, kinesics, incidents, and writing elements all inherit attributes providing information about their position in time from the classes <a class="link_odd" title="provides attributes common to those elements which have a duration in time, expressed either absolutely or by reference to an alignment map." href="ref-att.timed.html">att.timed</a> and <a class="link_odd" title="provides attributes for normalization of elements that contain datable events." href="ref-att.duration.html">att.duration</a>. These attributes can be used to link parts of the transcription very exactly with points on a timeline, or simply to indicate their duration. Note that if <span class="att">start</span> and <span class="att">end</span> point to <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> elements whose temporal distance from each other is specified in a timeline, then <span class="att">dur</span> is ignored.</p><p>The <a class="gi" title="(anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element." href="ref-anchor.html">anchor</a> element (see <a class="link_ptr" href="SA.html#SACS" title="Correspondence and Alignment"><span class="headingNumber">16.5 </span>Correspondence and Alignment</a>) may be used as an alternative means of aligning the start and end of timed elements, and is required when the temporal alignment involves points within an element.</p><p>For further discussion of temporal alignment and synchronization see <a class="link_ptr" href="TS.html#TSSAPA" title="Synchronization and Overlap"><span class="headingNumber">8.4.2 </span>Synchronization and Overlap</a> below.</p></div><div class="div3" id="TSSASH"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSBATI"><span class="headingNumber">8.3.5 </span>Temporal Information</a></li><li class="subtoc"></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSSASH" title="link to this section "><span class="invisible">TEI: Shifts</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.3.6 </span><span class="head">Shifts</span></h4><div class="p">A common requirement in transcribing spoken language is to mark positions at which a variety of prosodic features change. Many paralinguistic features (pitch, prominence, loudness, etc.) characterize stretches of speech which are not co-extensive with utterances or any of the other units discussed so far. One simple method of encoding such units is simply to mark their boundaries. An empty element called <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> is provided for this purpose. <ul class="specList"><li><span class="specList-elementSpec"><a href="ref-shift.html">shift</a></span> marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes.<table class="specDesc"><tr><td class="Attribute"><span class="att">feature</span></td><td>a paralinguistic feature.
Suggested values include: 1] tempo; 2] loud; 3] pitch; 4] tension; 5] rhythm; 6] voice</td></tr><tr><td class="Attribute"><span class="att">new</span></td><td>specifies the new state of the paralinguistic feature specified.</td></tr></table></li></ul> A <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> element may appear within an utterance or a segment to mark a significant change in the particular feature defined by its attributes, which is then understood to apply to all subsequent utterances for the same speaker, unless changed by a new shift for the same feature in the same speaker. Intervening utterances by other speakers do not normally carry the same feature. For example: <div id="index-egXML-d52e69476" class="pre egXML_valid"><span class="element">&lt;u&gt;</span><br /> <span class="element">&lt;shift <span class="attribute">feature</span>="<span class="attributevalue">loud</span>" <span class="attribute">new</span>="<span class="attributevalue">f</span>"/&gt;</span>Elizabeth<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u&gt;</span>Yes<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u&gt;</span><br /> <span class="element">&lt;shift <span class="attribute">feature</span>="<span class="attributevalue">loud</span>" <span class="attribute">new</span>="<span class="attributevalue">normal</span>"/&gt;</span>Come and try this <span class="element">&lt;pause/&gt;</span><br /> <span class="element">&lt;shift <span class="attribute">feature</span>="<span class="attributevalue">loud</span>" <span class="attribute">new</span>="<span class="attributevalue">ff</span>"/&gt;</span>come on<span class="element">&lt;/u&gt;</span></div> In this example, the word <span class="mentioned">Elizabeth</span> is spoken loudly, the words <span class="mentioned">Yes</span> and <span class="mentioned">Come and try this</span> with normal volume, and the words <span class="mentioned">come on</span> very loudly.</div><p>The values proposed here for the <span class="att">feature</span> attribute are based on those used by the Survey of English Usage (see further <span class="ref">Boase 1990</span>); this list may be revised or supplemented using the methods outlined in section <a class="link_ptr" href="USE.html#MD" title="Customization"><span class="headingNumber">23.3 </span>Customization</a>.</p><p>The <span class="att">new</span> attribute specifies the new state of the feature following the shift. If this attribute has the special value <span class="val">normal</span>, the implication is that the feature concerned ceases to be remarkable at this point.</p><p>A list of suggested values for each of the features proposed follows: </p><ul class="bulleted"><li class="item">tempo <dl><dt><span>a </span></dt><dd>allegro (fast)</dd><dt><span>aa </span></dt><dd>very fast</dd><dt><span>acc </span></dt><dd>accelerando (getting faster)</dd><dt><span>l </span></dt><dd>lento (slow)</dd><dt><span>ll </span></dt><dd>very slow</dd><dt><span>rall </span></dt><dd>rallentando (getting slower)</dd></dl></li><li class="item">loud (for loudness): <dl><dt><span>f </span></dt><dd>forte (loud)</dd><dt><span>ff </span></dt><dd>very loud</dd><dt><span>cresc </span></dt><dd>crescendo (getting louder)</dd><dt><span>p </span></dt><dd>piano (soft)</dd><dt><span>pp </span></dt><dd>very soft</dd><dt><span>dimin </span></dt><dd>diminuendo (getting softer)</dd></dl></li><li class="item">pitch (for pitch range): <dl><dt><span>high </span></dt><dd>high pitch-range</dd><dt><span>low </span></dt><dd>low pitch-range</dd><dt><span>wide </span></dt><dd>wide pitch-range</dd><dt><span>narrow</span></dt><dd>narrow pitch-range</dd><dt><span>asc </span></dt><dd>ascending</dd><dt><span>desc </span></dt><dd>descending</dd><dt><span>monot </span></dt><dd>monotonous</dd><dt><span>scand </span></dt><dd>scandent, each succeeding syllable higher than the last, generally ending in a falling tone</dd></dl></li><li class="item">tension: <dl><dt><span>sl </span></dt><dd>slurred</dd><dt><span>lax </span></dt><dd>lax, a little slurred</dd><dt><span>ten </span></dt><dd>tense</dd><dt><span>pr </span></dt><dd>very precise</dd><dt><span>st </span></dt><dd>staccato, every stressed syllable being doubly stressed</dd><dt><span>leg </span></dt><dd>legato, every syllable receiving more or less equal stress</dd></dl></li><li class="item">rhythm: <dl><dt><span>rh </span></dt><dd>beatable rhythm</dd><dt><span>arrh </span></dt><dd>arrhythmic, particularly halting</dd><dt><span>spr </span></dt><dd>spiky rising, with markedly higher unstressed syllables</dd><dt><span>spf </span></dt><dd>spiky falling, with markedly lower unstressed syllables</dd><dt><span>glr </span></dt><dd>glissando rising, like spiky rising but the unstressed syllables, usually several, also rise in pitch relative to each other</dd><dt><span>glf </span></dt><dd>glissando falling, like spiky falling but with the unstressed syllables also falling in pitch relative to each other</dd></dl></li><li class="item">voice (for voice quality): <dl><dt><span>whisp </span></dt><dd>whisper</dd><dt><span>breath </span></dt><dd>breathy</dd><dt><span>husk </span></dt><dd>husky</dd><dt><span>creak </span></dt><dd>creaky</dd><dt><span>fals </span></dt><dd>falsetto</dd><dt><span>reson </span></dt><dd>resonant</dd><dt><span>giggle </span></dt><dd>unvoiced laugh or giggle</dd><dt><span>laugh </span></dt><dd>voiced laugh</dd><dt><span>trem </span></dt><dd>tremulous</dd><dt><span>sob </span></dt><dd>sobbing</dd><dt><span>yawn </span></dt><dd>yawning</dd><dt><span>sigh </span></dt><dd>sighing</dd></dl></li></ul><p>A full definition of the sense of the values provided for each feature may be provided either in the encoding description section of the text header (see section <a class="link_ptr" href="HD.html#HD5" title="The Encoding Description"><span class="headingNumber">2.3 </span>The Encoding Description</a>) or as part of a TEI customization, as described in section <a class="link_ptr" href="USE.html#MD" title="Customization"><span class="headingNumber">23.3 </span>Customization</a>.</p></div></div><div class="div2" id="TSSA"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSBA"><span class="headingNumber">8.3 </span>Elements Unique to Spoken Texts</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#index-body.1_div.8_div.5"><span class="headingNumber">8.5 </span>Module for Transcribed Speech</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h3><span class="bookmarklink"><a class="bookmarklink" href="#TSSA" title="link to this section "><span class="invisible">TEI: Elements Defined Elsewhere</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4 </span><span class="head">Elements Defined Elsewhere</span></h3><p>This section describes the following features characteristic of spoken texts for which elements are defined elsewhere in these Guidelines: </p><ul class="bulleted"><li class="item">segmentation below the utterance level</li><li class="item">synchronization and overlap</li><li class="item">regularization of orthography</li></ul><p> The elements discussed here are not provided by the module for spoken texts. Some of them are included in the core module and others are contained in the modules for linking and for analysis respectively. The selection of modules and their combination to define a TEI schema is discussed in section <a class="link_ptr" href="ST.html#STIN" title="Defining a TEI Schema"><span class="headingNumber">1.2 </span>Defining a TEI Schema</a>. </p><div class="div3" id="TSSASE"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSSAPA"><span class="headingNumber">8.4.2 </span>Synchronization and Overlap</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSSASE" title="link to this section "><span class="invisible">TEI: Segmentation</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4.1 </span><span class="head">Segmentation</span></h4><p>For some analytic purposes it may be desirable to subdivide the divisions of a spoken text into units smaller than the individual utterance or turn. Segmentation may be performed for a number of different purposes and in terms of a variety of speech phenomena. Common examples include units defined both prosodically (by intonation, pausing, etc.) and syntactically (clauses, phrases, etc.) The term <span class="term">macrosyntagm</span> has been used by a number of researchers to define units peculiar to speech transcripts.<span id="Note72_return"><a class="notelink" title="The term was apparently first proposed by , where it is defined as follows: A text can be analysed as a sequence of segments which are internally conn…" href="#Note72"><sup>35</sup></a></span></p><p>These Guidelines propose that such analyses be performed in terms of neutrally-named <span class="term">segments</span>, represented by the <a class="gi" title="(arbitrary segment) represents any segmentation of text below the ‘chunk’ level." href="ref-seg.html">seg</a> element, which is discussed more fully in section <a class="link_ptr" href="SA.html#SASE" title="Blocks Segments and Anchors"><span class="headingNumber">16.3 </span>Blocks, Segments, and Anchors</a>. This element may take a <span class="att">type</span> attribute to specify the kind of segmentation applicable to a particular segment, if more than one is possible in a text. A full definition of the segmentation scheme or schemes used should be provided in the <a class="gi" title="describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc." href="ref-segmentation.html">segmentation</a> element of the <a class="gi" title="(editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text." href="ref-editorialDecl.html">editorialDecl</a> element in the TEI header (see <a class="link_ptr" href="HD.html#HD53" title="The Editorial Practices Declaration"><span class="headingNumber">2.3.3 </span>The Editorial Practices Declaration</a>).</p><div class="p">In the first example below, an utterance has been segmented according to a notion of syntactic completeness not necessarily marked by the speech, although in this case a pause has been recorded between the two sentence-like units. In the second, the segments are defined prosodically (an acute accent has been used to mark the position immediately following the syllable bearing the primary accent or stress), and may be thought of as ‘tone units’. <div id="index-egXML-d52e70614" class="pre egXML_valid"><span class="element">&lt;u&gt;</span><br /> <span class="element">&lt;seg&gt;</span>we went to the pub yesterday<span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;pause/&gt;</span><br /> <span class="element">&lt;seg&gt;</span>there was no one there<span class="element">&lt;/seg&gt;</span><br /><span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u&gt;</span><br /> <span class="element">&lt;seg&gt;</span>although its an old ide´a<span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg&gt;</span>it hasnt been on the mar´ket very long<span class="element">&lt;/seg&gt;</span><br /><span class="element">&lt;/u&gt;</span><div style="float: right;"><a href="BIB.html#TSSASE-eg-37">bibliography</a> </div></div>  In either case, the <a class="gi" title="describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc." href="ref-segmentation.html">segmentation</a> element in the header of the text should specify the principles adopted to define the segments marked in this way.</div><p>When utterances are segmented end-to-end in the same way as the s-units in written texts, the <a class="gi" title="(s-unit) contains a sentence-like division of a text." href="ref-s.html">s</a> element discussed in chapter <a class="link_ptr" href="AI.html" title="15"><span class="headingNumber">17 </span>Simple Analytic Mechanisms</a> may be used, either as an alternative or in addition to the more general purpose <a class="gi" title="(arbitrary segment) represents any segmentation of text below the ‘chunk’ level." href="ref-seg.html">seg</a> element. The <a class="gi" title="(s-unit) contains a sentence-like division of a text." href="ref-s.html">s</a> element is available without formality in all texts, but does not allow segments to nest within each other.</p><div class="p">Where segments of different kinds are to be distinguished within the same stretch of speech, the <span class="att">type</span> attribute may be used, as in the following example: <div id="index-egXML-d52e70650" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#T1</span>"&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>I think <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>this chap was writing <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>and he <span class="element">&lt;del <span class="attribute">type</span>="<span class="attributevalue">repeated</span>"&gt;</span>said hello<span class="element">&lt;/del&gt;</span> said <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">M</span>"&gt;</span>hello <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>and he said <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>I'm going to a gate<br />     at twenty past seven <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>he said <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">M</span>"&gt;</span>ok <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">M</span>"&gt;</span>right away <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>and so <span class="element">&lt;gap <span class="attribute">extent</span>="<span class="attributevalue">1 syll</span>"/&gt;</span> on they went <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>and they were <span class="element">&lt;gap <span class="attribute">extent</span>="<span class="attributevalue">3 sylls</span>"/&gt;</span><br />     writing there <span class="element">&lt;/seg&gt;</span><br /><span class="element">&lt;/u&gt;</span></div>  In this example, recoded from a corpus of language-impaired speech prepared by Fletcher and Garman, the speaker's utterance has been fully segmented into clausal (<code>type="C"</code>) or minor (<code>type="M"</code>) units.</div><div class="p">For some features, it may be more appropriate or convenient to introduce a new element in a custom namespace: <div id="index-egXML-d52e70692" class="pre egXML_invalid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#T1</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>and he said <span class="element">&lt;/seg&gt;</span><br /> <span class="element">&lt;seg <span class="attribute">type</span>="<span class="attributevalue">C</span>"&gt;</span>I'm going to a<br />  <span class="element">&lt;ext:paraphasia&gt;</span>gate<span class="element">&lt;/ext:paraphasia&gt;</span><br />     at twenty past seven <span class="element">&lt;/seg&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /><span class="element">&lt;/u&gt;</span></div> Here, <span class="gi">&lt;ext:paraphasia&gt;</span> has been used to define a particular characteristic of this corpus for which no element exists in the TEI scheme. See further chapter <a class="link_ptr" href="USE.html#MD" title="Customization"><span class="headingNumber">23.3 </span>Customization</a> for a discussion of the way in which this kind of user-defined extension of the TEI scheme may be performed and chapter <a class="link_ptr" href="ST.html" title="3"><span class="headingNumber">1 </span>The TEI Infrastructure</a> for the mechanisms on which it depends.</div><p>This example also uses the core elements <a class="gi" title="indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible." href="ref-gap.html">gap</a> and <a class="gi" title="(deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, or a previous annotator or corrector." href="ref-del.html">del</a> to mark editorial decisions concerning matter completely omitted from the transcript (because of inaudibility), and words which have been transcribed but which the transcriber wishes to exclude from the segment because they are repeated, respectively. See section <a class="link_ptr" href="CO.html#COED" title="Simple Editorial Changes"><span class="headingNumber">3.4 </span>Simple Editorial Changes</a> for a discussion of these and related elements.</p><p>It is often the case that the desired segmentation does not respect utterance boundaries; for example, syntactic units may cross utterance boundaries. For a detailed discussion of this problem, and the various methods proposed by these Guidelines for handling it, see chapter <a class="link_ptr" href="NH.html" title="31"><span class="headingNumber">20 </span>Non-hierarchical Structures</a>. Methods discussed there include these: </p><ul class="bulleted"><li class="item">‘milestone’ tags may be used; the special-purpose <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> tag discussed in section <a class="link_ptr" href="TS.html#TSSASH" title="Shifts"><span class="headingNumber">8.3.6 </span>Shifts</a> is an extension of this method</li><li class="item">where several discontinuous segments are to be grouped together to form a syntactic unit (e.g. a phrasal verb with interposed complement), the <a class="gi" title="identifies a possibly fragmented segment of text, by pointing at the possibly discontiguous elements which compose it." href="ref-join.html">join</a> element may be used</li></ul></div><div class="div3" id="TSSAPA"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSSASE"><span class="headingNumber">8.4.1 </span>Segmentation</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSREG"><span class="headingNumber">8.4.3 </span>Regularization of Word Forms</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSSAPA" title="link to this section "><span class="invisible">TEI: Synchronization and Overlap</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4.2 </span><span class="head">Synchronization and Overlap</span></h4><div class="p">A major difference between spoken and written texts is the importance of the temporal dimension to the former. As a very simple example, consider the following, first as it might be represented in a playscript: <div id="index-egXML-d52e70745" class="pre cdata egXML_valid"> Jane: Have you read Vanity Fair?
Stig: Yes
Lou: (nods vigorously)</div> To encode this, we first define the participants: <div id="index-egXML-d52e70748" class="pre egXML_valid"><span class="element">&lt;listPerson&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">stig</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">lou</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">jane</span>"&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /> <span class="element">&lt;/person&gt;</span><br /><span class="element">&lt;/listPerson&gt;</span></div> Let us assume that Stig and Lou respond to Jane's question before she has finished asking it—a fairly normal situation in spontaneous speech. The simplest way of representing this <span class="term">overlap</span> would be to use the <span class="att">trans</span> attribute previously discussed: <div id="index-egXML-d52e70763" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#jane</span>"&gt;</span>have you read Vanity Fair<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">trans</span>="<span class="attributevalue">overlap</span>" <span class="attribute">who</span>="<span class="attributevalue">#stig</span>"&gt;</span>yes<span class="element">&lt;/u&gt;</span></div> However, this does not allow us to indicate either the extent to which Stig's utterance is overlapped, nor does it show that there are in fact three things which are synchronous: the end of Jane's utterance, Stig's whole utterance, and Lou's kinesic. To overcome these problems, more sophisticated techniques, employing the mechanisms for pointing and alignment discussed in detail in section <a class="link_ptr" href="SA.html#SASY" title="Synchronization"><span class="headingNumber">16.4 </span>Synchronization</a>, are needed. If the module for linking has been enabled (as described in section <a class="link_ptr" href="TS.html#TSSASE" title="Segmentation"><span class="headingNumber">8.4.1 </span>Segmentation</a> above), one way to represent the simple example above would be as follows: <div id="index-egXML-d52e70774" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">utt1</span>" <span class="attribute">who</span>="<span class="attributevalue">#jane</span>"&gt;</span>have you read Vanity <span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#utt2 #k1</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">a1</span>"/&gt;</span> Fair<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">utt2</span>" <span class="attribute">who</span>="<span class="attributevalue">#stig</span>"&gt;</span>yes<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;kinesic <span class="attribute">xml:id</span>="<span class="attributevalue">k1</span>" <span class="attribute">who</span>="<span class="attributevalue">#lou</span>"<br /> <span class="attribute">iterated</span>="<span class="attributevalue">true</span>"&gt;</span><br /> <span class="element">&lt;desc&gt;</span>nods head vertically<span class="element">&lt;/desc&gt;</span><br /><span class="element">&lt;/kinesic&gt;</span></div></div><p>For a full discussion of this and related mechanisms, section <a class="link_ptr" href="SA.html#SASYMP" title="Placing Synchronous Events in Time"><span class="headingNumber">16.4.2 </span>Placing Synchronous Events in Time</a> should be consulted. The rest of the present section, which should be read in conjunction with that more detailed discussion, presents a number of ways in which these mechanisms may be applied to the specific problem of representing temporal alignment, synchrony, or overlap in transcribing spoken texts.</p><p>In the simple example above, the first utterance (that with identifier <span class="val">utt1</span>) contains an <a class="gi" title="(anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element." href="ref-anchor.html">anchor</a> element, the function of which is simply to mark a point within it. The <span class="att">synch</span> attribute associated with this anchor point specifies the identifiers of the other two elements which are to be synchronized with it: specifically, the second utterance (<span class="val">utt2</span>) and the kinesic (k1). Note that one of these elements has content and the other is empty.</p><p>This example demonstrates only a way of indicating a point within one utterance at which it can be synchronized with another utterance and a kinesic. For more complex kinds of alignment, involving possibly multiple synchronization points, an additional element is provided, known as a <a class="gi" title="provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text." href="ref-timeline.html">timeline</a>. This consists of a series of <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> elements, each representing a point in time, and bearing attributes which indicate its exact temporal position relative to other elements in the same timeline, in addition to the sequencing implied by its position within it.</p><div class="p">For example: <div id="index-egXML-d52e70812" class="pre egXML_valid"><span class="element">&lt;timeline <span class="attribute">unit</span>="<span class="attributevalue">s</span>" <span class="attribute">origin</span>="<span class="attributevalue">#TS-P1</span>"&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-P1</span>"<br />  <span class="attribute">absolute</span>="<span class="attributevalue">12:20:01+01:00</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-P2</span>" <span class="attribute">interval</span>="<span class="attributevalue">4.5</span>"<br />  <span class="attribute">since</span>="<span class="attributevalue">#TS-P1</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-P6</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-P3</span>" <span class="attribute">interval</span>="<span class="attributevalue">1.5</span>"<br />  <span class="attribute">since</span>="<span class="attributevalue">#TS-P6</span>"/&gt;</span><br /><span class="element">&lt;/timeline&gt;</span></div> This timeline represents four points in time, named TS-P1, TS-P2, TS-P6, and TS-P3 (as with all attributes named <span class="att">xml:id</span> in the TEI scheme, the names must be unique within the document but have no other significance). TS-P1 is located absolutely, at 12:20:01:01 BST. TS-P2 is 4.5 seconds later than TS-P2 (i.e. at 12:20:46). TS-P6 is at some unspecified time later than TS-P2 and previous to TS-P3 (this is implied by its position within the timeline, as no attribute values have been specified for it). The fourth point, TS-P3, is 1.5 seconds later than TS-P6.</div><p>One or more such timelines may be specified within a spoken text, to suit the encoder's convenience. If more than one is supplied, the <span class="att">origin</span> attribute may be used on each to specify which other <a class="gi" title="provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text." href="ref-timeline.html">timeline</a> element it follows. The <span class="att">unit</span> attribute indicates the units used for timings given on <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> elements contained by the alignment map. Alternatively, to avoid the need to specify times explicitly, the <span class="att">interval</span> attribute may be used to indicate that all the <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> elements in a time line are a fixed distance apart.</p><p>Three methods are available for aligning points or elements within a spoken text with the points in time defined by the <a class="gi" title="provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text." href="ref-timeline.html">timeline</a>: </p><ul class="bulleted"><li class="item">The elements to be synchronized may specify the identifier of a <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> element as the value of one of the <span class="att">start</span>, <span class="att">end</span>, or <span class="att">synch</span> attributes</li><li class="item">The <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> element may specify the identifiers of all the elements to be synchronized with it using the <span class="att">synch</span> attribute</li><li class="item">A free-standing <a class="gi" title="defines an association or hypertextual link among elements or passages, of some type not more precisely specifiable by other elements." href="ref-link.html">link</a> element may be used to associate the <a class="gi" title="indicates a point in time either relative to other elements in the same timeline tag, or absolutely." href="ref-when.html">when</a> element and the elements synchronized with it by specifying their identifiers as values for its <span class="att">target</span> attribute.</li></ul><div class="p">For example, using the timeline given above: <div id="index-egXML-d52e70884" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">TS-U1</span>" <span class="attribute">start</span>="<span class="attributevalue">#TS-P2</span>"<br /> <span class="attribute">end</span>="<span class="attributevalue">#TS-P3</span>"&gt;</span>This is my <span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-P6</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">TS-P6A</span>"/&gt;</span> turn<span class="element">&lt;/u&gt;</span></div> The start of utterance <span class="val">TS-U1</span> is aligned with <span class="val">TS-P2</span> and its end with <span class="val">TS-P3</span>. The transition between the words <span class="mentioned">my</span> and <span class="mentioned">turn</span> occurs at point <span class="val">TS-P6A</span>, which is synchronous with point <span class="val">TS-P6</span> on the timeline.</div><div class="p">The synchronization represented by the preceding examples could equally well be represented as follows: <div id="index-egXML-d52e70914" class="pre egXML_valid"><span class="element">&lt;timeline <span class="attribute">origin</span>="<span class="attributevalue">#ts-p1</span>" <span class="attribute">unit</span>="<span class="attributevalue">s</span>"&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">ts-p1</span>"<br />  <span class="attribute">absolute</span>="<span class="attributevalue">12:20:01+01:00</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#ts-u1</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">ts-p2</span>"<br />  <span class="attribute">interval</span>="<span class="attributevalue">4.5</span>" <span class="attribute">since</span>="<span class="attributevalue">#ts-p1</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#ts-x1</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">ts-p6</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#ts-u1</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">ts-p3</span>"<br />  <span class="attribute">interval</span>="<span class="attributevalue">1.5</span>" <span class="attribute">since</span>="<span class="attributevalue">#ts-p6</span>"/&gt;</span><br /><span class="element">&lt;/timeline&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">ts-u1</span>"&gt;</span>This is my <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">ts-x1</span>"/&gt;</span> turn<span class="element">&lt;/u&gt;</span></div> Here, the whole of the object with identifier <span class="val">ts-u1</span> (the utterance) has been aligned with two different points, <span class="val">ts-p2</span> and <span class="val">ts-p3</span>. This is interpreted to mean that the utterance spans at least those two points.</div><div class="p">Finally, a <a class="gi" title="(link group) defines a collection of associations or hypertextual links." href="ref-linkGrp.html">linkGrp</a> may be used as an alternative to the <span class="att">synch</span> attribute: <div id="index-egXML-d52e70942" class="pre egXML_valid"><span class="element">&lt;timeline <span class="attribute">origin</span>="<span class="attributevalue">#TS-p1</span>" <span class="attribute">unit</span>="<span class="attributevalue">s</span>"&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-p1</span>" <span class="attribute">absolute</span>="<span class="attributevalue">12:20:01</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-p2</span>" <span class="attribute">interval</span>="<span class="attributevalue">4.5</span>"<br />  <span class="attribute">since</span>="<span class="attributevalue">#TS-p1</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-p6</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-p3</span>" <span class="attribute">interval</span>="<span class="attributevalue">1.5</span>"<br />  <span class="attribute">since</span>="<span class="attributevalue">#TS-p6</span>"/&gt;</span><br /><span class="element">&lt;/timeline&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">TS-u1</span>"&gt;</span><br /> <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-u1start</span>"/&gt;</span><br />   This is my <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-x1</span>"/&gt;</span> turn<br /> <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-u1end</span>"/&gt;</span><span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;linkGrp <span class="attribute">type</span>="<span class="attributevalue">synchronous</span>"&gt;</span><br /> <span class="element">&lt;link <span class="attribute">target</span>="<span class="attributevalue">#TS-u1start #TS-p1</span>"/&gt;</span><br /> <span class="element">&lt;link <span class="attribute">target</span>="<span class="attributevalue">#TS-u1end #TS-p2</span>"/&gt;</span><br /> <span class="element">&lt;link <span class="attribute">target</span>="<span class="attributevalue">#TS-x1 #TS-p6</span>"/&gt;</span><br /><span class="element">&lt;/linkGrp&gt;</span></div></div><div class="p">As a further example of the three possibilities, consider the following dialogue, represented first as it might appear in a conventional playscript: <div id="index-egXML-d52e70961" class="pre cdata egXML_valid">Tom: I used to smoke - -
Bob: (interrupting) You used to smoke?
Tom: (at the same time) a lot more than this.  But I never
     inhaled the smoke<div style="float: right;"><a href="BIB.html#TSSASE-eg-20">bibliography</a> </div></div>  A commonly used convention might be to transcribe such a passage as follows: <div id="index-egXML-d52e70966" class="pre cdata egXML_valid"> (1) I used to smoke [ a lot more than this ]
(2)                 [ you used to smoke ]
(1) but I never inhaled the smoke</div> Such conventions have the drawback that they are hard to generalize or to extend beyond the very simple case presented here. Their reliance on the accidentals of physical layout may also make them difficult to transport and to process computationally. These Guidelines recommend the following mechanisms to encode this.</div><div class="p">Where the whole of one or another utterance is to be synchronized, the <span class="att">start</span> and <span class="att">end</span> attributes may be used: <div id="index-egXML-d52e70977" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#tom</span>"&gt;</span>I used to smoke <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-p10</span>"/&gt;</span> a lot more than this<br /> <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-p20</span>"/&gt;</span>but I never inhaled the smoke<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">start</span>="<span class="attributevalue">#TS-p10</span>" <span class="attribute">end</span>="<span class="attributevalue">#TS-p20</span>" <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span>You used to smoke<span class="element">&lt;/u&gt;</span></div> Note that the second utterance above could equally well be encoded as follows with exactly the same effect: <div id="index-egXML-d52e70987" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span><br /> <span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-p10</span>"/&gt;</span>You used to smoke<span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-p20</span>"/&gt;</span><span class="element">&lt;/u&gt;</span></div></div><div class="p">If synchronization with specific timing information is required, a <a class="gi" title="provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text." href="ref-timeline.html">timeline</a> must be included: <div id="index-egXML-d52e70997" class="pre egXML_valid"><span class="element">&lt;timeline <span class="attribute">origin</span>="<span class="attributevalue">#TS-t01</span>" <span class="attribute">unit</span>="<span class="attributevalue">s</span>"&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-t01</span>" <span class="attribute">absolute</span>="<span class="attributevalue">15:33:01Z</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">TS-t02</span>" <span class="attribute">interval</span>="<span class="attributevalue">2.5</span>"<br />  <span class="attribute">since</span>="<span class="attributevalue">#TS-t01</span>"/&gt;</span><br /><span class="element">&lt;/timeline&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#tom</span>"&gt;</span>I used to smoke<br /> <span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-t01</span>"/&gt;</span>a lot more than this<br /> <span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-t02</span>"/&gt;</span>but I never inhaled the smoke<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span><br /> <span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-t01</span>"/&gt;</span>You used to smoke<span class="element">&lt;anchor <span class="attribute">synch</span>="<span class="attributevalue">#TS-t02</span>"/&gt;</span><span class="element">&lt;/u&gt;</span></div> (Note that If only the ordering or sequencing of utterances is needed, then specific timing information shown here in <span class="att">unit</span>, <span class="att">absolute</span> and <span class="att">interval</span> does not need to be provided.)</div><div class="p">As above, since the whole of Bob's utterance is to be aligned, the <span class="att">start</span> and <span class="att">end</span> attributes may be used as an alternative to the second pair of <a class="gi" title="(anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element." href="ref-anchor.html">anchor</a> elements: <div id="index-egXML-d52e71032" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">start</span>="<span class="attributevalue">#TS-t01</span>" <span class="attribute">end</span>="<span class="attributevalue">#TS-t02</span>" <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span>You used to smoke<span class="element">&lt;/u&gt;</span></div></div><div class="p">An alternative approach is to mark the synchronization by pointing from the <a class="gi" title="provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text." href="ref-timeline.html">timeline</a> to the text: <div id="index-egXML-d52e71040" class="pre egXML_valid"><span class="element">&lt;timeline <span class="attribute">origin</span>="<span class="attributevalue">#TS-T01</span>"&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#TS-nm1 #bob-u2</span>"<br />  <span class="attribute">xml:id</span>="<span class="attributevalue">TS-T01</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#TS-nm2 #bob-u2</span>"<br />  <span class="attribute">xml:id</span>="<span class="attributevalue">TS-T02</span>"/&gt;</span><br /><span class="element">&lt;/timeline&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#tom</span>"&gt;</span>I used to smoke<br /> <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-nm1</span>"/&gt;</span>a lot more than this<br /> <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TS-nm2</span>"/&gt;</span>but I never inhaled the smoke<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">bob-u2</span>" <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span>You used to smoke<span class="element">&lt;/u&gt;</span></div> To avoid deciding whether to point from the timeline to the text or vice versa, a <a class="gi" title="(link group) defines a collection of associations or hypertextual links." href="ref-linkGrp.html">linkGrp</a> may be used: <div id="index-egXML-d52e71056" class="pre egXML_valid"><span class="element">&lt;body&gt;</span><br /> <span class="element">&lt;timeline <span class="attribute">origin</span>="<span class="attributevalue">#T001</span>"&gt;</span><br />  <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">T001</span>"/&gt;</span><br />  <span class="element">&lt;when <span class="attribute">xml:id</span>="<span class="attributevalue">T002</span>"/&gt;</span><br /> <span class="element">&lt;/timeline&gt;</span><br /> <span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#tom</span>"&gt;</span>I used to smoke<br />  <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">NM01</span>"/&gt;</span>a lot more than this<br />  <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">NM02</span>"/&gt;</span>but I never inhaled the smoke<span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;u <span class="attribute">xml:id</span>="<span class="attributevalue">bob-U2</span>" <span class="attribute">who</span>="<span class="attributevalue">#bob</span>"&gt;</span>You used to smoke<span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;linkGrp <span class="attribute">type</span>="<span class="attributevalue">synchronize</span>"&gt;</span><br />  <span class="element">&lt;link <span class="attribute">target</span>="<span class="attributevalue">#T001 #NM01 #bob-U2</span>"/&gt;</span><br />  <span class="element">&lt;link <span class="attribute">target</span>="<span class="attributevalue">#T002 #NM02 #bob-U2</span>"/&gt;</span><br /> <span class="element">&lt;/linkGrp&gt;</span><br /><span class="element">&lt;/body&gt;</span></div></div><p>Note that in each case, although Bob's utterance follows Tom's sequentially in the text, it is aligned temporally with its middle, without any need to disrupt the normal syntax of the text.</p><div class="p">As a final example, consider the following exchange, first as it might be represented using a musical-score-like notation, in which points of synchronization are represented by vertical alignment of the text: <div id="index-egXML-d52e71076" class="pre cdata egXML_valid"> Stig: This is |my  |turn
Jane:         |Balderdash
Lou :         |No, |it's mine</div> All three speakers are simultaneous at the words <span class="mentioned">my</span>, <span class="mentioned">Balderdash</span>, and <span class="mentioned">No</span>; speakers Stig and Lou are simultaneous at the words <span class="mentioned">turn</span> and <span class="mentioned">it's</span>. This could be encoded as follows, using pointers from the alignment map into the text: <div id="index-egXML-d52e71095" class="pre egXML_valid"><span class="element">&lt;timeline <span class="attribute">origin</span>="<span class="attributevalue">#TSp1</span>"&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#TSa1 #TSb1 #TSc1</span>"<br />  <span class="attribute">xml:id</span>="<span class="attributevalue">TSp1</span>"/&gt;</span><br /> <span class="element">&lt;when <span class="attribute">synch</span>="<span class="attributevalue">#TSa2 #TSc2</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">TSp2</span>"/&gt;</span><br /><span class="element">&lt;/timeline&gt;</span><br /><span class="comment">&lt;!-- ... --&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#stig</span>"&gt;</span>this is <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TSa1</span>"/&gt;</span> my <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TSa2</span>"/&gt;</span> turn<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#jane</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">TSb1</span>"&gt;</span>balderdash<span class="element">&lt;/u&gt;</span><br /><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#lou</span>" <span class="attribute">xml:id</span>="<span class="attributevalue">TSc1</span>"&gt;</span> no <span class="element">&lt;anchor <span class="attribute">xml:id</span>="<span class="attributevalue">TSc2</span>"/&gt;</span> it's mine<span class="element">&lt;/u&gt;</span></div></div></div><div class="div3" id="TSREG"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSSAPA"><span class="headingNumber">8.4.2 </span>Synchronization and Overlap</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSTPPR"><span class="headingNumber">8.4.4 </span>Prosody</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSREG" title="link to this section "><span class="invisible">TEI: Regularization of Word Forms</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4.3 </span><span class="head">Regularization of Word Forms</span></h4><p>When speech is transcribed using ordinary orthographic notation, as is customary, some compromise must be made between the sounds produced and conventional orthography. Particularly when dealing with informal, dialectal, or other varieties of language, the transcriber will frequently have to decide whether a particular sound is to be treated as a distinct vocabulary item or not. For example, while in a given project <span class="mentioned">kinda</span> may not be worth distinguishing as a vocabulary item from <span class="mentioned">kind of</span>, <span class="mentioned">isn't</span> may clearly be worth distinguishing from <span class="mentioned">is not</span>; for some purposes, the regional variant <span class="mentioned">isnae</span> might also be worth distinguishing in the same way.</p><p>One rule of thumb might be to allow such variation only where a generally accepted orthographic form exists, for example, in published dictionaries of the language register being encoded; this has the disadvantage that such dictionaries may not exist. Another is to maintain a controlled (but extensible) set of normalized forms for all such words; this has the advantage of enforcing some degree of consistency among different transcribers. Occasionally, as for example when transcribing abbreviations or acronyms, it may be felt necessary to depart from conventional spelling to distinguish between cases where the abbreviation is spelled out letter by letter (e.g. <span class="mentioned">B B C</span> or <span class="mentioned">V A T</span>) and where it is pronounced as a single word (<span class="mentioned">VAT</span> or <span class="mentioned">RADA</span>). Similar considerations might apply to pronunciation of foreign words (e.g. <span class="mentioned">Monsewer</span> vs. <span class="mentioned">Monsieur</span>).</p><p>In general, use of punctuation, capitalization, etc., in spoken transcripts should be carefully controlled. It is important to distinguish the transcriber's intuition as to what the punctuation should be from the marking of prosodic features such as pausing, intonation, etc.</p><p>Whatever practice is adopted, it is essential that it be clearly and fully documented in the editorial declarations section of the header. It may also be found helpful to include normalized forms of non-conventional spellings within the text, using the elements for simple editorial changes described in section <a class="link_ptr" href="CO.html#COED" title="Simple Editorial Changes"><span class="headingNumber">3.4 </span>Simple Editorial Changes</a> (see further section <a class="link_ptr" href="TS.html#TSTPSM" title="Speech Management"><span class="headingNumber">8.4.5 </span>Speech Management</a>).</p></div><div class="div3" id="TSTPPR"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSREG"><span class="headingNumber">8.4.3 </span>Regularization of Word Forms</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSTPSM"><span class="headingNumber">8.4.5 </span>Speech Management</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSTPPR" title="link to this section "><span class="invisible">TEI: Prosody</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4.4 </span><span class="head">Prosody</span></h4><p>In the absence of conventional punctuation, the marking of prosodic features assumes paramount importance, since these structure and organize the spoken message. Indeed, such prosodic features as points of primary or secondary stress may be represented by specialized punctuation marks, or other characters such as those provided by the Unicode Spacing Modifier Letters block. Pauses have already been dealt with in section <a class="link_ptr" href="TS.html#TSBAPA" title="Pausing"><span class="headingNumber">8.3.2 </span>Pausing</a>; while tone units (or intonational phrases) can be indicated by the segmentation tag discussed in section <a class="link_ptr" href="TS.html#TSSASE" title="Segmentation"><span class="headingNumber">8.4.1 </span>Segmentation</a>. The <a class="gi" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> element discussed in section <a class="link_ptr" href="TS.html#TSSASH" title="Shifts"><span class="headingNumber">8.3.6 </span>Shifts</a> may also be used to encode some prosodic features, for example where all that is required is the ability to record shifts in voice quality.</p><p>In a more detailed phonological transcript, it is common practice to include a number of conventional signs to mark prosodic features of the surrounding or (more usually) preceding speech. Such signs may be used to record, for example, particular intonation patterns, truncation, vowel quality (long or short) etc. These signs may be preserved in a transcript either by using conventional punctuation or by marking their presence by <a class="gi" title="(character or glyph) represents a glyph, or a non-standard character." href="ref-g.html">g</a> elements. Where a transcript includes many phonetic or phonemic aspects, it will generally be more convenient to use the appropriate Unicode characters (see further chapters <a class="link_ptr" href="CH.html" title="4"><span class="headingNumber">vi. </span>Languages and Character Sets</a> and <a class="link_ptr" href="WD.html" title="25"><span class="headingNumber">5 </span>Characters, Glyphs, and Writing Modes</a>). For representation of phonemic information, the use of the International Phonetic Alphabet, which can be represented in Unicode characters, is recommended.</p><div class="p">In the following example, special characters have been defined as follows within the <a class="gi" title="(encoding description) documents the relationship between an electronic text and the source or sources from which it was derived." href="ref-encodingDesc.html">encodingDesc</a> of the TEI header <div id="index-egXML-d52e71189" class="pre egXML_valid"><span class="element">&lt;charDecl&gt;</span><br /> <span class="element">&lt;char <span class="attribute">xml:id</span>="<span class="attributevalue">lf</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>low fall intonation<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/char&gt;</span><br /> <span class="element">&lt;char <span class="attribute">xml:id</span>="<span class="attributevalue">lr</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>low rise intonation<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/char&gt;</span><br /> <span class="element">&lt;char <span class="attribute">xml:id</span>="<span class="attributevalue">fr</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>fall rise intonation<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/char&gt;</span><br /> <span class="element">&lt;char <span class="attribute">xml:id</span>="<span class="attributevalue">rf</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>rise fall intonation<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/char&gt;</span><br /> <span class="element">&lt;char <span class="attribute">xml:id</span>="<span class="attributevalue">long</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>lengthened syllable<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/char&gt;</span><br /> <span class="element">&lt;char <span class="attribute">xml:id</span>="<span class="attributevalue">short</span>"&gt;</span><br />  <span class="element">&lt;desc&gt;</span>shortened syllable<span class="element">&lt;/desc&gt;</span><br /> <span class="element">&lt;/char&gt;</span><br /><span class="element">&lt;/charDecl&gt;</span></div> These declarations might additionally provide information about how the characters concerned should be rendered, their equivalent IPA form, etc. In the transcript itself references to them can then be included as follows: <div id="index-egXML-d52e71210" class="pre egXML_valid"><span class="element">&lt;div <span class="attribute">n</span>="<span class="attributevalue">Lod E-03</span>" <span class="attribute">type</span>="<span class="attributevalue">exchange</span>"&gt;</span><br /> <span class="element">&lt;note&gt;</span>C is with a friend<span class="element">&lt;/note&gt;</span><br /> <span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#cwn</span>"&gt;</span><br />  <span class="element">&lt;unclear&gt;</span>Excuse me<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span><span class="element">&lt;/unclear&gt;</span><br />  <span class="element">&lt;pause/&gt;</span> You dont have some<br />     aesthetic<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#short</span>"/&gt;</span><br />  <span class="element">&lt;pause/&gt;</span><br />  <span class="element">&lt;unclear&gt;</span>specially on early<span class="element">&lt;/unclear&gt;</span><br />     aesthetics terminology <span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lr</span>"/&gt;</span><span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#aj</span>"&gt;</span> No<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span><br />  <span class="element">&lt;pause/&gt;</span>No<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span><br />  <span class="element">&lt;gap <span class="attribute">extent</span>="<span class="attributevalue">2 beats</span>"/&gt;</span> I'm<br />     afraid<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span><span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;u <span class="attribute">trans</span>="<span class="attributevalue">latching</span>" <span class="attribute">who</span>="<span class="attributevalue">#cwn</span>"&gt;</span> No<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lr</span>"/&gt;</span><br />  <span class="element">&lt;unclear&gt;</span>Well<span class="element">&lt;/unclear&gt;</span> thanks<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lr</span>"/&gt;</span><br />  <span class="element">&lt;pause/&gt;</span> Oh<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#short</span>"/&gt;</span><br />  <span class="element">&lt;unclear&gt;</span>you couldnt<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#short</span>"/&gt;</span> can we<span class="element">&lt;/unclear&gt;</span> kind of<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#long</span>"/&gt;</span><br />  <span class="element">&lt;pause/&gt;</span>I mean ask you to order it for us<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#long</span>"/&gt;</span><br />  <span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#fr</span>"/&gt;</span><span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;u <span class="attribute">trans</span>="<span class="attributevalue">latching</span>" <span class="attribute">who</span>="<span class="attributevalue">#aj</span>"&gt;</span> Yes<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#fr</span>"/&gt;</span> if you know the title<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span> Yeah<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span><span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#cwn</span>"&gt;</span><br />  <span class="element">&lt;gap <span class="attribute">extent</span>="<span class="attributevalue">4 beats</span>"/&gt;</span><br /> <span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#aj</span>"&gt;</span> Yes thats fine. <span class="element">&lt;unclear&gt;</span>just as soon as it comes in we'll send<br />       you a postcard<span class="element">&lt;g <span class="attribute">ref</span>="<span class="attributevalue">#lf</span>"/&gt;</span><span class="element">&lt;/unclear&gt;</span><span class="element">&lt;/u&gt;</span><br /> <span class="element">&lt;listPerson&gt;</span><br />  <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">cwn</span>"&gt;</span><br />   <span class="element">&lt;p&gt;</span>Customer WN<span class="element">&lt;/p&gt;</span><br />  <span class="element">&lt;/person&gt;</span><br />  <span class="element">&lt;person <span class="attribute">xml:id</span>="<span class="attributevalue">aj</span>"&gt;</span><br />   <span class="element">&lt;p&gt;</span>Assistant K<span class="element">&lt;/p&gt;</span><br />  <span class="element">&lt;/person&gt;</span><br /> <span class="element">&lt;/listPerson&gt;</span><br /><span class="element">&lt;/div&gt;</span><div style="float: right;"><a href="BIB.html#TSTPPR-eg-58">bibliography</a> </div></div>  </div><p>This example, which is taken from a corpus of bookshop service encounters,  also demonstrates the use of the <a class="gi" title="contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source." href="ref-unclear.html">unclear</a> and <a class="gi" title="indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible." href="ref-gap.html">gap</a> elements discussed in section <a class="link_ptr" href="CO.html#COED" title="Simple Editorial Changes"><span class="headingNumber">3.4 </span>Simple Editorial Changes</a>. Where words are so unclear that only their extent can be recorded, the empty <a class="gi" title="indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible." href="ref-gap.html">gap</a> element may be used; where the encoder can identify the words but wishes to record a degree of uncertainty about their accuracy, the <a class="gi" title="contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source." href="ref-unclear.html">unclear</a> element may be used. More flexible and detailed methods of indicating uncertainty are discussed in chapter <a class="link_ptr" href="CE.html" title="17"><span class="headingNumber">21 </span>Certainty, Precision, and Responsibility</a>.</p><p>For more detailed work, involving a detailed phonological transcript including representation of stress and pitch patterns, it is probably best to maintain the prosodic description in parallel with the conventional written transcript, rather than attempt to embed detailed prosodic information within it. The two parallel streams may be aligned with each other and with other streams, for example an acoustic encoding, using the general alignment mechanisms discussed in section <a class="link_ptr" href="TS.html#TSSASH" title="Shifts"><span class="headingNumber">8.3.6 </span>Shifts</a>.</p></div><div class="div3" id="TSTPSM"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSTPPR"><span class="headingNumber">8.4.4 </span>Prosody</a></li><li class="subtoc"><span class="nextLink"> » </span><a class="navigation" href="TS.html#TSTPAC"><span class="headingNumber">8.4.6 </span>Analytic Coding</a></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSTPSM" title="link to this section "><span class="invisible">TEI: Speech Management</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4.5 </span><span class="head">Speech Management</span></h4><p>Phenomena of <span class="term">speech management</span> include disfluencies such as filled and unfilled pauses, interrupted or repeated words, corrections, and reformulations as well as interactional devices asking for or providing feedback. Depending on the importance attached to such features, transcribers may choose to adopt conventionalized representations for them (as discussed in section <a class="link_ptr" href="TS.html#TSREG" title="Regularization of Word Forms"><span class="headingNumber">8.4.3 </span>Regularization of Word Forms</a> above), or to transcribe them using IPA or some other transcription system. To simplify analysis of the lexical features of a speech transcript, it may be felt useful to ‘tidy away’ many of these disfluencies. Where this policy has been adopted, these Guidelines recommend the use of the tags for simple editorial intervention discussed in section <a class="link_ptr" href="CO.html#COED" title="Simple Editorial Changes"><span class="headingNumber">3.4 </span>Simple Editorial Changes</a>, to make explicit the extent of regularization or normalization performed by the transcriber.</p><div class="p">For example, false starts, repetition, and truncated words might all be included within a transcript, but marked as editorially deleted, in the following way: <div id="index-egXML-d52e71323" class="pre egXML_valid"><span class="element">&lt;u&gt;</span><br /> <span class="element">&lt;del <span class="attribute">type</span>="<span class="attributevalue">truncation</span>"&gt;</span>s<span class="element">&lt;/del&gt;</span>see<br /> <span class="element">&lt;del <span class="attribute">type</span>="<span class="attributevalue">repetition</span>"&gt;</span>you you<span class="element">&lt;/del&gt;</span> you know<br /> <span class="element">&lt;del <span class="attribute">type</span>="<span class="attributevalue">falseStart</span>"&gt;</span>it's<span class="element">&lt;/del&gt;</span> he's crazy<span class="element">&lt;/u&gt;</span></div></div><div class="p">As previously noted, the <a class="gi" title="indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible." href="ref-gap.html">gap</a> element may be used to mark points within a transcript where words have been omitted, for example because they are inaudible, as in the following example in which 5 seconds of speech is drowned out by an external event: <div id="index-egXML-d52e71339" class="pre egXML_valid"><span class="element">&lt;gap <span class="attribute">reason</span>="<span class="attributevalue">passing-truck</span>" <span class="attribute">quantity</span>="<span class="attributevalue">5</span>"<br /> <span class="attribute">unit</span>="<span class="attributevalue">s</span>"/&gt;</span></div></div><div class="p">The <a class="gi" title="contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source." href="ref-unclear.html">unclear</a> element may be used to mark words which have been included although the transcriber is unsure of their accuracy: <div id="index-egXML-d52e71346" class="pre egXML_valid"><span class="element">&lt;u&gt;</span>...and then <span class="element">&lt;unclear <span class="attribute">reason</span>="<span class="attributevalue">passing-truck</span>"&gt;</span>marbled queen<span class="element">&lt;/unclear&gt;</span><span class="element">&lt;/u&gt;</span></div></div><div class="p">Where a transcriber is believed to have incorrectly identified a word, the elements <a class="gi" title="(correction) contains the correct form of a passage apparently erroneous in the copy text." href="ref-corr.html">corr</a> or <a class="gi" title="(Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate." href="ref-sic.html">sic</a> embedded within a <a class="gi" title="groups a number of alternative encodings for the same point in a text." href="ref-choice.html">choice</a> element may be used to indicate both the original and a corrected form of it: <div id="index-egXML-d52e71362" class="pre egXML_valid"><span class="element">&lt;choice&gt;</span><br /> <span class="element">&lt;corr&gt;</span>SCSI<span class="element">&lt;/corr&gt;</span><br /> <span class="element">&lt;sic&gt;</span>skuzzy<span class="element">&lt;/sic&gt;</span><br /><span class="element">&lt;/choice&gt;</span></div> These elements are further discussed in section <a class="link_ptr" href="CO.html#COEDCOR" title="Apparent Errors"><span class="headingNumber">3.4.1 </span>Apparent Errors</a>.</div><div class="p">Finally phenomena such as <span class="term">code-switching</span>, where a speaker switches from one language to another, may easily be represented in a transcript by using the <a class="gi" title="identifies a word or phrase as belonging to some language other than that of the surrounding text." href="ref-foreign.html">foreign</a> element provided by the core tagset: <div id="index-egXML-d52e71379" class="pre egXML_valid"><span class="element">&lt;u <span class="attribute">who</span>="<span class="attributevalue">#P1</span>"&gt;</span>I proposed that <span class="element">&lt;foreign <span class="attribute">xml:lang</span>="<span class="attributevalue">de</span>"&gt;</span> wir können<br />  <span class="element">&lt;pause <span class="attribute">dur</span>="<span class="attributevalue">PT1S</span>"/&gt;</span> vielleicht <span class="element">&lt;/foreign&gt;</span> go to warsaw<br />   and <span class="element">&lt;emph&gt;</span>vienna<span class="element">&lt;/emph&gt;</span><span class="element">&lt;/u&gt;</span></div> </div></div><div class="div3" id="TSTPAC"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSTPSM"><span class="headingNumber">8.4.5 </span>Speech Management</a></li><li class="subtoc"></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h4><span class="bookmarklink"><a class="bookmarklink" href="#TSTPAC" title="link to this section "><span class="invisible">TEI: Analytic Coding</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.4.6 </span><span class="head">Analytic Coding</span></h4><p>The recommendations made here only concern the establishment of a basic text. Where a more sophisticated analysis is needed, more sophisticated methods of markup will also be appropriate, for example, using stand-off markup to indicate multiple segmentation of the stream of discourse, or complex alignment of several segments within it. Where additional annotations (sometimes called ‘codes’ or ‘tags’) are used to represent such features as linguistic word class (noun, verb, etc.), type of speech act (imperative, concessive, etc.), or information status (theme/rheme, given/new, active/semi-active/new), etc., a selection from the general purpose analytic tools discussed in chapters <a class="link_ptr" href="SA.html" title="14"><span class="headingNumber">16 </span>Linking, Segmentation, and Alignment</a>, <a class="link_ptr" href="AI.html" title="15"><span class="headingNumber">17 </span>Simple Analytic Mechanisms</a>, and <a class="link_ptr" href="FS.html" title="16"><span class="headingNumber">18 </span>Feature Structures</a> may be used to advantage.</p><p>The general-purpose <a class="gi" title="groups together various annotations, e.g. for parallel interpretations of a spoken segment." href="ref-annotationBlock.html">annotationBlock</a> element may be used to group together a transcription and multiple layers of annotation. It also serves to divide a transcribed text up into meaningful analytic sections. </p><ul class="specList"><li><span class="specList-elementSpec"><a href="ref-annotationBlock.html">annotationBlock</a></span> groups together various annotations, e.g. for parallel interpretations of a spoken segment.</li></ul></div></div><div class="teidiv1" id="index-body.1_div.8_div.5"><div class="miniTOC miniTOC_right"><ul class="subtoc"><li class="subtoc"><span class="previousLink"> « </span><a class="navigation" href="TS.html#TSSA"><span class="headingNumber">8.4 </span>Elements Defined Elsewhere</a></li><li class="subtoc"></li><li class="subtoc"><a class="navigation" href="index.html">Home</a></li></ul></div><h3><span class="bookmarklink"><a class="bookmarklink" href="#index-body.1_div.8_div.5" title="link to this section "><span class="invisible">TEI: Module for Transcribed Speech</span><span class="pilcrow">¶</span></a></span><span class="headingNumber">8.5 </span><span class="head">Module for Transcribed Speech</span></h3><p>The module described in this chapter makes available the following components: </p><dl class="moduleSpec"><dt class="moduleSpecHead"><span lang="en">Module</span> spoken: Transcribed Speech</dt><dd><ul><li><span lang="en">Elements defined</span>: <a class="link_odd" title="groups together various annotations, e.g. for parallel interpretations of a spoken segment." href="ref-annotationBlock.html">annotationBlock</a> <a class="link_odd" title="describes a broadcast used as the source of a spoken text." href="ref-broadcast.html">broadcast</a> <a class="link_odd" title="provides technical details of the equipment and media used for an audio or video recording used as the source for a spoken text." href="ref-equipment.html">equipment</a> <a class="link_odd" title="marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication." href="ref-incident.html">incident</a> <a class="link_odd" title="marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc." href="ref-kinesic.html">kinesic</a> <a class="link_odd" title="marks a pause either between or within utterances." href="ref-pause.html">pause</a> <a class="link_odd" title="(recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast." href="ref-recording.html">recording</a> <a class="link_odd" title="(recording statement) describes a set of recordings used as the basis for transcription of a spoken text." href="ref-recordingStmt.html">recordingStmt</a> <a class="link_odd" title="(script statement) contains a citation giving details of the script used for a spoken text." href="ref-scriptStmt.html">scriptStmt</a> <a class="link_odd" title="marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes." href="ref-shift.html">shift</a> <a class="link_odd" title="describes the set of transcription conventions used, particularly for spoken material." href="ref-transcriptionDesc.html">transcriptionDesc</a> <a class="link_odd" title="(utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker." href="ref-u.html">u</a> <a class="link_odd" title="marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc." href="ref-vocal.html">vocal</a> <a class="link_odd" title="contains a passage of written text revealed to participants in the course of a spoken text." href="ref-writing.html">writing</a></li><li><span lang="en">Classes defined</span>: <a class="link_odd" title="provides attributes for normalization of elements that contain datable events." href="ref-att.duration.html">att.duration</a> <a class="link_odd" title="groups elements structurally analogous to paragraphs within spoken texts." href="ref-model.divPart.spoken.html">model.divPart.spoken</a> <a class="link_odd" title="groups elements which may appear globally within spoken texts." href="ref-model.global.spoken.html">model.global.spoken</a> <a class="link_odd" title="groups elements used to describe details of an audio or video recording." href="ref-model.recordingPart.html">model.recordingPart</a></li></ul></dd></dl><p> The selection and combination of modules to form a TEI schema is described in <a class="link_ptr" href="ST.html#STIN" title="Defining a TEI Schema"><span class="headingNumber">1.2 </span>Defining a TEI Schema</a>.</p></div></div><nav class="left"><span class="upLink"> ↑ </span><a class="navigation" href="index.html">TEI P5 Guidelines</a><span class="previousLink"> « </span><a class="navigation" href="DR.html"><span class="headingNumber">7 </span>Performance Texts</a><span class="nextLink"> » </span><a class="navigation" href="DI.html"><span class="headingNumber">9 </span>Dictionaries</a></nav><!--Notes in [div]--><div class="notes"><div class="noteHeading">Notes</div><div class="note" id="Note69"><span class="noteLabel">32 </span><div class="noteBody">For a discussion of several of these see <a class="citlink" href="BIB.html#TS-BIBL-1">Edwards and Lampert (eds.) (1993)</a>; <a class="citlink" href="BIB.html#TS-BIBL-2">Johansson (1994)</a>; and <a class="citlink" href="BIB.html#TS-BIBL-3">Johansson et al. (1991)</a>.</div> <a class="link_return" title="Go back to text" href="#Note69_return">↵</a></div><div class="note" id="Note70"><span class="noteLabel">33 </span><div class="noteBody">The original is a conversation between two children and their parents, recorded in 1987, and discussed in <a class="citlink" href="BIB.html#TS-BIBL-4">MacWhinney (1988)</a></div> <a class="link_return" title="Go back to text" href="#Note70_return">↵</a></div><div class="note" id="Note71"><span class="noteLabel">34 </span><div class="noteBody">For the most part, the examples in this chapter use no sentence punctuation except to mark the rising intonation often found in interrogative statements; for further discussion, see section <a class="link_ptr" href="TS.html#TSREG" title="Regularization of Word Forms"><span class="headingNumber">8.4.3 </span>Regularization of Word Forms</a>.</div> <a class="link_return" title="Go back to text" href="#Note71_return">↵</a></div><div class="note" id="Note72"><span class="noteLabel">35 </span><div class="noteBody">The term was apparently first proposed by <a class="citlink" href="BIB.html#TS-BIBL-7">Loman and Jørgensen (1971)</a>, where it is defined as follows: <span class="q">‘A text can be analysed as a sequence of segments which are internally connected by a network of syntactic relations and externally delimited by the absence of such relations with respect to neighbouring segments. Such a segment is a syntactic unit called a macrosyntagm’</span> (trans. S. Johansson).</div> <a class="link_return" title="Go back to text" href="#Note72_return">↵</a></div></div><div class="stdfooter autogenerated"><p>
    [<a href="../../en/html/TS.html">English</a>]
    [<a href="../../de/html/TS.html">Deutsch</a>]
    [<a href="../../es/html/TS.html">Español</a>]
    [<a href="../../it/html/TS.html">Italiano</a>]
    [<a href="../../fr/html/TS.html">Français</a>]
    [<a href="../../ja/html/TS.html">日本語</a>]
    [<a href="../../ko/html/TS.html">한국어</a>]
    [<a href="../../zh-TW/html/TS.html">中文</a>]
    </p><hr /><div class="footer"><a class="plain" href="http://www.tei-c.org/About/">TEI Consortium</a> | <a class="plain" href="http://www.tei-c.org/About/contact.xml">Feedback</a></div><hr /><address><br />TEI Guidelines <a class="link_ref" href="AB.html#ABTEI4">Version</a> <a class="link_ref" href="../../readme-3.1.1.html">3.1.1a</a>. Last updated on <span class="date">10th May 2017</span>, revision <a class="link_ref" href="https://github.com/TEIC/TEI/commit/bd8dda3">bd8dda3</a>. This page generated on 2017-05-12T12:30:09Z.</address></div></div></body></html>