Web archiving

Titel: Web archiving : with 6 tables / Julien Masanès
Beteiligt:
Veröffentlicht: Berlin : Springer, 2006
Umfang: VII, 234 Seiten : Illustrationen, Diagramme ; 24 cm
Format: Buch
Sprache: Englisch
RVK-Notation:
Schlagworte:
ISBN: 9783540233381 ; 3540233385
  • 1
  • Web Archiving: Issues and Methods
  • p. 1
  • 1.1
  • Introduction
  • p. 1
  • 1.2
  • Heritage, Society, and the Web
  • p. 2
  • 1.3
  • Web Characterization in Relation to Preservation
  • p. 11
  • 1.4
  • New Methods for a New Medium
  • p. 18
  • 1.5
  • Current Initiatives Overview
  • p. 40
  • 1.6
  • Conclusion
  • p. 46
  • References
  • p. 46
  • 2
  • Web Use and Web Studies
  • p. 55
  • 2.1
  • Summary
  • p. 55
  • 2.2
  • Content Analysis
  • p. 56
  • 2.3
  • Surveys
  • p. 58
  • 2.4
  • Rhetorical Analysis
  • p. 59
  • 2.5
  • Discourse Analysis
  • p. 60
  • 2.6
  • Visual Analysis
  • p. 61
  • 2.7
  • Ethnography
  • p. 63
  • 2.8
  • Network Analysis
  • p. 64
  • 2.9
  • Ethical Considerations
  • p. 65
  • 2.10
  • Conclusion
  • p. 66
  • References
  • p. 67
  • 3
  • Selection for Web Archives
  • p. 71
  • 3.1
  • Introduction
  • p. 71
  • 3.2
  • Defining a Selection Policy
  • p. 72
  • 3.3
  • Issues and Concepts
  • p. 76
  • 3.4
  • Selection Process
  • p. 82
  • 3.5
  • Documentation
  • p. 89
  • 3.6
  • Conclusion
  • p. 89
  • References
  • p. 90
  • 4
  • Copying Websites
  • p. 93
  • 4.1
  • Introduction - The Art of Copying Websites
  • p. 93
  • 4.2
  • The Parser
  • p. 95
  • 4.3
  • Fetching Document
  • p. 102
  • 4.4
  • Create an Autonomous, Navigable Copy
  • p. 107
  • 4.5
  • Handling Updates
  • p. 109
  • 4.6
  • Conclusion
  • p. 112
  • Reference
  • p. 112
  • 5
  • Archiving the Hidden Web
  • p. 115
  • 5.1
  • Introduction
  • p. 115
  • 5.2
  • Finding At Least One Path to Documents
  • p. 116
  • 5.3
  • Characterizing the Hidden Web
  • p. 119
  • 5.4
  • Client Side Hidden Web Archiving
  • p. 121
  • 5.5
  • Crawler-Server Collaboration
  • p. 123
  • 5.6
  • Archiving Documentary Gateways
  • p. 125
  • 5.7
  • Conclusion
  • p. 127
  • References
  • p. 128
  • 6
  • Access and Finding Aids
  • p. 131
  • 6.1
  • Introduction
  • p. 131
  • 6.2
  • Registration
  • p. 133
  • 6.3
  • Indexing and Search Engines
  • p. 135
  • 6.4
  • Access Tools and User Interface
  • p. 137
  • 6.5
  • Case Studies
  • p. 146
  • 6.6
  • Acknowledgements
  • p. 151
  • References
  • p. 151
  • 7
  • Mining Web Collections
  • p. 153
  • 7.1
  • Introduction
  • p. 153
  • 7.2
  • Material for Web Archives
  • p. 155
  • 7.3
  • Other Types of Information
  • p. 160
  • 7.4
  • Use Cases
  • p. 161
  • 7.5
  • Conclusion
  • p. 172
  • References
  • p. 174
  • 8
  • The Long-Term Preservation of Web Content
  • p. 177
  • 8.1
  • Introduction
  • p. 177
  • 8.2
  • The Challenge of Long-Term Digital Preservation
  • p. 178
  • 8.3
  • Developing Trusted Digital Repositories
  • p. 181
  • 8.4
  • Digital Preservation Strategies
  • p. 184
  • 8.5
  • Preservation Metadata
  • p. 189
  • 8.6
  • Digital Preservation and the Web
  • p. 193
  • 8.7
  • Conclusion
  • p. 194
  • 8.8
  • Acknowledgements
  • p. 194
  • References
  • p. 194
  • 9
  • Year-by-Year: From an Archive of the Internet to an Archive on the Internet
  • p. 201
  • 9.1
  • Introduction
  • p. 201
  • 9.2
  • Background: Early Internet Publishing
  • p. 202
  • 9.3
  • 1996: Launch of the Internet Archive
  • p. 202
  • 9.4
  • 1997: Link Structure and Tape Robots
  • p. 203
  • 9.5
  • 1998: Getting Archive Data Onto (Almost) Every Desktop
  • p. 204
  • 9.6
  • 1999: From Tape to Disk, A New Crawler, and Moving Images
  • p. 205
  • 9.7
  • 2000: Building Thematic Web Collections
  • p. 206
  • 9.8
  • 2001: Public Access with the Wayback Machine: The 9/11 Archive
  • p. 207
  • 9.9
  • 2002: The Library of Alexandria, The Bookmobile, and Copyrights
  • p. 208
  • 9.10
  • 2003: Extending Our Reach via National Libraries and Educational Institutions
  • p. 210
  • 9.11
  • 2004: And the European Archive and the Petabox
  • p. 211
  • 9.12
  • The Future
  • p. 211
  • References
  • p. 212
  • 10
  • Small Scale Academic Web Archiving: DACHS
  • p. 213
  • 10.1
  • Why Small Scale Academic Archiving?
  • p. 213
  • 10.2
  • Digital Archive for Chinese Studies
  • p. 214
  • 10.3
  • Lessons Learned: Summing Up
  • p. 223
  • 10.4
  • Useful Resources
  • p. 224
  • List of Acronyms
  • p. 227
  • Index
  • p. 229