Web Archiving: What We Have Done and What We Should Do

웹 아카이빙의 성과와 과제

  • 서혜란 (신라대학교 정보관리학부 문헌정보학과)
  • Received : 2004.05.31
  • Accepted : 2004.06.19
  • Published : 2004.06.01

Abstract

The purpose of this study is to review what we have done and to identify what we have to do to be successful with Web archiving which is important to preserve our cultural heritage for the next generation. Some characteristics of Web resources as information sources were identified and some difficulties with Web archiving were discussed. The outcome of national and/or international Web archiving projects including Kurturarw3, PANDORA and Internet Archive were reviewed. Policy issues and technological problems of Web archiving we have to solve were listed.

이 연구의 목적은 도서관들이 웹 아카이빙이라는 새로운 도전에 대응하여 어떻게 해결책을 모색해 왔으며 앞으로 어떤 과제를 해결해 나가야 할 것인지를 정리하는 것이다. 이 논문에서는 웹 정보자원의 특성을 양적 급성장, 심층 웹의 존재, 웹 정보의 신뢰성에 대한 의문과 역동성, 웹 출판의 무정부성으로 규정하고, 도서관 이 왜 웹 아카이빙을 해야 하는가에 대해서 논의하였다. Kurturarw3, PANDORA, Internet Archive를 중심으로 웹 아카이빙 프로젝트의 성과를 검토하였다. 그리고 효과적이고 성공적인 웹 아카이빙을 실현하기 위해서 해결해야 할 정책적 과제와 기술적 과제들을 점검하였다.

Keywords

References

  1. Arvidson, Allan. 2002. The Collection of Swedish Web Pages at the Royal Library: the Web Heritage of Sweden. IFLA Council and General Conference, 68th, Glasgow, August 18-24, 2002. http://www.ifla.org/IV/ifla68/papers/111-163e.pdf
  2. Arvidson, Allan, Krister Persson and Johan Mannerheim. 2001. The Royal Swedish Web Archive: a ‘Complete’ Collection of Web Pages. International Preservation News 26: 10-12.
  3. Bergman, Michael K. 2001. The Deep Web: Surfacing Hidden Value. Journal of Electronic Publishing 7(1) http://www.press.umich.edu/jep/0 7-01/bergman.html
  4. Charlesworth, Andrew. 2003. Legal Issues Relating to the Archiving of Internet Resources in the UK, EU, USA and Australia: a Study Undertaken for the JICS and Wellcome Trust. http://www.jisc.ac.uk/uploaded_documents/archiving_legal.pdf
  5. Day, Michael. 2003. Collecting and Preserving the World Wide Web: a Feasibility Study Undertaken for the JICS and Wellcome Trust. http://www.jisc.ac.uk/uploaded_documents/archiving_feasibility.pdf
  6. Dellavalle, Robert P. et al. 2003. Going,Going, Gone: Lost Internet References. Science 302(5646): 787-788. https://doi.org/10.1126/science.1088234
  7. Hendler, J. 2003. Science and the Semantic Web. Science 299(5606): 520-521. https://doi.org/10.1126/science.1078874
  8. Hirtle, P. B. 2000. Archival Authenticity in a Digital Age. In Authenticity in a Digital Environment. Washington, D.C.: Council on Library and Information Resources, 8-23. http://www.clir.org/pubs/abstract/pub92abst.html
  9. Howell, Allan G. 2003. Preserving Digital Information: Challenges and Solutions: Workbook. http://www.alanhowell.com.au/papers/pdi_wkb.pdf
  10. Kahle, B. 1997. Preserving the Internet.Scientific American 276(3):72-73.
  11. Khale, B. 2002. Editors' Interview: the Internet Archive. RLG DigiNews 6(3) http://www.rlg.org/preserv/diginews/diginews6-3.html#interview
  12. Kuny, Terry. 1998. The Digital Dark Ages?: Challenges in the Preser-vation of Electronic Information. International Preservation News 17: 8-13. http://www.ifla.org/VI/4/news/17-98.htm#2
  13. Lavoie, Brian F. 2004. The Open Archive Information System Reference Model: Introductory Guide. OCLC Online Computer Library Center, Inc. and Digital Preservation Coalition.
  14. Law, Cliff. 2001. PANDORA: the Australian Electronic Heritage in a Box. International Preservation News 26: 13-17.
  15. Lee, Kyung-Ho, et al. 2002. The State of the Art and Practice in Digital Preservation. Journal of Research of the National Institute of Standards and Technology 107(1):93-106. https://doi.org/10.6028/jres.107.010
  16. Lyman, Peter. 2002. Archiving the World Wide Web. In Preserving Our Digital History: Plan for the National Digital Information Infrastructure and Preservation Program. Washington, D.C.: Library of Congress.
  17. Lyman, Peter and Hal R. Varian. 2000. How Much Information? 2000. http://www.sims.berkeley.edu/research/projects/how-much-info/
  18. Lyman, Peter and Hal R. Varian. 2003.How Much Information? 2003. http://www.sims.berkeley.edu/research/projects/how-much-info-20 03/
  19. Mannerheim, Johan. 2001. The New Preservation Tasks of the Library Community. International Preservation News 26: 5-9.
  20. Olsen, Stefanie. 2004. Yahoo crawls deep into the Web. http://zdnet.com.com/2100-1104_2-5167931.html
  21. O'Neill, Edward T., Brian F. Lavoie, Rick Bennett. 2003.Trends in the Evolution of the Public Web: 1998-2002. D-Lib Magazine 9(4)http://www.dlib.org/dlib/april03/lavoie/04lavoie.html
  22. Persson, Krister, Allan Arvidson, Johan Mannerheim. 2000. The Kulturarw3 Project - The Royal Swedish Web Archiw3e. http://www.kb.se/kw3/articles/article000605.pdf
  23. Phillips, Margaret E. 1999. Ensuring Long-Term Access to Online Publications. Journal of Electronic Publishing 4(4) http://www.press.umich.edu/jep/04-04/phillips.html
  24. RLG/OCLC Working Group on Digital Archive Attributes. 2002. Trusted Digital Repositories: Attributes and Responsibilities. Mountain View, Calif.: Research Libraries Group. http://www.rlg.org/longterm/repositories.pdf
  25. Stata, Raymie. 2002. Presentation of the Internet Archive. ECDL Workshop on Web Archiving, 2nd., Rome, Italy, September 19, 2002. http://bibnum.bnf.fr/ecdl/2002/
  26. Warner, Dorothy. 2002. Why Do We Need This in Print? It's on the Web...: a Review of Electronic Archiving Issues and Problems.Progressive Librarian 19/20: 47-64.
  27. Webb. Colin. 2000. Towards a Preserved National Collection of Selected Australian Digital Publications.in Preservation 2000: an International Conference on thePreservation and Long Term Accessibility of Digital Materials, 7/8 December 2000, York, England, Conference Papers. http://www.rlg.org/events/pres-2000/webb.html