Literature on Digital Repository Policy Development

This week, I have been looking into the collections policies and other policy documents of digital repositories, specifically data repositories. The other day, I came across this First Monday (open access!) article, “A balancing act: The ideal and the realistic in developing Dryad’s preservation policy” which I thought was worth summarizing here.

The authors report on their process for developing the preservation policy for Dryad, a general purpose scientific data repository. A Preservation Working Group consulted peer repositories and selected four which directly informed their process. The working group identified work already taking place and considered what a Preservation Policy should contain in developing their final document. In the article, the authors highlight important lessons learned such as the need to maintain realistic expectations and consider the constraints of the technology currently in place at the repository.

I have found few articles reporting on policy development in this way and thought this was a good example to share. Often in digital curation contexts, policy development is an afterthought or individual process, rather than a collaborative effort with diverse inputs. While it can seem trite to go through the process of creating policy rather than “doing the work,” it is vitally important for the vitality of organizations to have meaningful and well-thought-out policies which can inform future practice and help introduce new members into ongoing work. Here’s to publishing articles like this in the future!


Hierarchy of Google Products

With all of the recent talk about Google’s updated logo, I thought I’d share an observation I had this week while using two Google products: image search and Google Scholar. Here’s a screenshot, taken today, of images.google.com:

Google Images screenshot

And here’s a screenshot, taken today, of scholar.google.com:

Google Scholar screenshot

As you can see, the Scholar landing page still displays the old logo. Does this signal a shift in Google’s commitment to their scholarly literature search product? I hope not! Scholar is one of my favorite Google projects and a regular part of my research process. Update the logo and show us that Google Scholar is still thriving!



In which something I co-authored appears on the internet

Last week, an article I co-authored with Ixchel Faniel and my dissertation advisor Beth Yakel was finally published in the Journal of the Association for Information Science and Technology (JASIST).  The article reports on the results of a survey we conducted of 1,480 academic authors who cited ICPSR data in peer-reviewed publications, and is part of the larger DIPIR project which I was a part of for more than two years as a research assistant while in graduate school.

In the paper, we present a literature-based model to represent the relationship between data quality and user satisfaction with data in a reuse context. We tested this model with our survey data, using multiple regression analysis. The results of our survey indicate that data completeness, data accessibility, data ease of operation, data credibility, and documentation quality all correspond significantly with data reuser satisfaction. These findings suggest that repository managers should look to these areas when creating or updating guidelines or policies for data deposit and evaluation.

The paper is live on the JASIST website here. It’s not open access 🙁 but I’m really proud of this work! Email me if you want to talk about it or any of my other work.


Linux in the Wild

One of my first posts on this site was about Linux, and I always love seeing examples of how Open Source software powers so many computing devices which we interact with everyday. On a recent plane trip back from Seattle (where I attended ASIS&T. It was awesome!) I settled into my seat and prepared to watch a movie on the seatback screen when it suddenly went black. Confused, I looked up and noticed that the entire plane had lost their screens as well.  A few seconds later, much to my surprise, this appeared on screen as the software loaded:

Wild Linux

Can you see tux in the upper left corner? That’s right- Delta’s seatback screen are powered by Linux! After another period of intense scrolling text as the system rebooted, I was eventually greeted  by the clean welcome screen:

Delta Welcome

After this snafu, the system remained on for the rest of my flight, and I was able to watch Parks & Recreation while editing some files. I must have looked like a weirdo when I whipped out my phone to take pictures of the software loading on my seatback screen, but I always love witnessing moments like this. The experience of flying a commercial airline is designed to be sleek and streamlined, but remembering that much of this software runs on Linux was a refreshing reminder that the polished face of Delta runs on a complex infrastructure.

I’ve also been thinking about the recent dustup between Groupon and the GNOME project. Groupon used the trademarked name of the popular Open Source desktop environment as the name of their new point-of-sale system, filing trademarks that infringed upon those already in place for decades. I was surprised to see Groupon making this move when my assumption is that some portion of their developers and code is based on Linux. After some confusion, it looks like Groupon is pulling back and will change the name of their product. Score one for the OS lobby!

I wonder how many consumers know how important Linux is to so many aspects of our computational lives? How can we increase awareness of this software, and how would knowing more about Linux change conversations in society about the role and place of computing in everyday life?



Archival Articles on Wikipedia

My regular readers will know that I edit Wikipedia from time to time, and that I am a doctoral student in an iSchool who studies archives. I was therefore overjoyed to attend this session at the Society of American Archivists annual meeting this August. The session chairs, Dominic McDevitt-Parks and Sara Snyder, successfully signed up new editors for Wikipedia accounts and introduced them to the basics of editing. As a group, we even made some progress on a few articles relevant to archives. A wiki page documenting the session is located here.

I fully agree with the goals of this session: to increase the quality of Wikipedia articles which relate to archival concepts, archival institutions, and archivists. Since August, I have been looking for opportunities to edit archival articles. Now, this term I am working as a Graduate Student Instructor (also known as a TA at universities not named Michigan) in a course on archival access systems. During a recent lecture, my lead instructor provided an overview of many of the archival software platforms that exist today. Following along, I happened to google Archon and ended up on its Wikipedia page: Archon (software). I was dismayed to see that the article was not up to date and listed the tool as in active development when in fact it has merged with ArchivesSpace and is no longer maintained. I made a mental note to follow up and edit this page to reflect the most current information.

A few days later, when I returned to complete my edits, I noticed that someone else had come in and begun my work for me! A sentence indicating the inactive status of the project was tacked on to the end of the article. I still made some edits, cleaned up the page, and made sure that things were up-to-date. However, during this time I discovered that ArchivesSpace itself does not have an article yet. That’ll be a task for a later date.

You may be asking yourself- what is the point of this story? Well, if you see an article related to something having to do with archives that needs work, edit it! I did a bit of writing on a small article and discovered a much larger task that I will tackle in the upcoming weeks.

What wiki-event is happening at SAA next year?? I’m there!


Upcoming Conference- Society of American Archivists

This week I will be attending and presenting at the joint annual meeting of the Council of State Archivists (CoSA), the National Association of Government Archivists and Records Administrators (NAGARA), and the Society of American Archivists (SAA). My presentation is on Saturday; the full conference schedule can be found here.

I am excited for this conference as I hope to have the opportunity to meet some of the state archivists from whom I have collected data over the past few months for my dissertation project. It should be a swelteringly good time in our nation’s capital!


Conference Next Week- Archival Education and Research Institute

Next week I’ll be heading to Pittsburgh for the Archival Education and Research Institute, hosted by the University of Pittsburgh. This will be my fourth year attending and I’m excited to share my research and see  what others have been up to. I will be presenting preliminary results from my dissertation study.

The conference website is here and the schedule can be found here. I’m presenting on Monday afternoon so if you’re going to be at AERI come check out my session!


Upcoming Conference- International Conference on Digital Government Research

Just a quick note that I’ll be heading to Aguascalientes, Mexico for the 15th Annual Conference on Digital Government Research this week. I am participating in the doctoral colloquium and presenting a poster at the poster session on Thursday (6/19).

The conference website and full schedule and is here. If you’re at the conference stop by and say hi!


Excellent Coverage on Wikipedia and Cultural Institutions

Let me start by saying that I love Wikipedia. I’m not just a consumer of information from the online encyclopedia but also an editor, having made my first contribution back in 2006. While I have not been a consistent Wikipedian throughout the years, I make an effort to edit regularly these days and maintain a deep belief in the importance of this website on today’s internet. In a world of corporate web systems and services, Wikipedia is a refreshing organization in which people come together in the service of creating new knowledge and increasing human understanding of complex topics. For me, it represents a possibly-naive ideal that if everyone works together on this project, in the end knowledge will meaningfully increase and contributors will learn something about each other and the process of creating a global resource for learning and enjoyment.

All of this is not to say that Wikipedia is without flaws. Perhaps chief among these is a deep gender bias and an under-representation of female editors as well as topics on prominent women across the encyclopedia. A brief and admittedly superficial comparison of the article length of Halo: First Strike, a novel based on the popular video game series, and Flight Behavior, a novel by Pulizter-nominated  author Barbara Kingsolver demonstrates the results of the gender gap articulated in recent coverage of Wikipedia editors (e.g. this NYTimes article). The large number of male editors of Wikipedia articles has resulted in increased attention to male-centered topics such as video games. This leaves articles on novels by famous female novelists to languish as stubs, wiki-speak for articles which are too short to be of much value on the encyclopedia even though they cover notable or important topics (for more on stubs, see here). This is a disappointing trend as I would like to see more equal coverage of women in Wikipedia articles and would encourage more women to edit the encyclopedia and have a hand in its direction.

Which brings me to an article in today’s New York Times that I found refreshing. Noam Cohen gives a great description of what a Wikipedian-in-residence does, and highlights how edit-a-thons focused on women scientists, authors, and academics are attempting to address the gender gap issue through engagement with existing library, archival, and museum resources. It is always good to see coverage of Wikipedia in the national media that moves beyond the “can we trust Wikipedia??” baseline. The activities described in the article are positive developments as I see things and can only help improve the overall quality and usefulness of the encyclopedia over time. I’m all for long, detailed articles about Halo novels, but also think that Wikipedia should be a place where often overlooked but demonstrably important people can be included. All while adhering to proper Wikipedia formatting, citation guidelines, and style of course…