ASA Data Project Data Reports Cover LetterTo: Lynn Chmelir, Chair, Collection Development and Management Committee From: Nancy Nathanson April 6, 2005 Article Supplier Analysis Project: data reports We have received 17,668 entries for the Article Supplier Analysis study. All but one member institution (COCC) submitted a report, ranging from 34 entries (MHCC) to 2,201 (OSU). Each report was reviewed and corrected as necessary so that it could be used in a single file of combined data from all members. In most cases, we consulted with the submitting library to get corrected information or agree on how we would “fix” the data. The number of individual entries in the study is very large, and the data problems shouldn’t effect the outcome in a significant way. I have included some notes about the nature of the data in case it might influence how you interpret or factor a confidence level for the results Data from each report was loaded to a single Access table. Depending on the information desired, results were prepared from Access queries, or from data exported back to a single Excel file for other filtering and calculations. Notes about the data fieldsTitleDue to the varying data entry practices, some titles will be non-matchable (abbreviations; periods vs. no periods at end of title; etc.) A report specific to analyzing titles is not prepared at this time. There are only 857 records with a title and no ISSN. Of those, 710 are “unique” titles (in other words, some of the articles received were for the same title). There are probably a few more matches that we could find with additional review; this would reduce the number of unique titles by a little bit. ISSNA few entries might be monographs (suspicious ISBN’s; investigated and discarded a few, haven’t yet reviewed about a dozen more.). Some bad data was corrected (for example, first character omitted, bracket at end of field); 7 entries lacked both title and ISSN. Differing formats so that ISSN’s were not matching; many lack hyphen. We corrected some here, and asked for some to be re-submitted. We haven’t fixed them all. 16,804 entries with ISSN. 7,312 unique ISSN articles received with same ISSN or title: more than 1 Rec’d: 3,095 unique ISSN(18% of all received) more than 4 Rec’d: 751 unique ISSN
FormatSome entries in this field contained characters other than e, m, or p (commas, blanks, periods, other alpha and numeric characters). Most were corrected, only a few with extraneous data had to be deleted.
YearSome Year fields contained numeric data that had been reformatted and could not readily be interpreted as a “year”; sometimes we were able to guess at the year intended; also sometimes contained a span of years (e.g. 1973-74) or specific dates or months. No year field in 163 reported entries. Ultimately we had to eliminate the year field data in a couple dozen odd entries, and a few dozen others were changed by library staff and re-submitted. Bib recordsThe data in this field is not uniformly reliable for several reasons: lack of understanding about concept of bibliographic records and about catalog display screens, especially difference between Index browse and Title browse screens where number of entries is displayed; uncertainty about counting records owned only by CRL, etc. Borrowed FromBorrowed From institutions (Suppliers): instances of bad data were corrected where possible, such as “11” instead of “1”, “v” instead of “u”, etc.; and a small number (fewer than 10) were deleted in the last day. The attached reportsI have prepared several reports in a single Excel workbook. The source of data may have been Excel or an Access query.
[1] (Note: this is for all articles in the study. Limiting to only those titles received from outside the Alliance would require an additional report.) |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||