27
Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

Embed Size (px)

Citation preview

Page 1: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

Gathering Statistics at FCLA

ICOLC – PhiladelphiaMarch 28, 2006

Michele Newberry

Or ……

Page 2: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 2

Statistics Hell!!!

Page 3: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 3

ABC-CLIO Logon to the Stats interface at: http://serials.abc-clio.com/reports/ ID: <xxx>@ufl.edu PW: <nnnn> http://serials.abc-clio.com/reports/

start&_formname=loginfo&appname=reports&loginname=<xxx>@ufl.edu&password=<nnnn>&forgotten_password=0

Choose “Select All” for Institutions Choose a reporting period (one or multiple months) Choose an Output Type (I choose “Excel Friendly HTML”) Click “Run Report” http://serials.abc-clio.com/reports/go/ABC-Clio-Serials-Reports&_appname=

reports&_operation=DoReport&addlcid=I00271&addlcid=I00267&addlcid=I00268&addlcid=I00269&addlcid=I00270&addlcid=I00637&addlcid=I00725&addlcid=I00652&addlcid=I00697&addlcid=I00764&startmonth=20051001&stopmonth=20051101&outputtype=Coutputtype=C for CSV, E for Excel friendly html, H for HTMLstart,stop month in YYYYMMDD

Save Page As HTML file – “FCLAReportMMYY.html”, i.e. – FCLAReport0905.html.

Upload to FCLA website and transcribe total annual searches into the master spreadsheet.

Page 4: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 4

BePress BePress stats are VERY complicated to gather. Go to the admin

URL: http://www.bepress.com/cgi/myaccount.cgi Login with the user ID and Password for each school. Copy the URL for each school’s report and paste into the browser to

get an “Open” or “Save As…” popup window. Open the file in Excel, and make the following modifications:

Add a line above line one that says: BePress Usage Report (Arial, 14 point) Bold & Italicize & color purple the next line (Full-Text Downloads….) Delete the usage data for each title up to January of the current year (these

reports keep data from the creation of any given university’s account), and then move the data for the current year to the left so that the January data is in Column B.

Resize Column A so that the full title of each Journal is viewable Merge & Center these two header lines over the width of the report

Once finished, “Save As Web Page” naming by school & year, i.e. “famu_2005.html”

Repeat for each school. Upload to FCLA website and transcribe total annual searches into

the master spreadsheet.

Page 5: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 5

CSA Login to CSA Illumina Usage at:

http://mars3.csa.com/usage/ou_login.aspx Login with ID: <xxx> and PW: <nnnn> Choose either Live Reporting or Emailed Reports

(depending on how current the data needs to be (explained on the site) – normally choose Live for current data.

Under “Consortium Reports”, select a date range (one or multiple months), and run the report.

“Save As…” HTML, naming the file csa_fcla_MMYY.html, i.e., csa_fcla_0905.html

Upload to FCLA website and transcribe total annual searches into the master spreadsheet.

Page 6: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 6

EBSCO Login to EBSCO Admin at: http://eadmin.epnet.com/eadmin/login.aspx ID: <xxxx> PW: <nnnn> Click on the “Reports & Statistics” tab, then configure the report. Normally choose:

By Database, Consortium: ALL Level: Site Date range: one or multiple months Include: All Sites Fields to show: Sessions, Searches, Total Full Text Requests (any other fields are fine as well, but those are the required fields)

Then either Show, E-mail, or Schedule this report to be run. “Save As…” HTML, naming the file ebscoYYYY_MM.html,

i.e., ebsco2005_08.html. Upload to FCLA website and transcribe total annual searches into the

master spreadsheet.

Page 7: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 7

Gale / IAC InfoTrac Login to Gale InfoTrac Config at: http://infotrac.galegroup.com/itconfig/fcla_000 ID: <xxxx> PW: <nnnn> Click on Reports in the navigation bar. Under “Consortium”, select E-mail Gale or COUNTER reports, or setup a monthly

report. Normally we choose the Gale report, as the COUNTER report does not display stats in the searches by university & month style that FCLA prefers.

So choose Gale report, and select a date range. Under Gale Standard Use Reports, check “Usage Summary”, “Usage by

Database”, “Library Location”. Choose Format=“Comma Separated Values”; Compression=“None” and

Attachment= “Yes”. Recipient: enter E-mail address for the report and click “Get Report” to get it sent

via E-mail. Once received, format it to resemble the existing reports at: http://

www.fcla.edu/FCLAinfo/stats/iac/iac.html . Saved as HTML with a filename of gale_MM_YYYY.html, i.e., gale_09_2005.html. Upload to FCLA website and transcribe total annual searches into the

master spreadsheet.

Page 8: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 8

LexisNexis – Academic, Congressional & Statistical

LexisNexis stats can be gathered by accessing their website at: http://www3.lexisnexis.com/aur/signon.html

ID: <XXXX> PW: <NNNN> Once there, you can view HTML or download CSV reports. HTML

reports cannot be downloaded. FCLA worked with LexisNexis to get an FTP account through which we download the HTML reports.

FTP Setup Info: FTP Host - ftp.lexisnexis.com Login – <XXXX> Password – <NNNN>

Download all of the new reports for Academic, Congressional, Statistical, and the Rollup reports

Rename the files as institutionYYMM.html, i.e., famu0508.html . Upload to the FCLA website for each product and transcribe total

annual searches into master spreadsheet.

Page 9: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 9

ProQuest Login to ProQuest Local Admin at: http://lad.proquest.com/ladweb ID: <XXXX> PW: <NNNN> Click on the tab (or link) for Select Report Type:

Database Activity – Detail Delivery Method: Download or Email now Show items with zero usage: Yes Include sub-accounts in this report: Yes

Select a date range for the usage period (one or multiple months) Click Create Report. Save as HTML as pqYYMM.html, i.e., pq0905.html. Upload to FCLA website and transcribe total annual searches into the

master spreadsheet.NOTE: There is also a cumulative report for ProQuest Digital

Dissertations usage that is emailed once a month in HTML format and uploaded upon arrival as pqdd_stats.html.

Page 10: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 10

RLG Login to RLG Stats at: http://reports.rlg.org Invoice Account Code (IAC): <XXXX> Access via two reports:

6. Union Cat and Citation Files – Searches for Month 7. Other Info Resources – Search Activity for Month

Aggregate stats by institution manually due to the shared IAC.

Do not post on the FCLA website. Transcribe total annual searches into master

spreadsheet.

Page 11: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 11

Standard & Poor’s Login to S&P NetAdvantage at:

http://www.netadvantage.standardandpoors.com/NASApp/NetAdvantage/usage/Usage.do

ID: <XXXX> PW: <NNNN> S&P reports are only available for single month/single institution. 10 reports must be downloaded per month. If multiple institutions are selected, the report gives an aggregated

total, rather than a breakdown by site, which FCLA needs. Select Month, Year and Institution, then click “Show Report”. Click “Printer Friendly” to generate a new window with the full report,

then “Save Page As” HTML, as s&p_institution_MM_YYYY.html, i.e., s&p_famu_09_2005.html

Repeat for all 10 reports. Once all reports have been saved, all hyperlinks must all be removed

as they point to resources are not be available from the posted page. Upload to FCLA website and transcribe total annual searches into

master spreadsheet.

Page 12: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 12

ValueLine

ValueLine statistics are emailed to FCLA directly from the vendor rep.

Excel format. Save as a valueline_MM_YYYY.html,

i.e., valueline_09_2005.html Upload to the FCLA website and transcribe total

annual searches into master spreadsheet.

Page 13: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 13

Wilson Login to WilsonWeb at: http://www.hwwstats.com/ng/ Account Number: <NNNN> (then click Login) Password: <XXXX> (then click Continue) Click :Database Usage”

(COUNTER reports are available, but the reports under Database Usage are more appropriate to the kind of data that FCLA gathers monthly)

Select “Bill To Account” (Ship To Account will generate ZERO usage) Account: ALL (or you can run individual school reports) Product: ALL Detail Level: Complete Report Choose a date range (one or multiple months) Sort By: Number of Searches, then click “Submit”

Once the report is generated, save as a file (HTML) or email the file. Save the HTML file as wilson_MM_YYYY.html, i.e., wilson_09_2005.html.

Upload to the FCLA website and transcribe total annual searches into master spreadsheet.

Page 14: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 14

ACM ACM provides usage stats only twice per year, after

June 30 and after December 31. Data is provided personally by ACM rep.

<inject humorous story here> Files are delivered in HTML format, by school. Add the following header:

ACM Digital Library Usage Report for (school name) (Date Range of report)

Save the file as HTML (acm_school_year.html – i.e. acm_famu_2005.html).

Upload to FCLA website and transcribe total annual searches into master spreadsheet.

Page 15: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 15

  ABC-CLIO   IAC InfoTrac

  ACM   LexisNexis: Academic | Congressional

Statistical

  BePress   ProQuest

  CSA Usage Statistics   RLG

  Current Contents Connect   Standard & Poors

  EBSCOHost   ValueLine

  FirstSearch PerSearch Blocks   WebLUIS Databases

  FirstSearch   WilsonWeb

  Galenet

http://www.fcla.edu/system/intro.html

Page 16: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 16

EBSCOHost Usage Reports

2006

  January   February  

 

2005

  December 2005     June 2005

  November 2005     May 2005

   October 2005     April 2005

   September 2005     March 2005

   August 2005     February 2005

   July 2005     January 2005 

Page 17: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 17

EBSCOadmin Database Usage ReportJanuary 2006

Site Database Name SessionsTurn-aways Searches

Total Full Text

Image/ Video

Smart

LinkCustom

Link

FLORIDA GULF COAST UNIV CINAHL 763 0 2089 62 0 374 334

FLORIDA GULF COAST UNIV Econlit 435 0 473 12 0 0 1

FLORIDA GULF COAST UNIV Pre-CINAHL 399 0 516 0 0 3 2

FLORIDA GULF COAST UNIVRILM Abstracts of Music Literature

428 0 458 0 0 0 0

FLORIDA INTL UNIV CINAHL 44 0 204 0 0 9 18

FLORIDA INTL UNIV Econlit 91 0 340 2 0 6 117

FLORIDA INTL UNIV Pre-CINAHL 48 0 313 0 0 1 13

FLORIDA INTL UNIVRILM Abstracts of Music Literature

22 0 112 0 0 0 20

FLORIDA STATE UNIV CINAHL 267 0 908 46 0 0 343

FLORIDA STATE UNIV Econlit 100 0 378 5 0 0 32

FLORIDA STATE UNIV Pre-CINAHL 222 0 787 0 0 0 79

FLORIDA STATE UNIVRILM Abstracts of Music Literature

205 0 620 0 0 0 74

Page 18: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 18

Searches/Database/Institution FAMU FAU FGCU FIU FSU UCF UF UNF USF UWF

TOTAL

ABCAmerHistory&Life 5383 204 1634 705 1976 2599 2729 1611 3683 967 21491

ABC HistAbstracts 1966 122 747 83 1438 1116 1524 1169 1472 790 10427

ACM ACM Dig Coll 539 2822 181 2305 7391 8625 4967 2333 3753 743 33659

APA PsycInfo 2945 28188 2628 141342 149735 151612 154734 47226 186262 19757 854429

BEP BEPress 45 204 17 346 317 532 401 123 260 31 2276

Bowkr BIP 684 17840 5444 6509 21100 10336 13094 8368 17171 4443 104989

Bowkr Ulrichs 639 2907 2647 2641 7413 9123 5900 3297 20762 4375 59704

BowkrBowker

TOTAL 1323 20747 8091 9150 28513 19459 18994 11665 37933 8818 164693

CSA CSA Package 22057 264347 91046 815871 1371814 262353 583005 225981 423753 212520 4272747

CSA ATLA Religion 817 6403 1190 14468 36013 4330 11681 3695 10087 1424 90108

The “SPREADSHEET”

Page 19: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 19

Automated COUNTER Harvester Assumptions:

Data can be retrieved either via ftp or via html. If via ftp, only the address is needed. If via html, one or more pages may need to be traversed.

Design: Each COUNTER source has a harvesting script

associated with it. Each script is maintained in a separate file.

There is a COUNTER database that holds the retrieved data. It also holds COUNTER source information.

Page 20: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 20

ID SOURCE SCRIPT LAST_ACCESS

STATUS PERIOD

Table key The name of the COUNTER source. This is the prime key of the table

Fully qualified path to the script to be used to retrieve data from the source

Date and time stamp of completion of the last harvesting of this source.

Active – currently being harvestedError – last harvest failedWait – last harvest succeeded, next

harvest not yet initiated.

Frequency of harvest, in days.

SOURCE Table:

RESOURCES Table:

ID NAME TYPE PRINT_ISSN

ONLINE_ISSN

PUBLISHER PLATFORM

Table key The name of the resource, such as journal title, database name, service name

Journal, Database, Service

ISSN of print version

ISSN of online version

publisher ?

STATISTICS Table:

SOURCE ID START_DATE END_DATE TYPE COUNT

ID of source from SOURCE table

Start date of reporting period, as yyyy-mm-dd

End date of reporting period, as yyyy-mm-dd

ft_pdf – pdfft_html - html

Number of requests.

COUNTER Harvester Preliminary Design

Page 21: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 21

COUNTER Harvester Preliminary Design Scripts

Are XML documents that specify a set of actions to be performed and the information necessary to perform the action.

Comments are formatted as XML, i.e., <!-- ...-->.

Special characters must be escaped.

Page 22: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 22

Example: Gale<CounterHarvesterScript>

<debug/> <navigate>http://infotrac.galegroup.com/itconfig/fcla_000</navigate>

<pause> 2 </pause><navigate>http://infotrac.galegroup.com/itconfig/fcla_000?id=fclareports&amp;pass=fclareports</navigate><pause> 2 </pause><navigate>http://web6.infotrac.galegroup.com/infotrac_config/session/191/591/76110636w6/pg=ir!166&amp;ui3=CONSORT&amp;ui4=fcla&amp;ul=3&amp;un=0&amp;uo=9&amp;uiy=Y1&amp;uin=1&amp;uif=CSV&amp;uic=None&amp;uil=YES&amp;[email protected]&amp;uz=++Get+Report++</navigate><alert> Statisitcs will be sent by email to [email protected]. </alert>

</CounterHarvesterScript>

Page 23: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 23

Problems encountered COUNTER data is treated as secure data, so

creation of a web-walking robot is problematic.

Session data must be extracted at some point in the session and then inserted into URLs to be sent later in the session.

Reporting periods are defined in columns, one column per period. The column headers must be walked, looking for Total YTD after the monthly data column(s).

Page 24: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 24

COUNTER compliance is based on an Excel worksheet geared for human consumption, not computer processing. The various reports don’t start reporting periods in the same column: In Journal Report 1, periods start in column F In Journal Report 2, periods start in column G

Institution name may be in a single cell row or a column

Some csv files have information that must be skipped: Headings at the top of the sheet Subtotal and total lines interspersed and/or at the bottom of

the sheet

Page 25: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 25

Counter Harvester Conclusion COUNTER compliance is claimed by many but

delivered by few.

In fact, as of this writing, we haven’t found a single csv that fully conforms.

Many are very different from the standard format.

Others, like EBSCO, are very close but since they are not exact, are still not ameniable to machine processing.

Page 26: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 26

Hope for the future - SUSHI The SUSHI schema needs more specificity,e.g.:

the format of dates is not specified in the schema but different date formats may cause different servers to reject the request or, worse still, to fail.

Integrating SUSHI and csv-based data: In SUSHI, the reporting period is generalized with a

start/end date. In the Excel-based standard, the column headings are of the form mm-yyyy. To place both into a common database requires normalization.

Page 27: Gathering Statistics at FCLA ICOLC – Philadelphia March 28, 2006 Michele Newberry Or ……

March 28, 2006 ICOLC, Philadelphia, Newberry 27

Tim J Stats- SUSHI.ppt