57
What is fair use of seven terabytes? Paul Fyfe

What is fair use of 7TB?

Embed Size (px)

Citation preview

Page 1: What is fair use of 7TB?

What is fair use of seven

terabytes?

Paul Fyfe

Page 2: What is fair use of 7TB?

1. The history of a problem

Page 3: What is fair use of 7TB?
Page 4: What is fair use of 7TB?
Page 5: What is fair use of 7TB?
Page 6: What is fair use of 7TB?
Page 7: What is fair use of 7TB?

h-API-ness

Page 8: What is fair use of 7TB?
Page 9: What is fair use of 7TB?
Page 10: What is fair use of 7TB?
Page 11: What is fair use of 7TB?
Page 12: What is fair use of 7TB?
Page 13: What is fair use of 7TB?
Page 14: What is fair use of 7TB?
Page 15: What is fair use of 7TB?
Page 16: What is fair use of 7TB?
Page 17: What is fair use of 7TB?
Page 18: What is fair use of 7TB?
Page 19: What is fair use of 7TB?
Page 20: What is fair use of 7TB?
Page 21: What is fair use of 7TB?
Page 22: What is fair use of 7TB?
Page 23: What is fair use of 7TB?
Page 24: What is fair use of 7TB?

* Please see your own copyright librarian or library counsel before engaging with this or any other strenuous scholarly communications program.

Page 25: What is fair use of 7TB?

2. Big data, little access

Page 26: What is fair use of 7TB?

• restricted or licensed use

• conventional fair use

• non-expressive or non-consumptive use

• transformative use

Page 27: What is fair use of 7TB?

Fair Use

• Copyright Act of 1976• Statutory factors in determining “fair use”– purpose and character– what are you using– "amount and substantiality"–market harm

“Fair Use FAQ.” Copyright and Digital Scholarship Center, NCSU Libraries. Accessed 2015. Web. http://www.lib.ncsu.edu/cdsc/resources/faqs/fairuse

Page 28: What is fair use of 7TB?
Page 29: What is fair use of 7TB?
Page 30: What is fair use of 7TB?
Page 31: What is fair use of 7TB?

“the digitization of books for text-mining purposes is […] to be regarded as fair use as long as the end product is also nonexpressive or otherwise non-infringing”Jockers, Matthew L., Matthew Sag, and Jason Schultz. Brief of Digital Humanities and Law Scholars as Amici Curiae in Authors Guild v. Google. Rochester, NY: Social Science Research Network, 2012. papers.ssrn.com. Web. 25 Feb. 2015. http://papers.ssrn.com/abstract=2102542

“library digitization for the purpose of text mining and similar non-expressive uses present no legally cognizable conflict” Jockers, Matthew L., Matthew Sag, and Jason Schultz. Brief of Digital Humanities and Law Scholars as Amici Curiae in Authors Guild v. Hathitrust. Rochester, NY: Social Science Research Network, 2013. papers.ssrn.com. Web. http://papers.ssrn.com/abstract=2274832

Page 32: What is fair use of 7TB?
Page 33: What is fair use of 7TB?

“non-consumptive research paradigm”

HathiTrust Research Center http://www.hathitrust.org/htrc

“Non-consumptive research is defined in the settlement as: ‘ …research in which computational analysis is performed on one or more Books, but not research in which a researcher reads or displays substantial portions of a Book to understand the intellectual content presented within the Book’” Unsworth, John. “Computational Work with Very Large Text Collections.” Journal of the Text Encoding Initiative Issue 1 (2011): n. pag. jtei.revues.org. Web. http://jtei.revues.org/215

Page 34: What is fair use of 7TB?

“Today’s digital-minded literary scholar is shackled in time; we are all, or are all soon to become, nineteenth centuryists.”Jockers, Matthew. Macroanalysis: Methods for Digital Literary History. University of Illinois Press, 2013. 173.

Page 35: What is fair use of 7TB?
Page 36: What is fair use of 7TB?

Standardization of English

and American orthography,

circa 1800

Moving wall of public

domain / copyrighted materials,

1923

LONG NINETEENTH CENTURY FTW

Page 37: What is fair use of 7TB?

Public domain is not the only problem.

Page 38: What is fair use of 7TB?
Page 39: What is fair use of 7TB?
Page 40: What is fair use of 7TB?
Page 41: What is fair use of 7TB?

“The solution to the problem of heterogeneous access to licensed material is not scalable”

Unsworth, John. “Computational Work with Very Large Text Collections.” Journal of the Text Encoding Initiative Issue 1 (2011): n. pag. jtei.revues.org. Web. http://jtei.revues.org/215

Page 42: What is fair use of 7TB?

Fair use of seven

terabytes

VARIETIES.Time is like a creditor, who allows an ample space to make up accounts, but is inexorable at last.—Time is like a verb that can only be used in the present tense.—Time well employed, gives that health and vigour to the soul which rest and retirement afford to the body.—Time never sits heavily on us, but when it is badly employed.—Time is a grateful friend; use it well, and it never fails to make a suitable requital.

Berrow’s Worcester Journal. 3 January 1822, p4.

Page 43: What is fair use of 7TB?

• does the use affect the provider’s ability to commercialize, or substitute for the original

• do they actually care / does it really make an impact (“de minimis”)

Page 44: What is fair use of 7TB?

“Campbell [v. Acuff-Rose Music]’s most enduring contribution to fair use jurisprudence has been its emphatic embrace of the ‘transformative use’ paradigm”

Butler, Brandon. “Transformative Teaching and Educational Fair Use after Georgia State.” Rochester, NY: Social Science Research Network, 2015. papers.ssrn.com. Web. (forthcoming in Connecticut Law Review) http://papers.ssrn.com/abstract=2568936

Page 45: What is fair use of 7TB?

• “courts have shown deference to uses successfully characterized as ‘transformative’”

BUT • “education, has been mired for years in a

minimalist, market-based vision of fair use that is largely out of touch with mainstream fair use jurisprudence”

Butler, “Transformative Teaching and Educational Fair Use,” 2, 1

Page 46: What is fair use of 7TB?

1. scrape data and discard junk2. parse spaces, hyphens, breaks3. export to MongoDB4. run correction protocols 5. score automatically and with human

error checking6. export into JSON, CSV, XML, or

formats ready for queries or visualization

7. begin research

Page 47: What is fair use of 7TB?
Page 48: What is fair use of 7TB?
Page 49: What is fair use of 7TB?

Manovich, Lev. “On Broadway - a new interactive urban data visualization from Selfiecity team.” Software Studies Initiative, MIT. March 2, 2015. Web. http://lab.softwarestudies.com/2015/03/on-broadway-new-interactive-urban-data.html

Page 50: What is fair use of 7TB?

Manovich, Lev, et al. “Imageplots.” Selfiecity, 2015. Web. http://selfiecity.net/

Page 51: What is fair use of 7TB?

“Let’s turn our non-consumptive use of digitized works into expressive use of digitized works.” Sample, Mark. “The Poetics of Non-Consumptive Reading.” SAMPLE REALITY. N.p., 22 May 2013. Web. http://www.samplereality.com/2013/05/22/the-poetics-of-non-consumptive-reading/

Page 52: What is fair use of 7TB?
Page 53: What is fair use of 7TB?
Page 54: What is fair use of 7TB?
Page 56: What is fair use of 7TB?
Page 57: What is fair use of 7TB?

fair use is “an analytical tool that focuses on social and cultural patterns.”

Michael Madison, “A Pattern-Oriented Approach to Fair Use,” William & Mary Law Review 45 (2004)