Upload
booksai-rd-group
View
122
Download
0
Embed Size (px)
Citation preview
booksai
Artificial Intelligence for Book Publishers
Predicting the Success
Every publisher gets a pile of manuscripts everyday they need to read and choose the right one. For now, rather than going through thousands of unsolicited work, editors
can rely on AI to select work that would at least interest them.
Dear Sir or Madam, will you read my book? It took me years to write, will you take a look?
"The Beatles"
Analyze Differently
It is quite possible to digitize all the books in the world.
It is so easy to apply all the existing NLP and semantic techniques to these books and extract all the facts, concepts and events “scene by scene”.
Unfortunately, the only thing one can achieve this way is to make all these books
slightly searchable.
But this is absolutely not enough, because ...
Words Don't Matter
when it comes to analyzing a literary work...surprisingly
what matter is the author's personality, style, tone and attitude
Style is a very simple matter; it is all rhythm. Once you get that, you can't use the wrong words.
Now this is very profound, what rhythm is, and goes far deeper than any words. A sight, an emotion, creates this wave in the mind, long before it makes words to fit it.
Virginia Woolf on Writing and Consciousness
Every author has own style that makes his or her writing recognizable.
When you read several books by the same author, you become accustomed to the author’s style of writing and sometimes you look for authors with a
similar style.
It is rather often that a tone, mood, intonation and a lots of other hidden elements of style happen to be much more important than the genre or even
the plot.
Words Don't Matter
It doesn't matter precisely what words the story is told in - all we have are transcriptions of oral renderings, which themselves may differ greatly from one storyteller to another, even when telling the same story.
Words Don't Matter
Traditional linguistic techniques are helpless when it comes to the analysis of
a literary work in its beauty and integrity.
They do not understand the "spirit of a book", but we as human beings love books exactly for their spirit.
There are lots of novels on entrepreneurship.But “The Financier” by Theodore Dreiser is unique
Theodore Dreiser is NOT a “semantic field”or the set of keywords. He Was a Genius
There are lots of books about magic, but only one “Harry Porter”.
There are lots of fantasy books, but only one "The Lord of the Rings"
Theory & PracticeBeyond The Semantic Fields
Theory
At booksai, we take a more holistic and neuroscience-inspired approach to teaching machines how to understand books, that is not based on traditional semantic technologies.
On theoretical levelSome of our ideas inspired by V. Nalimov's probabilistic concept of the semantic fields. (In The Labyrinths of Language: A Mathematician's Journey). (Russian philosopher and mathematician - 4 November 1910 - 19 January 1997).
Theory
Continuity of consciousness vs discreteness of language
Nalimov stresses the continuous nature of consciousness, with which a person is always in contact, but which cannot be reduced to the discreteness of language (except partially through rhythmical texts). Phrases constructed over discrete symbol-words are always interpreted at the continuous level.
"The continuous nature of everyday language finds its expression in the limitless divisibility of the verbal meanings, while the continuous nature of the morphology of the animate world is expressed by the impossibility of constructing a discrete taxonomy".
V. Nalimov
Theory
...rhythm is something much more significant; rhythm probably means the dissolving of word meanings, their merging into a continuous, inwardly indissoluble stream of images. In other words, rhythm provides an opportunity for non-Bayesian reading of the texts...
V. Nalimov(Russian philosopher and mathematician)
Now this is very profound, what rhythm is, and goes far deeper than any words. A sight, an emotion, creates this wave in the mind, long before it makes words to fit it.
Virginia Woolf on Writing and Consciousness
Any creative activity is based on rhythm
...creates this wave in the mind, long before it makes words to fit it.
We are trying to detect this wave.
Practice
1. We have made some innovations in the field of machine learning (since traditional techniques are not effective enough to meet our needs).
2. We apply these adapted algorithms to different levels of semantics that really exist.
3. We analyze different level of semantic - melody of the speech, semantic of the rhythm and other nonverbal language elements.
Practice
Unlike traditional methods of the semantic analysis, our algorithm does not analyze the words or syntax.
This is the tool that can extract the latent signals and motivations behind a particular story...
and create “hologram-like”, literary ‘fingerprint’, containing lots of elements of author's text which hard to define, but vital for understanding the book essence.
Core Technology
- Self-learning
- Language-independent
- High speed of data processing
How It Works
It then generates a report comparing the story to other titles and giving you a breakdown of how much a manuscript compare to the top several similar writers and titles (10 000 bestsellers).
What you will get
GSES (Genre Specific Elements of Style)
This is not a genre as we know it, but aspects of style, tone and mood usually associated with particular genres.
- Imagine a futuristic sci-fi saga written with elements of Buddhist philosophy (in terms of mood and style).
- Imagine a Self-help title written with great sense of humor.
- Imagine an autobiography written as a YA fiction:)
What you will get
ASES (Author Specific Elements of Style)
Similar authors in different genres.You will have an ability to bring to the surface a modern but not known to anyone author, who, probably, writes like...
What you will get
List of most similar titles (based on our database of 10 000 bestselling books or 100 000 new titles).
It's just what you need, and nothing you don't.
Marketing ideas
Needless to say, it can lead to some interesting marketing ideas ...
“In addition to author comparisons, booksai also compared my book, Extreme Unction, to specific books by Sue Grafton and Agatha Christie, which is more helpful than what I got from Helix, who compared my writing to Parnell Hall. I can market my story to fans of Sue Grafton and Agatha Christie. Parnell Hall, not so much. The fan bases would be too distinct.
(Lupa Schwartz -Writer)
“I pasted a selection from my new ‘The Whiskey Bottle in the Wall’ and got 2 books by Donald Harrington. I was reading Donald Harrington
books while I was working on my book.”
Kathleen Valentine -Mass-based self-published author
so, you have got an idea
Tested and Proven
- 10 000 best-selling English titles- 50 000 new English titles (2013-2015)- 6000 self-published English (titles)
as well as
Spanish, German and Russian titles.
About Us:
We are an internationally distributed R&D team of computer scientists, developers and linguists who care about future of book publishing and discovery.
We believe that our breakthrough technology will benefit the entire ecosystem of book publishing.
Contact: [email protected]
booksai
Forbes: How Do You Discover New Books?
Project blog: http://booksai.blogspot.com/
Press